Gurwinder - AI & GPU

All Stories

DirectML platform architecture showing Direct3D 12 integration for GPU-accelerated machine learning

Deep Learning for Graphics Programmers: Performing Tensor Operations with DirectML and Direct3D 12

In the rapidly evolving landscape of machine learning and artificial intelligence, harnessing the …

Gurwinder Jul 14, 2024 · 9 min read

Side-by-side code comparison of matrix multiplication in SYCL, OpenCL, and CUDA

Comparing SYCL, OpenCL, and CUDA: Matrix Multiplication Example

Matrix multiplication is a core operation in scientific and engineering applications, often …

Gurwinder Jul 5, 2024 · 7 min read

DirectX 12 graphics pipeline diagram showing command lists, PSOs, root signatures, and descriptor heaps

Game Development

Intro to DirectX 12 Pipeline

DirectX 12 organizes graphics rendering into pipelines.

Components of DirectX 12 Pipeline:

Command …

Gurwinder Jul 3, 2024 · 4 min read

PyTorch Profiler chrome trace view showing operator execution timeline for ResNet model

Profiling ResNet Models with PyTorch Profiler for Performance Optimization

In the realm of deep learning, model performance is paramount. Whether you’re working on image …

Gurwinder Apr 2, 2024 · 3 min read

Benchmark comparison chart of PyTorch, TorchScript, and ONNX inference throughput on Intel Arc 770

Accelerating Deep Learning Inference on Intel Arc 770: ONNX and PyTorch Go Head-to-Head

When deploying deep learning models, the choice of framework can significantly impact performance. …

Gurwinder Mar 1, 2024 · 5 min read

PyTorch benchmark graph showing warmup iteration overhead vs stabilized inference time

Warmup Wisdom: Accurate PyTorch Benchmarking Made Simple!

In the realm of PyTorch model benchmarking, achieving accurate results is paramount for gauging …

Gurwinder Feb 10, 2024 · 3 min read

PresentMon frame time graph showing per-frame performance metrics for FPS calculation

Game Development

Mastering Frame Rates: Discover the True FPS with PresentMon

PresentMon is a tool used for capturing frame time data during application runtime, which can then …

Gurwinder Jan 10, 2024 · 3 min read

ONNX computation graph diagram showing nodes, tensors, and data flow between operations

Delving into ONNX: Comprehending Computation Graphs and Structure

ONNX (Open Neural Network Exchange) is an open-source format designed to represent machine learning …

Gurwinder Jun 13, 2023 · 5 min read

Unreal Engine UI widget showing health and mana progress bars bound to Blueprint variables

Game Development

Designing Dynamic RPG Systems: Quick Solutions for Health and Mana in Unreal Engine

Add two float variables to Third Person Controller Blueprint ‘Health’ and ‘Mana’.
Create a new …

Gurwinder Jan 7, 2023 · 2 min read