Posts

A thumbnail image

Deep Learning for Graphics Programmers: Performing Tensor Operations with DirectML and Direct3D 12

In the rapidly evolving landscape of machine learning and artificial intelligence, harnessing the …

A thumbnail image

Comparing SYCL, OpenCL, and CUDA: Matrix Multiplication Example

Matrix multiplication is a core operation in scientific and engineering applications, often …

A thumbnail image

Intro to DirectX 12 Pipeline

DirectX 12 organizes graphics rendering into pipelines.

Components of DirectX 12 Pipeline:

  1. Command …

A thumbnail image

The Simple Path to PyTorch Graphs: Dynamo and AOT Autograd Explained

Graph acquisition in PyTorch refers to the process of creating and managing the computational graph …

A thumbnail image

Profiling ResNet Models with PyTorch Profiler for Performance Optimization

In the realm of deep learning, model performance is paramount. Whether you’re working on image …

A thumbnail image

Accelerating Deep Learning Inference on Intel Arc 770: ONNX and PyTorch Go Head-to-Head

When deploying deep learning models, the choice of framework can significantly impact performance. …