Posts by Tags

Building Blocks for Deep Learning: Crafting an MLP with Triton

14 minute read

Published: December 26, 2024

In our previous post, we dipped our toes into the world of Triton by writing simple custom kernels. We learned how to add a scalar to a tensor and perform element-wise multiplication, understanding the fundamental concepts of launching kernels and managing GPU memory. Read more

Building Blocks for Deep Learning: Crafting an MLP with Triton

14 minute read

Published: December 26, 2024

In our previous post, we dipped our toes into the world of Triton by writing simple custom kernels. We learned how to add a scalar to a tensor and perform element-wise multiplication, understanding the fundamental concepts of launching kernels and managing GPU memory. Read more

Building Blocks for Deep Learning: Your First Custom Kernel with Triton

14 minute read

Published: December 25, 2024

So, you’ve heard the buzz about OpenAI’s Triton and its ability to let you craft custom GPU kernels? Maybe you’re facing performance bottlenecks with standard deep learning operations, or perhaps you have a unique algorithm that existing libraries don’t quite cover. Whatever your reason, you’re in the right place! This winter break, I decided to dive into the fascinating world of high-performance GPU programming, and one of the most exciting tools I’ve discovered is Triton. Read more

Building Blocks for Deep Learning: Crafting an MLP with Triton

14 minute read

Published: December 26, 2024

In our previous post, we dipped our toes into the world of Triton by writing simple custom kernels. We learned how to add a scalar to a tensor and perform element-wise multiplication, understanding the fundamental concepts of launching kernels and managing GPU memory. Read more

Building Blocks for Deep Learning: Crafting an MLP with Triton

14 minute read

Published: December 26, 2024

In our previous post, we dipped our toes into the world of Triton by writing simple custom kernels. We learned how to add a scalar to a tensor and perform element-wise multiplication, understanding the fundamental concepts of launching kernels and managing GPU memory. Read more

Building Blocks for Deep Learning: Your First Custom Kernel with Triton

14 minute read

Published: December 25, 2024

So, you’ve heard the buzz about OpenAI’s Triton and its ability to let you craft custom GPU kernels? Maybe you’re facing performance bottlenecks with standard deep learning operations, or perhaps you have a unique algorithm that existing libraries don’t quite cover. Whatever your reason, you’re in the right place! This winter break, I decided to dive into the fascinating world of high-performance GPU programming, and one of the most exciting tools I’ve discovered is Triton. Read more

Danial Kamali

Posts by Tags

Deep Learning

Building Blocks for Deep Learning: Crafting an MLP with Triton

GPU Kernels

Building Blocks for Deep Learning: Crafting an MLP with Triton

GPU programming

Building Blocks for Deep Learning: Your First Custom Kernel with Triton

Matrix Multiplication

Building Blocks for Deep Learning: Crafting an MLP with Triton

Triton

Building Blocks for Deep Learning: Crafting an MLP with Triton

Building Blocks for Deep Learning: Your First Custom Kernel with Triton