Building Blocks for Deep Learning: Crafting an MLP with Triton
Published:
In our previous post, we dipped our toes into the world of Triton by writing simple custom kernels. We learned how to add a scalar to a tensor and perform element-wise multiplication, understanding the fundamental concepts of launching kernels and managing GPU memory. Read more