Chat with Philippe Tillet, Author of Triton

MLSys Reading Group.

Summary. Triton is an open source python library for writing highly efficient GPU code. It requires no CUDA expertise and its performance is on-par with expert hand-tuned kernel libraries.

We document a summary of our exchange with Philippe. We write this summary based on a combination of notes & memory so this is definitely not a precise characterization of our conversation, but we try our best.

There are a few suggestions Philippe gave for making future compilers. The editor of this summary believes that much of these lessons can generalize to building any user-facing research tools/products.

Q/A Highlights: