
Coding agents like Claude Code excel at generating GPU kernels when given performance bottlenecks, but they struggle to diagnose them independently. The nCompass agent bridges this gap by identifying performance bottlenecks in code and suggesting strategies to resolve them. By supercharging Claude Code with our agent, we generated matrix multiplication kernels 3% faster than NVIDIA's CUTLASS kernels in a single session. Your turn.
Code/Dev