Staff Software Engineer, ML Performance & Systems
Designs and implements novel model serving architectures on in-house inference engine to maximize throughput and minimize latency for generative media models. Develops performance tools and collaborates with ML teams on Nvidia-based systems optimizations.