
[📢 Upcoming AI Entrepreneurship Series Talk]
Title:
Running AI is Harder Than Training It: The Engineering Behind Inference
Speaker:
Sidharth Shanker
Location:
Davis Auditorium
Date/Time:
Thursday, April 2, 2026, 11:30 AM ET
Bio:
Sidharth Shanker leads the Core Product Engineering team at Baseten, where he focuses on building a robust and scalable platform for deploying and serving machine learning models in production. With over a decade of experience in software engineering, he has worked across a range of industries, including e-commerce, genomics, and social media, developing systems that power real-world applications at scale.
At Baseten, Sidharth is particularly interested in the challenges of inference infrastructure—ensuring models are served securely, reliably, and efficiently to end users. His work sits at the intersection of machine learning systems and developer experience, with an emphasis on making advanced AI capabilities accessible in production environments.
Abstract:
In this talk, Sidharth will explore why deploying AI systems in real-world applications presents challenges distinct from model training. He will discuss the engineering complexities behind inference, including serving models securely, reliably, and at scale, and provide insight into the hidden systems work behind a single request to a large language model.

English







