Staff Software Engineer
WalmartMigrated Walmart Search clusters from single-tenant to secure multi-tenant Kubernetes model serving, improving availability to 99.9% and reducing inference latency by 25%. Designed multi-tier inference routing across multi-tenant, multi-locale environments supporting hundreds of millions of daily queries. Engineered query normalization and tokenization adopted across Search teams, and established architecture governance and distributed systems training programs.