Making AI work in the real world

Serving Infernce, Surviving Production (Reliably)

Rochester, New York

Currently at

Previously worked at

About

Systems thinking grounded by deployment reality

Ajay is a Machine Learning Engineer and researcher focused on LLM inference, ML systems in production, and performance optimization, with over five years of experience building and scaling AI products at B2B startups.

A large part of that work involved edge-based computer vision deployments with hard hardware limits, distributed camera fleets, and the kind of operational failures that do not show up in tidy benchmark papers. That experience now informs a broader interest in inference systems, serving constraints, and model efficiency across modern ML workloads.

Current work at Rochester Institute of Technology builds on those production constraints, with emphasis on efficient inference, deployment-aware system design, and making advanced models practical in production environments.

5+ years Production ML and computer vision engineering
225 sites Largest edge deployment footprint referenced in the portfolio copy
RIT M.S. Graduate research in artificial intelligence, currently in progress
News

Recent updates

Updates on research, publications, and production-facing ML work.

Jan 2026

Paper accepted to ICLR 2026: ProReGen: Progressive Residual Generation Under Attribute Correlations.

May 2025

First submission to NeurIPS, details will be posted soon! (Oops, rejection)

Mar 2025

Acceptance to AWARE-AI Research Traineeship (NRT)