Applied AI Evaluation & ML Infrastructure
Independent consulting on applied AI evaluation, ML infrastructure, and production AI systems. Engagements typically scoped to 10–80 hours depending on complexity. For rates and availability, reach out via the contact form.
What I Do
Production-grade AI engineering for teams that need to ship, not just prototype.
LLM Evaluation
Eval frameworks that surface the failure modes that matter — not just benchmark scores.
ML Infrastructure
Data pipelines, MLflow/Prefect workflows, and internal tooling built for long-term maintainability.
Voice AI & ASR
Speech pipeline architecture, API integration, and production deployment of voice systems that hold up under real-world load.
AI Training & Workshops
Facilitated training for engineering and product teams — in-person or remote, with reusable materials.
Start a conversation
Send a brief description of the work and I’ll come back with scope and pricing. Typically respond within 24 hours.
Contact Me →