0%

Anmol Mann

AI Engineering Leader | Production ML Systems & MLOps

WHO I AM

Professional Profile

Hands-on AI Engineering Leader with deep expertise in building and operationalizing production-grade ML systems and scalable AI infrastructure. Proven track record leading global engineering teams, driving MLOps practices (monitoring, automation, deployment support, incident pipelines), and delivering high-impact solutions for frontier AI customers.

Excel at technical architecture decisions, model evaluation through rigorous metrics and quality processes, cross-functional collaboration with Product/Engineering stakeholders, and fostering high-performance team environments. Passionate about translating AI capabilities into reliable, scalable, business-critical systems.

CAREER HISTORY

Professional Experience

Manager, AI Global Customer Support Engineering

CoreWeave — Remote / Toronto Area  |  2023 – Present

  • Provided technical leadership for production AI infrastructure, including database migration strategies, rate-limiting implementations, security hardening, and large-scale storage management (63TB capacity, 1TB/day growth).
  • Built and led a global team of 8+ engineers (NA, EMEA, APAC), scaling from 2–3 to 16+ while establishing 24/7 operations, performance management, career development, and 2026 headcount planning (15 total roles).
  • Drove MLOps practices through W&B and monitoring (DataDog), automation systems (JiraGinie, ReproBot, Support Bot handling 70%+ tickets), incident response pipelines (9 incidents managed via new INCI bot), and platform migrations.
  • Partnered with Engineering, Product, and Revenue teams on roadmap input, P0/P1 escalations for multi-million-dollar deals, executive stakeholder management, and proactive customer success for Tier 0/1 accounts.
  • Conducted tool evaluations and integrations (Rootly vs PagerDuty) and enforced quality standards (ticket reviews, DataDog usage, handoff protocols).
  • Managed budget, resource allocation, and capacity planning for cloud-native AI services (GCP project costs, tool licensing, shift balancing).
  • Implemented and deployed inci bot for better incident management at Weights & Biases.

Technologies: Python, Golang, SQL, Docker, DataDog, Zendesk, PagerDuty, large-scale distributed systems, GenAI integrations (Codex, Cursor).

Manager, AI Customer Support Engineering

Weights & Biases (acquired by CoreWeave in 2025) — Remote  |  2022 – 2023

  • Delivered technical leadership and hands-on support for enterprise ML/GenAI workflows, collaborating with Product and Engineering on feature feedback, roadmap alignment, and customer onboarding.
  • Mentored and managed Tier 1/2 ML engineers, drove user adoption, monitored customer health, and achieved 98% CSAT with 22.1-hour resolution times.
  • Implemented model evaluation and quality processes through QA testing (Cypress), issue reproduction, metrics tracking, and knowledge base development.

Technologies: PyTorch, TensorFlow, GenAI, W&B platform, Python, SQL, Docker, Cypress.

Machine Learning Support Engineer — Tier 2

Weights & Biases (acquired by CoreWeave in 2025) — Remote  |  2021 – 2022

  • Provided hands-on troubleshooting and resolution for complex production ML workflows across frameworks and distributed systems.
  • Served as primary escalation point, mentored junior engineers, and contributed to internal documentation and engineering feedback loops.

Technologies: PyTorch, TensorFlow, Hugging Face, W&B, Docker, Python, TypeScript, SQL.

Software Developer / Project Release Lead

Pacific Geotech Systems Ltd. — Victoria, BC  |  2019 – 2021

  • Designed, developed, and shipped web-based RESTful applications while leading project delivery and cross-team coordination.

Technologies: Java, Angular, Spring Boot, Jasper Reports.

EXPERTISE & EDUCATION

Skills & Education

Skills

  • AI/ML & Frameworks
    PyTorch, TensorFlow, Hugging Face, GenAI, W&B, end-to-end ML lifecycle support
  • MLOps & Infrastructure
    Monitoring (DataDog), automation pipelines, incident response, deployment support, scalable cloud systems (GCP), Docker
  • Programming
    Python (primary), Java, TypeScript, SQL
  • Leadership & Delivery
    Team scaling & mentorship, cross-functional stakeholder management, performance analytics, budget oversight, quality enforcement

Education

M.Sc. Computer Science  |  2018 – 2021

University of Victoria, Victoria, BC, Canada  —  CGPA: 8.5/9

Coursework: Deep Learning (Computer Vision), Data Mining, Distributed Systems. Research: Novel randomized algorithms for streaming graph clique estimation.

B.E. Computer Science  |  2014 – 2018

Chitkara University, Punjab, India  —  CGPA: 9.47/10

PEEK into my PROJECTS

Academic & Personal Projects

CONTACT US

Let's talk about the project