The second round of the screen lives inside this section, the final gate ahead of any
interview going on offer. A recruiter takes more time at this point, and even so the chair
you sit in now carries roughly 95% of the outcome.
That fits: nothing demonstrates your shipped production work as plainly as the seat you're sitting in this quarter. To pull a yes, the block has to land each entry from the
ML Engineer role profile, one bullet per area named under Domain
Expertise. And each bullet has to come off something you actually held in production,
never a ticket that brushed past your queue.
1
Model Training & Development
The visible work behind this role, and the opening checkbox the recruiter clears. Detail the
model you built, the training pipeline behind it, plus the metric it lifted in offline
evaluation. State the architecture and the training corpus, never "trained a model".
Techniques
Deep learning & transformers
Gradient boosting
Distributed training
Hyperparameter tuning
Tools
PyTorch, TensorFlow
Hugging Face Transformers
Ray, Horovod
Metrics
Models trained at scale
Offline metric lift
Training cost cut
2
Model Serving & Inference
Where the model meets traffic. Lay out the serving runtime you stood up, the latency budget
you held, plus the request throughput it carries daily. A model behind a p99 SLO at scale
reads as senior; "deployed models" alone does not.
Techniques
Online vs batch inference
Request batching
Quantization & distillation
A/B model routing
Tools
Triton, TorchServe
KServe, BentoML
ONNX, TensorRT
Metrics
Latency (p95 / p99)
Requests per second
Inference cost per request
3
Feature Engineering & Pipelines
The piping that feeds every model. Describe the feature pipeline you built, the consistency
you held between training and serving, and the feature store you maintain. Name the feature
and the model it powers, not "built feature pipelines".
Techniques
Train-serve consistency
Online vs offline features
Backfills & replays
Embedding pipelines
Tools
Feast, Tecton
Spark, Beam
Kafka, Flink
Metrics
Features in production
Feature freshness
Skew incidents cut
4
MLOps & Deployment
The bridge from notebook to production. Cover the CI/CD you wired for models, the registry
you ship through, and the rollback story you have when a launch goes wrong. Cite the deploy
cadence and what it unlocked, not "deployed via MLflow".
Techniques
Model registry & versioning
Canary & shadow deploys
CI/CD for models
Automated retraining
Tools
MLflow / W&B
SageMaker / Vertex AI
GitHub Actions, ArgoCD
Metrics
Deploys per week
Iteration time cut
Rollback MTTR
5
Monitoring, Drift & Reliability
Models in production silently rot if nobody watches them. Describe the monitoring you wired
up, the drift you caught ahead of users feeling it, plus the SLO you held under load. Figures
carry the weight here: drift incidents caught, on-call pages avoided, SLO hit rate.
Techniques
Data & concept drift
Latency & error budgets
Shadow scoring
Alerting & runbooks
Tools
Prometheus, Grafana
Evidently, WhyLabs
Datadog, Sentry
Metrics
SLO hit rate
Drift incidents caught
On-call MTTR
6
ML Infrastructure & Compute
The platform every model runs on. Show the training cluster you sized, the GPU utilization
you raised, and the compute bill you brought down. Name the workload and the savings, not
"managed GPU infra".
Techniques
GPU scheduling
Mixed-precision training
Multi-node distributed
Cost attribution
Tools
Kubernetes, Kubeflow
Ray, SLURM
Terraform, Pulumi
Metrics
GPU utilization
Training $ saved
Time-to-train cut
7
Cross-Functional Collaboration
ML Engineers carry nothing alone. Describe how you partnered with Data Science on the
model handoff, Backend on the API contract, plus Product on the launch criteria. Show what
the partnership produced, never simply the teams in the room.
Techniques
Research-to-prod handoffs
Model launch reviews
SLO negotiation
Office hours
Tools
Notion, Confluence
Slack, Linear
Jira, GitHub
Metrics
Models shipped jointly
Handoff time cut
Squads supported
8
Tooling & Workflow
The everyday setup that lets you ship without yak-shaving. Cover the environment you keep
reproducible, the tests you wrap around models, plus the review patterns that catch a bug
before it reaches production. Spell out what you actually use, never "a modern stack".
Techniques
Reproducible environments
Unit & integration tests for ML
Code review for model PRs
Experiment tracking
Tools
Git, GitHub
Docker, Poetry, uv
pytest, Great Expectations
Metrics
Repos maintained
Pipeline reproducibility rate
Onboarding ramp time cut