The second pass of the screen happens here, the closing checkpoint before an interview slot
opens up. A recruiter does take more time at this point, and even then, your present role
still carries roughly 95% of the call.
That holds: nothing speaks to what you can deliver today like the seat you occupy this quarter.
To win a "yes", the block must touch every entry on the Data Scientist role
profile, one line per area named under Domain Expertise. And every line has to come
from work you genuinely owned in production, never a Jira card that brushed past your queue.
1
Modeling & ML Development
The core of this position, and the first checkbox a recruiter clears. Spell out the model you
trained, the question it addressed, and the bet that rode on its output. Cite the model and
the call it shifted, never "built ML models".
Techniques
Classification & regression
Gradient boosting
Time-series forecasting
Recommenders
Tools
scikit-learn, XGBoost
PyTorch, TensorFlow
LightGBM
Metrics
Models in production
AUC / RMSE lift
Decisions driven
2
Experimentation & A/B Testing
Where opinions become measured bets. Walk through the test you designed, the metric it nudged,
and the roadmap call the company made on the result. A test that pushed a product change to ship
reads senior; "ran A/B tests" by itself does not.
Techniques
A/B & multivariate
Switchback & geo tests
Power & sample sizing
CUPED variance reduction
Tools
Statsig, Eppo, Optimizely
Bayesian frameworks
Internal A/B platform
Metrics
Tests shipped
Metric lift driven
Bad ideas killed early
3
Statistical Analysis & Inference
The math underneath the model, and the line that separates a data scientist from a notebook
hobbyist. Cover the question you scoped, the method you reached for, and the confidence interval
you put around the answer. Name the approach and what you concluded, not "did statistical
analysis".
Techniques
Hypothesis testing
Causal inference
Survival analysis
Bayesian methods
Tools
statsmodels, SciPy
R, PyMC
DoWhy, EconML
Metrics
Studies delivered
Reproducibility rate
Decisions reversed by evidence
4
Feature Engineering & EDA
The unglamorous craft that decides whether a model is worth anything. Cover the dataset you
wrestled into shape, the features you engineered, and the signal you turned up by exploring.
Cite the feature and the lift it bought, not "cleaned the data".
Techniques
Feature creation
Encoding & scaling
Missing-data strategy
Cohort & funnel EDA
Tools
pandas, Polars
Feast, Tecton
Jupyter, DuckDB
Metrics
Features in production
Model lift from new features
Data prep time cut
5
Productionization & MLOps
A model living in a notebook is a draft; a model that serves live traffic is a product. Cover
what you moved from prototype into production: the serving pattern, the monitoring you wired up,
and the retraining cadence. Numbers do the work here: latency, freshness, drift caught.
Techniques
Batch vs online serving
Model registry & versioning
Drift & performance monitoring
Retraining pipelines
Tools
MLflow, Weights & Biases
SageMaker, Vertex AI
FastAPI, Docker
Metrics
Latency (p95)
Models retrained on cadence
Drift caught before degradation
6
Data Storytelling & Communication
A finding nobody acts on may as well never have happened. Walk through the readout you wrote,
who it was pitched at, and the call the room made because of it. Cite the writeup and what it
unblocked, never "presented findings to stakeholders".
Techniques
Executive memos
Readout decks
Annotated dashboards
Recommendation framing
Tools
Looker, Tableau
Plotly, matplotlib
Notion, Google Docs
Metrics
Memos shipped
Roadmap calls informed
Stakeholder NPS
7
Cross-Functional Collaboration
Data scientists deliver nothing solo. Cover how you teamed with Product around the question,
Engineering around the integration, and Data Engineering around the feature inputs. Spell out
the joint work and what it freed up further along, not just which teams were in the room.
Techniques
Joint scoping
Model-product handoffs
Office hours
Roadmap planning
Tools
Notion, Confluence
Figma, Miro
Linear, Jira
Metrics
Squads embedded with
Models adopted by Product
Quarterly bets shaped
8
Tooling & Workflow
The setup that lets you go from blank notebook to shipped model without yak-shaving. Cover the
environment you keep reproducible, the version control you treat seriously, and the review
patterns that catch a bug before it reaches a stakeholder. Name what you actually use, not
"modern tooling".
Techniques
Notebook hygiene
Reproducible environments
Code review for analyses
Version-controlled SQL
Tools
Git, GitHub
Hex, Deepnote, JupyterHub
Poetry, uv
Metrics
Repos maintained
Analyses reproducible end-to-end
Onboarding time cut