Quant Research Interview Practice Guide
This conversation was generated using Cook'd AI's career assistant and reviewed by Fallon Han from our editorial team for accuracy.
I'm a master's student in data science on STEM OPT targeting quant research in New York. Test me on a research problem that mixes statistics with market intuition.

Great. Imagine a hedge fund wants you to build a short-horizon equity signal using daily returns, earnings revisions, and sector data. Start by telling me how you would check whether the signal is real versus just noise, and what statistical pitfalls you would watch for.
I would start with an out-of-sample framework, like a rolling time split, because random CV would leak future information. Then I'd check IC and rank IC stability across time, and use t-stats with Newey-West adjustment since returns are autocorrelated and overlapping labels can inflate significance. I'd also watch for look-ahead bias, survivorship bias, and multiple testing from trying too many features.

Good. Now suppose the raw signal is weak but improves after you neutralize sector exposure and market beta. Why might that happen, and how would you implement the neutralization in practice?
It may happen because the raw signal was partly just loading on broad risk factors, not true stock selection skill. To neutralize it, I would cross-sectionally regress the signal on sector dummies and maybe beta, then use the residual as the adjusted feature. In practice, I'd do this each day in the training window, or build a model that includes those controls so the alpha signal is isolated from systematic exposures.

Now let's go one level deeper. If you had earnings revisions, analyst coverage, and volatility as candidate features, how would you engineer them so the model is robust and not dominated by scale or outliers?
I'd first lag everything properly to avoid using information before it was available. Then I'd winsorize extreme values, z-score features cross-sectionally by date, and possibly rank-transform them if the distribution is heavy-tailed. For analyst coverage, I might use log coverage or coverage changes, and for revisions I'd scale by price or by past earnings magnitude so the feature is comparable across firms.

Final question. You notice the model performs well in backtests but the live paper-trading Sharpe drops sharply. Give me your diagnosis order and what you'd do next, briefly but concretely.
First I'd check data pipeline parity to make sure live features match backtest features exactly. Then I'd inspect transaction costs, turnover, and any delay in execution assumptions because those often explain the gap. After that, I'd test regime sensitivity and feature decay over time. If needed, I'd simplify the model, reduce turnover, and retrain on a more recent window with stricter validation.

Evaluation Report: Strong answer overall. You showed good judgment on time-aware validation, bias control, and factor-neutral feature design. Strengths: you correctly used rolling splits, Newey-West reasoning, residualization, cross-sectional normalization, and live-vs-backtest debugging. One improvement: be more explicit about how you would measure economic value, not just statistical significance, such as turnover-adjusted PnL, drawdown, and capacity. Interview readiness: 8.5 out of 10. To reach a top-tier hedge fund level, practice turning each answer into a crisp hypothesis, test, and result structure.
