RT Journal Article SR 00 ID 10.48550/arXiv.2011.06158 A1 Chen, Daniel L. A1 Chen, Jiafeng A1 Lewis, Gregory T1 Mostly harmless machine learning: learning optimal instruments in linear IV models JF Journal of Machine Learning Research YR 2024 FD 2024 AB We provide some simple theoretical results that justify incorporating machinelearning in a standard linear instrumental variable setting, prevalent in empiri-cal research in economics. Machine learning techniques, combined with sample-splitting, extract nonlinear variation in the instrument that may dramatically im-prove estimation precision and robustness by boosting instrument strength. Theanalysis is straightforward in the absence of covariates. The presence of linearlyincluded exogenous covariates complicates identification, as the researcher wouldlike to prevent nonlinearities in the covariates from providing the identifying vari-ation. Our procedure can be effectively adapted to account for this complication,based on an argument byChamberlain(1992). Our method preserves standard in-tuitions and interpretations of linear instrumental variable methods and provides asimple, user-friendly upgrade to the applied economics toolbox. We illustrate ourmethod with an example in law and criminal justice, examining the causal effectof appellate court reversals on district court sentencing decisions. NO En att DP. AT, le 06/01/2025 PB MIT Press SN 1533-7928 LK https://publications.ut-capitole.fr/id/eprint/42824/ UL http://tse-fr.eu/pub/125387