BEGIN:VCALENDAR VERSION:2.0 PRODID:-//132.216.98.100//NONSGML kigkonsult.se iCalcreator 2.20.4// BEGIN:VEVENT UID:20260602T174643EDT-6216Uw1PD0@132.216.98.100 DTSTAMP:20260602T214643Z DESCRIPTION:Title: To split or not to split that is the question: From cros s validation to debiased machine learning.\n\n\n Abstract:\n\n\nData splitt ing is an ubiquitous method in statistics with examples ranging from cross validation to cross-fitting. However\, despite its prevalence\, theoretic al guidance regarding its use is still lacking. In this talk we will explo re two examples and establish an asymptotic theory for it. In the first pa rt of this talk\, we study the cross-validation method\, a ubiquitous meth od for risk estimation\, and establish its asymptotic properties for a lar ge class of models and with an arbitrary number of folds. Under stability conditions\, we establish a central limit theorem and Berry-Esseen bounds for the cross-validated risk\, which enable us to compute asymptotically a ccurate confidence intervals. Using our results\, we study the statistical speed-up offered by cross validation compared to a train-test split proce dure. We reveal some surprising behavior of the cross-validated risk and e stablish the statistically optimal choice for the number of folds. In the second part of this talk\, we study the role of cross fitting in the gener alized method of moments with moments that also depend on some auxiliary f unctions. Recent lines of work show how one can use generic machine learni ng estimators for these auxiliary problems\, while maintaining asymptotic normality and root-n consistency of the target parameter of interest. The literature typically requires that these auxiliary problems are fitted on a separate sample or in a cross-fitting manner. We show that when these au xiliary estimation algorithms satisfy natural leave-one-out stability prop erties\, then sample splitting is not required. This allows for sample re- use\, which can be beneficial in moderately sized sample regimes.\n\n\n Spe aker\n\n\nMorgane Austern is an Assistant Professor of Statistics at Harva rd University. She obtained her PhD in statistics from Columbia University \, working with Peter Orbanz and Arian Maleki on limit theorems for depend ent and structured data. After that\, she was a postdoctoral researcher at Microsoft Research New England. Broadly\, she interested in developing pr obability tools for modern machine learning and in establishing the proper ties of learning algorithms in structured and dependent data contexts. Her current work is motivated by generalization and concentration bounds\, st able matching problems and random matrix theory.\n\n\n \n \n https://mcgill.z oom.us/j/83436686293?pwd=b0RmWmlXRXE3OWR6NlNIcWF5d0dJQT09\n\n Meeting ID: 8 34 3668 6293\n\n Passcode: 12345\n \n \n\n\n \n DTSTART:20230113T203000Z DTEND:20230113T213000Z SUMMARY:Morgane Austern (Harvard University) URL:/mathstat/channels/event/morgane-austern-harvard-u niversity-344874 END:VEVENT END:VCALENDAR