Professor of Econometrics and Statistics
Booth School of Business, University of Chicago

5807 South Woodlawn Avenue, Chicago, IL 60637
Google Scholar Profile
Curriculum Vitae

Dacheng Xiu’s research interests include developing statistical methodologies and applying them to financial data, while exploring their economic implications. His earlier research involved risk measurement and portfolio management with high-frequency data and econometric modeling of derivatives. His current work focuses on developing machine learning solutions to big-data problems in empirical asset pricing.

Xiu’s work has appeared in Econometrica, Journal of Econometrics, Journal of the American Statistical Association, Annals of Statistics, Review of Financial Studies, and Journal of Finance. He is a Co-Editor for the Journal of Financial Econometrics, an Associate Editor for the Journal of Econometrics, Journal of Business & Economic Statistics, Management Science, Journal of Applied Econometrics, Journal of Empirical Finance, and Statistica Sinica. He has received several recognitions for his research, including the Fellow of the Society for Financial Econometrics, the Fellow of the Journal of Econometrics, the 2018 Swiss Finance Institute Outstanding Paper Award, the 2018 AQR Insight Award, and the Best Conference Paper Prize at the 2017 Annual Meeting of the European Finance Association. In 2017, Xiu launched a website that provides up-to-date realized volatilities of individual stocks, as well as equity, currency, and commodity futures. These daily volatilities are calculated from intraday transactions and the methodologies are based on his research of high-frequency data.

Xiu earned his PhD and MA in applied mathematics from Princeton University, where he was also a student at the Bendheim Center for Finance. Prior to his graduate studies, he obtained a BS in mathematics from the University of Science and Technology of China.
"Autoencoder Asset Pricing Models", with Shihao Gu and Bryan Kelly, forthcoming in the Journal of Econometrics.
Machine Learning Time-Series and Cross-Section of Expected Returns Nonlinear Factor Model Neural Networks Big Data Return Predictability
"Empirical Asset Pricing via Machine Learning", with Shihao Gu and Bryan Kelly, Review of Financial Studies, Vol. 33, Issue 5, (2020), 2223-2273. Winner of the 2018 Swiss Finance Institute Outstanding Paper Award.
Chicago Booth Review GitHub Empirical Data Machine Learning Time-Series and Cross-Section of Expected Returns Neural Networks Big Data Return Predictability
"Taming the Factor Zoo: A Test of New Factors", with Guanhao Feng and Stefano Giglio, forthcoming in the Journal of Finance. First Prize Winner of the 2018 AQR Insight Award.
Chicago Booth Review Internet Appendix.pdf Slides Machine Learning Model Selection Cross-Section of Expected Returns Big Data
"High-Frequency Factor Models and Regressions", with Yacine Aït-Sahalia and Ilze Kalnina, Journal of Econometrics 216 (2020), 86-105.
High Frequency Fama-French Factors Volatility Estimation Factor Model
"A Hausman Test for the Presence of Market Microstructure Noise in High Frequency Data", with Yacine Aït-Sahalia, Journal of Econometrics 211 (2019), 176-205.
Matlab Codes.zip Microstructure Noise QMLE Volatility Estimation
Factor Model Covariance Estimation Microstructure Noise MSCI Barra Model PCA Portfolio Allocation Big Data
"Efficient Estimation of Integrated Volatility Functionals via Multiscale Jackknife", with Jia Li and Yunxiao Liu, Annals of Statistics Vol. 47, No. 1, (2019), 156-176.
Spot (Co)Variance Jackknife Bootstrap
"Principal Component Analysis of High Frequency Data", with Yacine Aït-Sahalia, Journal of the American Statistical Association 114 (2019), 287-303.
Biplots.mp4 PCA Spot (Co)Variance
"Resolution of Policy Uncertainty and Sudden Declines in Volatility", with Dante Amengual, Journal of Econometrics 203 (2018), 297-315.
Chicago Booth Review Supplemental Material.pdf Variance Swaps Pricing Non-affine Models Downward Volatility Jumps VIX
Matlab Codes.zip QMLE Factor Model Covariance Estimation Microstructure Noise Kalman Filtering, Smoothing, and EM Algorithm Missing Data
"Nonparametric Estimation of the Leverage Effect: A Trade-off between Robustness and Efficiency", with Ilze Kalnina, Journal of the American Statistical Association 112 (2017), 384-396.
Covariance Estimation VIX Spot (Co)Variance
"Using Principal Component Analysis to Estimate a High Dimensional Factor Model with High-Frequency Data", with Yacine Aït-Sahalia, Journal of Econometrics 201 (2017), 384-399.
High Frequency Fama-French Factors Big Data Factor Model PCA Portfolio Allocation Covariance Estimation
"Increased Correlation Among Asset Classes: Are Volatility or Jumps to Blame, or Both?" , with Yacine Aït-Sahalia, Journal of Econometrics 194 (2016), 205-219.
Co-Jumps Covariance Estimation Microstructure Noise
"Incorporating Global Industrial Classification Standard into Portfolio Allocation: A Simple Factor-Based Large Covariance Matrix Estimator with High Frequency Data", with Jianqing Fan and Alex Furger, Journal of Business & Economic Statistics 34 (2016), 489-503, (Special Issue on Big Data).
Chicago Booth Review Matlab Codes.zip High Frequency Fama-French Factors Portfolio Allocation Big Data Covariance Estimation
"A Tale of Two Option Markets: Pricing Kernels and Volatility Risk", with Zhaogang Song, Journal of Econometrics 190 (2016), 176-196. Honorable mention of the 2017 Dennis J. Aigner Award.
Nonparametric Option Pricing Closed-form Pricing of SPX and VIX Options Particle Filtering
"Quasi-Maximum Likelihood Estimation of GARCH Models with Heavy-Tailed Likelihoods", with Jianqing Fan and Lei Qi, Journal of Business & Economic Statistics 32 (2014), 178-191. Invited Paper with Discussion.
Rejoinder.pdf Matlab Codes.zip Volatility Estimation
"Hermite Polynomial Based Expansion of European Option Prices", Journal of Econometrics 179 (2014), 158-177.
Option Pricing Non-affine Models
"High-Frequency Covariance Estimates with Noisy and Asynchronous Data", with Yacine Aït-Sahalia and Jianqing Fan, Journal of the American Statistical Association 105 (2010), 1504-1517.
Matlab Codes.zip QMLE Covariance Estimation Microstructure Noise
QMLE Volatility Estimation Microstructure Noise
Chicago Booth Review Supplemental Material QMLE Volatility Estimation Microstructure Noise Kalman Filtering Model Selection
"Asset Pricing with Omitted Factors", with Stefano Giglio, Sep. 2019. Submitted. Winner of the Best Conference Paper Prize at the 44th EFA.
Chicago Booth Review Slides Matlab Codes.zip Three-Pass Estimator Factor Model PCA Cross-Section of Expected Returns Big Data
"Thousands of Alpha Tests", with Stefano Giglio and Yuan Liao, Mar. 2020. Submitted.
Supplemental Material Data Snooping Hedge Funds Factor Model PCA Matrix Completion Cross-Section of Expected Returns Machine Learning Big Data Missing Data
"Predicting Returns with Text Data", with Tracy Ke and Bryan Kelly, Apr. 2020. Winner of the 2019 CICF Best Paper Award.
Chicago Booth Review Text Mining Machine Learning Cross-Section of Expected Returns Big Data Return Predictability Sentiment Analysis Topic Model
"The Structure of Economic News", with Leland Bybee, Bryan Kelly, and Asaf Manela, Jan. 2020.
The Structure of Economic News Website Text Mining Machine Learning Big Data Topic Model
"Comment on: Limit of Random Measures Associated with the Increments of a Brownian Semimartingale", with Jia Li, Journal of Financial Econometrics 16(4) (2018), 570-582.
"Likelihood-Based Volatility Estimators in the Presence of Market Microstructure Noise: A Review", with Yacine Aït-Sahalia, Handbook of Volatility Models and their Applications, Chapter 14, Wiley 2012.
The 2008 meltdown of the financial system has led to tremendous interest in understanding and controlling the systemic risk of the financial market, which further hinges on the assessment of the risks of individual assets on the market and their interdependencies. The increasing availability of transaction-level data on a growing cross-section of tradable assets presents a unique opportunity and yet substantial challenges in estimating these quantities. The overarching theme of the lab is to design methodologies that exploit information embedded in the big data to better ascertain and manage the risk.
Realized Volatility V 1.0
Loading data ...
Electronic trades in GLOBEX hours
BOSoybean Oil Futures
CCCocoa Futures
CCorn Futures
CTCotton No.2 Futures
FCFeeder Cattle Futures
KCCoffee C Futures
LBLumber Futures
LCLive Cattle Futures
LHLean Hogs Futures
OJOrange Juice Futures
OOats Futures
SBSugar #11 Futures
SMSoybean Meal Futures
SSoybean Futures
WWheat Futures CBOT
CLLight Crude Oil Futures ...
HOHeating Oil #2 Futures N...
NGNatural Gas Futures NYMEX
Equities/Equity Index
DMS&P 400 MidCap E-Mini Fu...
DXDollar Index Futures ICE
ESSP 500 E-Mini Futures
NKNikkei 225 Futures CME
NQNASDAQ 100 E-Mini Futures
SPSP 500 Futures
SXFS&P Canada 60 Futures
YMDow Jones ($5) E-mini Fu...
ADAustralian Dollar Futures
BPBritish Pound Futures
CDCanadian Dollar Futures
JYJapanese Yen Futures
JYNMJapanese Yen E-Mini Futures
NENew Zealand Dollar Futures
SFSwiss Franc Futures
UROEuro FX Futures
UROMEuro FX E-mini Futures
Interest Rates
CGBCanadian 10-Year Futures
EDEurodollar Futures CME
FVUS 5-Year T-Note Futures
TUUS 2-Year T-Note Futures
TYUS 10-Year T-Note Futures
USUS 30-Year T-Bond Futures
GCGold Futures COMEX
HGCopper High Grade Future...
PAPalladium Futures NYMEX
PLPlatinum Futures NYMEX
SISilver Futures COMEX
BTCCME Bitcoin Futures
Trades in PIT market hours
CLLight Crude Oil Futures ...
Add RV
0Market S&P500SPY12.05% ±0.3%15.33% ±0.4%12.44% ±0.4%16.31% ±0.6%13.39% ±0.3%
MaterialXLB13.40% ±1.7%18.78% ±2.1%17.59% ±1.2%19.12% ±3.0%18.20% ±1.6%
IndustrialXLI15.59% ±1.0%19.53% ±1.6%17.59% ±1.3%24.12% ±2.2%19.34% ±1.2%
Consumer DiscretionaryXLY14.97% ±1.4%20.45% ±1.8%15.64% ±1.0%19.67% ±1.9%16.33% ±1.0%
Consumer StaplesXLP11.11% ±1.0%14.22% ±1.0%14.01% ±0.9%17.11% ±0.8%14.60% ±1.0%
Health CareXLV12.82% ±0.8%15.48% ±1.3%14.87% ±0.9%17.01% ±1.0%16.01% ±0.7%
FinancialXLF17.53% ±0.7%20.84% ±1.0%20.82% ±1.1%26.88% ±1.9%22.04% ±0.9%
TechnologyXLK14.76% ±1.1%18.34% ±0.9%15.52% ±0.9%19.12% ±1.3%16.21% ±0.8%
UtilitiesXLU15.92% ±1.1%17.99% ±0.9%19.91% ±1.7%22.73% ±2.1%22.20% ±1.1%
EnergyXLE28.67% ±1.9%29.09% ±2.5%31.04% ±1.6%38.06% ±3.0%35.64% ±2.1%
We provide up-to-date daily annualized realized volatilities for individual stocks, ETFs, and future contracts, which are estimated from high-frequency data. We are in the process of incorporating equities from global markets.

We collect trades at their highest frequencies available (up to every millisecond for US equities after 2007), and clean them using the prevalent national best bid and offer (NBBO) that are available up to every second. The mid-quotes are calculated based on the NBBOs, so their highest sampling frequencies are also up to every second.

We provide quasi-maximum likelihood estimates of volatility (QMLE) based on moving-average models MA(q), using non-zero returns of transaction prices (or mid-quotes if available) sampled up to their highest frequency available, for days with at least 12 observations. We select the best model (q) using Akaike Information Criterion (AIC). For comparison, we report realized volatility (RV) estimates using 5-minute and 15-minute subsampled returns.

1. “When Moving-Average Models Meet High-Frequency Data: Uniform Inference on Volatility”, by Rui Da and Dacheng Xiu. 2017.
2. “Quasi-Maximum Likelihood Estimation of Volatility with High Frequency Data”, by Dacheng Xiu. Journal of Econometrics, 159 (2010), 235-250.
3. “How Often to Sample A Continuous-time Process in the Presence of Market Microstructure Noise”, by Yacine Aït-Sahalia, Per Mykland, and Lan Zhang. Review of Financial Studies, 18 (2005), 351–416.
4. “The Distribution of Exchange Rate Volatility”, by Torben Andersen, Tim Bollerslev, Francis X. Diebold, and Paul Labys. Journal of the American Statistical Association, 96 (2001), 42-55.
5. “Econometric Analysis of Realized Volatility and Its Use in Estimating Stochastic Volatility Models”, by Ole E Barndorff‐Nielsen and Neil Shephard. Journal of the Royal Statistical Society: Series B, 64 (2002), 253-280.
B41100 Applied Regression Analysis (MBA)
This course is about regression, a powerful and widely used data analysis technique wherein we seek to understand how different random quantities relate to one another. Students will learn how to use regression to analyze a variety of complex real world problems, with the aim of understanding data and prediction of future events. Focus is placed on understanding of fundamental concepts and development of the skills necessary for robust application of regression techniques. Examples are used throughout to illustrate application of the tools.
B41902 Statistical Inference (PhD)
The focus of this course will be methods to draw inferences in econometric models. We will cover linear regression models, GMM, nonlinear models, and time series models. The majority of the discussion will cover frequentist methods focusing on the use of approximations to finite-sample sampling distributions as a means for obtaining inference. We will cover methods that are appropriate for independent data as well as dependent data. We will discuss intuition for how and when to use the econometric tools developed in the class in addition to deriving some of the relevant theoretical properties.
2018 SoFiE Summer School
Machine Learning and Finance: The New Empirical Asset Pricing. Program is here.
This website and the information presented on it are for research purposes only and should not be used for investment or other commercial purposes. You may use this website and the information presented on it solely for research purposes and not for any commercial purpose. The University of Chicago (the “University”) does not endorse this website and hereby disclaims all representations and warranties about it and the information presented on it, whether express or implied, including any implied warranties of merchantability, fitness for a particular purpose, and non-infringement. The University specifically disclaims any warranties as to the accuracy, usefulness, truthfulness, and availability of the information presented on this website. By using this website, you agree that you will not make any claim against the University or any of its trustees, officers, employees, agents, or other representatives relating to it or the information presented thereon. Neither the University nor any of its trustees, officers, employees, agents, or other representatives will have any liability to you or anyone else in connection with your use of this website or any information you receive through your use of it.
©2017, Dacheng Xiu at the University of Chicago.