Bayesian Learning 2021
Course: BAYESIAN LEARNING 2021 – Professional Master in Economics
Professor: Hedibert Freitas Lopes – www.hedibert.org
Teaching assistant: Igor Ferreira Batista Martins – igorfbm@al.insper.edu.br
Syllabus
Midterm exam (take-home) (solution) +
Homework assignments
- HW1 (Solution) (Code)
- HW2 (Code with solution)
- HW3
- HW4
Examples developed in class:
Course notes (+ R code & references)
Additional supporting material
Advanced Bayesian Econometrics 2021
Course: ADVANCED BAYESIAN ECONOMETRICS 2021 – Doctoral Program in Business Economics
Professor: Hedibert Freitas Lopes – www.hedibert.org
Objective: The end of the course goal is to allow the student to critically decide between a Bayesian, a frequentist or Bayesian-frequentist compromise when facing real world problems in the fields of micro- and macro-econometrics and finance, as well as in quantitative marketing, strategy and business administration. With this end in mind, we will visit well known Bayesian issues, such as prior specification and model comparison and model averaging, but also study regularization via Bayesian LASSO, Spike-and-Slab and related schemes, “small n, large p” issues, Bayesian statistical learning via additive regression trees, random forests, large-scale VAR and (dynamic) factor models.
Course description: Basic ingredients: prior, posterior, and predictive distributions, sequential Bayes, conjugate analysis, exchangeability, principles of data reduction and decision theory. Model criticism: Bayes factor, computing marginal likelihoods, Savage-Dickey ratio, reversible jump MCMC, Bayesian model averaging and deviance information criterion. Modern computation via (Markov chain) Monte Carlo methods: Monte Carlo integration, sampling-importance resampling, Gibbs sampler, Metropolis-Hastings algorithms. Mixture models, Hierarchical models, Bayesian regularization, Instrumental variables modeling, Large-scale (sparse) factor modeling, Bayesian additive regression trees (BART) and related topics, Dynamic models, Sequential Monte Carlo algorithms, Bayesian methods in microeconometrics, macroeconometrics, marketing and finance.
- Part I Bayesian ingredients: i) Inference: likelihood, prior, predictive and posterior distributions; ii) Model criticism: Marginal likelihoods, Bayes factor, model averaging and decision theory; and iii) Computation: An introduction (Markov chain and sequencial) Monte Carlo methods.
- Part II Multivariate models: i) Large-scale vector autoregressive models; ii) Factor models and other dimension reduction models; and iii) Time-varying high-dimensional covariance models.
- Part III Modern Bayesian statistical learning: i) Mixture models and the Dirichlet process: handling non-Gaussian models; ii) Regularization: sparsity via shrinkage and variable selection; iii) Large vector-autoregressive and factor models: combining sparsity and parsimony; iv) Classification and support vector machines; v) Regression trees and random forests; and vi) Latent Dirichlet allocation: Text as data, text mining.
Take-home midterm exam: 9am February 18th to 12pm February 20th. (data) (solution)
Homework assignments
- HW1 – Due date: February 4th (12pm) (solution)
- HW2 – Due date: February 11th (12pm) (solution)
- HW3 – Due date: February 18th (9am) (solution + full conditionals + R code)
- HW4 – Due date: March 25th (9am) – GPA data + some code: Use the GPA data to fit classical and Bayesian model choice strategies, similar to the ones we discussed in class for the Stock and Waton’s macro data, the Stine & Foster’s automation data and Wooldridge’s wage data.
Paper presentations: Your task is to read the manuscript very carefully (several times, possibly) and prepare a 20-minute video presentation (no more than 25 minutos!) plus a concise 5-page summary of the manuscript (up to additional 5 pages for tables and graphs) to be sent to me no later than 12pm, April 8th 2021.
Examples developed in class:
COURSE NOTES
PART I: Bayesian ingredients
- Basic Bayes
- Exchangeability
- Principles of data reduction
- More on estimators
- Decision theory
- Bayesian model criticism (pages 1-6 & 32-34)
- Additional reading material:
- Chapter 2 of Gamerman and Lopes (2006) – Compact, but easy to read.
- Chapters 2-4 of Migon, Gamerman and Louzada (2014) – Integrates classical and Bayesian inference.
- Chapter 1 and 2 of Gelman et al. (2013) – Application-oriented.
- Chapter 4 (Sections 4.1-4.4) of Berger (1985) – More technical.
- Discussion about p-values
PART II: Bayesian Computation
- Monte Carlo (MC) methods
- Markov chain: a brief review
- Markov chain Monte Carlo (MCMC) algorithms
- Hamiltonian Monte Carlo: A toy example
- Stan/rstan for posterior inference: Hamiltonian MC (HMC) methods
- MC and MCMC: Key References
- More on Bayesian model criticism
PART III: Bayesian Learning
- Fundamentos de Aprendizagem Estatística + R code + MC exercise
- Multiple linear regression: selection, shrinkage, sparsity
- Classification: logistic regression and discriminant analysis
- Bayesian factor analysis (BFA)
- Principal components analysis (PCA), PCA-based and FA-based regressions
- Classification and regression trees (CART)
- Bayesian CART
- Bootstrap aggregating (bagging)
- Bayesian additive regression trees (BART)
- Latent Dirichlet Allocation (LDA)
- Neural Networks
Complementary material to PART III
- Boosting (weak/stronger learners)
- Random forests
- Bayesian instrumental variables
- General linear and hierarchical models
- Limited dependent variable models
- Finite mixture of distributions
- Spatial models
- P.Richard Hahn’s top 25 books on Statistics, Causal Inference, Statistical Computing, Machine Learning and Data Science
MATERIAL FROM PREVIOUS YEARS (2018-2020)
Homework assignments and take-home exams
- Take-home midterm exam – 2020 (Bruno Levy’s solution + Igor Martins’ solution)
- Take-home midterm exam – 2019
- Take-home midterm exam – 2018 (Rafael Pucci’s solution + Raphael Gondo’s solution)
- HW1 2019: Problems 2.26(a) and 2.26(c) (page 79), 3.1 (pages 106-107), 3.12 (page 110), 5.7 (page 185) and Example 5.1 (pages 143-146) – Gamerman and Lopes (2006) MCMC: Stochastic Simulation for Bayesian Inference. Errata of the numerator of the Bayes Factor of problem 2.26(a) + Errata of the denominator of the Bayes Factor of problem 2.26(a)
- HW2 2019: Apply “Machine Learning” tools to the communities data from UCI Machine Learning Repository (http://archive.ics.uci.edu/ml/datasets.html). See my simple exploratory analysis of the data as a start-up here.
Examples developed in class
- Week of 01/13/2020: More on Bernoulli trials
- Week of 01/13/2020: Monte Carlo integration and MC via importance sampling
- Week of 01/20/2020: Student’s t: learning degree of freedom (Rmarkdown code)
- Week of 01/20/2020: Student’s t regression (Rmarkdown code)
- Week of 01/20/2020: Tobit linear regression (Dataset)
- Week of 01/27/2020: Two-component mixture of univariate Gaussians
- Week of 01/27/2020: SIR or Gibbs? The simple N-IG iid case
- Week of 01/27/2020: Our first Metropolis-Hastings algorithm
- Week of 01/27/2020: Linear model with the normal-gamma (NG) prior
- Week of 01/27/2020: Linear model with NG prior: comparing MCMC schemes
- Week of 02/03/2020: Bayesian skewed normal regression
- Week of 02/03/2020: Model comparison via prior predictives: Normal vs t exercise
- Week of 02/17/2020: A few R packages for Bayesian inference in linear models
- Week of 02/17/2020: Discussion about the fallacies of p-values
- Class of 01/15/2019: Posterior inference for the proportion of Bernoulli trial (R code)
- Week of 01/13/2019: Posterior of proportion of Bernoulli trial (R code)
- Class of 01/17/2019: Model comparison: Gaussian vs Student’s t
- Class of 01/24/2019: Multivariate Gaussian: joint vs univariate Gibbs sampling
- Class of 01/31/2019: Our 1st Metropolis-Hastings (MH) algorithm
- Class of 01/31/2019: Our 2nd MH algorithm – comparing proposals
- Class of 01/31/2019: Linear model with AR(1) errors (graphs)
- Class of 03/19/2019: Human development index (by municipaliity in Brazil)
- Class of May 3rd, 2018: Posterior inference for the variance in the normal case (R markdown code)
- Class of May 8th, 2018: Bayesian model comparison (R markdown code)
- Class of May 10th, 2018: Poisson model (Count data)(PDF with results)
- Class of May 15th, 2018: Beta regression (PDF with results) + Ex 3.6-3.7 from GL(2006)
- Class of May 17th, 2018: Comparing MC integration, SIR and Raoblackwellization
- Class of May 22nd, 2018: Bayesian linear regression
- Class of May 24th, 2018: Sparse Bayesian linear regression (ridge, Bayesian lasso & horseshoe) – R code
- Class of June 14th, 2018: My book chapter on Modern Bayesian Factor Analysis
- Class of June 15th, 2018: Mixture of Poisson distributions (Rmd code)
- Class of July 2nd, 2018: PCA and FA for term-structure data (dataset)
- Class of July 10th, 2018: Bootstrap: sampling distribution of correlation coefficient
Bibliography: Bayesian econometrics
- Zellner (1971) An Introduction to Bayesian Inference in Econometrics
- Goel and Iyngar (1992) Bayesian Analysis in Statistics and Econometrics
- West and Harrison (1997) Bayesian Forecasting and Dynamic Models (2nd edition)
-
Dorfman (1997) Bayesian Economics Through Numerical Methods
- Bauwens, Lubrano and Richard (2000) Bayesian Inference in Dynamic Econometric Models
- Koop (2003) Bayesian Econometrics
- Geweke (2005) Contemporary Bayesian Econometrics and Statistics
- Lancaster (2004) Introduction to Modern Bayesian Econometrics
- Rossi, Allenby and McCulloch (2005) Bayesian Statistics and Marketing
- Prado and West (2010) Time Series: Modeling, Computation and Inference
- Geweke, Koop and Van Dijk (2011) The Oxford Handbook of Bayesian Econometrics
- Greenberg (2013) Introduction to Bayesian Econometrics
- Herbst and Schorfheide (2015) Bayesian Estimation of DSGE Models
- Chan, Koop, Poirier and Tobias (2019) Bayesian Econometric Methods (2nd edition)
- Broemeling (2019) Bayesian Analysis of Time Series
- Bernardi, Grassi and Ravazzolo (2020) Bayesian Econometrics
Bibliography: Bayesian statistics
- Berger (1985) Statistical Decision Theory and Bayesian Analysis
- Bernardo and Smith (2000) Bayesian Theory
- Gelman and Hill (2006) Data Analysis Using Regression and Multilevel/Hierarchical Models
- Robert (2007) The Bayesian Choice: From Decision-Theoretic Foundations to Computational Implementation
- Hoff (2009) A First Course in Bayesian Statistical Methods
- Carlin and Louis (2009) Bayesian Methods for Data Analysis (3rd edition)
- Gelman, Carlin, Stern, Dunson, Vehtari and Rubin (2016) Bayesian Data Analysis
- Migon, Gamerman and Louzada (2015) Statistical Inference: An Integrated Approach (2nd edition)
- Reich and Ghosh (2019) Bayesian Statistical Methods
- Held and Sabanes-Bove (2020) Likelihood and Bayesian Inference: With Applications in Biology and Medicine
Bibliography: Bayesian computation
-
Gilks, Richardson and Spiegelhalter (1995) Markov Chain Monte Carlo in Practice
- Doucet, de Freitas and Gordon (2001) Sequential Monte Carlo Methods in Practice
- Robert and Casella (2004) Monte Carlo Statistical Methods (2nd edition)
- Gamerman and Lopes (2006) MCMC: Stochastic Simulation for Bayesian Inference, Second Edition
- Marin and Robert (2007) Bayesian Core: A Practical Approach to Computational Bayesian Statistics
- Albert (2009) Bayesian Computation with R
- Brooks, Gelman, Jones and Meng (2011) Handbook of Markov Chain Monte Carlo
- Givens and Hoeting (2012) Computational Statistics (2nd edition)
- Marin and Robert (2014) Bayesian Essentials with R (complete solution manual)
- Turkman, Paulino and Mueller (2019) Computational Bayesian Statistics: An Introduction
- McElreath (2020) Statistical Rethinking: A Bayesian course with Examples in R and STAN
- Chopin and Papaspiliopoulos (2020) An Introduction to Sequential Monte Carlo
Bibliography: (Bayesian) statistical learning
- Bishop (2006) Pattern Recognition and Machine Learning
- Hastie, Tibshirani and Friedman (2008) The Elements of Statistical Learning, 2nd edition
- Murphy (2012) Machine Learning: A Probabilistic Perspective
- Barber (2012) Bayesian Reasoning and Machine Learning
- James, Witten, Hastie and Tibshirani (2013) An Introduction to Statistical Learning
- Hastie, Tibshirani and Wainwright (2015) Statistical Learning with Sparsity
- Efron and Hastie (2016) Computer Age Statistical Inference: Algorithms, Evidence and Data Science
- Fernandez and Marques (2018) Data Science, Marketing and Business
- Izbicki & Santos (2020) Aprendizado de máquina: uma abordagem estatística
Bibliography: Classical Monte Carlo papers
- Metropolis and Ulam (1949) The Monte Carlo method. JASA, 44, 335-341.
- Metropolis, Rosenbluth, Rosenbluth, Teller and Teller (1953) Equation of state calculations by fast computing machines. Journal of Chemical Physics, 21, 2087-1092.
- Hastings (1970) Monte Carlo sampling methods using Markov chains and their applications. Biometrika, 57, 97-109.
- Peskun (1973) Optimum Monte Carlo sampling using Markov chains. Biometrika, 60, 607-612.
- Besag (1974) Spatial Interaction and the Statistical Analysis of Lattice Systems. JRSS-B, 36, 192-236.
- Kirkpatrick, Gelatt and Vecchi (1983) Optimization by Simulated Annealing. Science, 220 (4598), 671-680.
- Geman and Geman (1984) Stochastic relaxation, Gibbs distributions, and the Bayesian restoration of images. IEEE Trans. Pattern Analysis and Machine Intelligence, 6, 721-741.
- Pearl (1987) Evidential reasoning using stochastic simulation of causal models. Artificial intelligence, 32, 245-257.
- Tanner and Wong (1987) The Calculation of Posterior Distributions by Data Augmentation. JASA, 82, 528-540.
- Geweke (1989) Bayesian Inference in Econometric Models Using Monte Carlo Integration. Econometrica, 57, 1317-1339.
- Gelfand and Smith (1990) Sampling-Based Approaches to Calculating Marginal Densities. JASA, 85, 398-409.
- Casella and George (1992) Explaining the Gibbs Sampler. The American Statistician, 46, 167-174.
- Gilks and Wild (1992) Adaptive Rejection Sampling for Gibbs Sampling. Applied Statistics, 41, 337-348.
- Smith and Gelfand (1992) Bayesian Statistics without Tears: A Sampling-Resampling Perspective. The American Statistician, 46, 84-88.
- Chib and Greenberg (1995) Understanding the Metropolis-Hastings algorithm. The American Statistician, 49, 327-335.
Econometrics III 2021 (Time Series)
Course: ECONOMETRICS III 2021 (TIME SERIES) – Doctoral Program in Business Economics
Professor: Hedibert Freitas Lopes – www.hedibert.org
Objective: The main goal of the course is to make the student familiar with and able to implement univariate and multivariate time series models by using both frequentist and Bayesian approaches. All classroom examples and implementations as well as projects will be carried out by the open-source statistical software R.
Course description: Brief review of frequentist inference followed by the introduction of key ingredients of Bayesian inference, model selection and criticism. An introduction to the main Monte Carlo methods for Bayesian inference: MC integration, resampling, MCMC and sequential MC. Univariate time series models, including AR(F)IMA models, state-space models, Markov switching models, GARCH and stochastic volatility models. Multivariate time series models, including Bayesian VARs and factor-augmented VARs, dynamic factor models, time-varying covariance models.
- PART I: Basic univariate time series models: AR, MA and ARMA models; Unit-root non-stationarity and long-memory processes; Seasonal models.
- PART II: Bayesian ingredients (prior, likelihood, posterior, predictive, Bayes factor and posterior model probability); Monte Carlo (MC) methods (MC integration, sampling importance resampling (SIR)) and Markov chain Monte Carlo (MCMC) methods (Gibbs sampler and Metropolis-Hastings (MH) algorithms).
- PART III: More univariate time series: ARCH/GARCH models; EGARCH, GARCH-M, TGARCH; Bayesian GARCH; Bayesian inference in the local level model; Dynamic models; Stochastic volatility models. We will use MCMC as well as sequential Monte Carlo (SMC) schemes to perform batch and online posterior inference.
- PART IV: Multivariate time series models: Vector autoregressive (VAR) models; Large Bayesian VAR (BVAR) models, factor augmented VAR (FAVAR) models, time-varying parameter BVAR (TVP-BVAR) models, Bayesian FAVAR (BFAVAR) models; Factor models and time-varying covariance models.
Bibliography
Teaching assistant: Bruno do Prado Costa Levy (brunopcl at al dot insper dot edu dot br) – Wednesdays from 10am to 11:30am
Take-home midterm exam: 9am of Tuesday, February 23th to 12pm of Thursday, February 25th. (data)
Homework assignments
- HW1 – Due date: January 26th – Problems 1,2,3,8,19,20 and 21, chapter 1 of Shumway and Stoffer’s book (4th edition).
- HW2 – Due date: February 2nd – Problem 2.15 (page 107) of Tsay’s (2010) book. However, download the up-to-date quarterly gross domestic implicit price deflator time series from the Federal Reserve Bank of St Louis. Fit the ARIMA models with data up to the 4th quarter of 2018 and use 2019.I to 2020.III (7 quarters) for forecasting comparisons.
- HW3 – Due date: Tuesday, February 23rd 2021 at 9am
- HW4 – Due date: Tuesday, March 9th 2021 at 9am – Fit Gaussian and Student’s t GARCH(1,1) to Vale S.A. (VALE) using the R packages garchFit, bayesGARCH and RSTAN that I have provided when we studied Petrobras (PBR). Feel free to add other (non-Bayesian) GARCH-type fits based on the ARCH-glossary that we have discussed in class.
- HW5 – Due Date: Tuesday, March 23rd 2021 at 9am – Collect meaningful time-series (suggestion: Apple & SP500 for US market or Vale & Ibovespa for the Brazilian market). Use the time series in a simple CAP-M model which allows for time-varying slope. Repeat it by allowing ONLY time-varying slope. Then, allow both time-varying intercept and slope. Compare the three models to the benchmark OLS fit. You can use whatever Bayesian R package (such as bsts) that produces posterior summaries of the latent variables (intercepts and slopes) as well as static parameters. Comment your findings. There are lots of papers out there, and I recommend a 16-year old one here: Jostova and Philipov (2005) Bayesian analysis of stochastic betas, Journal of Financial and Quantitative Analysis, 747-778.
Paper presentations: Prepare a 20min video plus 5-page summary of the paper to be sent to me and Bruno by 12pm on Tuesday, April 13th 2021.
- Pedro: Graziadei, Lopes and Marques (2020) Bayesian generalizations of the integer-valued autoregressive model, Journal of Applied Statistics.
- Victoria: Silva, Lopes and Migon (2006) The extended generalized inverse Gaussian distribution for log-linear and stochastic volatility models, Brazilian Journal of Probability and Statistics, 67-91.
- Alexandre: Carvalho and Lopes (2006) Simulation-based sequential analysis of Markov switching stochastic volatility models, Computational Statistics and Data Analysis, 51 (9), 4526-4542.
- Vinicius: Warty, Lopes and Polson (2018) Sequential Bayesian learning for stochastic volatility with variance-gamma jumps in returns, Applied Stochastic Models in Business and Industry, 2018, 34, 460-483.
- Nathalia: Prado and Lopes (2013) Sequential parameter learning and filtering in structured autoregressive state-space models, Statistics and Computing, 23 (1), 43-57.
- Livia: Primiceri (2005) Time Varying Structural Vector Autoregressions and Monetary Policy, The Review of Economic Studies, Vol. 72, No. 3, 821-852.
- Thaline: Carriero, Todd and Massimiliano (2019) Large Bayesian vector autoregressions with stochastic volatility and non-conjugate priors, Journal of Econometrics, 212(1), 137-154.
- Guilherme: Shirota, Omori, Piao and Lopes (2017) Cholesky realized stochastic volatility model, Econometrics and Statistics, 2017, 3, 34-59.
- Giovanna: Kastner, Fruehwirth-Schnatter and Lopes (2017) Efficient Bayesian inference for multivariate factor SV models, Journal of Computational and Graphical Statistics, 26, 905-917.
- Rafael: Kastner and Huber (2020) Sparse Bayesian vector auto-regressions in huge dimensions, Journal of Forecasting, 30(7), 1142-1165.
- Thayla: Lopes, McCulloch and Tsay (2020) Parsimony inducing priors for large scale state-space models, Journal of Econometrics (Revised & Resubmitted).
- Octavio: Levy and Lopes (2021) Dynamic ordering learning in multivariate forecasting.
Examples developed in class
- Class of 01/12/2021 – Brief introduction to time series in R
- Class of 01/19/2021 – AR(1), random walk and AR(p) models
- Class of 01/19/2021 – ARMA & ARIMA models
- Class of 01/26/2021 – ARFIMA models
- Class of 01/26/2021 – Bayesian AR(1)
- Class of 02/02/2021 – Bayesian AR(1) with Normal and t priors
- Class of 02/02/2021 – Bayesian AR(2) with Normal and t priors
- Class of 02/02/2021 – Bayesian AR(2) with Normal and t models
- Class of 02/02/2021 – Bayesian nonlinear regression – SIR and RW-MH
- Class of 02/09/2021 – Bayesian AR(p) – conjugate analysis vs Gibbs sampler
- Class of 02/09/2021 – Comparing MCMC strategies – Gibbs, MH, block/single
- Class of 02/09/2021 – Nonlinear regression – comparing SIR and Gibbs+RWMH
- Class 0f 02/09/2021 – Bimodal posterior: comparing random-walk MH and independent MH + R code
- Class of 02/09/2021 – Linear Gaussian regression with Normal-Half-Cauchy prior – MCMC with Gibbs and RWMH steps + R code
- Class of 03/02/2021 – Petrobras (PBR): ARCH(1,1) + GARCH(1,1)
- Class of 03/02/2021 – Petrobras (PBR): garchFit – bayesGARCH – rstan (stan file)
- Class of 03/02/2021 – Modeling S&P 500 realized volatility & log returns (stan code + graphs + data)
- Class of 03/09/2021 – Modeling COVID-19 death: an exercise in state-space modeling
- Class of 03/09/2021 – Hamilton’s (2017) paper “Why you should never use the HP filter”
- Class of 03/23/2021 – SV-AR(1) for PBR: MCMC, SMC/particle filter and sequential MCMC (data)
- Class of 04/06/2021 – Univariate stochastic volatility, factor analysis, & factor stochastic volatility
- Class of 04/06/2021 – Bayesian time-varying covariance: DCC and FSV models (R code)
TEACHING MATERIAL
PART I: Basic univariate time series
- Autoregressive (AR) models and moving average (MA) models (HTML output)
- Unit-root nonstationarity and long-memory processes (HTML output)
- Seasonal models
PART II: Basic Bayes
- Bayesian ingredients
- Monte Carlo (MC) methods
- Markov chain Monte Carlo (MCMC) methods
- Using stan/rstan for approximate Bayesian inference via Hamiltonian MC (HMC) methods
- MC and MCMC: Key References
PART III: Garch-type, dynamic linear and stochastic volatility models – MCMC and SMC
- ARCH/GARCH-type models
- Dynamic linear models (DLMs)
- Nonlinear dynamic model – MCMC sampling individual states conditional on all other states
- Sequential Monte Carlo – pure filter
- Sequential Monte Carlo – parameter learning
- Stochastic volatility models
- Using R packages “stochvol” & “rstan” for SV with Gaussian or Student’s t errors
PART IV: Multivariate time series
- Vector autoregressive models
- Large BVAR, FAVAR, TVP-BVAR & BFAVAR
- Factor models (Standard factor analysis, Spatial dynamic factors, Factor stochastic volatility)
- Time-varying covariance models
Additional reading material on Bayesian time series
- Bayesian Statistics (a very brief introduction) – Ken Rice, April, 2014
- Lopes and Salazar (2006) Bayesian model uncertainty in smooth transition autoregressions, Journal of Time Series Analysis, 27, 99-117.
- Huerta and Lopes (2000) Bayesian forecasting and inference in latent structure for the Brazilian industrial production index, Brazilian Review of Econometrics, 20, 1-26.
- Kleibergen and Hoek (2000) Bayesian Analysis of ARMA Models. Tinbergen Institute Discussion Paper.
- Marriott, Ravishanker, Gelfand and Pai (1995) Bayesian Analysis of ARMA Processes: Complete Sampling Based Inference under Exact Likelihoods. Bayesian Statistics and Econometrics: Essays in honor of Arnold Zellner. Berry, Chaloner and Geweke, eds., John Wiley & sons, 241-254.
R stuff
Radford Neal’s 13 lectures about R
McLeod, Yu and Mahdi’s (2012) Time Series Analysis with R
MATERIAL FROM PREVIOUS YEARS (2018-2020)
Homework assignments and take-home exams
- Take-home midterm exam 2020
- Take-home midterm exam 2019
- Midterm exam 2017: solution
- HW1 2020
- HW2 2020
- HW3 2020: R code
- HW4 2020: VAR e BVAR para dados de consumo de energia em 7 estados brasileiros (R code)
- HW1 2019: Simple MA model + Exercises 1.1 to 1.5 and 2.1 to 2.4 of Tsay (2010).
- HW2 2019: dataset
- HW3 2019: VAR and BVAR (problems 2.4, 2.5 and 2.6 of Tsay (2014) Multivariate Time Series)
- HW2 2018: Turn in exercises 3.1 and 3.2 of Hamilton (1994) and exercises 2.7, 2.8 and 2.9 of Tsay (2010).
- HW3 2018: MC Integration, SIR & Gibbs sampler
- HW1 2017: Solution to 2b and 2c + Additional MC exercise + Solution to 2d.
- HW2 2017: dataset
Paper presentations
- Del Negro and Schorfheide (2004) Priors from General Equilibrium Models for VARS. IER, 45, 643-673.
- Banbura, Giannone and Reichlin (2010) Large BVARs, JAE, 25(1), 71-92.
- Koop and Korobilis (2013) Large TVP VARs. JoE, 177, 185-198.
- Giannone, Lenza and Primiceri (2015) Prior selection for VARs. The Review of Economics and Statistics, 97,436-451.
- Carriero, Clark and Marcellino (2015) BVARs: Specification choices and forecast accuracy. JAE, 30, 46-73.
- Chan and Eisenstat (2018) Bayesian model comparison for TVP VARs with SV. JAE, 33, 509-532.
- Carriero, Clark and Marcellino (2019) Large BVARs with SV and flexible priors. JoE, 212, 137-154.
- Kastner and Huber (2020) Sparse Bayesian vector auto-regressions in huge dimensions, JoF, 30(7), 1142-1165.
- Korobilis and Pettenuzzo (2020) Adaptive hierarchical priors for high-dimensional VARs. JoE, 212(1), 241-271.
- Koop, Korobilis and Pettenuzzo (2019) Bayesian Compressed VARs. JoE, 210, 135-154.
Examples developed in class
- Week of 01/13/2020: ACF of white noise and random walk processes
- Week of 01/13/2020: AR(3) simulation exercise(R markdown code)
- Week of 01/20/2020: AR(1) models: predictive analysis (R markdown code)
- Week of 01/27/2020: AR(3) models: Gibbs sampler (html) (Rmarkdown code )
- Week of 01/27/2020: Our first Metropolis-Hastings algorithm
- Week of 01/27/2020: Bayesian regression with the normal-gamma prior
- Class of 01/15/2019: ACF of white noise and random walk processes
- Class of 01/22/2019: AR(3) simulation exercise(R markdown code)
- Class of 01/24/2019: Gaussian and non-Gaussian GARCH models + Rmarkdown + Petrobras data
- Class of 01/29/2019: Our first state-space model: AR(1) plus noise model
- Class of 01/31/2019: Linear regression with AR(1) errors (graphs)
- Class of 02/05/2019: AR(1) plus noise model: FFBS
- Class of 02/07/2019: AR(1) plus noise model: block-move vs single-move
- Class of 02/26/2019: My first particle filter
- Class of 02/28/2019: SV-AR(1): MCMC & SMC + (R code)
- Class of 03/01/2019: SV & FSV + (R code) + (dados)
- Class of 03/26/2019: DCC-GARCH & FSV + (R code)
- Class of April 19th, 2018: ACF of white noise and random walk processes
- Class of May 3rd, 2018: AR(3) simulation exercise(R markdown code)
- Class of May 8th, 2018: Sampling distribution of the Dickey-Fuller ratio
- Class of May 11th, 2018: SARIMA for unemployment rate in Sao Paulo (R code)
- Class of May 15th, 2018: Sequential Bayesian learning
- Class of May 17th, 2018: Monte Carlo integration/simulation
- Class of May 22nd, 2018: Gaussian AR(2) model with conditionally conjugate priors: Gibbs Sampler
- Class of May 24th, 2018: Bayesian linear regression
- Class of May 24th, 2018: Binomial model and mixture of betas prior: comparing SIR and Metropolis-Hastings schemes
- Class of May 25th, 2018: Bayesian CAPM
- Class of May 29th, 2018: Gaussian and non-Gaussian GARCH models + Rmarkdown + Petrobras
- Class of June 5th, 2018: Modeling time-varying variances via stochastic volatility (SV) models
- Class of June 14th, 2018: Hidden Markov model: forward filtering, backward sampling (Rmd code)
- Class of June 21st, 2018: VAR homework (Due date: July 5th 10:30 am)
- Class of April 18th, 2017: R code for the AR(2) example worked in class
- Class of April 18th, 2017: R for Shumway and Stoffer’s chapter 1
- Class of April 18th, 2017: More R code for the AR(1) and AR(2) processes (Slides)
- Class of April 18th, 2017: R markdown script (run via Rstudio) (PDF output or HTML output)
- Class of April 25th, 2017: Monte Carlo exercise: studying the sampling behavior of the t test under unit root
- Class of May 1st, 2017: Bayesian inference for the Gaussian AR(2) model (R code)
- Class of May 8th, 2017: Computing pi via rejection sampling: our first MC sampling scheme
- Class of May 8th, 2017: MC integration for a simple normal-normal example
- Class of May 8th, 2017: Gibbs sampler for AR(1) model with a changing point (changing in the intercept)
- Class of May 16th, 2017: Brazilian monthly production of cement (January 2002 to February 2017)
- Class of May 16th, 2017: AR(1) plus noise + Figure 1 +Figure 2.
- Class of May 16th, 2017: AR(1) plus noise – Kalman filter and smoother + Figures.
- Class of May 16th, 2017: AR(1) plus noise – Bayesian inference via MCMC/FFBS + Figures.
- Class of May 16th, 2017: AR(1) plus noise – Modeling Alcoa realized volatilities via 1st order DLM + Data.
- Class of May 24th, 2017: AR(1) plus noise – Comparing block move (FFBS) with single move MCMC schemes
- Class of June 6th, 2017: Linear regression with Markov switching intercept – R code + Figures
Time-Series-PhD-ASU
Course: TIME SERIES (Spring 2023)
Professor: Hedibert Freitas Lopes – www.hedibert.org
Lectures: Mondays and Wednesdays, from 10:30am to 11:45am (January 10th to April 27th)
Office hours: Wednesdays, from 11am to 12pm (by appointment only)
Classroom: Social Sciences 205
ATTENTION: This is an advanced time series course (see course description below!). A strong background in calculus, probability, statistics and matrix algebra is highly beneficial.
Syllabus
Course description: The main goal of the course is to make the student familiar with and able to implement univariate and multivariate modern time series models. Univariate time series models we will consider include the family of autoregressive (fractionally) integrated moving average (ARIMA) models, dynamic linear models (aka state-space) models, Markov switching models, generalized autoregressive conditionally heteroskedastic (GARCH) and stochastic volatility (SV) models. Multivariate time series models we will considere include vector autoregressive (VAR) models, factor-augmented VARs, dynamic factor models and various time-varying covariance models. The inferential approach of this course is predominantly Bayesian, so we will briefly introduce key ingredients of Bayesian inference, model selection and criticism. An introduction to the main Monte Carlo methods for Bayesian inference, such as MC integration, sampling-importance-resampling (SIR), Markov chain Monte Carlo (MCMC) and sequential MC (SMC), will also be introduced. All classroom examples and implementations as well as projects will be carried out by the open-source statistical software R.
Key topics covered will be:
- PART I: Basic univariate time series models: AR, MA and ARMA models; unit-root non-stationarity and long-memory processes; seasonal models.
- PART II: Bayesian ingredients (prior, likelihood, posterior, predictive, Bayes factor and posterior model probability); Monte Carlo (MC) methods (MC integration, sampling importance resampling (SIR)) and Markov chain Monte Carlo (MCMC) methods (Gibbs sampler and Metropolis-Hastings (MH) algorithms).
- PART III: More univariate time series: ARCH/GARCH models; EGARCH, GARCH-M, TGARCH; Bayesian GARCH; Bayesian inference in the local level model; Dynamic models; Stochastic volatility models. We will use MCMC as well as sequential Monte Carlo (SMC) schemes to perform batch and online posterior inference.
- PART IV: Multivariate time series models: Vector autoregressive (VAR) models; Large Bayesian VAR (BVAR) models, factor augmented VAR (FAVAR) models, time-varying parameter BVAR (TVP-BVAR) models, Bayesian FAVAR (BFAVAR) models; Factor models and time-varying covariance models.
Useful textbooks:
Homework assignments: HW are to be delivered via a single PDF file to hedibert@gmail.com
- HW1 – Due date: At the beginning of class on Tuesday, January 31st, 2023
- HW2 – Due date: At the beginning of class on Thursday, February 23rd, 2023.
- HW3 – Due date: At the beginning of class on Thursday, March 16th, 2023.
- Fit Gaussian and Student’s t GARCH(1,1) to your favorite returns (Coke, Apple, Amazon, S&P, etc) using the R packages garchFit and bayesGARCH that I have used in class. Feel free to add other (non-Bayesian) GARCH-type fits based on the ARCH-glossary that we have discussed in class. Use data between January 2005 and December 2022, so you are including the 2007-2008 financial crisis, as well as the 2020-2021 COVID pandemic. Comment your findings.
- HW4 – Due date: At the beginning of class on Thursday, April 6th, 2023.
- Inspired by HW3 (above), fit Gaussian and Student’s t SV-AR(1) models, as well their extended versions that contemplate leverage effect (skewed effect between large positive returns and large negative returns), to your favorite returns (Coke, Apple, Amazon, S&P, etc) using the R packages stochvol (by Gregor Kastner). Use data between January 2005 and December 2022, so you are including the 2007-2008 financial crisis, as well as the 2020-2021 COVID pandemic. Comment your findings, including comparisons with the GARCH-type models from HW3. Hint: We basically perform this task in Section 5 of the following example: sv-ar(1) for S&P500 returns. Have fun!
List of papers for final presentation – Due date: 12pm, May 2nd 2023.
Your final evaluation has two parts AND both need to be turned on May 2nd 2023 at noon: a) A five-page summary of the paper, and b) recorded 15-minute presentation of the paper plus slides.
- Stefano Chiaradonna – Cyber risk measurement via loss distribution approach and GARCH model, Communications for Statistical Applications and Methods, 2023, Vol. 30, No. 1, 75–94. By Sanghee Kim and Seongjoo Song. https://doi.org/10.29220/CSAM.2023.30.1.075
- John Schiele – On the long run volatility of stocks: time-varying predictive systems, Journal of the American Statistical Association, 2018, 113, 1050-1069. By Carlos Carvalho, Hedibert Lopes & Robert McCulloch.
- Lydia Gabric – Bayesian prediction of risk measurements using copulas, in Bocker, K. (Ed.) Rethinking Risk Measurement and Reporting: Uncertainty, Bayesian Analysis and Expert Judgement, 2010, 553-578. By Ausin and Lopes. https://hedibert.org/wp-content/uploads/2013/12/ausin-lopes-2010.pdf
- Shuai Zhu – Bayesian generalizations of the integer-valued autoregressive model, Journal of Applied Statistics. By Graziadei, Lopes and Marques (2020)
- Chukwudi Obite – Simulation-based sequential analysis of Markov switching stochastic volatility models, Computational Statistics and Data Analysis, 51 (9), 4526-4542. By Carvalho and Lopes (2006)
- Fan Wu – Time Varying Structural Vector Autoregressions and Monetary Policy, The Review of Economic Studies, Vol. 72, No. 3, 821-852. By Primiceri (2005)
- Xianjian Xie – Sparse Bayesian vector auto-regressions in huge dimensions, Journal of Forecasting, 30(7), 1142-1165. By Kastner and Huber (2020)
TEACHING MATERIAL
PART I: Basic univariate time series
- Autoregressive (AR) models and moving average (MA) models (HTML output)
- Unit-root nonstationarity and long-memory processes (HTML output)
- Seasonal models
PART II: Basic Bayes
- Bayesian ingredients
- Bayesian computation
PART III: Garch-type, dynamic linear and stochastic volatility models
- Glossary of ARCH models
- Dynamic models
- Sequential Monte Carlo – pure filter
PART IV: Multivariate time series
- Vector autoregressive models (VAR) part one
- VAR part two: Large BVAR, FAVAR, TVP-BVAR & BFAVAR
- Bayesian factor analysis (BFA)
- Time-varying covariance modeling
Bonus topic: Time series meet machine learning
Old homework assignments (spring 2022): HW1 (Solution) + HW2 + HW3 (Derivations + R code) + HW4: Fit Gaussian and Student’s t GARCH(1,1) to your favorite returns (Coke, Apple, Amazon, S&P, etc) using the R packages garchFit and bayesGARCH that I have used in class. Feel free to add other (non-Bayesian) GARCH-type fits based on the ARCH-glossary that we have discussed in class. Use data between January 2005 and December 2021, so you are including the 2007-2008 financial crisis, as well as the 2020-2021 COVID pandemic. Comment your findings.
Advanced-Bayes-PhD-ASU
Course: ADVANCED BAYESIAN STATISTICAL LEARNING (Spring 2023)
Professor: Hedibert Freitas Lopes – www.hedibert.org
Lectures: Tuesdays and Thursdays, from 9:00am to 10:15am (January 10th to April 27th)
Office hours: Wednesdays, from 10am to 11am (by appointment only)
Classroom: Social Sciences 205
ATTENTION: This is an advanced Bayesian course (see course description below!). A strong background in calculus, probability, statistics and matrix algebra is highly beneficial.
Syllabus
Course description: The end of the course goal is to expose the student to modern Bayesian solutions to highly structured and stochastic real world problems. We will visit well known Bayesian issues, such as prior specification/sensitivity, model comparison/criticism and model averaging, as well as Bayesian computation via various Monte Carlo methods. We approach regularization in linear and log-linear models via Bayesian LASSO, Spike-and-Slab priors and related sparsity-inducing priors. We cover decoupling shrinkage and selection strategies in a fully Bayesian decision framework. Other topics covered are finite and infinite mixtures for Bayesian semi- and non-parametric modeling, large-scale (dynamic/spatial) factor models, Bayesian additive regression trees (BART), Bayesian text modeling and modeling large-scale time-varying covariance matrices. All classroom examples and implementations as well as projects will be carried out by the open-source statistical software R.
Useful textbooks:
- Gamerman and Lopes (2006) MCMC: Stochastic Simulation for Bayesian Inference, Second Edition. Chapman & Hall/CRC. http://www.dme.ufrj.br/mcmc.
- Gelman, Carlin, Stern, Dunson, Vehtari and Rubin (2020) Bayesian Data Analysis, Third Edition. Chapman & Hall/CRC. http://www.stat.columbia.edu/~gelman/book/BDA3.pdf
- Hoff (2009) A First Course in Bayesian Statistical Methods. Springer.
- Migon, Gamerman and Louzada (2015) Statistical Inference: An Integrated Approach, Second Edition, Chapman & Hall/CRC.
Homework assignments: HW are to be delivered via a single PDF file to hedibert@gmail.com
- HW1 – Due at the beginning of class, February 7th, 2023.
- HW2 – Due at the beginning of class, February 23rd, 2023.
- HW3 – Due at the beginning of class, March 16th, 2023
- HW4 – Due at the beginning of class, April 13th, 2023
List of papers for final presentation – Due date: 12pm, May 2nd 2023.
Your final evaluation has two parts AND both need to be turned on May 2nd 2023 at noon: a) A five-page summary of the paper, and b) recorded 15-minute presentation of the paper plus slides.
- The illusion of the illusion of sparsity – Fava and Lopes (2021), Brazilian Journal of Probability and Statistics, 35(4), 699-720 https://arxiv.org/abs/2009.14296 (Bryan Lietz)
- A weakly informative default prior distribution for logistic and other regression models – Gelman, Jakulin, Pittau and Zu (2008), Annals of Applied Statistics, 2(4), 1360-1383. https://doi.org/10.1214/08-AOAS191 (Tyler Hoffman)
- Do forecasts of bankruptcy cause bankruptcy? A machine learning sensitivity analysis – Papakostas, Hahn, Murray, Zhou and Gerakos (2023), Annals of Applied Statistics, 17(1), 711-739.
https://doi.org/10.1214/22-AOAS1648 (Yang Ba)
- Forecasting with many predictors using Bayesian additive regression trees – Pruser (2019), Journal of Forecasting, Volume38, Issue7, November 2019, Pages 621-631. https://doi.org/10.1002/for.2587 (Mina Jiang)
TEACHING MATERIAL
Bayesian ingredients
Old homework assignments (spring 2022): HW1 (Solution) + HW2 (Solution) + HW3 (Derivations + R code) + HW4 (Solution)
Pool of papers from final presentation (spring 2022)
Aprendizagem Bayesiana
Course: APRENDIZAGEM BAYESIANA – Mestrado Profissional em Economia (MPE)
Professor: Hedibert Freitas Lopes – www.hedibert.org
Monitoria: Henrique Bolfarine
Syllabus: Baixe aqui
Homework assignments
- HW1: Use the your own y and X in the attached R code (Due: 7:30pm, Monday, March 9th, 2020)
- HW2: PCA e FA para dados de consumo de energia em 7 estados brasileiros
Course notes (+ R code & references)
Additional supporting material
Econometrics III 2018
Course: ECONOMETRICS III 2018 – Doctoral Program in Business Economics
Professor: Hedibert Freitas Lopes – www.hedibert.org
We have 26 1.5h lectures between April 17th and July 12
Objective
The main goal of the course is to make the student familiar with and able to implement univariate and multivariate time series models by using both frequentist and Bayesian approaches. All classroom examples and implementations as well as projects will be carried out by the open-source statistical software R.
Course description
Brief review of frequentist inference followed by the introduction of key ingredients of Bayesian inference, model selection and criticism. An introduction to the main Monte Carlo methods for Bayesian inference: MC integration, resampling, MCMC and sequential MC. Univariate time series models, including AR(F)IMA models, state-space models, Markov switching models, GARCH and stochastic volatility models. Multivariate time series models, including Bayesian VARs and factor-augmented VARs, dynamic factor models, time-varying covariance models.
Course outline
- PART I: Basic univariate time series models
- AR, MA and ARMA models
- Unit-root nonstationarity and long-memory processes
- Seasonal models
- PART II: Basic Bayes
- Bayesian ingredients
- Monte Carlo methods
- Markov chain Monte Carlo methods
- PART III: More univariate time series
- ARCH/GARCH models
- EGARCH, GARCH-M, TGARCH
- Bayesian GARCH
- Bayesian inference in the local level model
- Dynamic models
- Stochastic volatility models
- PART IV: Multivariate time series models
- Vector autoregressive models
- Large BVAR, FAVAR, TVP-BVAR & BFAVAR models
- Factor models
- Time-varying covariance models
Bibliography
Teaching assistant: William Rojas (williamrojas1212 at gmail dot com)
Office hours: Wednesdays, from 8am to 9:30am, room 201. Except on May 9th, room Mario Haberfeld.
Homework assignments
- HW1: Due date: April 19th, 10:30am – Read chapter one and chapter two (Sections 2.1 to 2.5) of Tsay (2010) and turn in exercises 1.1 to 1.5 (pages 25 to 28) and exercises 2.2 to 2.4 (pages 104 to 105). You can work individually or in pairs.
- HW2: Due date: May 17th, 10:30am – Turn in exercises 3.1 and 3.2 (pages 70-71) of Hamilton’s (1994) Time Series Analysis and exercises 2.7, 2.8 and 2.9 (pages 105 to 106) of Tsay’s (2010) Analysis of Financial Time Series. You can work individually or in pairs.
- HW3: Due date: May 29th, 10:30am – MC Integration, SIR, Gibbs sampler
- HW4: Due date: July 5th, 10:30am – VAR and BVAR (problems 2.4, 2.5 and 2.6)
EXAMPLES DEVELOPED IN CLASS
- Class of April 19th, 2018: ACF of white noise and random walk processes
- Class of May 3rd, 2018: AR(3) simulation exercise(R markdown code)
- Class of May 8th, 2018: Sampling distribution of the Dickey-Fuller ratio
- Class of May 11th, 2018: SARIMA for unemployment rate in Sao Paulo (R code)
- Class of May 15th, 2018: Sequential Bayesian learning
- Class of May 17th, 2018: Monte Carlo integration/simulation
- Class of May 22nd, 2018: Gaussian AR(2) model with conditionally conjugate priors: Gibbs Sampler
- Class of May 24th, 2018: Bayesian linear regression + Binomial model and mixture of betas prior: comparing SIR and Metropolis-Hastings schemes
- Class of May 25th, 2018: Bayesian CAPM
- Class of May 29th, 2018: Gaussian and non-Gaussian GARCH models + Rmarkdown + Petrobras
- Class of June 5th, 2018: Modeling time-varying variances via stochastic volatility (SV) models
- Take home exam
- Class of June 14th, 2018: Hidden Markov model: forward filtering, backward sampling (Rmd code)
- Class of June 19th, 2018: Papers for final presentations
- Class of June 21th, 2019: VAR homework (Due date: July 5th 10:30 am)
TEACHING MATERIAL
PART I: Basic univariate time series
PART II: Basic Bayes
PART III: More univariate time series
- ARCH/GARCH models
- EGARCH, GARCH-M, TGARCH
- Bayesian GARCH
- Dynamic models (aka state-space models) and stochastic volatility (SV) models
PART IV: Multivariate time series
- Vector autoregressive models
- Large BVAR, FAVAR, TVP-BVAR & BFAVAR
- Factor models (Standard factor analysis, Spatial dynamic factors, Factor stochastic volatility)
- Time-varying covariance models
Additional material on Bayesian time series
- Bayesian Statistics (a very brief introduction) – Ken Rice, April, 2014
- Lopes and Salazar (2006) Bayesian model uncertainty in smooth transition autoregressions, Journal of Time Series Analysis, 27, 99-117.
- Huerta and Lopes (2000) Bayesian forecasting and inference in latent structure for the Brazilian industrial production index, Brazilian Review of Econometrics, 20, 1-26.
- Kleibergen and Hoek (2000) Bayesian Analysis of ARMA Models. Tinbergen Institute Discussion Paper.
- Marriott, Ravishanker, Gelfand and Pai (1995) Bayesian Analysis of ARMA Processes: Complete Sampling Based Inference under Exact Likelihoods. Bayesian Statistics and Econometrics: Essays in honor of Arnold Zellner. Berry, Chaloner and Geweke, eds., John Wiley & sons, 241-254.
R stuff
Radford Neal’s 13 lectures about R
McLeod, Yu and Mahdi’s (2012) Time Series Analysis with R
Bayesian Econometrics 2017
Course: BAYESIAN ECONOMETRICS 2017 – Doctoral Program in Business Economics
Professor: Hedibert Freitas Lopes – www.hedibert.org
Syllabus
Objective
The end of the course goal is to allow the student to critically decide between a Bayesian, a frequentist or Bayesian-frequentist compromise when facing real world problems in the fields of micro-econometrics, macro-econometrics, marketing and finance. With this end in mind, we will visit well known Bayesian issues, such as prior specification and model comparison and model averaging, but also study regularization, “small n, large p” issues, Bayesian statistical learning (additive regression trees) and large-scale factor models.
Course description
Basic ingredients: prior, posterior, and predictive distributions, sequential Bayes, conjugate analysis, exchangeability, principles of data reduction and decision theory. Model criticism: Bayes factor, computing marginal likelihoods, Savage-Dickey ratio, reversible jump MCMC, Bayesian model averaging and deviance information criterion. Modern computation via (Markov chain) Monte Carlo methods: Monte Carlo integration, sampling-importance resampling, Gibbs sampler, Metropolis-Hastings algorithms. Mixture models, Hierarchical models, Bayesian regularization, Instrumental variables modeling, Large-scale (sparse) factor modeling, Bayesian additive regression trees (BART) and related topics, Dynamic models, Sequential Monte Carlo algorithms, Bayesian methods in microeconometrics, macroeconometrics, marketing and finance
Course notes (+ R code & references)
Miscellaneous
Econometrics III 2017
Course: ECONOMETRICS III 2017 – Doctoral Program in Business Economics
Professor: Hedibert Freitas Lopes – www.hedibert.org
Syllabus
Objective
The main goal of the course is to make the student familiar with and able to implement univariate and multivariate time series models by using both frequentist and Bayesian approaches. All classroom examples and implementations as well as projects will be carried out by the open-source statistical software R.
Course description
Brief review of frequentist inference followed by the introduction of key ingredients of Bayesian inference, model selection and criticism. An introduction to the main Monte Carlo methods for Bayesian inference: MC integration, resampling, MCMC and sequential MC. Univariate time series models, including AR(F)IMA models, state-space models, Markov switching models, GARCH and stochastic volatility models. Multivariate time series models, including Bayesian VARs and factor-augmented VARs, dynamic factor models, time-varying covariance models.
Bibliography
Teaching assistant: Paloma Uribe (paloma dot uribe at gmail dot com)
Office hours: Thursdays from 1pm to 2pm (Room 604)
Day-to-day announcements
Homework assignments & Exams
April, 18th to 24th 2017:
April, 25th to 30th 2017:
May, 1st to May 7th 2017:
May 8th to May 15th 2017:
This week we started talking about two relatively simple and well-known time series models with time-varying variances: ARCH(1) and GARCH(1,1) models. We also considered a standard Bayesian approach to posterior inference regarding the GARCH(1,1) model. Chapter 3 of Tsay’s (2010) book is good enough as an introduction to the subject, particularly sections 3.4 (ARCH model) and 3.5 (GARCH model). A few exercises from his book are worth trying to fix most the ideas. I recommend the following exercises: 3.1, 3.2, 3.3, 3.4, 3.5 and 3.6.
May, 16th to May 23rd 2017:
May, 24th to June 5th 2017:
- AR(1) plus noise – Comparing block move (FFBS) with single move MCMC schemes: HTML
June 6th to June 15th 2017:
COURSE SLIDES AND TEACHING MATERIAL
PART I: Basic univariate time series
PART II: Basic Bayes
PART III: More univariate time series
- ARCH/GARCH models
- EGARCH, GARCH-M, TGARCH
- Bayesian GARCH
- Dynamic models (aka state-space models) and stochastic volatility (SV) models
PART IV: Multivariate time series
- Vector autoregressive models
- Large BVAR, FAVAR, TVP-BVAR & BFAVAR
- Factor models (Standard factor analysis, Spatial dynamic factors, Factor stochastic volatility)
- Time-varying covariance models
Additional material
- Bayesian Statistics (a very brief introduction) – Ken Rice, April, 2014
- Lopes and Salazar (2006) Bayesian model uncertainty in smooth transition autoregressions, Journal of Time Series Analysis, 27, 99-117.
- Huerta and Lopes (2000) Bayesian forecasting and inference in latent structure for the Brazilian industrial production index, Brazilian Review of Econometrics, 20, 1-26.
- Kleibergen and Hoek (2000) Bayesian Analysis of ARMA Models. Tinbergen Institute Discussion Paper.
- Marriott, Ravishanker, Gelfand and Pai (1995) Bayesian Analysis of ARMA Processes: Complete Sampling Based Inference under Exact Likelihoods. Bayesian Statistics and Econometrics: Essays in honor of Arnold Zellner. Berry, Chaloner and Geweke, eds., John Wiley & sons, 241-254.
R stuff
Introduction to R (by Paloma Uribe, in Portuguese)
Radford Neal’s 13 lectures about R
McLeod, Yu and Mahdi’s (2012) Time Series Analysis with R
Econometrics III 2016
Course: ECONOMETRICS III 2016 – Doctoral Program in Business Economics
Professor: Hedibert Freitas Lopes – www.hedibert.org
Syllabus
Objective
The main goal of the course is to make the student familiar with and able to implement univariate and multivariate time series models by using both frequentist and Bayesian approaches. All classroom examples and implementations as well as projects will be carried out by the open-source statistical software R.
Course description
Brief review of frequentist inference followed by the introduction of key ingredients of Bayesian inference, model selection and criticism. An introduction to the main Monte Carlo methods for Bayesian inference: MC integration, resampling, MCMC and sequential MC. Univariate time series models, including AR(F)IMA models, state-space models, Markov switching models, GARCH and stochastic volatility models. Multivariate time series models, including Bayesian VARs and factor-augmented VARs, dynamic factor models, time-varying covariance models.
Bibliography
Course material
- Bayesian ingredients
- GL2006: Examples 2.1, 2.4, 2.5, section 2.3.2, problems 2.1, 2.2, 2.5, 2.8a, 2.10 and 2.11.
- Monte Carlo methods
- Markov chain Monte Carlo methods
- Autoregressive (AR) models and moving average (MA) models
- Tsay (2010): Sections 2.1-2.6, sections 2.7 and 2.11, and sections 3.1-3.8
- Unit-root nonstationarity and long-memory processes
- Seasonal models
- ARCH/GARCH models (EGARCH, GARCH-M, TGARCH) (Bayesian GARCH)
- Dynamic models (aka state-space models) and stochastic volatility (SV) models
- Homework (due June 1st) (solution)
- Vector autoregressive models
- Large BVAR, FAVAR, TVP-BVAR & BFAVAR
- Sequential Monte Carlo methods
- Factor models (Standard factor analysis, Spatial dynamic factors, Factor stochastic volatility)
- Time-varying covariance models
Additional material
- Bayesian Statistics (a very brief introduction) – Ken Rice, April, 2014
- Lopes and Salazar (2006) Bayesian model uncertainty in smooth transition autoregressions, Journal of Time Series Analysis, 27, 99-117.
- Huerta and Lopes (2000) Bayesian forecasting and inference in latent structure for the Brazilian industrial production index, Brazilian Review of Econometrics, 20, 1-26.
- Kleibergen and Hoek (2000) Bayesian Analysis of ARMA Models. Tinbergen Institute Discussion Paper.
- Marriott, Ravishanker, Gelfand and Pai (1995) Bayesian Analysis of ARMA Processes: Complete Sampling Based Inference under Exact Likelihoods. Bayesian Statistics and Econometrics: Essays in honor of Arnold Zellner. Berry, Chaloner and Geweke, eds., John Wiley & sons, 241-254.
Econometria 2016-2
Disciplina: ECONOMETRIA – Turma 4ECO (Economia)
Período Letivo: 2016/2
Professor: Hedibert Freitas Lopes – www.hedibert.org
Monitora: Paloma Vaissman Uribe – PalomaVU@insper.edu.br
Programa de Ensino
Conteudo das aulas
- Apresentacao do curso
- Regressao Linear Simples – Parte 1: Minimos Quadrados Ordinarios & R2
- Regressao Linear Simples – Parte 2: formas funcionais
- Regressao Linear Simples – Parte 3: Suposicoes, propriedades e teste-t
- Analise de residuos
- Regressao linear multipla
- Regressao linear multipla – Vies de omissao de variable
- Regressao linear multipla – Teste F Parcial
- Heteroscedasticidade
- Endogeneidade: variaveis instrumentais (worked examples)
- Endogeneidade: estimacao
- Endogeneidade: tests
- Basic time series – Codigo R
Listas de exercicios
Lista 1 (Regressao linear simples): Wooldridge – 2.2, 2.3, 2.4, 2.5, 2.7, 2.9, 2.11
Lista 2 (Regressao linear multipla): Wooldridge – 3.3, 3.4, 3.5, 3.7, 3.9, 4.2, 4.3, 4.6, 4.9, 4.11
Lista 3 (Heteroscedasticidade): Wooldridge – 8.1, 8.2, 8.3, 8.4, 8.5 + refazer os exemplos 8.1 (pg 250-251), 8.2 (pg 252), 8.4 (pg 257) e 8.7 (pg 268-269)
Lista 4 (Endogeneidade): Wooldridge – 15.1, 15.2, 15.3, 15.7, 15.8 e 15.10 + refazer os exemplos 5.1 (pg 476-477), 5.2 (pg 477-478), 5.3 (pg 480-481) e 5.4 (pg 484-486).
Lista 5 (Series temporais): Wooldridge – Refazer os exemplos 10.1, 10.2, 10.3, 10.4, 10.7, 10.9, 11.3, 11.4, 11.5, 11.6 e 11.7 + problemas 11.1, 11.2, 11.3, 11.4 e 11.5
Atividade 2
Dados por escola do ENEM2015 – Analise exploratoria dos dados – Codigo R
Econometria em R
- Tutorial de R: aula 1
- Tutorial de R: aula 2
- Tutorial de R: regressao
- Tutorial de R: R2
- Introducao ao uso do R (Paloma Uribe)
- Using R for Introductory Econometrics (Florian Heiss)
- Econometric and time series modeling using R (Cribari-Neto)
- Introduction to programmingEconometrics with R (Bruno Rodrigues)
- Econometrics in R: Past, Present, and Future (Achim Zeileis & Roger Koenker)
- CRAN Task View: Econometrics (Achim Zeileis)
- R-Econometrics – Learn R for applied economics in a comprehensive way
Econometria em outras linguagens/pacotes
- PYTHON: Introductory Econometrics – Jeffrey M. Wooldridge: Capítulos 2 ao 8 usando PYTHON
- Kevin Sheppard’s Python for Econometrics
- STATA: Introductory Econometrics – Jeffrey M. Wooldridge: Capítulos 2 ao 18 usando STATA
- Statistical Analysis in R, MATLAB, SAS, STATA and SPSS
Mais conjuntos de dados
-
-
- VendedoresRH.xls
- salario.txt: Salario vs posicao, anos de experiencia e sexo
- retornos-2014.csv: Retornos de Ambev, Vale, Petrobras, JBS, Natura, Gafisa, Lojas Americanas & Ibovespa
- vs posicao, anos de experiencia e sexo
- bankwages.txt: Salario vs salario inicial, educacao, sexo, minoria e categoria
- wage2-wooldridge.txt: Monthly earnings, education, IQ, etc for 935 men in 1980
- gpa2-wooldridge.txt: Data on 4,137 US college students
- houseprices.txt: House prices vs offers, sqft, brick, bedrooms, bathrooms and neighborhood
- orangejuice-chicagoarea.csv: Weekly sales of 64oz orange juice containers in the Chicago area
- toyotacorolla.csv: Sales prices and vehicle characteristics of 1436 used Toyota Corollas
- peso-altura-idade-sexo-criancas.txt: peso vs altura, idade e sexo
- gpa2.txt: same as gpa2-wooldridge.txt
- temco.txt
- temcoprod.txt
- hprice1.txt: dados sobre 88 residencias.
- dadosmunicipais.csv: dados de 5848 municipios brasileiros (analfabetismo, renda, desigualdade).
- caliescom.xls: Performance media de 420 escolas em um teste padronizado.
- ceosal2.xls: Data on 177 chief executive officers.
- florida2000.xls: Florida 2000 presidential vote as of 5pm ET Saturday, Nov. 11, 2000 (after recount).
- temcoprod.xls: Caracteristicas de funcionarios de uma firma.
- simulados_producao.xls: Modelagem da produção de firmas de um determinado setor.
- wage1.csv: Wooldridge’s dataset.
- smoke.txt: Wooldridge’s dataset.
- card.csv: Wooldridge’s dataset.
- mroz.csv: Wooldridge’s dataset.
- bwght.csv: Wooldridge’s dataset.
Econometria 2016-1
Disciplina: ECONOMETRIA – Turma 4ECO (Economia)
Período Letivo: 2016/1
Professor: Hedibert Freitas Lopes – www.hedibert.org
Monitora: Paloma Vaissman Uribe – PalomaVU@insper.edu.br
Programa de Ensino
Conteudo das aulas
- Apresentacao do curso
- Regressao Linear Simples – Parte 1: Minimos Quadrados Ordinarios & R2
- Regressao Linear Simples – Parte 2: formas funcionais
- Regressao Linear Simples – Parte 3: Suposicoes, propriedades e teste-t
- Analise de residuos
- Atividade 1 – Dados da PNAD 2009 + readme.txt + Analise exploratoria dos dados
- Madalozzo and Mauriz (2012) Does investing in education reduce the gender wage gap? A Brazilian population study. Population Review, Volume 51, Number 2, pp. 59-84.
- Bertrand, Kamenica and Pan (2015) Gender Identity and relative income within households. Quarterly Journal of Economics.
- USA data: Median usual weekly earnings (second quartile), Employed full time, Wage and salary workers. U.S. Bureau of Labor Statistics, United States Department of Labor.
- pnad2009.R: Codigo R para os dados brasileiros.
- earnings.R: Codigo R para os dados americanos.
- Regressao linear multipla
- Regressao linear multipla – Teste F Parcial + Teste RESET
- Heteroscedasticidade
- Dados para atividade 2: sleep75.csv
- Endogeneidade: variaveis instrumentais (worked examples)
- Endogeneidade: estimacao
- Endogeneidade: tests
- Basic time series – Codigo R
Listas de exercicios
- Lista 1
- Lista 2
- Lista 3
- Lista 4
Econometria em R
- Introducao ao uso do R (Paloma Uribe)
- Using R for Introductory Econometrics (Florian Heiss)
- Econometric and time series modeling using R (Cribari-Neto)
- Introduction to programmingEconometrics with R (Bruno Rodrigues)
- Econometrics in R: Past, Present, and Future (Achim Zeileis & Roger Koenker)
- CRAN Task View: Econometrics (Achim Zeileis)
- R-Econometrics – Learn R for applied economics in a comprehensive way
Econometria em outras linguagens/pacotes
- PYTHON: Introductory Econometrics – Jeffrey M. Wooldridge: Capítulos 2 ao 8 usando PYTHON
- Kevin Sheppard’s Python for Econometrics
- STATA: Introductory Econometrics – Jeffrey M. Wooldridge: Capítulos 2 ao 18 usando STATA
- Statistical Analysis in R, MATLAB, SAS, STATA and SPSS
Mais conjuntos de dados
-
-
- VendedoresRH.xls
- salario.txt: Salario vs posicao, anos de experiencia e sexo
- retornos-2014.csv: Retornos de Ambev, Vale, Petrobras, JBS, Natura, Gafisa, Lojas Americanas & Ibovespa
- vs posicao, anos de experiencia e sexo
- bankwages.txt: Salario vs salario inicial, educacao, sexo, minoria e categoria
- wage2-wooldridge.txt: Monthly earnings, education, IQ, etc for 935 men in 1980
- gpa2-wooldridge.txt: Data on 4,137 US college students
- houseprices.txt: House prices vs offers, sqft, brick, bedrooms, bathrooms and neighborhood
- orangejuice-chicagoarea.csv: Weekly sales of 64oz orange juice containers in the Chicago area
- toyotacorolla.csv: Sales prices and vehicle characteristics of 1436 used Toyota Corollas
- peso-altura-idade-sexo-criancas.txt: peso vs altura, idade e sexo
- gpa2.txt: same as gpa2-wooldridge.txt
- temco.txt
- temcoprod.txt
- hprice1.txt: dados sobre 88 residencias.
- dadosmunicipais.csv: dados de 5848 municipios brasileiros (analfabetismo, renda, desigualdade).
- caliescom.xls: Performance media de 420 escolas em um teste padronizado.
- ceosal2.xls: Data on 177 chief executive officers.
- florida2000.xls: Florida 2000 presidential vote as of 5pm ET Saturday, Nov. 11, 2000 (after recount).
- temcoprod.xls: Caracteristicas de funcionarios de uma firma.
- simulados_producao.xls: Modelagem da produção de firmas de um determinado setor.
- wage1.csv: Wooldridge’s dataset.
- smoke.txt: Wooldridge’s dataset.
- card.csv: Wooldridge’s dataset.
- mroz.csv: Wooldridge’s dataset.
- bwght.csv: Wooldridge’s dataset.
Analise Multivariada 2015
Disciplina: ANALISE MULTIVARIADA 2015 (MPA)
Período Letivo: 2015/1
Professor: Hedibert Freitas Lopes – www.hedibert.org
Monitor: Leandro Augusto Ferreira
Objetivo: O objetivo do curso é apresentar os conceitos e métodos de análise multivariada de dados, aplicando-os a dados reais e interpretando os resultados de forma prática. No curso de análise multivariada são utilizados conceitos de estatística básica e inferência, com ênfase na resolução de problemas reais e interpretação dos resultados. Na maioria dos estudos, a complexidade dos fenômenos estudados faz com seja necessário coletar informações sobre um conjunto de variáveis. A análise multivariada permite o estudo simultâneo de um conjunto de variáveis, aproveitando a estrutura de correlação existente entre as mesmas. Nesta disciplina são apresentadas técnicas de análise de dados quantitativos e qualitativos, discutindo aplicações nas áreas de marketing, operações, recursos humanos e finanças.
Programa de Ensino
03/02/2015: Análise Exploratória de Dados Multivariados
06/02/2015: Inferencia Multivariada – MANOVA
10/02/2015: Análise de Componentes Principais
24/02/2015: Análise Fatorial – parte 1 + Análise Fatorial – parte 2
03/03/2015: Regressao logistica + Analise discriminante
10/03/2015: Correlacao canonica
17/03/2015: Cluster analysis
24/03/2015: Trabalho em sala de aula (terminado em casa) 31/03/2015: Correspondence analysis & multidimensional scaling
10/04/2015: Structural equation modeling
Conjunto de dados:
Bayesian Statistical Learning 2016
Course: Readings in Statistics and Econometrics 2016: Bayesian Statistical Learning
Professors: Hedibert Freitas Lopes & Paulo Marques
Objective: In this Second Readings in Statistics and Econometrics we will study and discuss, through a series of well established papers, the broad topic of Statistical Learning with an emphasis on its natural Bayesian solutions. The 5 lectures and 8 seminars will take place on Fridays between 10am and 12pm from January 29th to April 8th 2016. Paulo and I will give lectures discussing traditional Statistical Learning techniques, alternated with seminars given by the participants on papers presenting Bayesian counterparts to the techniques discussed in the lectures.
Outline of the meetings (5 lectures and 8 seminars)
Books
Papers
-
-
- Chipman, George and McCulloch (2010) BART: Bayesian Additive and Regression Trees. AOAS, 4, 266-298.
- Cucala, Marin, Robert and Titterington (2009) A Bayesian Reassessment of Nearest-Neighbor Classification. JASA, 104, 263-273.
- Griffin and Brown (2010) Inference with normal-gamma prior distributions in regression problems. BA, 5, 171-188.
- Griffin and Brown (2012) Structuring shrinkage: some correlated priors for regression. Biometrika, 99, 481-487.
- Griffin and Brown (2013) Some priors for sparse regression modelling. BA, 8, 691-702.
- Holmes and Adams (2002) A Probabilistic Nearest Neighbour Method for Statistical Pattern Recognition. JRSS-B, 64, 295-306.
- Holmes and Adams (2003) Likelihood Inference in Nearest-Neighbour Classification Models. Biometrika, 90, 99-112.
- Kulis and Jordan (2012) Revisiting k-means: New Algorithms via Bayesian Nonparametrics. Proceedings of the 29th International Conference on Machine Learning, Edinburgh, Scotland.
- Polson and Scott (2011) Data Augmentation for Support Vector Machines. BA, 6, 1-24.
- Polson, Scott and Windle (2014) The Bayesian Bridge. JRSS-B, 76, 713-733.
- Tipping (2001) Sparse Bayesian learning and the Relevance Vector Machine. JMLR, 1, 211-244.
- Marques and Pereira (2013) Predictive Analysis of Microarray Data.
Causality 2015
Readings in Statistics and Econometrics 2015: Causality
Organizer: Hedibert Freitas Lopes
Email: hedibertFL at insper.edu.br
In this First Readings in Statistics and Econometrics we will study and discuss, through a series of well established papers, the broad topic of causality. The lectures are held at INSPER on Tuesdays, from 7:30am to 9:30am, from September 29th to December 1st, 2015, at classroom Paulo Renato de Souza, 2nd floor.
Annotated bibliography: Here you will find links to textbooks and edited books, special issues, articles with discussion and web material: slides of lectures, discussion of causality, video lectures and more (in chronological order).
Annotated bibliography: Only articles and book chapters (in alphabetical order).
Outline of the lectures
-
-
- September 29th – Hedibert Lopes – INSPER
Haavelmo (1943) The statistical implications of a system of simultaneous equations. Econometrica, 11, 1-12.
slides of the lecture
- October 6th – Hedibert Lopes – INSPER
Rubin (1974) Estimating causal effects of treatments in randomized and nonrandomized studies. Journal of Educational Psychology, 56, 688-701.
slides of the lecture
- October 13th – Andre Yoshizumi, IME/USP
Holland (1986) Statistics and causal inference (with discussion). JASA, 81, 945-970.
slides of the lecture
- October 20th – Paloma Uribe, IME/USP
Pearl (1995) Causal diagrams for empirical research (with discussion). Biometrika, 82, 669-710.
slides of the lecture + slides of Joao M.P.De Mello’s talk
- November 3rd – Sergio Firpo, EESP/FGV
Angrist, Imbens and Rubin (1996) Identication of causal effects using instrumental variables (with discussion). JASA, 91, 444-472.
slides of the lecture
- November 10th – Julio Trecenti, IME/USP
Dawid (2000) Causal inference without counterfactuals (with discussion). JASA, 95, 407-424.
slides of the lecture
- November 24th – Manasses Nobrega, UFABC
Vansteelandt and Goetghebeur (2003) Causal inference with generalized structural mean
models. JRSS-B, 65, 817-835.
- December 1st – Hedibert Lopes – INSPER
Heckman and Pinto (2015) Causal analysis after Haavelmo. Econometric Theory, 31,115-151.
slides of the lecture
Books & special issues
-
-
- Journal of Econometrics (1988), Volume 39, Issues 1-2
- Spirtes, Glymour and Scheines (2001) Causation, Prediction, and Search (2nd edition)
- Gelman and Meng (2004) Applied Bayesian Modeling and Causal Inference from Incomplete-Data Perspectives
- Dawid (2007) Fundamentals of Statistical Causality
- Morgan and Winship (2007) Counterfactuals and Causal Inference: Methods and Principles for Social Research (2nd edition)
- Angrist and Pischke (2008) Mostly Harmless Econometrics: An Empiricist’s Companion
- Pearl (2009) Causality: Models, Reasoning and Inference (2nd Edition)
- Schroeder (2010) Accounting and Causal Effects: Econometric Challenges
- Berzuini, Dawid and Bernardinelli (2012) {\em Causality: Statistical Perspectives and Applications
- Morgan (2013) Handbook of Causal Analysis for Social Research
- Imbens and Rubin (2015) Causal Inference for Statistics, Social, and Biomedical Sciences: An Introduction
- Hernan and Robins (2015) Causal Inference
- Econometric Theory (2015), Volume 31, Issue 01
Articles with discussion
-
-
- Holland (1986) Statistics and causal inference. JASA, 81, 945-970.
- Pearl (1995) Causal diagrams for empirical research. Biometrika, 82, 669-710.
- Angrist, Imbens and Rubin (1996) Identification of causal effects using IVs. JASA, 91, 444-472.
- Dawid (2000) Causal inference without counterfactuals. JASA, 95, 407-424.
- Heckman (2005) The scientific model of causality. Sociological Methodology, 35, 1-150.
From the web
Econometria Avancada 2015
Disciplina: ECONOMETRIA AVANCADA 2015
Período Letivo: 2015/1
Professor: Hedibert Freitas Lopes – www.hedibert.org
Monitor: Paloma Vaissman Uribe
Objetivo: O objetivo do curso é apresentar os conceitos e métodos de análise multivariada de dados, aplicando-os a dados reais e interpretando os resultados de forma prática. No curso de análise multivariada são utilizados conceitos de estatística básica e inferência, com ênfase na resolução de problemas reais e interpretação dos resultados. Na maioria dos estudos, a complexidade dos fenômenos estudados faz com seja necessário coletar informações sobre um conjunto de variáveis. A análise multivariada permite o estudo simultâneo de um conjunto de variáveis, aproveitando a estrutura de correlação existente entre as mesmas. Nesta disciplina são apresentadas técnicas de análise de dados quantitativos e qualitativos, discutindo aplicações nas áreas de marketing, operações, recursos humanos e finanças.
Programa de Ensino
Introducao ao R by Paloma Uribe (Material apresentado na 1a monitoria)
Lista de exercicios
Trabalhos em grupo
Prova intermediaria: Prova (solucao)
Prova final: Solucao
Notas de aula
Codigo R
Textos complementares
Conjuntos de dados
Alguns sites interessantes para o curso
Econometria 2014
Disciplina: ECONOMETRIA 2014 – Turma 4ECO (Economia)
Período Letivo: 2014/1
Professor: Hedibert Freitas Lopes – www.hedibert.org
Objetivo: Apresentar uma abordagem introdutória a Econometria dando ênfase tanto à base estatística quanto a aplicações econômicas. Será discutido, em detalhes, o significado e as implicações das suposições do modelo linear geral. Ainda, serão descritos e aplicados testes de violações das hipóteses do modelo linear geral, bem como serão apresentados e aplicados estimadores alternativos ao de mínimos quadrados ordinários (MQO). Ao final desse curso, o aluno deverá ser capaz de utilizar técnicas estatísticas adequadas para mensurar quantidades de interesse e realizar previsões.
Programa de Ensino
12/02/2014: Apresentacao do curso + Primeiro exemplo
14/02/2014: Regressao Linear Simples – Parte 1: Minimos Quadrados Ordinarios & R2
19/02/2014: Regressao Linear Simples – Parte 2: formas funcionais
21/02/2014: Regressao Linear Simples – Parte 3: Suposicoes, propriedades e teste-t
26/02/2014: Regressao Linear Multipla – Parte 1: Estimacao
28/02/2014: Regressao Linear Multipla – Parte 2: R2 ajustado
07/03/2014: Regressao Linear Multipla – Parte 3: Suposicoes e propriedades
-
-
- Data on monthly earnings, education, several demographic variables, and IQ scores for 935 men in 1980. (Dataset) (Codigo R)
- Data on 4,137 US college students.(Dataset) (Codigo R)
12/03/2014: Regressao Linear Multipla – Parte 4: Inferencia
14/03/2014: Regressao Linear Multipla – Parte 5: Interacao e funcao quadratica
19,21&26/03/2014: Regressao Linear Multipla – Parte 6: Informacao qualitativa atraves de variaveis dummy
11/04/2014: Regressao Linear Multipla – Parte 7: Teste F parcial
16/04/2014: Regressao Linear Multipla – Parte 8: Teoria assintotica
23/04/2014: Regressao Linear Multipla – Parte 9: Teste do multiplicador de Lagrange
-
-
- R code for hprice1 example
- Sawyer (2002) The Method of Lagrange Multipliers.
- Buse (1982) The Likelihood Ratio, Wald, and Lagrange Multiplier Tests: An Expository Note. The American Statistician, Vol. 36, No. 3, Part 1 (Aug., 1982), pp. 153-157.
- Engle (1983) Wald, Likelihood Ratio, and Lagrange Multiplier Tests in Econometrics. In Intriligator and Griliches. Handbook of Econometrics II. Elsevier. pp. 796–801.
25/04/2014: Regressao Linear Multipla – Parte 10: Regression specification error test (RESET)
30/04/2014: Real-time analysis of three datasets
07-09/05/2014: Regressao Linear Multipla – Parte 11: Homocedasticidade
14/05/2014: Trabalho em grupo
16 a 30/05/2014: Endogeneidade & Equacoes simultaneas
Listas de exercicios + gabaritos
Conjuntos de dados
-
-
- VendedoresRH.xls
- salario.txt: Salario vs posicao, anos de experiencia e sexo
- bankwages.txt: Salario vs salario inicial, educacao, sexo, minoria e categoria
- wage2-wooldridge.txt: Monthly earnings, education, IQ, etc for 935 men in 1980
- gpa2-wooldridge.txt: Data on 4,137 US college students
- houseprices.txt: House prices vs offers, sqft, brick, bedrooms, bathrooms and neighborhood
- orangejuice-chicagoarea.csv: Weekly sales of 64oz orange juice containers in the Chicago area
- toyotacorolla.csv: Sales prices and vehicle characteristics of 1436 used Toyota Corollas
- peso-altura-idade-sexo-criancas.txt: peso vs altura, idade e sexo
- gpa2.txt: same as gpa2-wooldridge.txt
- temco.txt
- temcoprod.txt
- hprice1.txt: dados sobre 88 residencias.
- dadosmunicipais.csv: dados de 5848 municipios brasileiros (analfabetismo, renda, desigualdade).
- caliescom.xls: Performance media de 420 escolas em um teste padronizado.
- ceosal2.xls: Data on 177 chief executive officers.
- florida2000.xls: Florida 2000 presidential vote as of 5pm ET Saturday, Nov. 11, 2000 (after recount).
- temcoprod.xls: Caracteristicas de funcionarios de uma firma.
- simulados_producao.xls: Modelagem da produção de firmas de um determinado setor.
Business Statistics 2013
Course: Business Statistics 41000-81/82 – Spring Quarter 2013
Professor: Hedibert Freitas Lopes
Teaching Assistant: Samir Warty
Office hours: Sundays 3pm-4:30pm (April 28th and June 9th: 3pm-5pm)
Location: Gleacher 203 (April 28th and June 9th: Gleacher 204)
Course notes
Course syllabus
Course notes (2 per page) (3 per page)
Old exams
Homework assignments
Midterm and final exams
-
-
- 41000-81: Midterm – 04/29/2013 – 6:00pm-8:00pm
- 41000-81: Final – 06/10/2013 – 6:30pm-9:30pm
- 41000-82: Midterm – 04/30/2013 – 6:00pm-8:00pm
- 41000-82: Final – 06/11/2013 – 6:30pm-9:30pm
Additional class material
Class 8: May 20th and 21st
Class 7: May 13th and 14th
Class 5: April 29th and 39th
Class 4: April 22nd and 23rd
Class 3: April 15th and 16th
Class 2: April 8th and 9th
Class 1: April 1st and 2nd
Data sets
-
-
- returns.txt – monthly returns on a broad based portfolio of Canadian assets (class notes page 12)
- volume.txt – daily volume in the cattle pit (class notes page 15)
- mutualfunds.txt – returns on different mutual funds such as the equally weighted market and T-bills (class notes page 18/19)
- beer-production-US.txt – number of beers male MBA students claim they drink without getting drunk (class notes page 20)
- temperature.txt – average daily temperature in Rio de Janeiro, Durham and Chicago for the period between 1/1/1995 and 12/11/2008 (class notes page 83)
- highest-temperatures.txt – highest temperatures per state in the US (class notes page 89)
- unemployment.txt – unemployment rates per state in the US in 2004 (class notes page 92)
- stockreturns-countries.txt – daily returns on EOE, DAX, CAC40, FTSE100, Hang Seng, Nikkei, Singapore All Shares and S&P500 (class notes page 96)
- houseprice.txt – house characteristics, such as price, size, neighborhood, number of bedrooms and bathrooms, etc (class notes page 122)
- US.xls – social indicators per state in the US
- nasdaq-djia.txt – NASDAQ and DJIA daily returns for the period between January 4th 2000 to December 31st 2008
- SP500-dailyreturns.txt – S&P500 daily returns for the period between January 5th 2009 to September 24th 2009
- SP500-monthlyreturns.txt – S&P500 daily returns for the period between February 1950 to August 2009
- dowjones-components.txt – Daily returns for the components of the DJIA for the period between January 2nd 1997 to December 29th 2006
- DJIA-19components-july1980-april2013.txt – Daily returns for 19 components of the DJIA for the period between July 29th 1980 and April 22nd 2013 (Thanks C.R.Keller for sending me the data)
- DJIA-19components-jan1981-dec2012.txt
- sp500-components.txt – Daily returns for the components of the S&P500
- sp500-ge.xls – GE and S&P500 daily returns for the period between January 2nd 1962 and November 19th 2009
- GDP2008.xls – GDP in billions of US dollars
- heights-weights.xls – Height and weight of several male MBA students
- online-investment-portfolios.txt – Value of investiments (in thousands of dollars) for a sample of clients in the 40- and 50- age group
- amsterdam-frankfurt-paris-london-1997.xls – Equally weighted porfolios based on two indices (Amsterdam, Frankfurt, Paris and London)
- profit.txt
- salary.txt
- boston-houseprice.txt
- logwages-yearseducation.txt
- GDPgrowth.txt & GDPgrowth.xls
- pm10-emission.txt & pm10-emission.xls
- mortality-under5.txt & mortality-under5.xls
- Brazil.txt
- nyse73-nasdaq42.txt