CRAN_Status_Badge Travis AppVeyor Build Status

wooldridge: 111 Data Sets for Econometrics

Economics students new to both econometrics and R may find the introduction to both challenging. However, if their text is “Introductory Econometrics: A Modern Approach, 6e” by Jeffrey M. Wooldridge, they are in luck!

The wooldridge data package aims to lighten the task by easily loading any data set from the text. The package contains full documentation for every data set and all data have been compressed to a fraction of their original size. Just install the package, load it, and call the data you wish to work with.

But wait…there’s more! A vignette, Introductory Econometrics Examples:sparkles:, illustrates solutions to examples from each chapter of the text, offering a relevant introduction to econometric modelling with R. The vignette also includes an Appendix of helpful resources, such as Using R for Introductory Econometrics by Florian Hess.

While the original course companion site provides publicly available data sets for Eviews, Excel, and Stata commercial software, this package is the official R open source option. Using R while building a foundation in econometric modeling, not only saves learners a few units of currency, but also introduces them to software capable of scaling with the demands of modern statistical computing.

Note: All data sets are from the 6th edition (Wooldridge 2016, ISBN-13: 978-1-305-27010-7), which is compatible with all other editions.


Install wooldridge v1.3 directly from The Comprehensive R Archive Network (CRAN). The package contains all data sets from the 6th edition and depends on R >= 3.0.0.



Load the wooldridge package and use the data() function to bring the desired data set into the working environment. Data set names match those in the text. Once present in the working environment, modelling data is quick and easy, leaving learners with more time to focus on interpretation.



lm(lwage ~ educ + exper + tenure, data = wage1)


It’s always recommended that one read supporting documentation for data sets of interest. This becomes trivially easy with the wooldridge package:


Documentation includes variable column names, original source of data, and page number(s) where data appear in the text.