We are all being told now that Big Data is the next big thing. It is difficult to argue that the dramatic drops in computer costs and data storage will not have a huge impact. Hard disks larger than five terabytes are becoming an inexpensive consumer item. This was inconceivable for many of us only a few years ago. Now a book on "Big Data" even made the best seller list.
That a book such as Big Data should appear is not a surprise. For those of us who have worked with the large datasets coming from the administration of programs for years, it will seem a long time coming. Although the current dramatic drops in computational costs were foreseen, it was not anticipated that there would be a basic challenge to how we think about statistics.
The need for some kind of shift in statistical thinking is obvious to the applied statisticians working with the big datasets. Those that had access attempted to do statistical analysis on the larger datasets as soon as it became technically feasible. As these databases became larger, the explanatory variables became more significant. By the time regressions with a sample size of over 100K became possible, it was rare that an explanatory variable would not be seen as statistically significant.
Where does that leave program evaluators? I, like many others, always relied on the test of statistical significance to justify the statement that the program has no impact. There was something very convenient about the statement, "No statistically significant evidence was found of program impacts". Now with all variables being statistically significant, where does this leave program evaluators?
In my opinion, big data will lead to a healthy reexamination of how evaluators think about causality. Unfortunately, we may be to some degree alone on this. Early on in the above cited book, the authors state, "society will need to shed some of its obsession for causality in exchange for simple correlation". This is perfectly understandable as the bulk of data analysts across the world spend their days trying to spot emerging trends and predict the future. Program evaluators are fairly rare in their exclusive interest in causality.
How will this play out for evaluators? No one can predict the future with certainty. However, I would suggest that there are two likely outcomes. First off, rather than just focussing on the sign and significance of the estimate of program impact, we will look at the size of the coefficient. As confidence intervals shrink with growing sample sizes, the conversation may move to whether the estimated benefits are large enough to be considered significant from a policy standpoint rather than a statistical. Secondly, we may move away from statistical inference to what is called statistical learning.
Recent books have proposed that we evaluate the quality of statistical models by how well they predict outcomes, rather than the traditional R-square and p-statistics. Books such as An Introduction to Statistical Learning demonstrate how it is not necessary to use all of the sample to estimate the model, but that substantial portions can be omitted for testing purposes. The test of a model consists of seeing how well it can predict the values in the test portion of the data set. With such a procedure, evaluators could describe the state of the world for program participants as versus non participants. This would provide an interpretation of program evaluation results that the clients may even find more intuitive than before.
The old problem of self-selection will still remain. Still with larger samples, approaches such as matching may even work better. Myself, I am optimistic about the future, although it will not be without challenges.
![]() |
| How many extra drives will you need in the age of Big Data? |

