Bias Apparently Hollywood does not recycle action-movie plots. The data said so, so it must be right It pains me to think how many people have used these keywords to build models.
Big Data Counting is hard, especially when you don't have theories Exploring the data about movies, uncovering data issues
Aggregation Good models + Bad data = Bad analysis Example showing how to diagnose bad data in data science models