Aggregation Good models + Bad data = Bad analysis Example showing how to diagnose bad data in data science models
Algorithms Pre-processing data is not just about correcting errors Exploration of IMDB rating data, by Kaiser Fung, founder of Principal Analytics Prep
Big Data Counting is hard, especially when you don't have theories Exploring the data about movies, uncovering data issues
Bias Apparently Hollywood does not recycle action-movie plots. The data said so, so it must be right It pains me to think how many people have used these keywords to build models.