Bias Apparently Hollywood does not recycle action-movie plots. The data said so, so it must be right It pains me to think how many people have used these keywords to build models.
Algorithms Pre-processing data is not just about correcting errors Exploration of IMDB rating data, by Kaiser Fung, founder of Principal Analytics Prep
Aggregation Good models + Bad data = Bad analysis Example showing how to diagnose bad data in data science models