Round 2: 10 Ways to Rank the Rio Summer Olympics

Kaiser finds another set of 10 metrics that can be used to rank any country top of the medals table.

Round 2: 10 Ways to Rank the Rio Summer Olympics

In a recent post, I took data from the Tokyo Olympics to demonstrate how one can find any number of data-driven stories in a sufficiently complex dataset. I outlined 10 different ranking schemes that place a different country first.

A few readers noticed and complained that the metric used for Germany was "not positive". They didn't like the insight that Germany saw the greatest decline in medals from Rio 2016 to Tokyo 2020. While "not positive" metrics have their practical uses, I indeed was hoping to make a purely "positive" post, and ran out of time. (I said the challenge can be completed; I never said it was easy!)

Since today is a new day, and this is a new post, I will now close the gap. Germany's older athletes, those over 45 years and over, were by far the strongest. Incredibly, two out of three earned medals.

Kfung_rio2016_germany

Now, surely no one can question the completeness of my answer.

***

Since a few of you were cheering for me to fail, I decided to double down, and present to you another 10 metrics, proving that if you spend enough time with the data, you can pretty much say anything you want.

Scratch that. In the course of doing this second round, I bumped into one of those awkward data science moments: you see, the exploratory analysis caused me to look up an athlete's bio, and doing so, I discovered that the spreadsheet I snatched online did not show Tokyo athletes as expected but athletes from Rio 2016. So in this second round, what I will deliver is the same challenge but using strictly the data from the Rio 2016 Summer Olympics.

As before, I start with the official top 10 medals table from Rio 2016.

Rio2016_medaltable_bytotalmedals

Nine of the 10 countries are the same as before. The Netherlands was swapped for South Korea. (Russia competed under its proper name rather than ROC.)

The United States won on four metrics, golds, silvers, bronzes and total medals - and plenty of others, if you start digging around.

Kfung_rio2016_usa


China won the second most medals at Rio 2016. But the Chinese team had the highest proportion of female athletes - by a wide margin.

Kfung_rio2016_china

Team GB came in #3. No other team, though, produced a gold winner older than Nick Skelton, who at 59 proved that age is no barrier.

Kfung_rio2016_britain


Russia was 4th in the overall medals table. Its women athletes were the most efficient at earning medals.

Kfung_rio2016_russia

Germany was #5. No other team achieved more success amongst older athletes - those 45 years old and over.

Kfung_rio2016_germany

France came in #6. Among the top 10 countries, the French medal-winners had the highest age.

Kfung_rio2016_france

Japan was #7 overall. They were the mirror image of the French: the average Japanese medal-winner was the youngest.

Kfung_rio2016_japan

Australia showed up in the 8th spot. The Aussie gold medalists were even younger than the Japanese.

Kfung_rio2016_aus

The Azzurri placed 9th overall. Previously, we learned before that the Italians had the most close calls. It turned out their gold medal winners tended to be younger.

Kfung_rio2016_italy

In Rio, South Korea squeaked in at the 10th position. The nation was unique in sending a team in which the women were older than the men.

Kfung_rio2016_korea

I end on a strong note. This last metric was very satisfying as it single-handedly placed South Korea on on side of the ledger with the other nine countries on the other side.

***

There you have it. Take the Rio Olympics, and one can achieve the same result - one can find 10 metrics that rank each of the top 10 countries as #1.

This is the power of statistics. How one uses this power is a different matter. Use it to inform, not to mislead.

(For those who may be snorting at this exercise, notice that every statement I made about the Rio Olympics presented historical facts. I did not generalize these observations to the future. Some of these insights are likely to be artifacts of this historical dataset. It's when someone uses such insights to speak about the future when you should be concerned.)