Can We Predict the Results of Eurovision 2022 Semi-Finals With a Statistical Model?

Background

The ones who know me in person probably know about my background in Math, Statistics and Data models.

code projected over woman
(Image Credit Pexels.com)


As part of ESCBEAT time, I have a great passion for Eurovision.
So the natural question would be: Why not combine both?

Model – Main Ideas

What does the model include? I have included several different variables:

  1. Digital Platforms Data – How popular are the songs on the 2 main digital platforms:
    • YouTube – sum of views of the videos. We consider the videos on the official Eurovision Song Contest YouTube channel, as well as the ones on the artist’s channel and videos on the national broadcaster channel. If an act is chosen by a national selection, the video from the selection will be included. In the case of an act that was chosen internally, we would take one live performance into consideration. Pre-parties videos are not included.
      All views have been updated to Saturday, April 23rd.
    • Spotify – number of streams (updated to Saturday, April 23rd)
  2. Betting Odds – We consider the average position for each country in the semi-final betting odds. The average is calculated on the positions between April 1st and April 23rd.
  3. Qualifying Rate For Country – The Eurovision semi-finals were introduced since 2004 (one semi-final in 2004 – 2007, and 2 semi-finals since 2008). We calculated the percentages of qualifying for the final for each country. For example, if a country has qualified 6 times out of 12 times, the value will be 50% = 0.5. We would like to remind you that during the semi-finals of Eurovision 2008 – 2009, the jury had the option to choose a 10th qualifier which wasn’t necessarily in the 10th place. We have taken this under consideration
  4. Average Rate for Country – We calculated the average positions that each country achieved in the history of the semi-finals. For example, Australia has participated in 5 semi-finals (2016 – 2019, 2021). The places they achieved in the semi-finals were 1, 6, 4, 1, 14 and therefore the average will be = (1+6+4+1+14)/5 = 5.2.
  5. Qualifying Rate for Position – We calculated the probability of qualifying from each spot in the semi-finals. For example, we have had 30 semi-finals events so far. In 13 cases, the country that performed in the 5th position has qualified for the final. Therefore, the probability of qualifying from the 5th position is 13/30 = 43.33%. We would like to remind you that during the semi-finals of Eurovision 2008 – 2009, the jury had the option to choose a 10th qualifier which wasn’t necessarily in the 10th place. We have taken this under consideration
  6. Average Rate for Position – We calculate the average place in the semi-finals for each optional spot. The most successful spot in the semi-finals is the 14th position.
  7. Voting Patterns – Based on the results of each one of the shows since 1975, we built a list of the top 10 countries that tend to vote for each country. Insights are based on each vote: semi-final, final, televote and jury voting. Thereafter, for each country, we counted the number of “friendly countries” that will vote in the same semi-final.

Together they permit us to have a wide point of view of the contest

The Method

analytics text
(Image Credit: Pexels.com)

Based on preliminary research, I’ve given a weight (0-100) to each one of the 8 suggested variables. The sum of the weights together is 100.

Variable (Data Feature)Weight (percentage in final score)
YouTube Views16.666%
Spotify Streams16.666%
Semi-Final Betting Odds33.333%
Qualifying Rate Per Country6.666%
Average Rate Per Country6.666%
Qualifying Rate Per Position6.666%
Average Rate Per Position6.666%
Number of “Friendly Countries” voting in the same semi-final6.666%

We have ranked the obtained values in each variable and scored the country based on a uniform range. This means that if we have 17 countries particiapting in semi-final 1, the one with the highest value in a specific feature will get the score of 17, while the lowest one will get 1 point only.
Thereafter, we multiplied the score with the weight as explanied in the table above.

Semi-Final 1 Analysis

As a reminder, this is the running order of the first semi-final:

Remember: The highest score you get, the highest your chances to qualify.
The table is too wide, so we’ve split it into 2 images:

Example of a score: Let’s see together how Albania has scored 1210 points which puts them in 4th place:

  • YouTube Views = 16.66%: Albania has achieved around 7.8 Million views. The only country to defeat them in this semi-final was Ukraine. Therefore, Albania gets 16 points (as the leading country gets 17 points). So the first component of the score is 16.66 * 16 = 266.666 points
  • Spotify Streams = 16.66%: Albania has around 2.2 million Spotify streams. This puts them in 5th place (after The Netherlands, Norway, Ukraine and Austria). Therefore, 5th place means 13 points (as the first places get 17, 16, 15, 14 points). So it adds them 16.66 * 13 = 216.666 points
  • Semi-Final Betting Odds = 33.33%: The average place of Albania in the semi-final betting odds is 5th. Therefore they will get a score of 13 points. Let’s add 13.33 * 13 = 433.333 points
  • Qualifying Rate Per Country = 6.66%: Albania has participated in 16 semi-final shows to date. They have qualified in 9 times out of 16. So their qualifying probability is 56.25%. This puts them (together with Lithuania and Iceland) in the 9th place regarding this variable. So let’s add another 6.66 * 9 = 60 points to the final score.
  • Average Rate Per Country = 6.66%: If we calculate the average place of Albania in the semi-finals, we get 10.63 which puts them in the 10th place which equals to 8 points. Therefore, it is time to add 6.66 * 8 = 53.333 points.
  • Qualifying Rate Per Position = 6.66%: Let us focus on the cases where a country has competed in the 1st slot in the semi-finals. In 16 cases out of 30, the country qualified for the country. So the qualifying rate from this slot is about 53.33%. This puts the 1st position in the 11 place, therefore Albania will get the rank of 7 points. This leads to adding of 53.33 * 6.66 = 46.66 points.
  • Average Rate Per Position = 6.66%: Let us focus on the cases where a country has competed in the 1st slot in the semi-finals. The average place the country has finished in this case is about 10.17. This puts the 1st slot in 11th place. Therefore let’s add 6.66 * 7 = 46.66 points.
  • The number of “Friendly Countries” voting in the same semi-final = 6.66%. For each country, we’ve built a list of the top10 countries that tend to give them points (both televote and jury vote, in semi-finals and finals). In Albania‘s case, the country will have the option to rely on 5 of its friendly countries in this semi-final: Greece, Italy, Switzerland, Croatia,  Austria. There are 5 countries in the same situation in this semi-final (having 5 out of 10 friendly countries): Albania, Portugal, Denmark, Switzerland, Austria. This puts each one of them in 5th place regarding this category. So each one of them will get a score of 13 points. Therefore, we can add another 6.66 * 13 = 86.66 points.
  • Putting it all together, our model estimated Albania at 1210 points. Comparing this to the other countries in the semi-finals, Albania has the 4th highest ranking.

As you can see, our prediction for the 10 qualifiers is (not to confuse the points with the scale of points at Eurovision!)

  1. 🇺🇦 Ukraine – 1473 points
  2. 🇳🇴 Norway – 1340 points
  3. 🇳🇱 The Netherlands – 1257 points
  4. 🇦🇱 Albania – 1210 points
  5. 🇬🇷 Greece– 1197 points
  6. 🇲🇩 Moldova – 1077 points
  7. 🇦🇹 Austria – 1000 points
  8. 🇵🇹 Portugal – 967 points
  9. 🇦🇲 Armenia – 913 points
  10. 🇱🇹 Lithuania – 803 points

Appendix: You can see the list of friendly countries that voted in the same semi-final

CountryList of Friendly Countries voting in the same semi-final
AlbaniaGreece, Italy, Switzerland, Croatia, Austria
LatviaLithuania, Norway
LithuaniaLatvia, NorwayUkraineItaly
SwitzerlandAustria, Albania, The NetherlandsIceland, Denmark
SloveniaCroatia, Portugal
UkraineArmenia, Moldova
BulgariaGreece, Moldova, Albania, Austria
The NetherlandsDenmarkAustria, Switzerland
MoldovaPortugal, Ukraine, Italy, Greece
PortugalFrance, Switzerland, Iceland, Armenia, Latvia
CroatiaSlovenia, Austria, Switzerland, Ukraine
DenmarkIceland, Norway, The Netherlands, Latvia, Slovenia
AustriaBulgaria, Slovenia, Lithuania, Switzerland, Iceland
IcelandDenmark, Norway, Latvia
GreeceAlbania, Bulgaria, Armenia
NorwayDenmarkIceland, Lithuania, Latvia
ArmeniaFrance, Bulgaria, Greece, The Netherlands

Semi-Final 2 Analysis

As a reminder, this is the running order of the first semi-final:

Remember: The highest score you get, the highest your chances to qualify.
The table is too wide, so we’ve split it into 2 images:

As you can see, our prediction for the 10 qualifiers is (not to confuse the points with the scale of points at Eurovision!):

  1. 🇸🇪 Sweden – 1550 points
  2. 🇵🇱 Poland – 1427 points
  3. 🇫🇮 Finland – 1257 points
  4. 🇦🇺 Australia – 1253 points
  5. 🇨🇾 Cyprus – 1193 points
  6. 🇷🇸 Serbia – 1190 points
  7. 🇪🇪 Estonia – 1147 points
  8. 🇨🇿 Czech Republic – 1090 points.
  9. 🇷🇴 Romania – 1067 points
  10. 🇧🇪 Belgium – 973 points

Appendix: You can see the list of friendly countries that voted in the same semi-final

CountryList of Friendly Countries voting in the same semi-final
FinlandEstonia, Sweden, Poland, Ireland
IsraelAzerbaijan, Finland, Australia, the United Kingdom
SerbiaMontenegro, North Macedonia, Germany, Sweden
AzerbaijanGeorgiaCzech Republic, Malta, Romania, Montenegro
GeorgiaAzerbaijan, Cyprus
MaltaAustralia, Azerbaijan, United Kingdom, Ireland, North Macedonia, San Marino, Montenegro
San MarinoAzerbaijan, Georgia, Montenegro, Malta, and Poland
AustraliaPoland, Sweden, the United Kingdom, FinlandIsrael, SpainBelgium
CyprusMalta, San Marino, United Kingdom, Georgia
IrelandUnited Kingdom, Czech Republic, Sweden, Belgium, EstoniaSpain
North MacedoniaSerbia, Montenegro, Azerbaijan
EstoniaFinland, Sweden, Ireland, Georgia, Israel
RomaniaSpain, Israel, Azerbaijan, Cyprus, Malta
PolandGermany, Ireland, the United Kingdom
MontenegroSerbia, North Macedonia, Azerbaijan, San Marino, Malta
BelgiumAustraliaPoland, Sweden, Ireland
SwedenEstonia, Finland, Australia, Malta
Czech RepublicSpain, Belgium, Israel, Australia, Estonia

Remarks

  • Unsurprisingly, we see a different ranking by Spotify users and YouTube rankings. The main explanation is that users from different countries prefer using different digital platforms. For example, Spotifu is very popular in Western Europe.
  • It is worth mentioning that jury votes are harder to predict. We believe that they can be somehow inferred from the trends that the model’s features give.
  • Different countries have selected their songs at different times. However, when songs are popular, it can be easily seen, no matter how much time has passed since they were released.
  • One can think about the model results as a prediction for the actual ranking, and not only the list of qualifiers.
  • There are many different approaches to solve this challenge. Some of them take more time and are harder to explain. That’s why we chose a simple model.

What do you think about the concept?
Did you find the model’s predictions logical?
Would you be satisfied with this list of qualifiers?

If you share the content, please don’t forget to give a porper credit!

Read all about Eurovision 2022 on our homepage

This image has an empty alt attribute; its file name is Eurovision-2022-Homepage.jpg


Doron Lahav is 30-year-old, who lives in Israel. Doron has been watching the Eurovision Song Contest since early 2000s. Doron works as a Data Scientist and shows interest in singing and writing.

2 Replies to “Can We Predict the Results of Eurovision 2022 Semi-Finals With a Statistical Model?

Leave a Reply

This site uses Akismet to reduce spam. Learn how your comment data is processed.

%d bloggers like this: