{"id":60,"date":"2015-02-24T10:27:56","date_gmt":"2015-02-24T10:27:56","guid":{"rendered":"http:\/\/blog.rickstafford.com\/?p=60"},"modified":"2015-02-24T10:30:02","modified_gmt":"2015-02-24T10:30:02","slug":"champions-league","status":"publish","type":"post","link":"http:\/\/blog.rickstafford.com\/?p=60","title":{"rendered":"Champions\u2019 League"},"content":{"rendered":"<p>I\u2019d like to say that I\u2019ve delayed this post to allow for an update on the predictions of the model \u2013 but it really isn\u2019t true, I just haven\u2019t got around to doing it until now. However, all these predictions are made on data available to me at the start of January. Also, since I have very little knowledge of football (since I left Newcastle in 2006), I haven\u2019t been following anything that has happened, and therefore I have an unbiased methodology (a horrible word, possibly actually used correctly here..)<\/p>\n<p>So, the purpose of this work is to predict which teams will finish in the Champions\u2019 league from the UK premiership (the top 4). I\u2019ve already done some work, based on current points and form (see <a href=\"http:\/\/blog.rickstafford.com\/?p=38\">here<\/a>), which is my hard quantitative data. However, things can change \u2013 there is a transfer window in January to buy new players etc. So, expert opinion might be useful too. And, in football, everyone is an expert\u2026 So \u2018public opinion\u2019 might be useful too.<\/p>\n<p>The data I have are:<\/p>\n<p>The previous points and form data (see <a href=\"http:\/\/blog.rickstafford.com\/?p=38\">here<\/a>).<\/p>\n<p>\u2018Expert\u2019 opinion. This was actually difficult to find (as would probably always be the case), so the best I have is a summary of what players each team needs to buy in the transfer window to maximise their success (courtesy of Match magazine \u2013 30<sup>th<\/sup> Dec 2014).<\/p>\n<p>Given that transfers could greatly affect the team, I have then created a \u2018money\u2019 variable, which basically looks at the likelihood of being able to purchase the recommended players.<\/p>\n<p>Finally, I\u2019ve found a public survey, asking which teams will be the top 4 finishers (from quibblo.com on the 4<sup>th<\/sup> Jan 2015).<\/p>\n<p>Of course, these data are in a wide range of forms \u2013 so how do I integrate them?<\/p>\n<p>I\u2019m going to convert each data type into a probability (between 0 and 1) of finishing in the top 4. The previous form data are already in this format.<\/p>\n<p>Expert opinion. I\u2019ve used part formula, part intuition to create this. For example, for both Man City and Chelsea, there were no specific signings recommended \u2013 however, the tone of writing (as judged by me \u2013 so, yes, slightly subjective) was more positive for Chelsea \u2013 Man City\u2019s entry said: \u201c..a big name transfer would be a massive boost for the players and fans\u201d, compared to Chelsea: \u201cwe\u2019d sign a massive star to make their squad even more unstoppable\u201d. Both are clearly high, as no weaknesses were identified \u2013 so Chelsea get a value of 0.95, Man City = 0.9. Other than that, it was relatively simple \u2013 one player identified as vital (in a role, such as defence, striker) = 0.6, two players = 0.5, three players = 0.4.<\/p>\n<p>Money. This was very subjective, and wasn\u2019t really researched here. However, it is well known (even by me) that some teams have more money than others. Hence, this looked at the likelihood of being able to buy the players identified above \u2013 lower scores for poorer clubs and those needing more players.<\/p>\n<p>Public opinion \u2013 This was largely numeric data anyway. Votes were available for each team, in this case, the highest number of votes (26) was for Chelsea \u2013 converting to 0.95 probability. Man City and Arsenal were on 22 and 23 votes resperctively\u00a0 -both getting probabilities of 0.9. Liverpool had 17 votes (p=0.7), Man U 14 (p=0.6), Tottenham had 5 votes (p=0.2), Southampton has 2 votes (p=0.1) and West Ham were not on the list (p=0.1 \u2013 all other teams were low, so it is likely if they were included, they would be low too). Obviously there may be bias in here \u2013 with people voting for teams they support over true opinion, but largely this is the nature of public opinion, it is bias \u2013 and it doesn\u2019t need to necessarily be treated as equal to other data (see below).<\/p>\n<p>The probabilities for each team (as well as the overall probability \u2013 prior to interactions) are shown below:<\/p>\n<p><a href=\"http:\/\/blog.rickstafford.com\/wp-content\/uploads\/2015\/02\/inputData.jpg\"><img loading=\"lazy\" class=\"alignnone size-full wp-image-62\" src=\"http:\/\/blog.rickstafford.com\/wp-content\/uploads\/2015\/02\/inputData.jpg\" alt=\"inputData\" width=\"407\" height=\"182\" srcset=\"http:\/\/blog.rickstafford.com\/wp-content\/uploads\/2015\/02\/inputData.jpg 407w, http:\/\/blog.rickstafford.com\/wp-content\/uploads\/2015\/02\/inputData-300x134.jpg 300w\" sizes=\"(max-width: 407px) 100vw, 407px\" \/><\/a><\/p>\n<p>Integrating the data:<\/p>\n<p>This was done by setting up a Bayesian belief network using JavaBayes (available <a href=\"http:\/\/www.cs.cmu.edu\/~javabayes\/Home\/\">here<\/a>), in the same way as for the political data previously. \u00a0In this case, all four data sets fed into a final posterior distribution (however, as this was to be used in further analysis, it was called the \u2018new prior\u2019). The nice aspect of JavaBayes here is that it becomes intuitive that some parts of the data deserve more weighting than other parts. For example, in the screen shot below, it is clear that the previous form is given more weighting than the other variables (if form suggests the team will not make the top 4 \u2013 i.e. it is FALSE, but all the other variables suggest they will make the top 4, then there is only a 0.4 probability that they will in fact make the top four (in these calculations). Full details of the probabilities used are in the XML file (<a href=\"http:\/\/www.rickstafford.com\/software\/IntegratorNetwork.xml\">here<\/a>\u00a0&#8211; right click and &#8216;Save link as&#8217; to access) which can be loaded into JavaBayes.<\/p>\n<p><a href=\"http:\/\/blog.rickstafford.com\/wp-content\/uploads\/2015\/02\/Importance-of-form.jpg\"><img loading=\"lazy\" class=\"alignnone size-full wp-image-64\" src=\"http:\/\/blog.rickstafford.com\/wp-content\/uploads\/2015\/02\/Importance-of-form.jpg\" alt=\"Importance of form\" width=\"792\" height=\"576\" srcset=\"http:\/\/blog.rickstafford.com\/wp-content\/uploads\/2015\/02\/Importance-of-form.jpg 792w, http:\/\/blog.rickstafford.com\/wp-content\/uploads\/2015\/02\/Importance-of-form-300x218.jpg 300w\" sizes=\"(max-width: 792px) 100vw, 792px\" \/><\/a><\/p>\n<p>Interactions between teams:<\/p>\n<p>From previous form and current (at the start of January) points, we identified 8 teams which could finish in the top 4. However, there are interactions between teams \u2013 if one wins, then by default, the one they play against loses. Equally there are only 4 places in the top 4 (an obvious fact, but perhaps one that needs stating\u2026). So, if one team are in the top 4, then this means the chances of others getting there are decreased.<\/p>\n<p>Incorporating reciprocal interactions in Bayesian networks is difficult statistically, and as such not really done in an intuitive manner by most BBN software. However, it is quite easy computationally (see <a href=\"http:\/\/f1000research.com\/articles\/3-312\/v1\">here <\/a>for details). <a href=\"http:\/\/rickstafford.com\/software\/Premier.xlsm\">This Excel file<\/a>, with associated VBA code, runs reciprocal interactions \u2013 how it works can probably be followed from <a href=\"http:\/\/f1000research.com\/articles\/3-312\/v1\">this paper<\/a>.\u00a0 In this case, the interaction probabilities (the third tab of the worksheet) are key. It is obvious from the data above that Man City and Chelsea will be extremely unlikely to finish outside the top 4. Hence, really the competition is for the remaining two places. Hence interaction strengths between teams and either Chelsea of Man City are weaker than for the others (closer to 0.5, meaning equal chance of being or not being in the top 4).<\/p>\n<p>Running the simulation produces the following predictions:<\/p>\n<p><a href=\"http:\/\/blog.rickstafford.com\/wp-content\/uploads\/2015\/02\/FinalPredict.jpg\"><img loading=\"lazy\" class=\"alignnone size-full wp-image-63\" src=\"http:\/\/blog.rickstafford.com\/wp-content\/uploads\/2015\/02\/FinalPredict.jpg\" alt=\"FinalPredict\" width=\"373\" height=\"183\" srcset=\"http:\/\/blog.rickstafford.com\/wp-content\/uploads\/2015\/02\/FinalPredict.jpg 373w, http:\/\/blog.rickstafford.com\/wp-content\/uploads\/2015\/02\/FinalPredict-300x147.jpg 300w\" sizes=\"(max-width: 373px) 100vw, 373px\" \/><\/a><\/p>\n<p>&nbsp;<\/p>\n<p>So, top 4 finishes likely for Chelsea, Man City and Man U. The final place is equally likely to be either Southampton or Arsenal (based on predictions and data from the start of 2015).<\/p>\n<p>The current (24<sup>th<\/sup> Feb) positions are (and yes, this is my first look at this, since early Jan):<\/p>\n<p><a href=\"http:\/\/blog.rickstafford.com\/wp-content\/uploads\/2015\/02\/Table24thFeb.jpg\"><img loading=\"lazy\" class=\"alignnone size-full wp-image-61\" src=\"http:\/\/blog.rickstafford.com\/wp-content\/uploads\/2015\/02\/Table24thFeb.jpg\" alt=\"Table24thFeb\" width=\"548\" height=\"241\" srcset=\"http:\/\/blog.rickstafford.com\/wp-content\/uploads\/2015\/02\/Table24thFeb.jpg 548w, http:\/\/blog.rickstafford.com\/wp-content\/uploads\/2015\/02\/Table24thFeb-300x132.jpg 300w\" sizes=\"(max-width: 548px) 100vw, 548px\" \/><\/a><\/p>\n<p>So, these really are looking pretty good at the moment.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>I\u2019d like to say that I\u2019ve delayed this post to allow for an update on the predictions of the model \u2013 but it really isn\u2019t true, I just haven\u2019t got around to doing it until now. However, all these predictions &hellip; <a href=\"http:\/\/blog.rickstafford.com\/?p=60\">Continue reading <span class=\"meta-nav\">&rarr;<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":[],"categories":[1],"tags":[],"_links":{"self":[{"href":"http:\/\/blog.rickstafford.com\/index.php?rest_route=\/wp\/v2\/posts\/60"}],"collection":[{"href":"http:\/\/blog.rickstafford.com\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"http:\/\/blog.rickstafford.com\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"http:\/\/blog.rickstafford.com\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"http:\/\/blog.rickstafford.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=60"}],"version-history":[{"count":3,"href":"http:\/\/blog.rickstafford.com\/index.php?rest_route=\/wp\/v2\/posts\/60\/revisions"}],"predecessor-version":[{"id":67,"href":"http:\/\/blog.rickstafford.com\/index.php?rest_route=\/wp\/v2\/posts\/60\/revisions\/67"}],"wp:attachment":[{"href":"http:\/\/blog.rickstafford.com\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=60"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"http:\/\/blog.rickstafford.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=60"},{"taxonomy":"post_tag","embeddable":true,"href":"http:\/\/blog.rickstafford.com\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=60"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}