t

t
Showing posts with label Las Palmas. Show all posts
Showing posts with label Las Palmas. Show all posts

Tuesday, 8 November 2016

Best 11 vs. 11 "Best Bang for Your Buck"

With the initial study rounds being a matter of the past, the 2016-2017 season is finally well underway. An appropriate time to assess whose star has been shining. In today's post, I will assemble and compare two line-ups, across all leagues: the best performing one and the one that has so far yielded the best value for money. As performance measure, I will employ the industry standard, i.e. the InStat Index, an overall quality score indicating a player's recent form. 

The InStat Index is based on a detailed breakdown of a player's recent performances along about a dozen relevant performance dimensions, according to the position the player occupies on the field, e.g. finishing for a forward and key passing for a central midfielder. As a measure of a player's value, I use the market value as provided by transfermarkt.com. Both InStat Index scores and market values used are those as of November 8, 2016.

To determine the best-in-form line-up, I checked for each position on the field which of all players on that position has the highest InStat Index score. The resulting team is as follows:
 
Figure 1: Best 11, based on InStat Index per position
(Top, italicized, number in players' squares is their InStat
Index score; bottom number their market value, in €m)

Observation 1: Although players from all professional leagues across the world are being taken into consideration, the best starting eleven come from as few as three national leagues: La Liga, the Premier League and Ligue 1. Moreover, the Spanish league is host to no less than 9 of the 11. Additionally, apart from PSG's Brazilian skipper Thiago Silva, the only representative of a league other than the Spanish one is David Silva, indeed a 2010 World Cup winner with Spain. In total, there are five Spanish players in the team and three Brazilians.

Observation 2: Spanish champions FC Barcelona provide more than half of the best team: the attacking line is entirely Barça's MSN, with Pep Guardiola's incarnation on the field, Sergio Busquets, forming a link between the attack and a defense featuring both the blaugranas Umtiti and Sergi Roberto.

Observation 3: The two players in the team who are no absolute stars (yet) are Sevilla FC's goalkeeper Sergio Rico and Las Palmas' central midfielder Jonathan Viera. With solid performances both in La Liga and the Champions League, Rico, a 23-year-old local boy from Sevilla, who made his debut for Spain earlier this year, is keeping PSG arrival, Italian international Sirigu behind him on the bench. Those readers for whom virtuoso Jonathan Viera, by far the player with the lowest market value to make it into the squad, may or may not ring a bell, I gladly point to the previous post in this blog.

Observation 4: Apart from who made it to the team, it is also interesting to consider some who did not. In spite of them being the players with the third and fourth highest InStat Index score overall, respectively, neither Real Madrid's Gareth Bale nor Cristiano Ronaldo feature in the 11 best. Their misfortune is that they happen to be playing in the same position as the numbers one and two in terms of InStat Index, Messi and Neymar, respectively. Marcelo is thus the only merengue to make it to the best 11.

As, for most clubs and people, money is a limited resource, it is at least as worthwhile to find out which team yields the best value for money. In terms of our analysis, which players represent the most InStat Index points per euro? Simply dividing a player's InStat Index score by his market value, however, would imply that, say, a player with half Messi's InStat Index score and costing less than half of Messi would be "better value for money". To correct for this, I apply a natural-logarithmic transformation to the players' market value. Thus, the measure I consider here is InStat Index score divided by ln(market value). 

For a player to be withheld, in addition to having the best above measure in his position, he should be part of the top 10 in his position according to InStat Index and he must have featured in at least half of his team's domestic league games this season to date. The resulting "best bang for your buck" 11 are as follows:

Figure 2: "Best bang for your buck" 11,
based on InStat Index and market values
(Top, italicized, number in players' squares is their InStat
Index score; bottom number their market value, in €m)

Observation 5: The dominance of La Liga is similar in case of the "best bang for your buck" 11: 8 players play their football in the Spanish league; one of the "foreigners" in the team, Bayern Munich's Xabi Alonso, is actually a World Cup winner for Spain. Together with Schalke 04's Brazilian centre back Naldo, Alonso is one of two Bundesliga representatives coming into the team. With six players, more than half of the team are Spanish; again there are three Brazilians.

Observation 6: With a market value of just one tenth even of Sergio Rico's and with an InStat Index score that is just marginally lower than his Sevilla FC counterpart, Espanyol's on-loan goalkeeper Diego López, the man in whose favor Mourinho famously dropped San Iker Casillas to the bench when at Real Madrid, comes into the team. Sevilla FC does manage to keep a player in the squad, though, as Sergio Escudero replaces Marcelo on left back.

Observation 7: Expensive Barça options on right back and centre back, Sergi Roberto and Umtiti, are being replaced by the very lowly valued Míchel and David García, both from Las Palmas. As, naturally, Jonathan Viera retains his spot in the team, Las Palmas thus become the only club with as many as three players in the "best bang for your buck" 11! Another player coming in from one of La Liga's smaller teams is Villarreal's promising midfielder Manu Trigueros.

Observation 8: Both Messi and Neymar remain in the team, in spite of their astronomical market value. The rationale is simple: both are so much better (as indicated by the InStat Index differential) than the other players in their position that they still provide the best value for money – in this case a lot of value for a lot of money. As mentioned above, Bale and Cristiano Ronaldo are the ones who come closest to the performance of Messi and Neymar, respectively, but their market value is comparable to their blaugrana counterparts.

Observation 9: A big part of bringing down the cost (i.e. market value) of the team without sacrificing performance much has been achieved by bringing aging stars in, whose market value is at a low, but whose performance is still up to standard. Whereas only two players in the best 11 team are over thirty, with Thiago Silva (32) being the oldest, in the "best bang for your buck" 11, five players are aged 34 or 35. The average age of both teams differs by 3 years.

Observation 10: The best 11, corresponding to a total market value of €520m, managed an accumulated InStat Index score of 3,685. This compares to 3,567 for the "best bang for your buck" 11, valued at €259m. In other words, with only half the budget – and still being able to count with both Messi and Neymar – the "best bang for your buck" 11 represent only a 3% loss in InStat Index score compared to the best possible 11.

However, adding players' individual performances may not provide the right basis for comparison. If football is a "weakest link" rather than a "strongest link" game, as advocated by some of its greatest managerial innovators, such as Lobanovskyi and Sacchi, multiplying individual performances may be the more accurate way to go (see Anderson and Sally's The Numbers Game). This would paint quite a different picture as the product of the individual InStat Index scores of the "best bang for your buck" 11 reaches only 69% of the best 11's – hence a 31% rather than just a 3% loss. So one's philosophy of the game will matter a great deal, also when talking "hard" numbers.

Tuesday, 16 August 2016

Offensive Contribution in La Liga: Usual Suspects and One Ugly Duckling


In this post, we look forward to the new season of La Liga by looking back to the previous one. More specifically, we’ll zoom in on the offensive contribution of those forwards who put in stellar performances on this front. A player’s offensive contribution is typically determined "quick and dirty" by considering goals and assists, often by simply adding the two. Naturally, there are much more sophisticated – and accurate – ways to gauge the offensive contribution a player represents to his team. One such method that I will employ here has been proposed by Thomas Severini, Professor of statistics at Northwestern University, and concerns a regression model with multiple predictor variables.

The intuition and rationale behind the statistical model are as follows: For all teams in the Primera División, over the last seasons, we know how many goals they scored and a bunch of other statistics, including how many shots on goal, how many attempts at goal from outside the box, from inside the box, how many dribbles, passes, etc. they executed. What we are interested in to find out, is what are the variables that actually relate (most) to the number of goals scored by a given team. To this extent, in our model, we consider each team’s performance over the last five seasons, leading to 100 (i.e. 5 times 20 teams) observations. The statistical model allows for filtering out those variables that yield important additional information about the dependent variable (here: goals scored).

A concise model that allows us to explain no less than 85 percent (i.e. the R2) of the goals scored in the past five seasons of La Liga turns out to be the following:

– 13.7 – 0.10112*shots_outside_of_box + 0.41682*shots_on_target + 0.0132*successful_passes

Most important when interpreting this model – rather than the actual number, which is hard to interpret – are the variables included and their respective signs: Shots on target has a positive sign, implying that more shots on target tend to coincide with more goals. The sign of the variable shots outside of box is negative as it negatively adjusts the impact of shots on target attempted from outside the box in terms of their success probability. Successful passes turn out to constitute another important aspect of the offensive contribution, whereas, for example, successful dribbles, do not. Also noteworthy is the relative difference between the respective coefficients of variable shots on target and of successful passes: A shot on target will have an impact on the offensive contribution over thirtyfold (i.e. 0.41682/0.0132) the one of a successful pass. In case I would have data on the area of the pitch where the passes took place (e.g. final third), the model could be made even more accurate and I could also include non-forwards.

Following Severini's Analytic Methods in Sports (2015), to apply to above team-level model at the level of the individual player, we merely need to divide the intercept by 10 (because of the ten field players). Thus, for an individual player,

Offensive Contribution = – 1.37 – 0.10112*shots_outside_of_box + 0.41682*shots_on_target + 0.0132*successful_passes

The top 10 of offensive contributors for the 2015-2016 La Liga season is as follows:

Figure 1: Top 10 offensive contributors, La Liga 2015-2016

Naturally, some players received more playing time than others, e.g. due to injury. For comparative purposes, therefore, it is also helpful to consider offensive contribution assuming all players would have played all matches – at the level of their offensive contribution when they were actually fielded:

Figure 2: Top 10 offensive contributors, La Liga 2015-2016,
assuming all players would have played all of their team's games

Key observations:

1.     The number 1 in terms of offensive contribution is Lionel Messi.
2.     Although Cristiano Ronaldo came closest to Messi in terms of offensive contribution, Neymar would have jumped Cristiano, had they both had the same playing time.
3.     All members of MSN as well as of BBC are included in the top ten, assuming all players played the same number of games.
4.     MSN’s combined offensive contribution is larger than BBC’s.
5.     There is one “ugly duckling” in the top 7 (8), otherwise made up entirely of “usual suspect” stars: Jonathan Viera of Las Palmas. The 26-year-old Canary Islander tends to remain under the radar of more traditional measures of offensive contribution.
6.     The top 10 is completed by club topscorers who had an exceptionally prolific season: Depor’s Lucas Pérez, Betis’ Rubén Castro, Bilbao’s Aduriz and Real Sociedad’s Agirretxe – who missed over half the season due to injury.
7.     Relative differences are quite substantial, e.g. Messi's offensive contribution is almost double the rankings' number 10.

While it is difficult to perform even better in the season following an exceptionally good one – among others, due to a principle known in statistics as “regression to the mean” – I particularly look forward to finding out whether Las Palmas’ Viera can really make a name for himself this season and whether Messi and Cristiano Ronaldo can remain at the very top, or whether youngster Neymar will make a move towards absolute supremacy.

Note that by means of our regression model, we are measuring correlations, not causation. This concretely means that, over the past five seasons, the selected variables were accurate predictors of offensive contribution in La Liga. However, it does not imply that a player’s future offensive contribution will move accordingly. Especially if players were to “act upon” the above model, e.g. by dribbling less and shooting or passing more instead, the included predictor variables may (or may not) lose part of their positive correlation with the dependent variable.

Let the new season commence! J