avatarAlex Marin Felices

Summary

The research paper introduces the VPEP metric, a novel approach to quantify the effectiveness of pressing in football by analyzing the risk-reward trade-off between ball recovery and defensive structure.

Abstract

The paper presents a comprehensive analysis of pressing in football, introducing the Value of Pressing Effectiveness (VPEP) metric as a new tool to measure the defensive tactic's efficiency. This metric, inspired by the VAEP metric, evaluates the balance between the benefits of regaining possession and the potential risks to the team's defensive organization. Using data from StatsBomb, the authors analyze matches from the 2018/19 European leagues to objectively rate pressing decisions and use machine learning to estimate the probabilities of ball recovery and conceding opportunities. The VPEP framework is grounded in SPADL language for action description and employs binary classifiers for predicting pressing outcomes. The paper discusses the feature set used to train the classifiers, including SPADL, complex, and game context features, and pressure-related metrics. Experiments with real-world data demonstrate the classifiers' accuracy and calibration, providing insights into team pressing performance. The research reveals the potential of the VPEP metric to help teams optimize their pressing strategies by identifying strengths and weaknesses, and it underscores the impact of fatigue on pressing effectiveness.

Opinions

  • The authors believe that effective pressing is crucial for disrupting opponents' play, despite its inherent risks.
  • They opine that the VPEP metric is superior to traditional metrics like PPDA, as it offers a more nuanced view of pressing effectiveness by considering the quality and context of pressing actions.
  • The paper suggests that teams like SD Eibar, known for their intense wing-oriented pressing, can be highly effective in limiting opponents' shot opportunities.
  • The authors highlight the importance of player fitness and the ability to sustain pressing throughout a match, as evidenced by the decline in pressing effectiveness due to fatigue.
  • They propose that the VPEP metric can be a valuable tool for club analysts to assess their team's defensive strategy and exploit opponents' vulnerabilities.
  • The research emphasizes the potential of machine learning, particularly XGBoost stacked with logistic regression, in providing well-calibrated probability estimates for pressing actions.
  • The authors convey that understanding the trade-offs involved in pressing can lead to more informed decision-making by coaches and players alike.

Understanding the Effectiveness of Pressing in Different Game Scenarios

Quantifying Pressing Effectiveness in Football with A Novel Metric Revealing the Trade-off Between Ball Recovery and Defensive Structure

Introduction

The introduction of the research paper highlights the importance of pressing in football defense and its impact on a team’s success. Effective pressing can disrupt opponents’ play, but it also poses risks. The paper introduces the VPEP metric, inspired by the VAEP metric, as a novel approach to quantify the effectiveness of pressing actions. Using event data from StatsBomb, the authors analyze matches from the 2018/19 seasons of the top European leagues. They discuss the objective rating of pressing decisions as a risk-reward trade-off and propose using machine learning to express the risk and reward as probabilities. The paper concludes by highlighting the potential of the VPEP metric in identifying weaknesses in team and opponent pressing strategies.

The VPEP metric is heavily inspired by VAEP metric according to the authors. If you want to explore this metric you can read any of the articles I posted about it.

VPEP: A framework to assess the risk-reward trade-off of pressing

The section discusses the risk-reward trade-off involved in pressing and introduces the VPEP framework as a method to assess this trade-off. Pressing aims to quickly win the ball back and prevent the opposition from developing an attack. However, an aggressive high press can also leave defensive vulnerabilities and lead to conceding goal-scoring opportunities. The decision to press requires players to evaluate the rewards of regaining possession versus the risks of conceding a goal-scoring opportunity. The framework calculates the probabilities of ball recovery (P_recovery) and conceding a goal-scoring opportunity (P_attack) based on game states (S_i=[a_1,…a_i]) and pressure events (p_i).

They use the SPADL language (also explained in the first article linked above) to describe these actions. SPADL describes each action in terms of nine attributes.

The metrics ∆P_recovery and ∆P_attack represent the changes in these probabilities with and without pressing. The total value of a pressure event (V(Si)) is then calculated by combining these changes, with a factor C (in their case C=5) to account for the higher cost of failure compared to the reward of success.

Estimating the success and failure probabilities of pressing

Next, the authors talk about the process of estimating the success and failure probabilities of pressing. To achieve this, the authors employ binary probabilistic classifiers trained on historical match data. The goal is to estimate the probabilities P_recovery(Si, pi) and P_attack(Si, pi) for a given game state Si and set of pressure events pi by the defending team during that game state. The authors choose XGBoost stacked with a logistic regression model as their classifier, ensuring well-calibrated probability estimates. To train the classifier, each game state is converted into a feature-vector format. The paper then proceeds to explain the computation of labels and features for each game state.

Labels

Here the paper focuses on the assignment of labels for the classification problems related to estimating the probabilities of ball recovery (Precovery) and conceding a shot (Pattack) for a given game state Si and set of pressure events pi. For Precovery the authors assign a positive label (1) to a game state Si if the defending team concedes a shot within the subsequent k actions, and a negative label (0) in all other cases. On the other hand, for Pattack the authors assign a positive label (1) to a game state Si if the defending team recovers the ball within the subsequent k actions, and a negative label (0) in all other cases.

The parameter k represents the lookahead distance in actions, determining how far ahead in the future the effects of an action are considered. For Precovery, the authors choose k = 4 to capture the short-term objective of regaining possession quickly, while for Pattack, k = 7 is chosen. The selection of these values is based on the observed significant impact of pressing on Precovery and Pattack. Additionally, the authors highlight the preference for k = 4 for Precovery to better align with the goal of regaining possession within a short time frame of approximately five seconds.

Features

In this section, the paper discusses the features used to describe a game state Si and their impact on the probabilities of ball recovery and conceding a goal-scoring opportunity. Instead of considering the entire game state, the authors focus on the previous three actions [ai-2, ai-1, ai] as they contain the most relevant information in the current context. Based on the SPADL representation, three categories of features are defined:

Table 1: The difference in probability (%) to recover the ball ( Precovery) and to incur a goal-scoring opportunity (Pattack) in the next k actions after an action that happened under pressure vs after an action that happened free of any pressure.”
  1. SPADL features: Categorical and real-valued features are derived from the explicit information provided in the SPADL representation. This includes features such as the action type, result, body part used, (x, y)-coordinates of the action’s start and end locations, and time elapsed since the start of the game.
  2. Complex features: These features combine information within an action and across consecutive actions. They include the distance and angle to the goal for both the action’s start and end locations, the distance covered during the action in the x and y directions, the distance and elapsed time between consecutive actions, and whether the ball changed possession. These features provide insights into the current speed of play.
  3. Game context features: These features capture the game’s overall context, including the number of goals scored by the team possessing the ball and the defending team after action ai, as well as the goal difference. These features account for the fact that teams often adapt their playing style based on the current scoreline.
  4. Pressure features: These features are associated with each pressure event and include the distance between the defender and the ball, the angle between the defender’s goal, the ball, and the defender, and the time delay between the start of the action and the start of the pressure. These features provide information about the proximity and timing of defensive pressure.

By incorporating these features, the model captures important aspects of the game state that influence the probabilities of ball recovery and conceding a goal-scoring opportunity, enabling a more comprehensive analysis of pressing effectiveness.

Experiments

The experiments conducted to evaluate the proposed metric are explained together with its underlying machine learning models. Since there are no ground truth ratings available for pressing, the evaluation focuses on the performance of the binary probabilistic classifiers that estimate Precovery and Pattack.

The experiments are conducted using StatsBomb event data from the 2018/19 season of the top divisions in English, Spanish, German, Italian, and French leagues. Due to the limited dataset, a separate training set is not used. Instead, the first 80% of the games in each league are used to train the classification models using the XGBoost algorithm and the previously described feature set. The remaining 20% of the games are used as a validation set to prevent overfitting.

The learned models are then applied to all 1826 games in the dataset to generate VPEP ratings for the entire 2018/19 season. The experiments aim to assess the performance of the models and provide insights into how the proposed framework behaves and compares to other metrics. Additionally, the paper presents use cases demonstrating how teams can leverage the VPEP metric to identify weaknesses in their pressing strategies.

By conducting these experiments on real-world match data, the paper validates the effectiveness and applicability of the proposed framework in evaluating pressing strategies and providing valuable insights to teams.

Evaluation of the probabilistic classifiers

The next step is to evaluate the probabilistic classifiers used in the VPEP framework. Again, since there are no objective guidelines for determining the effectiveness of pressing in different game contexts, the evaluation is based on the components of the metric itself. The evaluation considers two aspects: accuracy and calibration.

The accuracy of the classifiers is assessed by measuring their ability to accurately predict whether the defending team will recover the ball or concede a goal-scoring opportunity in the near future. The area under the receiver operating characteristic curve (ROC AUC) is used as the evaluation metric. With a ROC AUC of 0.902 for Precovery and 0.806 for Pattack, the classifiers demonstrate a reasonably high accuracy in distinguishing between positive and negative examples.

The calibration of the predicted probabilities is another important aspect of the evaluation. The paper emphasizes the importance of the predicted probabilities aligning with the true underlying probability distribution of the data. To evaluate this, the authors examine whether the predicted probabilities correspond to the actual occurrence of events in the data. They collect game states for which the model predicts a specific recovery probability and assess whether the observed recovery rate matches the predicted probability. The probability calibration curves shown in Figure 1 demonstrate that the classifiers achieve good calibration.

Fig. 1: Probability calibration curves and histograms of the predicted probabilities. Both the probabilities for Precovery (left) and Pattack (right) are wellcalibrated.”

Overall, the evaluation of the probabilistic classifiers shows that they are capable of accurately predicting recovery and conceding probabilities and exhibit good calibration. These findings validate the reliability and effectiveness of the classifiers within the VPEP framework.

Intuition behind the values

The intuition behind the values calculated by the VPEP framework is described using a specific example of a pressing sequence by Barcelona against Huesca. The sequence involves three pressure events and highlights the varying ratings assigned to each event based on their effectiveness and impact on the opponent’s decision-making.

The example starts with Barcelona’s midfielder Rakitić putting pressure on Huesca’s left-back Luisinho, who is forced to pass the ball backwards to his teammate Etxeita due to the pressure. At this point, Rakitić’s pressure event receives a rating of 0.040, indicating a relatively low risk and a moderate probability of ball recovery. The pass from Etxeita to the goalkeeper Werner is also under pressure from Messi, earning a rating of 0.033. Similar to the previous event, the risk is low, and the recovery probability remains modest.

Fig. 2: A successful pressing sequence of Barcelona in their game against Huesca on September 2, 2018.”

However, the most impactful pressure event in the sequence occurs when Luis Suárez presses the goalkeeper Werner. This event receives a significantly higher rating of 0.181. Suárez’s aggressive pressing strategy effectively corners Werner, leaving him with limited options. Consequently, Werner is forced to kick an inaccurate long pass, which is ultimately recovered by Barcelona’s Coutinho. The high rating for Suárez’s pressing event reflects the substantial value it adds to the team’s pressing strategy.

The example illustrates the differential ratings assigned to pressure events based on their effectiveness and impact on the opponent’s decision-making process. Events that push the opponent backward and to the sideline, while still maintaining a relatively low recovery probability, receive lower ratings. Conversely, events that effectively restrict the opponent’s options and force them into unfavorable decisions receive higher ratings.

This intuition behind the values generated by the VPEP framework provides valuable insights into the effectiveness of specific pressure events within a pressing sequence. By quantifying the impact of each event, coaches and football enthusiasts can gain a better understanding of the value added by different pressing strategies and make informed decisions to optimize their team’s defensive tactics.

Identifying the best pressing teams

In this section, the paper focuses on identifying the best pressing teams based on the VPEP metric and provides comparisons with other existing metrics. The rankings in Table 2 reveal that top teams such as Bayern Munich, Napoli, Manchester City, Barcelona, and Borussia Dortmund are consistently recognized for their pressing style. However, SD Eibar emerges as a surprising leader in the rankings. Eibar’s tactical system stands out with its high and intense wing-oriented pressing, where they crowd one wing, making the opposite end inaccessible.

Table 2: Teams with the highest average VPEP rating per pressure event. For comparison, the number of pressure events per game and the Passes Per Defensive Action (PPDA) are reported too. The rank of each team according to these metrics is written between parentheses.”

Notably, Eibar also boasts the fourth-lowest shots conceded per game among the big-5 leagues in the 2018/19 season, allowing just 8.4 shots per game. Only Manchester City, Bayern Munich, and Liverpool conceded fewer shots. This indicates that Eibar’s high press is highly effective in limiting opponents’ shot opportunities. However, it is mentioned that Eibar is vulnerable to conceding goals once their press is broken, with the shots they do concede being of high quality according to xG/shot data.

The paper also compares the VPEP metric with other commonly used metrics for quantifying pressing, such as “passes per defensive action” (PPDA). PPDA measures the ratio of passes allowed by the defending team to the total number of defensive actions, reflecting the level of defensive intensity and the frequency of pressing. The comparison reveals a strong correlation between PPDA and VPEP, where teams with low PPDA values tend to achieve high VPEP ratings. However, the metrics capture different aspects of pressing. While PPDA measures intensity, VPEP evaluates the intelligent execution of pressing. This means that teams with a high PPDA can still obtain a high VPEP score if they press strategically and effectively. For instance, Borussia Dortmund adopts a more passive defensive strategy but presses intelligently in specific situations, leading to a relatively high PPDA value but a lower rank in terms of VPEP.

By analyzing and comparing these metrics, the paper provides insights into the effectiveness of different pressing strategies and highlights the importance of intelligent pressing in addition to intensity. The VPEP metric offers a comprehensive evaluation of pressing by considering both the frequency and quality of pressure events, enabling a more nuanced understanding of a team’s pressing performance.

How fatigue affects the effectiveness of pressing

Finally, the paper explores how fatigue affects the effectiveness of pressing in football. It emphasizes that pressing requires exceptional fitness levels from players in order to effectively apply pressure and restrict the options of the opposition. The ability to sustain pressing throughout the entire 90 minutes of a match is challenging and typically requires extensive training.

The results presented in Figure 3 demonstrate the impact of fatigue on pressing effectiveness. The average VPEP rating for all teams is analyzed and grouped into 15-minute periods, while the total number of pressure events in each period is also considered. Although the number of pressure events remains relatively consistent throughout the match, the average VPEP value decreases as the game progresses towards the end.

Fig. 3: Average VPEP value during the 2018/19 per fifteen-minute game period for all teams (Left) and the total number of pressing events during each period (right). While the total number of pressing events does not change significantly as the games progress, the VPEP values decrease consistently over the course of a game.”

To illustrate the impact of fatigue on pressing, the paper provides an example of Liverpool under Jürgen Klopp (Figure 4). Klopp is renowned for his pressing tactics; however, the data suggests that Liverpool’s pressing intensity diminishes after the 30-minute mark. This finding contrasts with their Premier League title competitor in the 2018/19 season, Manchester City. Pep Guardiola’s team managed to sustain effective pressing throughout the entire game, with only a slight decline observed during the final fifteen minutes of each half.

Fig. 4: Average VPEP value during the 2018/19 per fifteen-minute game period for Manchester City (Left) and Liverpool (right). While Manchester City’s pressing only tapers off slightly during the final quarter of each half, Liverpool’s pressing already tapers off after the 30 minute mark.”

These observations highlight the significance of fitness levels and training in maintaining a high-intensity pressing style. Teams that can sustain their pressing efforts without significant decline throughout the match are more likely to maintain their effectiveness and disrupt their opponents’ play consistently. Understanding the impact of fatigue on pressing allows coaches and teams to develop appropriate strategies to manage and optimize pressing performance over the course of a game.

Conclusion

In conclusion, this research paper proposes the VPEP (Value of Pressing Effectiveness) metric as a novel approach for quantifying the effectiveness of pressing in various game scenarios. Unlike existing metrics, the VPEP metric considers the contextual factors of pressing, taking into account the trade-off between the benefits of ball recovery and the associated risks to the defensive structure.

The paper highlights the limited availability of pressing-related information in event stream data. However, despite these limitations, the VPEP metric proves valuable for club analysts in assessing their team’s defensive strategy and gaining insights into the weaknesses of their opponents.

By introducing the VPEP metric, the paper contributes to the field of football analytics by providing a more nuanced and context-aware approach to evaluating pressing effectiveness. This metric can aid in decision-making processes related to defensive tactics and help teams optimize their pressing strategies based on a better understanding of the risks and rewards involved.

Overall, the VPEP metric offers a promising avenue for future research and analysis in the field of football analytics, and it has the potential to enhance the assessment and improvement of pressing strategies in the game.

References

Learn More

Football
Soccer
Football Analytics
Data Science
Sports
Recommended from ReadMedium