The purpose of this analysis is to explore the players from the top European competition in order to find useful information that could be used to compare, recommend or evaluate a player.
We have considered players who play in: Serie A (Italy), Ligue 1 (France), Premier League (England), La Liga (Spain). The datasets were taken from https://fbref.com/en/.
The project is organized in two Jupyter notebook in which are done different type of analysis:
- Clustering_portieri: In this notebook are analyzed the goalkeeper, with the aim of:
-
recognize the different types of goalkeepers on the basis of statistics
-
gather goalkeepers based on their characteristics
-
compare the characteristics of two or more goalkeepers
-
recommend a new goalkeeper to a team
-
evaluate and identify the general change in the characteristics of the goalkeepers over time
- AnalisiComplementare: In this notebook are analyzed all the outfield players with the following purpose:
- recognize players who could perform better in a role other than the one assigned to them
- compare the statistics of different role
- Kmeans
- Agglomerative Clustering
- Decision Tree classifier
- Principal Component Analysis
- NumPy
- Pandas
- Plotly
- Sklearn
- Matplotlib