I used data sets of "120-years-of-olympic-history-athletes-and-results" in Kaggele which has information about players , their teams , medals and gender
The data consists of information regarding 271116 rows and includes :
- Event related :
- City
- Year
- Athlete related:
- Sport
- Age
- Weight
- Height
- Medal
- Name
- Team
- region
- notes
I focused on distribution of :
- age
- height
- weight
and separated outliers by their sport for further analysis to find sports with different distribution
I focused on frequency of players and started to generate several plots depending on :
- sex
- year
- medals
- season
- height and weight
- to reach that:
- women trend grew to be similar to men over winter season
- most winning players prefect height ,weight and age
- sports that are stopped in olympics like arts
- to reach that: