Data Science Class. Project 1 Task
- get the data (metacritics)
- join the data
- investigate the mojo
- investigate the metacritics
Ideas
- look at distribution/ histogram
- missing values, invalid, duplicates, outliers
- overrall spend trend over time
- rate of return = domestic gross / budget, relationship with other variables target: domestic gross take x: budget len(genre) categorical variables with genre
season: spring: Jan to April summer: (may 01- jul) fall: (aug-oct) christmas: (Nov - Dec )
studios: group them to top 10 and other top_studio_ind = 1 for top 10 studio else 0
director: established_directors_ind = any one with more than 10 films, 1, else 0
- seasonality
- look at director, find money making directors / actors
- genere,
- how important is the first week w/total
- should you pay attention to critics? (critics ~ gross take)