Create Your First Project
Start adding your projects to your portfolio. Click on "Manage Projects" to get started
Ivestigating Movie Datasets
The dataset selected for analysis is a collection of information about movies. It contains several columns with significant information about each movie. Below is a description of each column:
tmdb-movies
id: Unique identifier for each movie.
imdb_id: IMDb identifier for each movie.
popularity: The popularity score of the movie.
budget: The budget allocated for the production of the movie.
revenue: The revenue generated by the movie.
original_title: The original title of the movie.
cast: Names of the main cast members of the movie.
homepage: The URL of the movie's official website.
director: Name of the director of the movie.
tagline: The tagline associated with the movie.
keywords: Keywords associated with the movie.
overview: A brief overview or summary of the movie.
runtime: The duration of the movie in minutes.
genres: The genres associated with the movie.
production_companies: The production companies involved in making the movie.
release_date: The release date of the movie.
vote_count: The number of votes received by the movie.
vote_average: The average rating given to the movie.
release_year: The year in which the movie was released.
budget_adj: Adjusted budget for inflation.
revenue_adj: Adjusted revenue for inflation.
List of analysis questions:
1. How does the budget of a movie correlate with its revenue?
- Dependent Variable: Revenue
- Independent Variable: Budget
2. Are certain production companies associated with higher revenue-generating movies?
- Dependent Variable: Revenue
- Independent Variable: Production company
3. Does the runtime of a movie affect its popularity?
- Dependent Variable: Popularity score
- Independent Variable: Runtime
4. Are movies released in certain months more likely to have higher revenues?
- Dependent Variable: Revenue
- Independent Variable: Release month













