This project performs exploratory data analysis (EDA) on the Netflix Titles dataset to uncover trends and insights about the platform's global content distribution, genre preferences, and content evolution over time.
- File:
netflix_titles.csv - Total Records: ~8,800+
- Features: Title, Type, Director, Cast, Country, Release Year, Rating, Duration, Genres, and more.
- 📅 Analyze release trends over the years
- 🌍 Identify top countries producing Netflix content
- 🎬 Explore popular genres and content types
- 🎭 Analyze most frequent actors and directors
- 📌 Understand the distribution of Movies vs TV Shows
- 📈 Visualize missing data and key patterns using Python
pandasfor data analysisnumpyfor numerical operationsmatplotlibandseabornfor data visualizationmissingnofor missing data handlingwordcloudfor visualizing top actors and genres
- Movie vs TV Show distribution pie chart
- Year-wise content addition bar graph
- Top 10 countries by content volume
- Wordclouds for cast and genre
- Heatmaps of missing data
- Rating-wise content analysis
-
Clone the repo
git clone https://github.com/shruthika-tr/Netflix-Data-Analytics.git cd Netflix-Data-Analytics
-
Install required libraries
pip install -r requirements.txt
-
Launch the Jupyter Notebook
jupyter notebook netflix_data_analytics.ipynb