Skip to content

shruthika-tr/Netflix-Data-Analytics

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

📊 Netflix Data Analytics

This project performs exploratory data analysis (EDA) on the Netflix Titles dataset to uncover trends and insights about the platform's global content distribution, genre preferences, and content evolution over time.


📁 Dataset

  • File: netflix_titles.csv
  • Total Records: ~8,800+
  • Features: Title, Type, Director, Cast, Country, Release Year, Rating, Duration, Genres, and more.

🔍 Objectives

  • 📅 Analyze release trends over the years
  • 🌍 Identify top countries producing Netflix content
  • 🎬 Explore popular genres and content types
  • 🎭 Analyze most frequent actors and directors
  • 📌 Understand the distribution of Movies vs TV Shows
  • 📈 Visualize missing data and key patterns using Python

🧰 Tools & Libraries

  • pandas for data analysis
  • numpy for numerical operations
  • matplotlib and seaborn for data visualization
  • missingno for missing data handling
  • wordcloud for visualizing top actors and genres

📊 Key Visualizations

  • Movie vs TV Show distribution pie chart
  • Year-wise content addition bar graph
  • Top 10 countries by content volume
  • Wordclouds for cast and genre
  • Heatmaps of missing data
  • Rating-wise content analysis

▶️ How to Run

  1. Clone the repo

    git clone https://github.com/shruthika-tr/Netflix-Data-Analytics.git cd Netflix-Data-Analytics

  2. Install required libraries

    pip install -r requirements.txt

  3. Launch the Jupyter Notebook

    jupyter notebook netflix_data_analytics.ipynb

About

Exploratory data analysis and visualization of Netflix titles using Python and pandas.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors