This project is part of my internship at Prodigy Infotech (June 2025) under the Data Science domain.
Perform data cleaning and exploratory data analysis on the Titanic dataset to uncover meaningful patterns, trends, and relationships between variables.
- Python
- Pandas, NumPy
- Matplotlib, Seaborn
- Loaded the Titanic dataset (
train.csv) - Handled missing values (
Age,Embarked, droppedCabin) - Performed EDA with visualizations:
- Survival distribution
- Survival by gender and class
- Age distribution
- Correlation heatmap
- Females had a higher survival rate than males
- Passengers in 1st class had better chances of survival
- Most passengers were aged between 20β40
π©βπ» Anika
BTech CSE (AI & DS) | Intern at Prodigy Infotech