Project Overview
This project performs Exploratory Data Analysis (EDA) on the Iris dataset, containing information about three species of iris flowers (Setosa, Versicolor, and Virginica) based on four numerical features.
150
Samples
4
Features
3
Species
Dataset Information
The Iris dataset is a well-known dataset in machine learning and statistics, built into Seaborn.
Feature | Description |
---|---|
Sepal Length | Length of sepal in cm |
Sepal Width | Width of sepal in cm |
Petal Length | Length of petal in cm |
Petal Width | Width of petal in cm |
Exploratory Data Analysis
1. Data Preprocessing
- Checking for missing values
- Summary statistics
- Understanding distributions
2. Data Visualization
- Pairplot - Feature relationships
- Boxplot - Outlier detection
- Heatmap - Correlation analysis