Iris Dataset Exploratory Data Analysis

Understanding flower species through data visualization and statistical analysis

View Code

Project Overview

This project performs Exploratory Data Analysis (EDA) on the Iris dataset, containing information about three species of iris flowers (Setosa, Versicolor, and Virginica) based on four numerical features.

150

Samples

4

Features

3

Species

Dataset Information

The Iris dataset is a well-known dataset in machine learning and statistics, built into Seaborn.

Feature Description
Sepal Length Length of sepal in cm
Sepal Width Width of sepal in cm
Petal Length Length of petal in cm
Petal Width Width of petal in cm

Exploratory Data Analysis

1. Data Preprocessing

  • Checking for missing values
  • Summary statistics
  • Understanding distributions

2. Data Visualization

  • Pairplot - Feature relationships
  • Boxplot - Outlier detection
  • Heatmap - Correlation analysis

Data Visualizations

Pairplot

Boxplot

Heatmap

Implementation Code