Skip to content

shaikhadibbb/data-preprocessing-numpy-pandas

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Data Preprocessing using NumPy & Pandas

This project covers data cleaning, preprocessing and analysis on a student dataset using NumPy and Pandas.

What it does

  • Part 1: NumPy operations - mean, median, max, min, normalization
  • Part 2: Pandas exploration - data types, missing values, filtering
  • Part 3: Data preprocessing - handling missing values, datetime conversion, outlier detection, removing duplicates
  • Part 4: Data analysis - average scores, top students, correlation, groupby

Dataset

The dataset (student_data.csv) contains student records with fields like name, gender, math/science/english scores, attendance and exam date.

How to run

pip install numpy pandas
python assignment2_solution.py

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages