Skip to content

subodhss23/raw_real_world_data

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Raw Real World Data

This repository contains raw datasets collected from real-world sources that can be used for practicing data analysis, exploratory data analysis (EDA), and building data projects. The goal is to work with messy, imperfect, and realistic data, similar to what analysts encounter in real business environments.

Many tutorials and courses provide clean and well-structured datasets, but real-world data is rarely that simple. These datasets are intentionally kept raw or minimally processed so analysts can practice the full workflow: cleaning data, transforming it, exploring patterns, and generating insights.


Purpose

The objective of this repository is to:

  • Practice real-world data analysis workflows
  • Work with imperfect and unstructured datasets
  • Improve skills in SQL, Python (Pandas), and spreadsheets
  • Develop strong exploratory data analysis (EDA) habits
  • Generate insights that can support decision-making, case studies, and portfolio projects

Possible Analyses

These datasets can be used to explore questions such as:

  • What patterns or trends exist in the data?
  • What anomalies or outliers can be identified?
  • How can the data be cleaned and transformed?
  • What insights could influence decision-making?
  • What visualizations best communicate the findings?

Tools You Can Use

You can analyze these datasets using tools such as:

  • SQL
  • Python (Pandas, NumPy)
  • Jupyter Notebook
  • Excel or Google Sheets
  • Data visualization tools such as Tableau or Power BI

Who This Repository Is For

This repository may be useful for:

  • Data analysts practicing real-world datasets
  • Students learning data analysis and exploratory data analysis
  • Developers experimenting with data processing
  • Anyone interested in exploring and generating insights from raw data

Contribution

If you have interesting real-world datasets or improvements, feel free to contribute by submitting a pull request.

About

A collection of real-world raw datasets used to practice data analysis, explore patterns, and generate actionable insights.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors