📊 CSV Profiler Tool A comprehensive Python-based tool designed to analyze, profile, and summarize CSV data. This project was developed as part of the AI Professionals Bootcamp (SDAIA) to demonstrate core Python mastery, OOP principles, and modern application development.
🌟 Overview The CSV Profiler allows users to quickly understand the structure and quality of their datasets. It identifies data types, calculates key statistics for numerical columns, and identifies missing data. The tool offers two interfaces: a powerful Command Line Interface (CLI) and an interactive Web GUI.
✨ Features Smart Type Inference: Automatically detects if a column is numeric, text, or boolean.
Data Statistics: Calculates Min, Max, Mean, and Median for numerical data.
Missing Value Analysis: Reports the count and percentage of missing values per column.
Dual Interface:
CLI: Fast, terminal-based profiling using Typer.
Web GUI: User-friendly dashboard built with Streamlit.
Exportable Reports: Generates professional summaries in both JSON and Markdown formats.
🛠️ Tech Stack Language: Python 3.12+
Environment Management: uv (Fast Python package installer)
CLI Framework: Typer
🚀 Getting Started To run this project on your local machine, follow these simple steps in your PowerShell:
🚀 Getting Started To run this project on your local machine, follow these simple steps in your PowerShell:
- Clone the Repository
git clone https://github.com/iAbdullah/Bootcamp.git
cd Bootcamp/csv-profiler
- Install Requirements Make sure you have Python installed, then install the necessary library:
pip install streamlit
- Run the App
Start the application using the following command:
streamlit run oop.py
🛠 How to Use
Upload: Use the sidebar to upload your CSV file.
Preview: View your raw data in the "Data Preview" section.
Analyze: Click the "Generate Report" button.
Export: Download your final report as a Markdown or JSO
Data Handling: Native csv & json modules, pathlib for file system management.
