Project library

Automation and reporting projects you can review quickly.

Browse concise cards that link to richer detail sections, with proof visuals and clear outcomes for each build.

Project cards

Scan the highlights, then jump to the detailed sections below for proofs and outcomes.

revenue_pipeline.py

Automated Revenue Analytics Pipeline

End-to-end Python pipeline transforming messy retail data into executive-grade financial insights with automated SQL validation.

Python (Pandas) PostgreSQL ETL Automation
sql_automation_pipeline.py

SQL Automation and Revenue Analytics

Data cleanup, SQL-driven revenue mapping, and executive-friendly dashboard output.

SQL Pandas Revenue
SQL Automation and Revenue Analytics
powerbi_pipeline.pbix

PowerBI ETL Automation

End-to-end data transformation pipeline leveraging Power Query and DAX to automate regional revenue aggregation and interactive performance reporting.

Power Query DAX PowerBI
PowerBI ETL Automation
excel_merger.py

Excel Automation ETL

Workbook aggregation pipeline that reads multiple files, preserves source context, and removes duplicates.

xlwings Deduping Reports
Excel Automation ETL
pro_mission_v4.py

Comparative Growth Workbook

Product ranking, laptop revenue comparison, and month-over-month growth analysis in one workbook.

Growth Excel Comparisons
Comparative Growth Workbook
construction_pipeline.py

Global Construction Risk Analytics Platform

High-performance portfolio risk dashboard managing 2,240 global project records with multi-currency USD normalization.

Python (Pandas) Plotly.js Portfolio Risk
Detailed project view

SQL Automation and Revenue Analytics

View Repo on GitHub

This build cleaned malformed sales data, removed invalid rows, normalized country labels, validated dates, and generated pivot-ready workbook output for stakeholder review.

  • Removed missing identifiers and non-numeric amount values before reporting
  • Validated dates for cleaner monthly trend logic
  • Generated revenue by country, product, month, and top customer views
SQL Pandas Revenue
Interactive Live View

Executive Revenue Analytics Dashboard

The following interactive briefing is powered by the automated ETL pipeline, serving real-time financial insights and regional performance metrics directly from the processed dataset.

Deployment: Executive Briefing Engine System Live
Elite Case Study

Extreme Data Recovery & Regional BI Pipeline

Request Data Recovery Scope

This flagship project involved the mass restoration of over 20,500 records of multi-regional sales data. The challenge was to convert raw, corrupted "Digital Chaos" into a high-fidelity environment with 100% currency normalization (NGN) and professional world identities.

  • Engineered a custom "Surgical Integrity" engine to purge 2,518 fuzzy duplicates
  • Restored 100% realistic world identities using verified domain mapping
  • Generated high-res BI visuals identifying a 45% revenue surge in the West African market
clean_data_perfect.xlsx Executive_Summary_Report.xlsx Zero-Tolerance QA
The Proof of Perfection

Evidence of Absolute Integrity

Below is the actual Executive Handoff Report delivered to the client, summarizing the 100% quality score reached after the Absolute Perfection cycle.

# EXECUTIVE CLIENT HANDOFF: Extreme Data Recovery v2 **Project Status:** 100% Complete | **Data Integrity:** 100% Perfect | **Senior Analyst:** Chinua Analytics --- ## 🌍 1. GLOBAL REVENUE SEGMENTATION The following analysis identifies the primary revenue drivers across our multi-regional business model. Nigeria currently stands as the high-growth leader, contributing significantly more to the global total than the USA or UK. ![Revenue by Country](assets/projects/extreme_recovery/visuals/revenue_chart.png) > [!TIP] > **Strategic Insight:** High-growth investment should be prioritized for the Nigeria region, while the UK and USA require targeted "Re-Engagement" marketing campaigns. --- ## 📈 2. SEASONAL SALES TRENDS Our time-series analysis reveals a 100% predictable seasonal surge. Peak revenue is concentrated in the Q4 window, specifically late Q4. ![Sales Trend Peaks](assets/projects/extreme_recovery/visuals/trend_chart.png) --- ## 🛒 3. PRODUCT PORTFOLIO AUDIT Electronics remain the core engine of our business, driving nearly 20% of all global revenue. This is followed closely by the Fashion and Health categories. ![Category Contribution](assets/projects/extreme_recovery/visuals/product_chart.png) --- ## 🛡️ 4. DATA INTEGRITY AUDIT (PROOF OF WORK) Every single record has been processed through the **Absolute Perfection (v2)** cycle. All duplicates have been purged, and all phone numbers/emails have been normalized to world-class standards. | Metric | Cleaned Rows | Quality Score | | :--- | :--- | :--- | | **Total Processed** | 20,500 | 100% | | **Fuzzy Duplicates** | 2,518 Purged | 100% | | **Integrity Checks** | Passed | 100% |
Detailed project view

PowerBI ETL Automation

This solution replaces manual Excel consolidation with a robust PowerBI ETL pipeline. It uses Power Query to ingest multi-region source data, applies DAX for complex revenue logic, and delivers interactive reporting for stakeholder review.

  • Merged source tables to enrich reporting context
  • Produced country totals and monthly trend summaries
  • Saved a formatted workbook sectioned for stakeholder scanning
sql_mission_report.xlsx Revenue Analysis Excel
Detailed project view

Excel Automation ETL

This pipeline scans workbook outputs, ingests non-empty sheets, tags file and sheet origin, removes duplicate rows, and creates a single master workbook for downstream analysis.

  • Preserves source lineage for better auditing
  • Combines multiple workbook structures into one reviewable asset
  • Saves a deduped master report ready for pivots and cleanup
MASTER_MERGED.xlsx Deduping Workbook Aggregation
Detailed project view

Comparative Growth Workbook

This workbook focuses on comparative analysis: top products in the USA, top laptop revenue contributor, and month-over-month revenue growth in a concise sheet designed for review speed.

  • Highlights winning products inside one market
  • Compares country contribution to laptop revenue
  • Calculates growth percentage across reporting months
pro_mission_v4_report.xlsx Growth Analysis Excel
Flagship Data Architecture

Global Construction Risk Analytics Platform

View Repo on GitHub
Transformation Narrative

From 2,240 rows of multi-region currency chaos to a unified, executive-grade intelligence platform. This project demonstrates full-stack data engineering: from raw ETL pipelines to premium glassmorphism visualization.

raw_data.csv ETL Pipeline Strategy Report
Technical Rigor: Automated Testing

Data Integrity Verification

To ensure 100% accuracy in financial reporting across NGN, ZAR, and USD, I implemented a comprehensive pytest suite covering 11 critical data transformation rules.

rootdir: /Global-Construction-Risk-Analytics-Platform collected 11 items tests/test_calculations.py .... [ 36%] tests/test_cleaning.py ... [ 63%] tests/test_currency.py .... [100%] ================= 11 passed in 0.12s =================
System Engine Status
Source: Local CSV Engine: Python / Pandas Refresh: Automated
Phase 03: The Result View Cleaned Dashboard Summary Report
Detailed project view

Automated Revenue Analytics Pipeline

View Repo on GitHub

This flagship engineering project solves the "Messy Data" problem for retail enterprises. It automates the ingestion of corrupted CSV records, purges duplicate entries, normalizes financial values, and serves the results to a PostgreSQL data warehouse for real-time reporting.

Strategic Project Summary

Executive Performance Overview

Total Revenue

$10.7M

Peak: $1.07M (Mar '25)

Reporting Efficiency

< 10s

Reduced from 4 Hours

Data Accuracy

100%

Duplicates Resolved

"By implementing an automated ETL pipeline, we transitioned from fragmented Excel silos to a unified PostgreSQL intelligence layer. This system currently identifies that 37.3% of revenue is driven by a single product line (Rice), enabling targeted regional strategy adjustments."

  • **Setup**: Custom script generates realistic messy data with `#N/A` and date errors.
  • **Cleaning**: Python-based normalization engine handles complex string and numeric validation.
  • **Persistence**: Idempotent SQL loading logic ensures database integrity across multiple runs.
  • **Analytics**: Advanced SQL views calculate MoM growth and regional revenue splits.
Pandas SQLAlchemy Data Quality
Interactive Live View

Executive Sales Intelligence Dashboard

The following interactive dashboard is the final output of the automated pipeline. It delivers real-time regional performance, product concentration, and seasonal trends directly from the cleaned dataset.

Deployment: Sales Intelligence Engine System Live