Projects

Browse Projects

Browse projects tagged with "data-pipeline".

Android App Store Analysis cover image

Android App Store Analysis

Data

Analyses 10,000+ Google Play Store apps with pandas and Plotly to answer real market questions: which categories are most competitive, how much a paid app earns, and how many downloads you give up by charging.

CNN Food Classifier cover image

CNN Food Classifier

Data

I built a convolutional neural network that classifies food images into 101 categories using two-stage transfer learning with MobileNetV2. Upload any food photo and get instant top-3 predictions with confidence scores — live on Hugging Face Spaces.

Data Preprocessing Pipeline — NYC Airbnb cover image

Data Preprocessing Pipeline — NYC Airbnb

Data

A production-grade, class-based data preprocessing pipeline built in Python on the NYC Airbnb Open Dataset (48,895 listings). Handles missing values, outliers, duplicates, encoding, and scaling — then generates five before/after diagnostic visualisations.

LEGO Dataset Analysis cover image

LEGO Dataset Analysis

Data

LEGO nearly went bankrupt in 1998. This project joins six relational tables across 15,710 sets to measure exactly what changed — licensed share grew from 0% to 31.5%, average complexity 8×, and minifigure density 57% from 1970 to 2010.

Programming Language Workforce Strategy — Data Analysis cover image

Programming Language Workforce Strategy — Data Analysis

Data

Stack Overflow lost 97.7% of its post volume since 2016 — and its momentum now anti-correlates with hiring demand. This project proves the signal is broken, then builds a four-source replacement index to answer which languages to hire for.

Zillow Web Scraper & Form Bot cover image

Zillow Web Scraper & Form Bot

Software

Automated data-entry pipeline: scrapes rental listings from a Zillow clone with BeautifulSoup, then drives Chrome via undetected-chromedriver to auto-fill and submit a Google Form for every address, price, and link found.

No projects found

No projects match the current filter.