Browse Projects
Browse projects tagged with "data-pipeline".
Android App Store Analysis
DataAnalyses 10,000+ Google Play Store apps with pandas and Plotly to answer real market questions: which categories are most competitive, how much a paid app earns, and how many downloads you give up by charging.
CNN Food Classifier
DataI built a convolutional neural network that classifies food images into 101 categories using two-stage transfer learning with MobileNetV2. Upload any food photo and get instant top-3 predictions with confidence scores — live on Hugging Face Spaces.
Data Preprocessing Pipeline — NYC Airbnb
DataA production-grade, class-based data preprocessing pipeline built in Python on the NYC Airbnb Open Dataset (48,895 listings). Handles missing values, outliers, duplicates, encoding, and scaling — then generates five before/after diagnostic visualisations.
LEGO Dataset Analysis
DataLEGO nearly went bankrupt in 1998. This project joins six relational tables across 15,710 sets to measure exactly what changed — licensed share grew from 0% to 31.5%, average complexity 8×, and minifigure density 57% from 1970 to 2010.
Programming Language Workforce Strategy — Data Analysis
DataStack Overflow lost 97.7% of its post volume since 2016 — and its momentum now anti-correlates with hiring demand. This project proves the signal is broken, then builds a four-source replacement index to answer which languages to hire for.
Zillow Web Scraper & Form Bot
SoftwareAutomated data-entry pipeline: scrapes rental listings from a Zillow clone with BeautifulSoup, then drives Chrome via undetected-chromedriver to auto-fill and submit a Google Form for every address, price, and link found.
No projects found
No projects match the current filter.