Data Projects
Browse my data work tagged with "CI-CD".
Google Trends Time Series Analysis
DataThree economic signals — Tesla, Bitcoin, and US unemployment — each paired with Google Trends search data across mismatched time frequencies and aligned by resampling. The 2020 COVID shock compressed months of unemployment pattern into weeks.
LEGO Dataset Analysis
DataLEGO nearly went bankrupt in 1998. This project joins six relational tables across 15,710 sets to measure exactly what changed — licensed share grew from 0% to 31.5%, average complexity 8×, and minifigure density 57% from 1970 to 2010.
Movie Budget Linear Regression Analysis
DataAcross 5,384 films from 1915–2018, a linear regression on budget vs. worldwide gross gives a slope of 3.12 (R²=55.77%) — every dollar of budget is associated with $3.12 in revenue, yet 37.28% of films still failed to recoup their costs.
NumPy NDArray Computation
DataNumPy treats images as numbers — a 768 × 1024 photograph is a 3D array of integers. This project works through ndarray operations from first principles: slicing, broadcasting, matrix multiplication, and pixel-level transforms like greyscale conversion and colour inversion.
Semmelweis Handwashing Data Analysis
DataRe-examination of Dr Semmelweis's 1861 hospital records from Vienna General Hospital (1841–1849). Mandatory handwashing in June 1846 cut the average monthly death rate from 10.5% to 5.0% — confirmed statistically at p ≈ 0.00025.
No projects found
No projects match the current filter.