Project 06 · Medical AI
💊 Diabetes Risk Predictor
Predict whether a patient is at risk of diabetes using clinical measurements such as glucose, BMI, and age.
- Handle missing values coded as zeros (physiologically impossible readings)
- Perform feature importance analysis to identify top clinical predictors
- Train and compare Logistic Regression, Decision Tree, and Random Forest
- Justify why Recall matters more than Precision in medical diagnosis
Healthcare AIClinical DataFeature Importance
Project 07 · Time Series
📈 Sales Forecasting
Predict future weekly sales for a retail chain using historical transaction data and seasonal patterns.
- Engineer time features: day of week, month, is_holiday, rolling averages
- Train Random Forest Regressor on the engineered time features
- Evaluate with MAE and MAPE (Mean Absolute Percentage Error)
- Visualise predicted vs actual sales on a time-series line chart
Time SeriesFeature Eng.RetailForecasting
Project 08 · Recommender
🎬 Movie Recommendation Engine
Build a content-based recommendation system that suggests similar movies based on genre, cast, and plot keywords.
- Combine genre, cast, director and keyword text into one feature string
- Compute TF-IDF vectors and cosine similarity between all movie pairs
- Build a function: input any movie title, return top 10 similar movies
- Extend with collaborative filtering using user rating matrix
RecommenderCosine SimilarityTF-IDFContent-Based
Project 09 · Computer Vision
📷 Handwritten Digit Classifier
Classify handwritten digits 0-9 from 28x28 pixel images using the MNIST dataset — the Hello World of deep learning.
- Flatten images from (28,28) to (784,) feature vectors, normalise 0-1
- Train Random Forest and compare to a simple Neural Network with sklearn
- Display a confusion matrix heatmap to see which digits are confused
- Visualise misclassified samples and explain why the model failed
Computer VisionMNISTImage DataNeural Net
Project 10 · Regression
⚡ Energy Consumption Predictor
Predict hourly electricity demand for a city using weather, time, and economic activity data.
- Engineer cyclical time features: encode hour and month as sin/cos pairs
- Handle weather outliers and missing sensor readings
- Compare Gradient Boosting vs Linear Regression for energy forecasting
- Report feature importances and explain which factors drive consumption peaks
Energy AICyclical FeaturesGradient BoostingRegression