LexiMind — MTL for literary & academic text
Nov 2025 – present
A 272M-parameter encoder-decoder transformer I implemented from scratch (Pre-LN, RMSNorm, T5 relative position bias, FlashAttention, gated-GELU FFN, KV-cache) with explicit FLAN-T5-base weight mapping — jointly trained on abstractive summarization, topic classification, and multi-label emotion detection. I diagnosed negative transfer in naive MTL and designed two targeted interventions: learned attention pooling for classification heads and temperature-based task sampling (α=0.5). Emotion-detection sample-averaged F1 went 0.199 → 0.352 (+77%), beating the single-task baseline; per-class threshold tuning lifted macro F1 to 0.290. Shared-encoder MTL also beats a BERT-base topic-classification specialist (85.7% vs. 83.1%). 5-seed validation, bootstrap CIs, cross-task MD5 dedup, gradient-conflict diagnostics, MLflow tracking. Targeting arXiv + EMNLP 2026.
PyTorch
Transformers
Multi-Task Learning
FlashAttention
MLflow
NLP
LiminalML — structured interview prep for ML & SWE
2025 – present
A product I'm building: a structured, non-skippable path through 157+ ML and SWE interview topics, grounded in the candidate's own resume. Two tracks (ML/Research, 71 topics; SWE, 86 topics), a six-stage flow that progresses from big picture → intuition → math/internals → implementation → interview questions → retrieval check. Every session injects the user's real projects and STAR stories into explanations and comprehension checks, so prep is calibrated to engineers who've already shipped code, not beginners.
Product
LLM Orchestration
Curriculum Design
Full Stack
F1 race outcome predictor
2025
End-to-end ML pipeline over FastF1 and Ergast data across multiple seasons, with engineered features covering lap times, session deltas, and driver/team context. Random Forest regressor (MAE 1.78, RMSE 2.28 grid positions) plus two Logistic Regression classifiers for Q3 and top-ten qualification (93.8% and 92.6% accuracy). Interactive Streamlit dashboard for scenario planning and model diagnostics.
Python
Scikit-learn
Pandas
Streamlit
FastF1
PlayAxis — full-stack sports & events platform
2025
Aggregates real-time schedules, scores, and event data from 3+ external APIs into a FastAPI backend with Pydantic validation and a PostgreSQL store. Dockerized with CI/CD through Netlify and Koyeb. A good exercise in real-world schema reconciliation and deploy ergonomics.
React
FastAPI
PostgreSQL
Docker
Python
IoT monitoring for fossil fuel power plants
2023 – 2024 · Triaxis Power
A Raspberry Pi 4 system with thermocouples and potentiometers to track pipe temperature and thermal expansion for predictive maintenance. I scoped, specified, and deployed it end-to-end — hardware selection, pipeline, and visualization — across plant sites.
Raspberry Pi
IoT
Python
Formula 1 statistics dashboard
2024
Tableau analysis of Formula 1 data from 1950–2024, cutting across drivers, constructors, and circuits to surface long-range performance trends.
Tableau
Data Viz
This site
2024 – present
Built by hand — HTML / CSS / JS with Lenis for smooth scrolling and GSAP for soft fade-ins. Hosted on Netlify.
HTML
CSS
JavaScript