⚡ Designing scalable pipelines • Automating data workflows • Powering AI & Analytics
I am a Data Engineer building scalable data pipelines, analytics, and AI solutions across renewable energy, SMB, and public health.
I develop production-ready platforms using Spark, dbt, and Dagster, model infectious disease transmission for policy impact, and design analytics pipelines for EV products and battery energy storage systems.
My work bridges complex data with real-world decisions, accelerating sustainable adoption and enabling actionable insights.
Languages |
|
Data Engineering |
|
Cloud Platform |
|
Databases / Storage |
|
BI / Dashboards |
|
MLOps / ML |
|
Development Tools |
|
- 🚀 Building production-ready data platforms with Spark, dbt, Dagster, Superset, and Streamlit, integrating analytics and Generative AI.
- 🌏 Simulating infectious disease transmission across Pacific Island travel networks to inform public health decisions.
- 📦 Developing modular data pipelines for real-time vector search, RAG (Retrieval-Augmented Generation), and automated reporting.
- 🧠 Experimenting with lightweight LLMs (Mistral, Ollama) to create domain-specific assistants with secure local inference.
- 📊 Designing dashboards and analytics pipelines to analyze EV product data, sales conversions, and model configuration preferences.
Thanks for stopping by! ⭐ Feel free to check out my projects below.