vim89

Follow

💭

I may be slow to respond.

Vitthal Mirji vim89

💭

I may be slow to respond.

Follow

Software & Data Engineer Lives in Mumbai, India Mumbai-Pune-Goa-Mumbai Love cars, I enjoy driving my @Ford 1.5 TDCi Watch collector:Own over 3 dozen

30 followers · 78 following

Achievements

Achievements

vim89/README.md

Hi 👋, I'm Vitthal

Software/Data Engineer, Data Architect, teacher from India

🌱 I’m currently learning Functional programming in Scala/Kotlin, Designing Data Engineering Applications, Google cloud & Databricks
📝 I regularly write articles on Vitthal Mirji
👀 Check out my libraries published on Maven Central Repository:
- Datapipelines Essentials 🎸, a Best practices APIs/libraries for data engineering & data quality
- DATA ENGINEERING & WAREHOUSING FUNCTIONAL PROGRAMMING EXTENSIONS 🌩️, Apply Engineering methods on data, taking influence of functional programming & chain your algorithm steps with power of Scala
- SHC - BigTable-HBase Connector 🐛, a Google BigTable with namespaces & name descriptors - bridging the gap
- Nested Complex Data typed Data Parsing in Spark 🎭, a prototype library implementing to Derive new attributes from XML when you have XPATH transformations. Accelerate boring stuff in #Pyspark & Python Also check out how to handle multi line XML
- Vitthal Mirji Blog 🧪, My blog
💬 Ask me about Data Engineering, Building Frameworks, Low-level Design Patterns, Data Warehousing, Java, Scala, Cats (still learning), Akka, Python, SQL and Kotlin
📄 Know about my experiences https://www.linkedin.com/in/vitthal10/

Connect with me:

Languages and Tools:

Pinned Loading

flowforge flowforge Public

Let's be honest - most data pipeline frameworks treat types as suggestions. Config files are strings. Schemas are "validated" at runtime. Data quality is an afterthought. So, let's do differently

Scala
datapipelines-essentials-python datapipelines-essentials-python Public

Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validation, Column extensions, SQL functions, and DataFrame transform…

Python 55 40
llm4s/llm4s llm4s/llm4s Public

Agentic and LLM Programming in Scala

Scala 158 33
toon4s toon4s Public

toon4s: Token-Oriented Object Notation for JVM

Scala 21 2
dataengineering-savvy dataengineering-savvy Public

Scala 1
cask cask Public

Forked from com-lihaoyi/cask

Cask: a Scala HTTP micro-framework. Cask makes it easy to set up a website, backend server, or REST API using Scala

Scala