-
🌱 I’m currently learning Functional programming in Scala/Kotlin, Designing Data Engineering Applications, Google cloud & Databricks
-
📝 I regularly write articles on Vitthal Mirji
-
👀 Check out my libraries published on Maven Central Repository:
- Datapipelines Essentials 🎸, a Best practices APIs/libraries for data engineering & data quality
- DATA ENGINEERING & WAREHOUSING FUNCTIONAL PROGRAMMING EXTENSIONS 🌩️, Apply Engineering methods on data, taking influence of functional programming & chain your algorithm steps with power of Scala
- SHC - BigTable-HBase Connector 🐛, a Google BigTable with namespaces & name descriptors - bridging the gap
- Nested Complex Data typed Data Parsing in Spark 🎭, a prototype library implementing to Derive new attributes from XML when you have XPATH transformations. Accelerate boring stuff in #Pyspark & Python Also check out how to handle multi line XML
- Vitthal Mirji Blog 🧪, My blog
-
💬 Ask me about Data Engineering, Machine Learning, Building Frameworks, Low-level Design Patterns, Data Warehousing, Java, Scala, Cats (still learning), Akka, Python, SQL and Kotlin
-
📄 Know about my experiences https://www.linkedin.com/in/vitthal10/
-
Senior Data Engineer @walmart
- Mumbai, India
-
12:11
(UTC +05:30) - https://www.vitthalmirji.com
- @whoami_vim
- in/vitthal10
Pinned Loading
-
datapipelines-essentials-python
datapipelines-essentials-python PublicSimplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validation, Column extensions, SQL functions, and DataFrame transform…
-
Design-Patterns
Design-Patterns PublicDesign patterns provide a reusable solution to commonly occurring software problems.
Java 1
-
gcp-datalake
gcp-datalake PublicRead various types of files from Google storage, Maps data to Google Bigtable & Performs bulk load into Google Bigtable
Scala 1
-
MapReduceExamples
MapReduceExamples PublicMapReduce various examples & Algorithm & Hadoop Batch Processing using MapReduce
Java 1
-
-
hortonworks-spark/shc
hortonworks-spark/shc PublicThe Apache Spark - Apache HBase Connector is a library to support Spark accessing HBase table as external data source or sink.
If the problem persists, check the GitHub status page or contact support.