Skip to content

data engineering class projects for log analytics using kafka and spark

Notifications You must be signed in to change notification settings

ZCai25/de_kafka_spark

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

26 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

🧐 About

This is a data engineering project for log analytics using kafka and spark

🏁 Getting Started

To get started and runing this program

Run kafka producer in terminal

  • python producer.py

Run spark streaming script to consume the log files

  • spark-submit --packages org.apache.spark:spark-sql-kafka-0-10_2.12:3.4.1 consumer_2gb.py

About

data engineering class projects for log analytics using kafka and spark

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages