Skip to content

PySpark & Kafka - Streaming Fire Data from NASA's Satellite data - TERRA & AQUA

Notifications You must be signed in to change notification settings

anil-gurbuz/Streaming-Application

Repository files navigation

Streaming-Application

Streaming Fire Data

In this project, multiple Apache Kafka producers are implemented to simulate the real-time streaming of the data which is processed by Apache Spark Streaming client and then inserted into MongoDB. Also, real-time data visualisation is created.

System Architecture

Screen Shot 2020-02-12 at 2 30 42 pm

User Guide

  1. Kafka_Producer1,2,3.ipynb files should be run first to produce events on LocalHost:9092.
  2. Streaming_Application.ipynb should be run to collect the real-time data, process and merge the data and save it into local MongoDB server.
  3. Real_Time_Data_Visualisation.ipynb can be run to visualise fire locations on a map and create an air tempreture - time dashboard.

About

PySpark & Kafka - Streaming Fire Data from NASA's Satellite data - TERRA & AQUA

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published