GitHub - chus-chus/Flight-interruption-prediction: Predictive analysis with PySpark ⚡️

author: Jesus Antonanzas Acero, Alex Carrillo Alza
version: "1.0"
email: "[email protected], [email protected]"
info: BDA, GCED, Big Data Analytics project
date: 16/12/2019

Our project should contain 5 '.py' scripts:

'config.py': configures Spark environment when necessary
'load_into_hdfs.py': loads sensor csv's into HDFS in AVRO format
'data_management.py': creates training data matrix
'data_analysis.py': trains a decission tree classifier model on training data
'data_classifier.py': predicts new flight observations

These have to be put into the same directory as the 'resources' directory that was included in he code skeleton, in order for the local reading option of the sensor data (CSV's) to work.

'load_into_hdfs.py' is to be executed first if one wants to use HDFS. Note that the HDFS path into which the files are to be loaded has to be explicitly changed, as well as the reading path in 'data_management.py' and 'data_classifier.py' if using this option.

If nothing is specified in 'data_management.py' or 'data_classifier.py', though, CSV's will be read from local.

Then the order of execution is: (optional): 'load_into_hdfs.py'

'data_management.py'
'data_analysis.py'
'data_classifier.py'

Note that these scripts will write three objects:

'data_matrix'
'model'
'test_matrix'

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 51 Commits
Reports		Reports
resources		resources
.DS_Store		.DS_Store
README.md		README.md
config.py		config.py
data_analysis.py		data_analysis.py
data_classifier.py		data_classifier.py
data_management.py		data_management.py
load_into_hdfs.py		load_into_hdfs.py

chus-chus/Flight-interruption-prediction

Folders and files

Latest commit

History

Repository files navigation

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages