Distributed-Geospatial-computation

We perform spatial operations on the dataset distributed over HDFS and Spark cluster consisting of a master and two worker nodes. We use New York City taxi dataset to obtain spatial statistics and use that to identify the statistically significant cell.

For Phase 1 description, please check CSE512-Phase1.pdf

Youtube Demo Phase 1: https://www.youtube.com/watch?v=oe5NLMuJWAI

For Phase 2 description, please check Phase2-requirement.pdf

A new function is created in GeoSpark Library to perform Naive Cartesian Product and the performance is compared with other optimized API calls. Full detailed analysis is reported (Phase2.pdf)

For Phase 3 description, please check Phase3-requirement.pdf.

We use New York City taxi dataset to find the important points in the envelope and create neighborhood of each cell in the space-time cube with latitude, longitude, and date (each cell has 26 neighbors), calculate significant cells using Getis-ord statistic (Hot Spot Analysis)

Full report: Final Report.pdf

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
Phase 1		Phase 1
Phase 2		Phase 2
Phase 3		Phase 3
Phase 4		Phase 4
1.scala		1.scala
4b.scala		4b.scala
4c.scala		4c.scala
CSE512-PhaseI.pdf		CSE512-PhaseI.pdf
Final Report.pdf		Final Report.pdf
JoinQuerya.scala		JoinQuerya.scala
KNN3b.scala		KNN3b.scala
Phase1_requirement.pdf		Phase1_requirement.pdf
Phase2.pdf		Phase2.pdf
Phase2_Report.pdf		Phase2_Report.pdf
Phase4_requirement.pdf		Phase4_requirement.pdf
QueryPointRDD.scala		QueryPointRDD.scala
README.md		README.md
RTreePointRDD.scala		RTreePointRDD.scala
SpatialKNNa.scala		SpatialKNNa.scala
arealm.csv		arealm.csv
geospark_dns.jar		geospark_dns.jar
phase2-requirement.pdf		phase2-requirement.pdf
phase3-requirement.pdf		phase3-requirement.pdf
zcta510.csv		zcta510.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Distributed-Geospatial-computation

About

Releases

Packages

Languages

narendrakumar92/Distributed-Geospatial-computation

Folders and files

Latest commit

History

Repository files navigation

Distributed-Geospatial-computation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages