Skip to content

d9w/Spark.jl

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Spark.jl

Not under current development

This repository was part of a project for MIT's Distributed Systems class. It is not under current development. There is another implementation of Spark in Julia which seems to have active attention:

https://github.com/dfdx/Spark.jl

Usage

A basic implementation of Apache Spark for Julia. Examples of use can be found in the test directory. A user will specify workers in a JSON file like default_workers.json and call julia start_worker.jl on them, specifying the correct port.

Locally, the user can then start a Spark.Master instance and use it to create and manipulate resilient distributed datasets:

using Spark
master = Spark.Master("127.0.0.1", 3333)
# fill in master.workers
Spark.load(master, "default_workers.json")
# start master listener
Spark.initserver(master)

# Create and manipulate RDDs
rdd = Spark.input(master, "RDDA.txt", "int_reader")
filtered_rdd = Spark.filter(master, rdd, "number_filter")
results = Spark.collect(master, filtered_rdd)

About

Spark in Julia for MIT 6.824

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages