spark-scala-example

This project is a proof of concept (POC) for a Spark Scala application using S3 storage (Minio) without HDFS or YARN.

Preparation

Start a Spark standalone cluster.
Start a Minio cluster for S3 storage.
Create an S3 access key and secret key on Minio.
Create an S3 bucket on Minio.
Start a simple HTTP server with Python to share the target JAR files, for example: python3 -m http.server --bind 0.0.0.0 3000.

build and deploy

use build.sh

./build.sh -C "sample.DataProcessExample" -D cluster
./build.sh -C "sample.SparkPi"
./build.sh -C "sample.WordCount"

    [spark standalone cluster] ---------read/write----------->> [minio-cluster]                                
                               ---------read/write----------->> [kafka-cluster] 
                               ---------read/write----------->> [jdbc]
                               ---------read/write----------->> [redis]
                               ---------read/write----------->> [elasticsearch]

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
conf		conf
data		data
src/main/scala/sample		src/main/scala/sample
.gitignore		.gitignore
README.md		README.md
build.sh		build.sh
pom.xml		pom.xml
spark-submit-help.md		spark-submit-help.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

spark-scala-example

ref

About

Releases

Packages

Languages

dyrnq/spark-scala-example

Folders and files

Latest commit

History

Repository files navigation

spark-scala-example

ref

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages