PySpark is an interface for Apache Spark in Python. It not only allows you to write Spark applications using Python APIs, but also provides the PySpark shell for interactively analyzing your data in a distributed environment. PySpark supports most of Sparkโs features such as Spark SQL, DataFrame, Streaming, MLlib (Machine Learning) and Spark Core.
-
Notifications
You must be signed in to change notification settings - Fork 0
phricardorj/pyspark-study
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
ย | ย | |||
ย | ย | |||
ย | ย | |||
ย | ย | |||
ย | ย | |||
ย | ย | |||
ย | ย | |||
ย | ย | |||
ย | ย | |||
Repository files navigation
About
๐ | My PYSPARK studies. PySpark is an interface for Apache Spark in Python.
Topics
Resources
Stars
Watchers
Forks
Releases
No releases published