#

pyspark-notebook

Here are 195 public repositories matching this topic...

Ragadeepthi / Loading-different-types-of-data-files-using-Flume-and-pyspark

Loading different types of dataset files using Flume and pyspark

python machine-learning pyspark machinelearning pyspark-notebook pyspark-python

Updated Jul 4, 2019
Python

PeterSchuld / Sparkify

Capstone Project in the Udacity Data Scientist Nanodegree program. We manipulate large and realistic datasets with Spark to engineer relevant features for predicting churn. We'll learn how to use Spark MLlib to build machine learning models with large datasets, far beyond what could be done with non-distributed technologies like scikit-learn.

machine-learning spark-sql pyspark-notebook big-data-analytics

Updated Jul 29, 2021
HTML

RiccardoRobb / Sentiment-analysis-firefox-extension

Tweet sentiment analysis

machine-learning pyspark tweet-analysis pyspark-notebook

Updated Sep 21, 2023
Jupyter Notebook

rdeo265 / aws-pyspark

BDAS with PySpark on AWS

aws machine-learning data-mining jupyter-notebook pyspark pyspark-notebook

Updated Oct 24, 2020
Jupyter Notebook

samuelesimone / Pyspark-fundamentals

Pyspark fundamentals

pyspark pyspark-notebook pyspark-examples

Updated Jan 10, 2023
Jupyter Notebook

saraparveen26 / Home-Sales---BigData

This project creates and examines different metrics about Home Sales data.

bigdata pyspark pyspark-notebook googlecolab

Updated Jun 5, 2023
Jupyter Notebook

abhinit21 / data-analysis-pyspark

analyze the data set of world championship chess games using PySpark

pyspark data-analysis pyspark-notebook colab-notebook

Updated Nov 2, 2022
Jupyter Notebook

anik475 / Realtime_Tweet_Sentiment_Analysis

Tracking Tweet sentiment at scale using a pretrained transformer (classifier)

pandas-dataframe pyspark transformer parquet databricks streaming-data pyspark-notebook etl-pipeline mlflow huggingface-transformers

Updated Jul 10, 2024
Jupyter Notebook

amit-singh-rathore / market-basket-analysis

Implementing Apriori algorithm in PySpark

apriori-algorithm pyspark-notebook

Updated Oct 31, 2020
Jupyter Notebook

rantoncuadrado / udacity_capstone_project

Udacity Data Engineering Nanodegree. Capstone Project.

spark pyspark pyspark-notebook

Updated Aug 19, 2021
Jupyter Notebook

caiocmb7 / python-rep

Studies about python, including basic stuffs and oop

python basic study oop projects pyspark-notebook

Updated Jan 12, 2023
Jupyter Notebook

FranzDiebold / advent-of-code-2021

Solutions for Advent of Code 2021 in (Py)Spark

spark python3 pyspark pyspark-notebook advent-of-code-2021

Updated Dec 13, 2021
Jupyter Notebook

rezaneo7 / Persian-Wikipedia-Analysis

python spark pyspark pyspark-notebook

Updated Mar 15, 2022
Jupyter Notebook

dlleonardo / spark-de-ml-assignments

Spark DE&ML assignments from the "Data Engineering and Machine Learning with Spark" course (offered by IBM Skills Network)

spark pyspark pyspark-notebook sparkml-pipelines

Updated Jan 18, 2023
Jupyter Notebook

rsantos2032 / Cardiovascular-Disease-Detection

Cardiovascular Disease Detection using PySpark

spark hadoop python3 pyspark pyspark-notebook pyspark-machine-learning

Updated Apr 26, 2024
Jupyter Notebook

jasuncion2 / Learning-Jupyter

Code for the book Learning Jupyter

r scala spark jupyter-notebook pyspark python-3 pyspark-notebook ijavascript

Updated Sep 4, 2019
Jupyter Notebook

sanogotech / pyspark-examples

Pyspark RDD, DataFrame and Dataset Examples in Python language

python spark pyspark join pyspark-notebook

Updated Dec 1, 2022
Python

choang94 / yelp-reviews

Loading Yelp Reviews Data from Kaggle to a Spark Cluster provisioned on AWS EMR and doing analyses

aws spark pyspark-notebook emr-cluster

Updated Apr 27, 2021
Jupyter Notebook

quadrantofsola / PySpark_Dataframes

Analysis of Clinical Trial Dataset using Dataframes on PySpark

bigdata dataframes databricks pyspark-notebook databricks-notebooks

Updated May 22, 2022

manoharpalanisamy / Distributed-Keras

Research And Development on Distributed Keras with Spark

spark hadoop-cluster keras-neural-networks pyspark-notebook sshtunnel

Updated Jul 19, 2018
Jupyter Notebook

Improve this page

Add a description, image, and links to the pyspark-notebook topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the pyspark-notebook topic, visit your repo's landing page and select "manage topics."