spark-sql

Star

Here are 20 public repositories matching this topic...

syedhassaanahmed / databricks-notebooks

Star

Collection of Databricks and Jupyter Notebooks

Updated Mar 11, 2024
Jupyter Notebook

Dirkster99 / PyNotes

Star

My notebook on using Python with Jupyter Notebook, PySpark etc

python spark pandas-dataframe jupyter-notebook pyspark parquet panda dataframe spark-sql sparknlp

Updated Aug 25, 2021
Jupyter Notebook

roshankoirala / pySpark_tutorial

Star

Implementation of Spark code in Jupyter notebook. Topics include: RDDs and DataFrame, exploratory data analysis (EDA), handling multiple DataFrames, visualization, Machine Learning

visualization machine-learning sql apache-spark exploratory-data-analysis regression pyspark classification dataframe spark-sql pyspark-tutorial spark-ml rdds

Updated Aug 26, 2020
Jupyter Notebook

amanjeetsahu / Apache-Spark-Tutorials

Star

This repo contains my learnings and practice notebooks on Spark using PySpark (Python Language API on Spark). All the notebooks in the repo can be used as template code for most of the ML algorithms and can be built upon it for more complex problems.

machine-learning big-data spark bigdata pyspark spark-streaming spark-sql spark-mllib

Updated Dec 9, 2021
Jupyter Notebook

kayvansol / PySparkJupyterOnKubernetes

Star

PySpark & Jupyter Notebooks Deployed On Kubernetes

apache-spark jupyter docker-compose jupyter-notebook kubernetes-cluster python3 pyspark spark-sql bitnami-charts

Updated Apr 16, 2024

abulbasar / zeppelin-notebooks

Star

scala spark spark-sql spark-mllib

Updated Oct 21, 2017

flaviostutz / spark-scala-jupyter

Star

Jupyter notebook server prepared for running Spark with Scala kernels on a remote Spark master

scala spark jupyter jupyter-notebook hdfs spark-sql hdfs-docker scala-spark hdfs-cluster

Updated Apr 25, 2020
Jupyter Notebook

matheus-conrado / notebook_databricks_cloud_guardians

Star

Repositório contendo todo o projeto de engenharia de dados realizado na Databricks conectando com o redshift na aws

python spark databricks spark-sql databricks-notebooks

Updated Mar 28, 2022
Jupyter Notebook

shrikantnaidu / Apache-Spark

Star

Spark and Spark ML notebooks

udacity spark spark-sql spark-ml

Updated Apr 27, 2020
Jupyter Notebook

sunujh6 / spark_practice

Star

spark notebook jupyter-notebook pyspark spark-sql spark-mllib spark-structured-streaming

Updated Aug 23, 2022
Jupyter Notebook

venkatakamaiah46 / Pyspark-Programs_OLD

Star

Spark Python Programs on Jupyter notebook

python apache-spark spark-sql

Updated Jun 14, 2023
Jupyter Notebook

zemuldo / spark-tutorial

Sponsor

Star

Jupiter notebooks for my spark tutorials.

spark spark-sql

Updated Nov 7, 2020
Jupyter Notebook

rhl-gupta / pyspark---Basics

Star

Explains the implementation of spark concepts using pyspark API from jupyter notebook

spark-sql spark-dataframes pyspark-api sprakcontext pyspark-in-jupter

Updated Jun 28, 2018

JonathanPollyn / Spark

Star

This notebook contains detailed code for spark and machine learning and databricks

python spark pyspark spark-sql pyspark-python

Updated Mar 15, 2023
Jupyter Notebook

jkanclerz / data-science-workshop-2022

Star

The repository contains notebook templates for the purposes of the data science course at the Cracow University of Economics.

python data-science spark spark-sql

Updated Jan 21, 2023
Jupyter Notebook

DeevanshiSharma / Bundesliga-Big-Data-Analysis-using-PySpark

Star

Performed Big Data Analysis on Bundesliga Football League Dataset using tools PySpark, spark-SQL, and numpy and done in Jupyter Notebook.

python big-data apache-spark numpy jupyter-notebook pyspark data-analysis spark-sql big-data-analytics ad-hoc-analysis

Updated Feb 16, 2023
Jupyter Notebook

RahulParajuli / PySpark-Queries

Star

Pyspark tutorial with different query that you can use on notebook using pyspark. It is very useful tool to analyze large amount of data.

database bigdata pyspark spark-sql dataanalysis

Updated Sep 15, 2022
Jupyter Notebook

cavallon / Home_Sales

Star

This SparkSQL project analyzes home sales data, optimizing queries and calculating average prices. Results are saved in a Jupyter Notebook and uploaded to a GitHub repository named "Home_Sales."

data-science jupyter-notebook spark-sql

Updated Apr 23, 2024
Jupyter Notebook

irenacosta / monkeypoxPySpark

Star

Análise de dataset gerado em projeto de 100 dias da Organização "Our World in Data" que acompanhou e tabulou dados mundiais da epidemia de "varíola dos macacos" no ano de 2022. Desenvolvido em Pyspark no ambiente notebook do Google Colab.

python data sql pandas pyspark spark-sql colab-notebook

Updated Feb 16, 2023
Jupyter Notebook

NijatZeynalov / Apache-SparkML-Pipelines

Star

In this notebook I’ll use the HMP dataset and perform some basic operations using Apache SparkML Pipeline component. This dataset is a public collection of labelled accelerometer data recordings to be used for the creation and validation of acceleration models of human motion primitives.

spark-sql sparkml-pipelines

Updated Apr 20, 2020
Jupyter Notebook

Improve this page

Add a description, image, and links to the spark-sql topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the spark-sql topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

spark-sql

Here are 20 public repositories matching this topic...

syedhassaanahmed / databricks-notebooks

Dirkster99 / PyNotes

roshankoirala / pySpark_tutorial

amanjeetsahu / Apache-Spark-Tutorials

kayvansol / PySparkJupyterOnKubernetes

abulbasar / zeppelin-notebooks

flaviostutz / spark-scala-jupyter

matheus-conrado / notebook_databricks_cloud_guardians

shrikantnaidu / Apache-Spark

sunujh6 / spark_practice

venkatakamaiah46 / Pyspark-Programs_OLD

zemuldo / spark-tutorial

rhl-gupta / pyspark---Basics

JonathanPollyn / Spark

jkanclerz / data-science-workshop-2022

DeevanshiSharma / Bundesliga-Big-Data-Analysis-using-PySpark

RahulParajuli / PySpark-Queries

cavallon / Home_Sales

irenacosta / monkeypoxPySpark

NijatZeynalov / Apache-SparkML-Pipelines

Improve this page

Add this topic to your repo