DevOps pipeline for Real Time Social/Web Mining
-
Updated
Jul 14, 2024 - HTML
DevOps pipeline for Real Time Social/Web Mining
The repo contains a Hadoop cluster configuration and a client-server app. The goal is to predict smartphone's price range using a machine learning model generated over Apache Spark, and visualize charts about smarphone statistics using data originated by Apache Hive.
Taxi versus Uber in NYC
NYC Subway Data Analysis, project on Hadoop-MapReduce
My solution to Introduction to Big Data with Apache Spark MOOC at Edx
Taller SparkR para las Jornadas de Usuarios de R
Add a description, image, and links to the hdfs topic page so that developers can more easily learn about it.
To associate your repository with the hdfs topic, visit your repo's landing page and select "manage topics."