Skip to content

aholowko/spark-stackoverflow-trends

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

StackOverflow Trends with Apache Spark

Build Status

The purpose of the thesis is trends analysis of programming languages according to dataset from Stack Overflow. It aims to recognize the most frequently used programming language used by software developers.

The analysis was conducted with use of Big Data techniques. The main part of data processing was accomplished with Apache Spark engine for large-scale data processing. The analyzed statistics are for example: number of questions concerning a programming language per month, number of developers answering questions about a programming language.

The obtained results were graphically presented with tools such as Python Pandas and Altair. According to the presented analysis, the most popular programming languages among Stack Overflow users are

  1. “JavaScript” (growing popularity),

  2. “Java” (stable popularity),

  3. “C#” (shrinking popularity).

Releases

No releases published

Packages

No packages published

Languages