[EN/TH]
Data Analysis project for analyzing questions, answers, comments and overall user's behavior on the Stack Overflow site.
This project is a part of Problem Solving in Information Technology (06016314) - King Mongkut's Institute of Technology Ladkrabang
- Analyze topics popularity by tag(s) defined in asked questions.
- Analyze user behavior based on the positive and negative context of the comments.
- Analyze how time in a year affect user's activity on the site.
badges
- Acquired badges - 1.19 GBcomments
- Posted Comments - 12.01 GBpost_questions
- Submitted Question - 25.10 GBpost_answers
- Submitted Answer - 20.17 GBtags
- Used tags in questions - 2.08 MBusers
- User's info - 1.4 GB
Data Range - 2008 - 2018
Total Size - 59.87 GB (Estimated)
- Python
3.7.0
- pygal
2.4.0
- pygal
- Google Cloud Platform
- BigQuery
Install the required library
pip install pygal
dataset
data
- Raw and converted dataquery
- BigQuery query method
convert
- Python files for converting raw data into visualization ready formatvisualize
- Python files for data visualizationdocs
- Project's site
Notes - All the path is set to relative to the project's root directory. (./StackBehavior/...
)
- Naphat Pornbunruang - 61070044 - 61070044
- Phuwathid Summaviwat - 61070173 - phwt
- Veerapong Tanjantuk - 61070213 - veerapong76
- Sahatsawat Hiranpetch - 61070239 - maizerocom