This project is only for educational purpose only. It by no mean can be used for more serious, high accuracy expectation scenarios without further modification of used methods and existing codebase.
- Name project: Fake News Detection in Big Data Scenario
- Group: Allin
- Instructor: PhD. Nguyễn Ngọc Thảo
- Department: Computer Science
This repository contains propsal, tasks, descriptions, source code and other related resources for this course.
Use following code snippet:
uri = "mongodb://dreamywanderer:fIheB7sQzEsjH3U6WXmOXoVP1Hj79V4Xom1pNV0uHNbNBal0Lx75X6fwSovFOxXFftvFAMsf5SGoACDboPqXRA==@dreamywanderer.mongo.cosmos.azure.com:10255/?ssl=true&retrywrites=false&replicaSet=globaldb&maxIdleTimeMS=120000&appName=@dreamywanderer@"
client = pymongo.MongoClient(uri)
NewsDataset = client['NewsDataset']
NewsDataset
: Contains collections of multiple original (raw) dataset. Each dataset belongs to a collection:VNFD
: VNFD dataset.VnExpress
: VnExpress dataset.FakeVN
: FakeVN dataset.VNFDPreprocessed
: The preprocessed VNFD dataset.
Every crawled dataset before pushed to database need to convert to json with features described in Document/denineScheme.md
.
There are
- VNFD
- VnExpress
- FakeVN
The specification of each above dataset is described in file Document/datasetDetail
.
Try to follow Conventional Commits 1.0.0 and choose type of commits from here when you do a commit. This will help set up the standard and is more easy to follow the repository.
I suggest you install CoPilot of Github (Mircrosoft) for quicker coding in some cubersome, tedious or boring tasks.