steps:
- Download the dump from archive.org: https://archive.org/details/stackexchange
- extract the community you want in
extracted/<name of the community>/
withPosts.xml
& co. in the dir- you can
dtrx
on linux
- you can
- you need to
sudo pip3 install xmltodict
python3 to_csv.py extracted/<name of the community>
to get the csvs incsvs/
sh import.sh
to import the csvs in neo4j- assuming that neo4j is in the
../neo/
directory - the script assume that you want to remove you old database (at the end)
- assuming that neo4j is in the
Look at the scripts before using them to understand what they do :)
Have fun!