Import Stackoverflow into Neo4j

Download the Stackoverflow Archives https://archive.org/details/stackexchangedownloadstackoverflow.com-Badges.7zstackoverflow.com-Comments.7zstackoverflow.com-PostHistory.7zstackoverflow.com-PostLinks.7zstackoverflow.com-Posts.7zstackoverflow.com-Tags.7zstackoverflow.com-Users.7zstackoverflow.com-Votes.7z Install p7zip brew install p7zip Unzip the Posts, Users and Tags 7za -y -oextracted x *Users.7z7za -y -oextracted x *Tags.7z7za -y -oextracted x *Posts.7z Review the Extracted XML files Users.xml - 3.53GBTags.xml - 5MBPosts.xml - 74GB Clone stackoverflow-neo4j git clone https://github.com/mdamien/stackoverflow-neo4j Install Python brew install python3sudo pip3 install xmltodict Extract... Continue Reading →

Up ↑

%d bloggers like this: