Skip to content

A graph-based recreation of the "Six Degrees of Wikipedia" using the Simple English Wikipedia dataset.

Notifications You must be signed in to change notification settings

bhavyaKhatri2703/sixDegrees

Repository files navigation

Six Degrees

This project recreates the famous Six Degrees of Wikipedia concept, demonstrating the theory of six degrees of separation in the context of Wikipedia’s interconnected articles. It processes and cleans Wikipedia SQL dumps, converts them into CSV files, and stores the resulting graph data in a Neo4j database with over 18 million nodes and 600+ million relationships. A simple web interface built with React and Node.js allows users to explore connections between any two Wikipedia articles.

Features

  • Large-scale graph of Wikipedia articles and their links.
  • Neo4j-backed database with millions of nodes and relationships.
  • Interactive React + Node.js frontend for exploring connections.
  • Demonstrates real-world application of graph theory.

Tech Stack

  • Database: Neo4j Graph Database

  • Backend: Node.js

  • Frontend: React.js

  • Data Processing: Custom parsing and cleaning of Wikipedia SQL dumps

    Screenshot from 2025-05-18 13-20-07 Screenshot from 2025-05-18 13-20-55

About

A graph-based recreation of the "Six Degrees of Wikipedia" using the Simple English Wikipedia dataset.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published