Skip to content

ChrisEssomba/Premier-League-Data-Scraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

4 Commits
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

Premier League Data Scraper

โšฝ Project Overview

A web scraping project that extracts Premier League statistics (rankings, player data, match results) using Scrapy and Selenium for dynamic content. The collected data is stored in CSV and JSON formats for analysis.

๐Ÿš€ Key Features

  • Comprehensive Data Collection: Scrapes rankings, player stats, and match results
  • Dynamic Content Handling: Uses Selenium for JavaScript-rendered content
  • Structured Output: Stores data in both CSV and JSON formats
  • Production-Ready: Configured with proper Scrapy middlewares and pipelines

๐Ÿ’ป Technologies Used

  • Web Scraping: Scrapy, Selenium
  • Browser Automation: ChromeDriver
  • Data Processing: Pandas, NumPy
  • Data Formats: JSON, CSV
  • Analysis: Jupyter Notebooks

๐Ÿ› ๏ธ Installation & Usage

  1. Clone the repository:
    git clone [repository-url]
    cd premier-league-scraper
    
    
  2. Install dependencies:
    pip install -r requirements.txt
    
    
  3. Install ChromeDriver (for Selenium):
     brew install chromedriver  # MacOS

or download from https://chromedriver.chromium.org/

  1. Run a spider:
     cd scraper
     scrapy crawl rankings -O ../data/raw/rank.json
    

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages