Premier League Data Scraper

⚽ Project Overview

A web scraping project that extracts Premier League statistics (rankings, player data, match results) using Scrapy and Selenium for dynamic content. The collected data is stored in CSV and JSON formats for analysis.

🚀 Key Features

Comprehensive Data Collection: Scrapes rankings, player stats, and match results
Dynamic Content Handling: Uses Selenium for JavaScript-rendered content
Structured Output: Stores data in both CSV and JSON formats
Production-Ready: Configured with proper Scrapy middlewares and pipelines

💻 Technologies Used

Web Scraping: Scrapy, Selenium
Browser Automation: ChromeDriver
Data Processing: Pandas, NumPy
Data Formats: JSON, CSV
Analysis: Jupyter Notebooks

🛠️ Installation & Usage

Clone the repository:

git clone [repository-url]
cd premier-league-scraper

Install dependencies:
```
pip install -r requirements.txt
```
Install ChromeDriver (for Selenium):
```
 brew install chromedriver  # MacOS
```

or download from https://chromedriver.chromium.org/

Run a spider:

 cd scraper
 scrapy crawl rankings -O ../data/raw/rank.json

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
DataScraper		DataScraper
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Premier League Data Scraper

⚽ Project Overview

🚀 Key Features

💻 Technologies Used

🛠️ Installation & Usage

About

Uh oh!

Releases

Packages

Languages

ChrisEssomba/Premier-League-Data-Scraper

Folders and files

Latest commit

History

Repository files navigation

Premier League Data Scraper

⚽ Project Overview

🚀 Key Features

💻 Technologies Used

🛠️ Installation & Usage

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages