Ultra-Compress PDF

A web application to compress PDF files while maintaining readability for both humans and LLMs.

Features

Drag and drop interface for PDF files
Adjustable compression settings:
- Image quality (affects file size)
- Resolution (DPI)
Real-time compression progress
Download compressed PDF files
Size comparison between original and compressed files
Side-by-side comparison viewer for original and compressed PDFs
Download all compressed files at once
Reorder PDFs via drag-and-drop
Combine multiple PDFs into a single document
OCR analysis for combined PDFs via Mistral AI
AWS S3 integration for temporary file storage

How It Works

Ultra-Compress PDF works by:

Reading the PDF file using PDF.js
Rendering each page to a canvas
Converting each page to a JPEG with adjustable quality
Creating a new PDF with pdf-lib containing the compressed images
Preserving the original page dimensions

For OCR functionality:

Combined PDF is securely uploaded to AWS S3 using pre-signed URLs
Mistral AI's OCR service processes the document
Text is extracted from all pages, including images and scanned content
The file is automatically deleted from S3 after processing

Setup

Prerequisites

Node.js 16+
AWS account with S3 bucket
Mistral AI API key (for OCR functionality)

Installation

Clone the repository

git clone https://github.com/bchewy/compress.git
cd compress

Install dependencies
```
npm install
```

Create a .env file with your AWS credentials:

AWS_REGION=your-region
AWS_ACCESS_KEY_ID=your-access-key
AWS_SECRET_ACCESS_KEY=your-secret-key
AWS_BUCKET_NAME=your-bucket-name

Start the server
```
npm start
```
Open http://localhost:3000 in your browser

Usage

Open the application in a web browser
Drag and drop PDF files onto the drop area or click "Select Files" to choose files
Arrange the files in your desired order using drag and drop
Adjust compression settings as needed:
- Lower image quality for smaller file size
- Lower DPI for further size reduction
Optionally check "Combine all PDFs into a single file" if you want a merged document
For OCR analysis of combined PDFs, enable the OCR option and enter your Mistral API key
Click "Compress Files" to start the compression process
Once compression is complete, you can:
- Download individual compressed files
- Compare the original and compressed versions side by side
- Download all compressed files at once with the "Download All" button
- Download the combined PDF (if you selected that option)
- View OCR results and download extracted text (for combined PDFs with OCR)

Security Note

All PDF processing is done client-side
AWS credentials are securely handled server-side
S3 uploads use pre-signed URLs for secure, temporary access
Files uploaded to S3 are automatically deleted after OCR processing
Your Mistral API key is stored locally in your browser if you choose to save it

Dependencies

PDF.js - For rendering PDF pages
pdf-lib - For creating new PDF files
SortableJS - For drag and drop reordering
Express - For server-side routing and API
dotenv - For environment variable management

Comparison Viewer

The comparison viewer allows you to:

See the original and compressed PDFs side by side
Navigate through all pages using the page controls
Visually check the quality difference between versions

File Ordering

You can easily reorder your PDF files before compression:

Drag the handle (⋮⋮) on the left side of each file to reorder
The order number is displayed next to each file
This order determines:
- The processing sequence
- The page order when combining PDFs into a single document

Notes

All processing is done client-side; no files are uploaded to a server
Large PDFs may take more time to process
Text quality is dependent on the selected DPI and image quality settings

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.gitignore		.gitignore
README.md		README.md
index.html		index.html
package-lock.json		package-lock.json
package.json		package.json
script.js		script.js
server.js		server.js
styles.css		styles.css
vercel.json		vercel.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Ultra-Compress PDF

Features

How It Works

Setup

Prerequisites

Installation

Usage

Security Note

Dependencies

Comparison Viewer

File Ordering

Notes

About

Uh oh!

Releases

Packages

Uh oh!

Languages

bchewy/compress

Folders and files

Latest commit

History

Repository files navigation

Ultra-Compress PDF

Features

How It Works

Setup

Prerequisites

Installation

Usage

Security Note

Dependencies

Comparison Viewer

File Ordering

Notes

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages