RAID

Source code for the paper "RAID: A Dataset for Testing the Adversarial Robustness of AI-Generated Image Detectors"

Setup

conda create -n myenv python=3.10
conda activate myenv
pip install -r requirements.txt

Detector Training

If you want to use the available pretrained models, you can directly skip to the next step: the checkpoints, stored in this HuggingFace repository, will be automatically downloaded.

Dataset Setup

First, create a directory data in the project root folder and download the ELSA D3 dataset (or a subset):

python src/raid/scripts/elsa_downloader.py --split train --amount 200_000
python src/raid/scripts/elsa_downloader.py --split val

Dataset Structure

The structure of the ELSA D3 dataset is as follows:

data
└── ELSA_D3					
      ├── train
            ├── real
            ├── gen_0
            |      .
            |      .
      │── val   	
            │── real
            ├── gen_0
            |      .
            |      .

Training

The train_detectors.sh file contains all the training scripts to be launched. You need to uncomment the lines corresponding to the detectors you want to train before launching it. Additionally, to use the new generated checkpoints, you have to modify the constants.py file to have, for each model inside the MODELS dictionary, the respective checkpoint path.

Evaluating and attacking a detector

Download the RAID dataset from the following link and check that the structure matches the one below:

data
└── RAID					
      ├── original
            ├── real
            ├── gen_0
            |      .
            |      .
      │── epsilon32   	
            │── real
            ├── gen_0
            |      .
            |      .

Running the attack

run_attack.sh: Run the ensemble attack and evaluate the provided model on adversarial examples. Optionally saves adversarial examples for dataset creation.

  --generate                  setting this argument generates and saves the adversarial dataset at the provided output_dir        
  --eval_model                model to be evaluated
  --eval_output_dir           directory where the evaluation results are saved
  --models                    model(s) to be attacked in the attack
  --device                    device on which to run the models. If a list is passed, its length must be equal to the number of models                   
  --path_to_dataset           path to the dataset
  --dataset_type              subfolders for the datasets with the same structure as the ELSA D3 dataset, wang2020 for the structure used by detectors' dataset such as Forensynths
  --output_dir                directory where the adversarial dataset is saved
  --epsilon                   the perturbation budget for the attack
  --num_steps                 number of steps for the attack
  --step_size                 attack step size
  --ensembling_strategy       the ensembling strategy to be used in the attack
  --ensemble_loss             the ensemble attack loss

Evaluating on the adversarial dataset

run_experiments.sh: Evaluate provided model(s) on a saved adversarial examples dataset.

  --model                     model to be evaluated
  --path_to_checkpoint        checkpoint of the model to be loaded
  --dataset_type              dataset for the adversarial datasetset generated, subfolders for the datasets with the same structure as the ELSA D3 dataset, wang2020 for the structure used by detectors' dataset such as Forensynths
  --output_dir           directory where the evaluation results are saved

Evaluating and attacking additional detectors

Evaluating and attacking an new detector

File Structure

raid/attacks/: Directory containing the code for the ensemble attack and the used trackers and losses
raid/datasets.py: Python script containing the dataloaders for the datasets
external/: Directory containing essential files for loading and training the detectors
raid/models/: Directory containing the wrapped detectors
raid/plots/plot_adv_example.py: Python script used for plotting adversarial examples
raid/scripts/: Directory containing scripts to download the elsa D3 dataset from huggingface
raid/attack_generate.py: Python script to run the adversarial attack on an ensemble of detectors, evaluate it on a detector and generate the adversarial dataset
raid/evaluate_detector.py: Python script to evaluate detector(s) on a given dataset (adversarial or otherwise)
run_attack.sh: Script used for running the attack and evaluation
run_experiments.sh: Script for evaluation on a dataset
train_detectors.sh: Training script for a list of detectors

Detector Categorization

Detector	Detection Method	Architecture	Dataset	Preprocessing	Performance
Ojha2023 (Universal)	CLIP Feature space not for AI-generated image + Trainable binary classifier	Pretrained CLIP:ViT-L/14 network + Trainable linear layer	ForenSynths (ProGAN, LSUN), DMs	Normalize (CLIP) + CenterCrop (224x224)	x
Corvi23	Two methods: 1. CNN 2. Ensemble of two CNNs trained on different datasets	Modified ResNet50 with: No Downsampling	Custom (ProGAN, Latent Diffusion)	Normalize (ImageNet)	x
Cavia2024	CNN Patch Level Scoring + Global Average Pooling	Modified ResNet50 with: Custom Convolutions	ForenSynths (ProGAN, LSUN)	Normalize (ImageNet) + ReSize (256x256)	x
Chen2024 (convnext)	Diffusion Reconstruction Contrastive Training (DRCT) framework utilizing contrastive/training loss on top of reconustructed images included during training	ConvNeXt Architecture	DRCT-2M	Normalize (ImageNet) + CenterCrop (224)	x
Chen2024 (clip)	Diffusion Reconstruction Contrastive Training (DRCT) framework utilizing contrastive/training loss on top of reconustructed images included during training	CLIP:ViT-L/14 Architecture	DRCT-2M	Normalize (ImageNet) + CenterCrop (224)	x
Koutlis2024	CLIP CLIP's intermediate encoder-block representations	Pretrained CLIP:ViT-B/16 model + Trainable linear layers	ForenSynths (ProGAN, LSUN), Ojha, Tan	Normalize (CLIP) + CenterCrop (224)	x
Wang2020	CNN Pretrained ResNet50 on ImageNet trained for binary classification	ResNet50	ForenSynths (ProGAN, LSUN)	Normalize (ImageNet)	x

Detectors and Reference Papers

Detector	Reference Paper	Repository
Ojha2023 (Universal)	Towards Universal Fake Image Detectors that Generalize Across Generative Models	UniversalFakeDetect
Corvi23	On the detection of synthetic images generated by diffusion models	DMimageDetection
Cavia2024	Real-Time Deepfake Detection in the Real-World	RealTime-DeepfakeDetection-in-the-RealWorld
Chen2024 (convnext/clip)	DRCT: Diffusion Reconstruction Contrastive Training towards Universal Detection of Diffusion Generated Images	DRCT
Koutlis2024	Leveraging Representations from Intermediate Encoder-blocks for Synthetic Image Detection	rine
Wang2020	CNN-generated images are surprisingly easy to spot...for now	CNNDetection

Licenses

The provided MIT License only applies to the raid directory. The code contained in the external folder, provided by third-parties and modified in some parts, has its own licenses that are included in each subfolder.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

RAID

Setup

Detector Training

Dataset Setup

Dataset Structure

Training

Evaluating and attacking a detector

Running the attack

Evaluating on the adversarial dataset

Evaluating and attacking additional detectors

File Structure

Detector Categorization

Detectors and Reference Papers

Licenses

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

License

pralab/RAID

Folders and files

Latest commit

History

Repository files navigation

RAID

Setup

Detector Training

Dataset Setup

Dataset Structure

Training

Evaluating and attacking a detector

Running the attack

Evaluating on the adversarial dataset

Evaluating and attacking additional detectors

File Structure

Detector Categorization

Detectors and Reference Papers

Licenses

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages