`CUDA out of memory` for larger datasets during attribution

## 🐛 Bug Report

When loading inseq with a larger dataset, on a CUDA device, an out-of-memory error is occurring regardless of the defined `batch_size`. I believe that is is caused by the call to `self.encode` in`attribution_model.py` lines 345 and 347, which is operating on the full inputs instead of a single batch and moves all inputs to the CUDA device after the encoding.

## 🔬 How To Reproduce

Steps to reproduce the behavior:

1. Load any model without pre-generated targets
2. Load a larger dataset with at least 1000 samples
3. Call the `.attribute()` method with any `batch_size` parameter

### Code sample

### Environment

* OS: macOS
* Python version: 3.10

* Inseq version: 0.4.0

## Expected behavior

The input texts should ideally only be encoded or moved to the GPU once they are actually processed.

## Additional context


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

`CUDA out of memory` for larger datasets during attribution #191

🐛 Bug Report

🔬 How To Reproduce

Code sample

Environment

Expected behavior

Additional context

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

CUDA out of memory for larger datasets during attribution #191

Description

🐛 Bug Report

🔬 How To Reproduce

Code sample

Environment

Expected behavior

Additional context

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

`CUDA out of memory` for larger datasets during attribution #191