Questions about the gradual increase of dist_entropy in the mpe environment

Thank you very much for the author's work. I had a question in the process of reproducing.
The environment I used was the petting zoo mpe environment in the example, and the reward curve I got was consistent with that in the paper, but I don't understand why the dist_entropy in the agent is gradually increasing.

<img width="1168" height="528" alt="Image" src="https://github.com/user-attachments/assets/3bb0caca-c644-48ca-845f-d9dd59f75068" />

<img width="1163" height="505" alt="Image" src="https://github.com/user-attachments/assets/e6409a52-2ed4-44ea-ae56-7fc23d4b9a3f" />

<img width="565" height="411" alt="Image" src="https://github.com/user-attachments/assets/1838fb86-eeef-4be3-9cf6-29e463d7f7f4" />

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Questions about the gradual increase of dist_entropy in the mpe environment #73

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Questions about the gradual increase of dist_entropy in the mpe environment #73

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions