Skip to content

define final tasks in tool calling benchmark #576

Open
@jmatejcz

Description

@jmatejcz

Is your feature request related to a problem? Please describe.
For now we have 30 tasks in Tool calling agent benchmark from categories:

  • basic (get image from camera topic etc.)
  • navigation ( navigate somewhere type tasks )
  • manipulation ( grab / drop something )
  • spatial reasoning ( answer question about an image )
  • custom interfaces ( forming and sending the messages of custom interface )

Describe the solution you'd like
We need to establish what tasks we want and how many.
Please check out current tasks.
If you have any suggestion, leave a comment

Describe alternatives you've considered

Additional context

Metadata

Metadata

Assignees

Labels

enhancementNew feature or requestquestionFurther information is requested

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions