Spaces:
Sleeping
Sleeping
| We advise you to use the system prompt provided in the paper to ensure your agents answer using the correct and expected format. In practice, GPT4 level models easily follow it. | |
| We expect submissions to be json-line files with the following format. The first two fields are mandatory, reasoning_trace is optional: | |
| {"task_id": "task_id_1", "model_answer": "Answer 1 from your model", "reasoning_trace": "The different steps by which your model reached answer 1"} | |
| {"task_id": "task_id_2", "model_answer": "Answer 2 from your model", "reasoning_trace": "The different steps by which your model reached answer 2"} | |