Loading...

XML

Word

Printable

Details

Type: Improvement
Resolution: Done
Priority: Major
Fix Version/s: 0.6
Affects Version/s: 0.4
Labels:
None

Difficulty:
Unknown
Similar issues:

Description

The benchmark should evaluate the output language of the LLM and ensure it corresponds to the language of the question (as opposed to the language of the provided context). We should also try to improve the performance of the models by explicitly prompting them to use the question's language.

Attachments

Activity

People

Assignee:: Paul Pantiru

Reporter:: Michael Hamann

Votes:: 0 Vote for this issue

Watchers:: 1 Start watching this issue

Dates

Created:: 27/May/24 15:22

Updated:: 17/Jul/24 06:49

Resolved:: 17/Jul/24 06:49