Uploaded image for project: 'LLM AI Integration'
  1. LLM AI Integration
  2. LLMAI-80

Evaluate the output language of the LLM in the benchmark

    XMLWordPrintable

Details

    • Improvement
    • Resolution: Done
    • Major
    • 0.6
    • 0.4
    • None
    • Unknown

    Description

      The benchmark should evaluate the output language of the LLM and ensure it corresponds to the language of the question (as opposed to the language of the provided context). We should also try to improve the performance of the models by explicitly prompting them to use the question's language.

      Attachments

        Activity

          People

            ppantiru Paul Pantiru
            MichaelHamann Michael Hamann
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: