Uploaded image for project: 'LLM AI Integration'
  1. LLM AI Integration
  2. LLMAI-87

Execute the benchmark and document results

    XMLWordPrintable

Details

    • Task
    • Resolution: Fixed
    • Major
    • 0.6
    • 0.3.1
    • None
    • Unknown

    Description

      The benchmark we created needs to be executed and results need to be compiled. For this, the following needs to be done:

      • Select a list of LLMs to evaluate
      • Execute the benchmark
      • Check the results, making sure that the automated evaluation is good
      • Document the results

      Attachments

        Activity

          People

            ppantiru Paul Pantiru
            MichaelHamann Michael Hamann
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: