llm-eval.github.io

Original research on evaluation of LLMs conducted by Microsoft Research and other collaborated institutes. (Updated at: 2023/10) (Contact: Jindong Wang, also see our projects on LLM enhancement) DyVal: graph-informed dynamic evaluation of large language mod…

Read more here: External Link