LLMJudge Logo LLM-as-a-Rel: Benchmarking Automatic Relevance Judgments

1University College London, 2Microsoft, 3University of Waterloo, 4University of Amsterdam, 5University of Padua

LLMJudge Benchamrk TBA

An image to be added.

Summary

To be added.

LLMJudge Challenge Dataset

intro

LLMJudge Benchamrk

intro

Results and Analysis


Main Results

results

Analysis

analysis results

(A) Ablations

BibTeX

@article{rahmani2024llmhudgebench,
      author    = {Rahmani, Hossein A. and Yilmaz, Emine and Craswell, Nick and Mitra, Bhaskar and Thomas, Paul and Clarke, Charles L. A. and Aliannejadi, Mohammad and Siro, Clemencia and Faggioli, Guglielmo},
      title     = {LLMJudge: Automatic Relevance Judgments for Search and Retrieval Systems},
      year      = {2024},
     journal    = {#},
     url        = {#}
    }