HITS

Analyzing AI Evaluation Benchmarks Through Information Retrieval and Network Science

Poster Presentation - The 48th European Conference on Information Retrieval (ECIR 2026). Delft, The Netherlands.

Analyzing AI Evaluation Benchmarks Through Information Retrieval and Network Science

Many analyses have been performed on Information Retrieval (IR) evaluation benchmarks. Benchmarking also plays a central role in evaluating the capabilities of Large Language Models (LLMs). In this paper, we apply an IR approach to LLM evaluation. …

HITS Hits Readersourcing: Validating Peer Review Alternatives Using Network Analysis

Peer review is a well known mechanism exploited within the scholarly publishing process to ensure the quality of scientific literature. Such a mechanism, despite being well established and reasonable, is not free from problems, and alternative …