ecosyste.ms
All services
Data
Packages
Repositories
Advisories
Tools
Dependency Parser
Dependency Resolver
SBOM Parser
License Parser
Digest
Archives
Diff
Summary
Indexes
Timeline
Commits
Issues
Sponsors
Docker
Open Collective
Dependabot
Applications
Funds
Dashboards
Experiments
OST
Papers
Awesome
Ruby
Open Source Science
Fund name
Search
Fields
Support
GitHub
API
Projects
Sort
Recently synced
Ranking
Science Score
Updated 6 months ago
xfinder
• Rank 6.6 • Science 51%
[ICLR 2025] xFinder: Large Language Models as Automated Evaluators for Reliable Evaluation
benchmark
cc-by-nc-nd-4
chatglm
dataset
evaluation
gpt
judge-model
key-answer-extraction
large-language-models
llm
llm-as-a-judge
llm-as-evaluator
lm-evaluation
open-compass
phi
qwen
regex
reliability
reliable-evaluation
xfinder