LLMEval

LLMEval

免费学术研究编程助手数据分析开发者工具

LLMEval is a research initiative from Fudan NLP Lab, providing rigorous and fair evaluation frameworks for large language models across multiple domains.

访问官网
LLMEval

我们的评价

AI 正在分析...

像谁在用

personas.forAnalystspersonas.forDeveloperspersonas.forResearchers🎓学生友好

核心功能

Comprehensive evaluation across 13+ academic disciplines
Adversarial hardening for robustness
Physician-validated medical benchmark
Contamination-resistant data curation
Automated LLM-as-a-judge process

适用场景

Evaluating the performance of large language modelsResearching logical reasoning capabilitiesAssessing fairness and robustness in AI models

适合人群

ResearchersAI developersMedical professionals

定价

免费

相似工具

相关推荐