linmj-judy / sciassess Goto Github PK
View Code? Open in Web Editor NEWThis project forked from sci-assess/sciassess
SciAssess is a comprehensive benchmark for evaluating Large Language Models' proficiency in scientific literature analysis across various fields, focusing on memorization, comprehension, and analysis.
Home Page: https://arxiv.org/abs/2403.01976