#1 Is Mathematical Problem-Solving Expertise in Large Language Models Associated with Assessment Performance?
Score: 25.2
Matched keywords: agent, ai, benchmark, large language models, llm, reasoning
Categories: cs.AI
Score: 25.2
Matched keywords: agent, ai, benchmark, large language models, llm, reasoning
Categories: cs.AI
Score: 15.6
Matched keywords: agent, alignment, large language model, llm
Categories: cs.CL
Score: 15.4
Matched keywords: large language models, llm
Categories: cs.CL, cs.AI, cs.CY
Score: 15.4
Matched keywords: large language models, llm
Categories: cs.CL, cs.AI, cs.CY, cs.DL, cs.LG
Score: 15.0
Matched keywords: fine-tuning, multimodal, reasoning, transformer
Categories: cs.CV, cs.LG
Score: 14.0
Matched keywords: benchmark, large language models, multimodal, reasoning
Categories: cs.CV, cs.AI
Score: 13.7
Matched keywords: deep learning, machine learning
Categories: cs.LG, cs.AI
Score: 13.6
Matched keywords: llm, rag, retrieval-augmented
Categories: cs.AI, cs.CL, cs.IR
Score: 13.2
Matched keywords: ai, benchmark, large language models, multimodal
Categories: eess.IV, cs.CV, cs.HC
Score: 13.0
Matched keywords: alignment, benchmark, diffusion, foundation model, transformer
Categories: cs.CV