arXiv daily keyword digest · 2026-03-27

#1 Is Mathematical Problem-Solving Expertise in Large Language Models Associated with Assessment Performance?

Score: 25.2

Matched keywords: agent, ai, benchmark, large language models, llm, reasoning

Categories: cs.AI

Open summary page · arXiv · PDF

#2 PICon: A Multi-Turn Interrogation Framework for Evaluating Persona Agent Consistency

Score: 15.6

Matched keywords: agent, alignment, large language model, llm

Categories: cs.CL

Open summary page · arXiv · PDF

#3 Measuring What Matters -- or What's Convenient?: Robustness of LLM-Based Scoring Systems to Construct-Irrelevant Factors

Score: 15.4

Matched keywords: large language models, llm

Categories: cs.CL, cs.AI, cs.CY

Open summary page · arXiv · PDF

#4 Beyond Via: Analysis and Estimation of the Impact of Large Language Models in Academic Papers

Score: 15.4

Matched keywords: large language models, llm

Categories: cs.CL, cs.AI, cs.CY, cs.DL, cs.LG

Open summary page · arXiv · PDF

#5 LanteRn: Latent Visual Structured Reasoning

Score: 15.0

Matched keywords: fine-tuning, multimodal, reasoning, transformer

Categories: cs.CV, cs.LG

Open summary page · arXiv · PDF

#6 Demographic Fairness in Multimodal LLMs: A Benchmark of Gender and Ethnicity Bias in Face Verification

Score: 14.0

Matched keywords: benchmark, large language models, multimodal, reasoning

Categories: cs.CV, cs.AI

Open summary page · arXiv · PDF

#7 Neural Network Conversion of Machine Learning Pipelines

Score: 13.7

Matched keywords: deep learning, machine learning

Categories: cs.LG, cs.AI

Open summary page · arXiv · PDF

#8 Training the Knowledge Base through Evidence Distillation and Write-Back Enrichment

Score: 13.6

Matched keywords: llm, rag, retrieval-augmented

Categories: cs.AI, cs.CL, cs.IR

Open summary page · arXiv · PDF

#9 Colon-Bench: An Agentic Workflow for Scalable Dense Lesion Annotation in Full-Procedure Colonoscopy Videos

Score: 13.2

Matched keywords: ai, benchmark, large language models, multimodal

Categories: eess.IV, cs.CV, cs.HC

Open summary page · arXiv · PDF

#10 RefAlign: Representation Alignment for Reference-to-Video Generation

Score: 13.0

Matched keywords: alignment, benchmark, diffusion, foundation model, transformer

Categories: cs.CV

Open summary page · arXiv · PDF

2026-03-27 · arXiv Daily Keyword Digest (Top 10 of 89)

#1 Is Mathematical Problem-Solving Expertise in Large Language Models Associated with Assessment Performance?

#2 PICon: A Multi-Turn Interrogation Framework for Evaluating Persona Agent Consistency

#3 Measuring What Matters -- or What's Convenient?: Robustness of LLM-Based Scoring Systems to Construct-Irrelevant Factors

#4 Beyond Via: Analysis and Estimation of the Impact of Large Language Models in Academic Papers

#5 LanteRn: Latent Visual Structured Reasoning

#6 Demographic Fairness in Multimodal LLMs: A Benchmark of Gender and Ethnicity Bias in Face Verification

#7 Neural Network Conversion of Machine Learning Pipelines

#8 Training the Knowledge Base through Evidence Distillation and Write-Back Enrichment

#9 Colon-Bench: An Agentic Workflow for Scalable Dense Lesion Annotation in Full-Procedure Colonoscopy Videos

#10 RefAlign: Representation Alignment for Reference-to-Video Generation