arXiv cs/recent · Keyword Digest (Top 10)

Generated: 2026-03-23T11:30:37.717914+00:00

Source: https://arxiv.org/list/cs/recent

Selection logic: keyword-weight score + subject boost

#1 Can Large Multimodal Models Inspect Buildings? A Hierarchical Benchmark for Structural Pathology Reasoning

Score: 20.6

Matched keywords: ai, ai agents, benchmark, foundation models, multimodal, reasoning

Categories: cs.CV

Open summary page · arXiv · PDF

#2 Reasoning Gets Harder for LLMs Inside A Dialogue

Score: 20.0

Matched keywords: benchmark, large language models, llm, reasoning

Categories: cs.CL

Open summary page · arXiv · PDF

#3 Evolving Jailbreaks: Automated Multi-Objective Long-Tail Attacks on Large Language Models

Score: 17.0

Matched keywords: alignment, large language models, llm, prompt

Categories: cs.CR, cs.AI

Open summary page · arXiv · PDF

#4 Chain-of-Adaptation: Surgical Vision-Language Adaptation with Reinforcement Learning

Score: 16.4

Matched keywords: alignment, fine-tuning, multimodal, reasoning

Categories: cs.CV, cs.AI

Open summary page · arXiv · PDF

#5 Measuring Faithfulness Depends on How You Measure: Classifier Sensitivity in LLM Chain-of-Thought Evaluation

Score: 14.8

Matched keywords: llm, reasoning

Categories: cs.CL, cs.AI, cs.LG

Open summary page · arXiv · PDF

#6 IndoorR2 X: Indoor Robot-to-Everything Coordination with LLM-Driven Planning

Score: 14.0

Matched keywords: benchmark, large language model, llm

Categories: cs.RO, cs.MA

Open summary page · arXiv · PDF

#7 Semantic Token Clustering for Efficient Uncertainty Quantification in Large Language Models

Score: 13.8

Matched keywords: large language models, token

Categories: cs.CL, cs.AI, cs.LG

Open summary page · arXiv · PDF

#8 LumosX: Relate Any Identities with Their Attributes for Personalized Video Generation

Score: 13.4

Matched keywords: alignment, benchmark, diffusion, large language models, multimodal

Categories: cs.CV, cs.AI

Open summary page · arXiv · PDF

#9 VideoSeek: Long-Horizon Video Agent with Tool-Guided Seeking

Score: 11.6

Matched keywords: agent, reasoning

Categories: cs.CV, cs.AI, cs.CL

Open summary page · arXiv · PDF

#10 CoVR-R:Reason-Aware Composed Video Retrieval

Score: 11.2

Matched keywords: benchmark, multimodal, reasoning

Categories: cs.CV

Open summary page · arXiv · PDF