arXiv daily keyword digest · 2026-04-06

#1 Do Agent Societies Develop Intellectual Elites? The Hidden Power Laws of Collective Cognition in LLM Multi-Agent Systems

Score: 38.2

Matched keywords: agent, large language model, llm, multi-agent, reasoning

Categories: cs.MA, cs.AI

Compressed abstract: Large Language Model (LLM) multi-agent systems are increasingly deployed as interacting agent societies, yet scaling these systems often yields diminishing or unstable returns, the causes of which remain poorly understood. We present the first large-scale empirical study of coordination dynamics in LLM-based multi-agent systems, introducing an atomic event-level formulation that reconstructs reasoning as cascades of…

Open summary page · arXiv · PDF

#2 Single-Agent LLMs Outperform Multi-Agent Systems on Multi-Hop Reasoning Under Equal Thinking Token Budgets

Score: 30.6

Matched keywords: agent, llm, multi-agent, reasoning, token

Categories: cs.CL, cs.MA

Compressed abstract: Recent work reports strong performance from multi-agent LLM systems (MAS), but these gains are often confounded by increased test-time computation. When computation is normalized, single-agent systems (SAS) can match or outperform MAS, yet the theoretical basis and evaluation methodology behind this comparison remain unclear.

Open summary page · arXiv · PDF

#3 Fighting AI with AI: AI-Agent Augmented DNS Blocking of LLM Services during Student Evaluations

Score: 28.0

Matched keywords: agent, ai, large language models, llm

Categories: cs.NI, cs.LG

Compressed abstract: The transformative potential of large language models (LLMs) in education, such as improving accessibility and personalized learning, is being eclipsed by significant challenges. These challenges stem from concerns that LLMs undermine academic assessment by enabling bypassing of critical thinking, leading to increased cognitive offloading.

Open summary page · arXiv · PDF

#4 Train Yourself as an LLM: Exploring Effects of AI Literacy on Persuasion via Role-playing LLM Training

Score: 22.4

Matched keywords: ai, large language models, llm, rlhf

Categories: cs.CL

Compressed abstract: As large language models (LLMs) become increasingly persuasive, there is concern that people's opinions and decisions may be influenced across various contexts at scale. Prior mitigation (e.g., AI detectors and disclaimers) largely treats people as passive recipients of AI-generated information.

Open summary page · arXiv · PDF

#5 TokenDance: Scaling Multi-Agent LLM Serving via Collective KV Cache Sharing

Score: 27.8

Matched keywords: agent, llm, multi-agent, prompt

Categories: cs.DC

Compressed abstract: Multi-agent LLM applications organize execution in synchronized rounds where a central scheduler gathers outputs from all agents and redistributes the combined context. This All-Gather communication pattern creates massive KV Cache redundancy, because every agent's prompt contains the same shared output blocks, yet existing reuse methods fail to exploit it efficiently.

Open summary page · arXiv · PDF

#6 V2 X-QA: A Comprehensive Reasoning Dataset and Benchmark for Multimodal Large Language Models in Autonomous Driving Across Ego, Infrastructure, and Cooperative Views

Score: 21.6

Matched keywords: alignment, benchmark, large language models, multimodal, reasoning

Categories: cs.RO, cs.AI, cs.CV

Compressed abstract: Multimodal large language models (MLLMs) have shown strong potential for autonomous driving, yet existing benchmarks remain largely ego-centric and therefore cannot systematically assess model performance in infrastructure-centric and cooperative driving conditions. In this work, we introduce V2 X-QA, a real-world dataset and benchmark for evaluating MLLMs across vehicle-side, infrastructure-side, and cooperative vi…

Open summary page · arXiv · PDF

#7 Trivial Vocabulary Bans Improve LLM Reasoning More Than Deep Linguistic Constraints

Score: 17.2

Matched keywords: llm, prompt, reasoning

Categories: cs.CL, cs.AI

Compressed abstract: A previous study reported that E-Prime (English without the verb "to be") selectively altered reasoning in language models, with cross-model correlations suggesting a structural signature tied to which vocabulary was removed. I designed a replication with active controls to test the proposed mechanism: cognitive restructuring through specific vocabulary-cognition mappings.

Open summary page · arXiv · PDF

#8 LLM Reasoning with Process Rewards for Outcome-Guided Steps

Score: 21.0

Matched keywords: large language models, llm, prompt, reasoning

Categories: cs.LG, cs.AI

Compressed abstract: Mathematical reasoning in large language models has improved substantially with reinforcement learning using verifiable rewards, where final answers can be checked automatically and converted into reliable training signals. Most such pipelines optimize outcome correctness only, which yields sparse feedback for long, multi-step solutions and offers limited guidance on intermediate reasoning errors.

Open summary page · arXiv · PDF

#9 PolyJarvis: LLM Agent for Autonomous Polymer MD Simulations

Score: 24.2

Matched keywords: agent, large language model, llm

Categories: cs.CL, cond-mat.mtrl-sci

Compressed abstract: All-atom molecular dynamics (MD) simulations can predict polymer properties from molecular structure, yet their execution requires specialized expertise in force field selection, system construction, equilibration, and property extraction. We present PolyJarvis, an agent that couples a large language model (LLM) with the RadonPy simulation platform through Model Context Protocol (MCP) servers, enabling end-to-end po…

Open summary page · arXiv · PDF

#10 AIVV: Neuro-Symbolic LLM Agent-Integrated Verification and Validation for Trustworthy Autonomous Systems

Score: 26.2

Matched keywords: agent, deep learning, large language models, llm

Categories: cs.AI

Compressed abstract: Deep learning models excel at detecting anomaly patterns in normal data. However, they do not provide a direct solution for anomaly classification and scalability across diverse control systems, frequently failing to distinguish genuine faults from nuisance faults caused by noise or the control system's large transient response.

Open summary page · arXiv · PDF

2026-04-06 · arXiv Daily Keyword Digest (Top 10 of 540)

#1 Do Agent Societies Develop Intellectual Elites? The Hidden Power Laws of Collective Cognition in LLM Multi-Agent Systems

#2 Single-Agent LLMs Outperform Multi-Agent Systems on Multi-Hop Reasoning Under Equal Thinking Token Budgets

#3 Fighting AI with AI: AI-Agent Augmented DNS Blocking of LLM Services during Student Evaluations

#4 Train Yourself as an LLM: Exploring Effects of AI Literacy on Persuasion via Role-playing LLM Training

#5 TokenDance: Scaling Multi-Agent LLM Serving via Collective KV Cache Sharing

#6 V2 X-QA: A Comprehensive Reasoning Dataset and Benchmark for Multimodal Large Language Models in Autonomous Driving Across Ego, Infrastructure, and Cooperative Views

#7 Trivial Vocabulary Bans Improve LLM Reasoning More Than Deep Linguistic Constraints

#8 LLM Reasoning with Process Rewards for Outcome-Guided Steps

#9 PolyJarvis: LLM Agent for Autonomous Polymer MD Simulations

#10 AIVV: Neuro-Symbolic LLM Agent-Integrated Verification and Validation for Trustworthy Autonomous Systems