2026-05-29 · arXiv Daily Keyword Digest (Top 10 of 902)

Generated: 2026-05-30T08:02:22.908628+09:00

Target date (KST): 2026-05-29

Selection: picked 10 from 902 papers published on the target date

Source: https://export.arxiv.org/api/query (`cat:cs.*`, sorted by submittedDate desc)

Selection logic: keyword-weight score + subject boost

#1 LLM-ALSO: LLM-Driven Adaptive Learning-Signal Optimization for Multi-Agent Reinforcement Learning

Score: 24.0

Matched keywords: agent, large language models, llm, multi-agent

Categories: cs.MA

Compressed abstract: Effective training-time guidance is central to multi-agent reinforcement learning (MARL), yet remains difficult in sparse-reward settings where weak supervision limits coordination and policy improvement, and existing methods often require substantial domain expertise or manual design effort. Large language models (LLMs) provide a promising alternative for flexible learning-signal design, yet existing LLM-based meth…

Open summary page · arXiv · PDF

#2 Improving Collaborative Storytelling with a Multi-Agent Framework Based on Large Language Models

Score: 44.7

Matched keywords: agent, agent framework, ai, ai agents, large language models, llm, multi-agent

Categories: cs.AI

Compressed abstract: The topic of Co-creation, i.e., AI agents interacting with humans to generate outputs (e.g., art), has gained significant attention recently. However, most studies focus on adult-human interactions in a digital setting.

Open summary page · arXiv · PDF

#3 AgentDoG 1.5: A Lightweight and Scalable Alignment Framework for AI Agent Safety and Security

Score: 24.4

Matched keywords: agent, ai, ai agent, alignment

Categories: cs.AI, cs.CL, cs.CR, cs.CV, cs.LG

Compressed abstract: Modern open-world agents such as OpenClaw exhibit powerful cross-environment execution capabilities yet introduce broad new safety risk sources. Meanwhile, advanced frontier AI models drastically lower attack barriers, rendering current agent alignment frameworks inadequate for real-world deployment.

Open summary page · arXiv · PDF

#4 Unifying Temporal and Structural Credit Assignment in LLM-Based Multi-Agent Prompt Optimization

Score: 40.0

Matched keywords: agent, large language models, llm, multi-agent, prompt, reasoning

Categories: cs.MA, cs.AI

Compressed abstract: While Multi-Agent Systems (MAS) empower Large Language Models to tackle complex reasoning tasks through collaborative interaction, optimizing their dynamics remains a formidable challenge due to the discrete, non-differentiable nature of the computation graph and the sparsity of global supervisory signals. Existing black-box optimizers struggle to attribute trajectory-level failure to specific local components, resu…

Open summary page · arXiv · PDF

#5 Towards Verifiable Multimodal Deep Research: A Multi-Agent Harness for Interleaved Report Generation

Score: 37.8

Matched keywords: agent, harness, large language models, multi-agent, multimodal, tool use

Categories: cs.CL, cs.AI

Compressed abstract: Large Language Models (LLMs) have advanced autonomous agents from deep search, which retrieves concise factual answers, to deep research, which synthesizes scattered evidence into long-form reports. However, verifiable multimodal deep research remains challenging due to open-ended synthesis without deterministic ground truth and the need to interleave textual arguments with visual evidence.

Open summary page · arXiv · PDF

#6 Battery-Sim-Agent: Leveraging LLM-Agent for Inverse Battery Parameter Estimation

Score: 32.2

Matched keywords: agent, benchmark, large language model, llm, reasoning

Categories: cs.AI

Compressed abstract: Parameterizing high-fidelity "digital twins" of batteries is a critical yet challenging inverse problem that hinders the pace of battery innovation. Prevailing methods formulate this as a black-box optimization (BBO) task, employing algorithms that are sample-inefficient and blind to the underlying physics.

Open summary page · arXiv · PDF

#7 AgentSchool: An LLM-Powered Multi-Agent Simulation for Education

Score: 32.2

Matched keywords: agent, ai, llm, multi-agent, reasoning

Categories: cs.AI, cs.MA

Compressed abstract: Despite the rapid deployment of LLMs into classrooms, validating educational AI remains uniquely intractable: interventions act on developing learners whose cognitive and social trajectories are irreversibly shaped, while real-world trials are slow, ethically constrained, and institutionally locked. LLM-based educational simulators have emerged as a potential remedy, but many still collapse learning into persona-con…

Open summary page · arXiv · PDF

#8 Evolve as a Team: Collaborative Self-Evolution for LLM-based Multi-Agent Systems

Score: 27.2

Matched keywords: agent, llm, multi-agent

Categories: cs.MA, cs.AI

Compressed abstract: LLM-based multi-agent systems (MAS) have emerged as an effective paradigm for complex and long-horizon tasks. However, in real-world tasks, MAS often exhibit various failures during execution and such failures are difficult to eliminate during design.

Open summary page · arXiv · PDF

#9 CONCAT: Consensus- and Confidence-Driven Ad Hoc Teaming for Efficient LLM-Based Multi-Agent Systems

Score: 31.2

Matched keywords: agent, large language model, llm, multi-agent

Categories: cs.MA, cs.CL

Compressed abstract: Although large language model (LLM) based multi-agent systems (MAS) show their capability to solve complex tasks and achieve higher performance over single agent systems, they lead to huge computational overheads because of heavy communication between agents. Previous research has made efforts to train a sparse multi-agent graph or fine-tune a planner to orchestrate the workflow better.

Open summary page · arXiv · PDF

#10 SafeRx-Agent: A Knowledge-Grounded Multi-Agent Framework for Safe and Explainable Medication Recommendation

Score: 14.9

Matched keywords: agent, agent framework, code generation, llm, multi-agent

Categories: cs.CL, cs.AI

Compressed abstract: Medication recommendation predicts medications for patient visits, but existing methods still face two key challenges. At the model level, traditional drug recommendation methods only predict structured drug codes with limited evidence grounding, while LLM agents can use richer clinical context but may lack safety verification and traceability.

Open summary page · arXiv · PDF