2026-04-14 · arXiv Daily Keyword Digest (Top 10 of 1000)

Generated: 2026-04-15T08:02:20.642168+09:00

Target date (KST): 2026-04-14

Selection: picked 10 from 1000 papers published on the target date

Source: https://export.arxiv.org/api/query (`cat:cs.*`, sorted by submittedDate desc)

Selection logic: keyword-weight score + subject boost

#1 OOM-RL: Out-of-Money Reinforcement Learning Market-Driven Alignment for LLM-Based Multi-Agent Systems

Score: 29.9

Matched keywords: agent, ai, alignment, llm, multi-agent, rlhf

Categories: cs.AI, cs.SE, q-fin.TR

Compressed abstract: The alignment of Multi-Agent Systems (MAS) for autonomous software engineering is constrained by evaluator epistemic uncertainty. Current paradigms, such as Reinforcement Learning from Human Feedback (RLHF) and AI Feedback (RLAIF), frequently induce model sycophancy, while execution-based environments suffer from adversarial "Test Evasion" by unconstrained agents.

Open summary page · arXiv · PDF

#2 TrajOnco: a multi-agent framework for temporal reasoning over longitudinal EHR for multi-cancer early detection

Score: 24.7

Matched keywords: agent, agent framework, benchmark, large language model, llm, machine learning, multi-agent, reasoning

Categories: cs.AI, cs.MA

Compressed abstract: Accurate estimation of cancer risk from longitudinal electronic health records (EHRs) could support earlier detection and improved care, but modeling such complex patient trajectories remains challenging. We present TrajOnco, a training-free, multi-agent large language model (LLM) framework designed for scalable multi-cancer early detection.

Open summary page · arXiv · PDF

#3 Multi-ORFT: Stable Online Reinforcement Fine-Tuning for Multi-Agent Diffusion Planning in Cooperative Driving

Score: 29.6

Matched keywords: agent, benchmark, diffusion, fine-tuning, multi-agent, multimodal

Categories: cs.RO, cs.AI

Compressed abstract: Closed-loop cooperative driving requires planners that generate realistic multimodal multi-agent trajectories while improving safety and traffic efficiency. Existing diffusion planners can model multimodal behaviors from demonstrations, but they often exhibit weak scene consistency and remain poorly aligned with closed-loop objectives; meanwhile, stable online post-training in reactive multi-agent environments remai…

Open summary page · arXiv · PDF

#4 FM-Agent: Scaling Formal Methods to Large Systems via LLM-Based Hoare-Style Reasoning

Score: 27.2

Matched keywords: agent, llm, reasoning

Categories: cs.SE, cs.AI

Compressed abstract: LLM-assisted software development has become increasingly prevalent, and can generate large-scale systems, such as compilers. It becomes crucial to strengthen the correctness of the generated code.

Open summary page · arXiv · PDF

#5 From Translation to Superset: Benchmark-Driven Evolution of a Production AI Agent from Rust to Python

Score: 43.7

Matched keywords: agent, ai, ai agent, benchmark, coding agent, large language model, llm, multi-agent

Categories: cs.SE, cs.AI

Compressed abstract: Cross-language migration of large software systems is a persistent engineering challenge, particularly when the source codebase evolves rapidly. We present a methodology for LLM-assisted continuous code translation in which a large language model translates a production Rust codebase (648 K LOC, 65 crates) into Python (41 K LOC, 28 modules), with public agent benchmarks as the objective function driving iterative re…

Open summary page · arXiv · PDF

#6 EA-Agent: A Structured Multi-Step Reasoning Agent for Entity Alignment

Score: 34.0

Matched keywords: agent, alignment, benchmark, large language models, llm, reasoning

Categories: cs.IR

Compressed abstract: Entity alignment (EA) aims to identify entities across different knowledge graphs (KGs) that refer to the same real-world object and plays a critical role in knowledge fusion and integration. Traditional EA methods mainly rely on knowledge representation learning, but their performance is often limited under noisy or sparsely supervised scenarios.

Open summary page · arXiv · PDF

#7 A Simulation-Based Method for Testing Collaborative Learning Scaffolds Using LLM-Based Multi-Agent Systems

Score: 28.0

Matched keywords: agent, alignment, llm, multi-agent

Categories: cs.HC, cs.MA

Compressed abstract: Background: Traditional research on collaborative learning scaffolding is often time-consuming and resource-heavy, which hinders the rapid iteration and optimization of instructional strategies. LLM-based multi-agent systems have recently emerged as a powerful tool to simulate complex social interactions and provide a novel paradigm for educational research.

Open summary page · arXiv · PDF

#8 RCBSF: A Multi-Agent Framework for Automated Contract Revision via Stackelberg Game

Score: 24.9

Matched keywords: agent, agent framework, ai, benchmark, large language models, multi-agent, token

Categories: cs.CL

Compressed abstract: Despite the widespread adoption of Large Language Models (LLMs) in Legal AI, their utility for automated contract revision remains impeded by hallucinated safety and a lack of rigorous behavioral constraints. To address these limitations, we propose the Risk-Constrained Bilevel Stackelberg Framework (RCBSF), which formulates revision as a non-cooperative Stackelberg game.

Open summary page · arXiv · PDF

#9 Tracing the Roots: A Multi-Agent Framework for Uncovering Data Lineage in Post-Training LLMs

Score: 35.2

Matched keywords: agent, agent framework, benchmark, large language models, llm, multi-agent

Categories: cs.AI

Compressed abstract: Post-training data plays a pivotal role in shaping the capabilities of Large Language Models (LLMs), yet datasets are often treated as isolated artifacts, overlooking the systemic connections that underlie their evolution. To disentangle these complex relationships, we introduce the concept of data lineage to the LLM ecosystem and propose an automated multi-agent framework to reconstruct the evolutionary graph of da…

Open summary page · arXiv · PDF

#10 Exploring Knowledge Conflicts for Faithful LLM Reasoning: Benchmark and Method

Score: 29.8

Matched keywords: benchmark, large language models, llm, rag, reasoning, retrieval-augmented

Categories: cs.CL, cs.AI

Compressed abstract: Large language models (LLMs) have achieved remarkable success across a wide range of applications especially when augmented by external knowledge through retrieval-augmented generation (RAG). Despite their widespread adoption, recent studies have shown that LLMs often struggle to perform faithful reasoning when conflicting knowledge is retrieved.

Open summary page · arXiv · PDF