2026-04-22 · arXiv Daily Keyword Digest (Top 10 of 639)

Generated: 2026-04-23T08:02:20.839396+09:00

Target date (KST): 2026-04-22

Selection: picked 10 from 639 papers published on the target date

Source: https://export.arxiv.org/api/query (`cat:cs.*`, sorted by submittedDate desc)

Selection logic: keyword-weight score + subject boost

#1 A Multi-Agent Framework with Structured Reasoning and Reflective Refinement for Multimodal Empathetic Response Generation

Score: 35.4

Matched keywords: agent, agent framework, multi-agent, multimodal, reasoning

Categories: cs.CV

Compressed abstract: Multimodal empathetic response generation (MERG) aims to generate emotionally engaging and empathetic responses based on users' multimodal contexts. Existing approaches usually rely on an implicit one-pass generation paradigm from multimodal context to the final response, which overlooks two intrinsic characteristics of MERG: (1) Human perception of emotional cues is inherently structured rather than a direct mappin…

Open summary page · arXiv · PDF

#2 Owner-Harm: A Missing Threat Model for AI Agent Safety

Score: 34.2

Matched keywords: agent, ai, ai agent, alignment, benchmark, llm, prompt

Categories: cs.CR, cs.AI, cs.CL

Compressed abstract: Existing AI agent safety benchmarks focus on generic criminal harm (cybercrime, harassment, weapon synthesis), leaving a systematic blind spot for a distinct and commercially consequential threat category: agents harming their own deployers. Real-world incidents illustrate the gap: Slack AI credential exfiltration (Aug 2024), Microsoft 365 Copilot calendar-injection leaks (Jan 2024), and a Meta agent unauthorized fo…

Open summary page · arXiv · PDF

#3 Less Is More: Cognitive Load and the Single-Prompt Ceiling in LLM Mathematical Reasoning

Score: 18.8

Matched keywords: llm, prompt, reasoning

Categories: cs.CL, cs.LG

Compressed abstract: We present a systematic empirical study of prompt engineering for formal mathematical reasoning in the context of the SAIR Equational Theories Stage 1 competition. The task requires deciding whether one equational law implies another over all magmas -- a problem that is undecidable in general but decidable for FALSE via finite model search.

Open summary page · arXiv · PDF

#4 What Makes an LLM a Good Optimizer? A Trajectory Analysis of LLM-Guided Evolutionary Search

Score: 17.0

Matched keywords: large language models, llm

Categories: cs.CL, cs.NE

Compressed abstract: Recent work has demonstrated the promise of orchestrating large language models (LLMs) within evolutionary and agentic optimization systems. However, the mechanisms driving these optimization gains remain poorly understood.

Open summary page · arXiv · PDF

#5 Agent-GWO: Collaborative Agents for Dynamic Prompt Optimization in Large Language Models

Score: 34.6

Matched keywords: agent, large language models, llm, prompt, reasoning

Categories: cs.NE, cs.AI, cs.LG

Compressed abstract: Large Language Models (LLMs) have demonstrated strong capabilities in complex reasoning tasks, while recent prompting strategies such as Chain-of-Thought (CoT) have further elevated their performance in handling complex logical problems. Despite these advances, high-quality reasoning remains heavily reliant on manual static prompts and is sensitive to decoding configurations and task distributions, leading to perfor…

Open summary page · arXiv · PDF

#6 Discovering a Shared Logical Subspace: Steering LLM Logical Reasoning via Alignment of Natural-Language and Symbolic Views

Score: 20.2

Matched keywords: alignment, large language models, llm, reasoning

Categories: cs.CL

Compressed abstract: Large Language Models (LLMs) still struggle with multi-step logical reasoning. Existing approaches either purely refine the reasoning chain in natural language form or attach a symbolic solver as an external module.

Open summary page · arXiv · PDF

#7 An AI Agent Execution Environment to Safeguard User Data

Score: 36.8

Matched keywords: agent, ai, ai agent, ai agents, prompt

Categories: cs.CR, cs.AI, cs.OS

Compressed abstract: AI agents promise to serve as general-purpose personal assistants for their users, which requires them to have access to private user data (e.g., personal and financial information). This poses a serious risk to security and privacy.

Open summary page · arXiv · PDF

#8 Debating the Unspoken: Role-Anchored Multi-Agent Reasoning for Half-Truth Detection

Score: 23.2

Matched keywords: agent, multi-agent, reasoning

Categories: cs.CL

Compressed abstract: Half-truths, claims that are factually correct yet misleading due to omitted context, remain a blind spot for fact verification systems focused on explicit falsehoods. Addressing such omission-based manipulation requires reasoning not only about what is said, but also about what is left unsaid.

Open summary page · arXiv · PDF

#9 Four-Axis Decision Alignment for Long-Horizon Enterprise AI Agents

Score: 25.3

Matched keywords: agent, ai, ai agents, alignment, benchmark, prompt, reasoning

Categories: cs.AI

Compressed abstract: Long-horizon enterprise agents make high-stakes decisions (loan underwriting, claims adjudication, clinical review, prior authorization) under lossy memory, multi-step reasoning, and binding regulatory constraints. Current evaluation reports a single task-success scalar that conflates distinct failure modes and hides whether an agent is aligned with the standards its deployment environment requires.

Open summary page · arXiv · PDF

#10 Reasoning Structure Matters for Safety Alignment of Reasoning Models

Score: 12.2

Matched keywords: alignment, reasoning

Categories: cs.AI

Compressed abstract: Large reasoning models (LRMs) achieve strong performance on complex reasoning tasks but often generate harmful responses to malicious user queries. This paper investigates the underlying cause of these safety risks and shows that the issue lies in the reasoning structure itself.

Open summary page · arXiv · PDF