2026-05-04 · arXiv Daily Keyword Digest (Top 10 of 461)

Generated: 2026-05-05T08:02:24.457674+09:00

Target date (KST): 2026-05-04

Selection: picked 10 from 461 papers published on the target date

Source: https://export.arxiv.org/api/query (`cat:cs.*`, sorted by submittedDate desc)

Selection logic: keyword-weight score + subject boost

#1 SAGA: Workflow-Atomic Scheduling for AI Agent Inference on GPU Clusters

Score: 33.7

Matched keywords: agent, agent workflow, ai, ai agent, ai agents, llm

Categories: cs.DC, cs.AI, cs.LG, cs.OS

Compressed abstract: AI agents execute tens to hundreds of chained LLM calls per task, yet GPU schedulers treat each call as independent, discarding gigabytes of intermediate state between steps and inflating end-to-end latency by 3-8 x. We argue that this request-level abstraction is fundamentally mismatched to compound AI workloads, and propose a shift to program-level scheduling: treating the entire agent workflow (not individual inf…

Open summary page · arXiv · PDF

#2 Self-Adaptive Multi-Agent LLM-Based Security Pattern Selection for IoT Systems

Score: 33.5

Matched keywords: agent, large language model, llm, multi-agent, reasoning

Categories: cs.CR

Compressed abstract: The adoption of Internet of Things (IoT) systems at the network edge of smart architectures is increasing rapidly, intensifying the need for security mechanisms that are both adaptive and resource-efficient. In such environments, runtime defence mechanisms are no longer limited to detection alone but become a resource-constrained task of selecting mitigation actions.

Open summary page · arXiv · PDF

#3 AdaMeZO: Adam-style Zeroth-Order Optimizer for LLM Fine-tuning Without Maintaining the Moments

Score: 16.2

Matched keywords: fine-tuning, llm

Categories: cs.LG, cs.AI

Compressed abstract: Fine-tuning LLMs is necessary for various dedicated downstream tasks, but classic backpropagation-based fine-tuning methods require substantial GPU memory. To this end, a recent work, MeZO, which relies solely on forward passes to fine-tune LLMs, significantly reduces GPU requirements at the cost of slower convergence due to its indifference to loss landscapes.

Open summary page · arXiv · PDF

#4 LIMSSR: LLM-Driven Sequence-to-Score Reasoning under Training-Time Incomplete Multimodal Observations

Score: 28.2

Matched keywords: large language models, llm, multimodal, prompt, reasoning

Categories: cs.CV

Compressed abstract: Real-world multimodal learning is often hindered by missing modalities. While Incomplete Multimodal Learning (IML) has gained traction, existing methods typically rely on the unrealistic assumption of full-modal availability during training to provide reconstruction supervision or cross-modal priors.

Open summary page · arXiv · PDF

#5 ResRL: Boosting LLM Reasoning via Negative Sample Projection Residual Reinforcement Learning

Score: 26.4

Matched keywords: agent, alignment, large language models, llm, reasoning, token

Categories: cs.LG, cs.CL

Compressed abstract: Reinforcement Learning with Verifiable Rewards (RLVR) enhances reasoning of Large Language Models (LLMs) but usually exhibits limited generation diversity due to the over-incentivization of positive rewards. Although methods like Negative Sample Reinforcement (NSR) mitigate this issue by upweighting penalty from negative samples, they may suppress the semantic distributions shared between positive and negative respo…

Open summary page · arXiv · PDF

#6 SiriusHelper: An LLM Agent-Based Operations Assistant for Big Data Platforms

Score: 16.0

Matched keywords: agent, llm, rag

Categories: cs.DB, cs.AI, cs.MA

Compressed abstract: Big data platforms are widely used in modern enterprises, and an in-production intelligent assistant is increasingly important to help users quickly find actionable guidance and reduce operational burden. While recent LLM+RAG assistants provide a natural interface, they face practical challenges in real deployments: limited scenario coverage across both general consultation and domain-specific troubleshooting workfl…

Open summary page · arXiv · PDF

#7 LLM-Emu: Native Runtime Emulation of LLM Inference via Profile-Driven Sampling

Score: 11.0

Matched keywords: llm

Categories: cs.DC

Compressed abstract: Realistic evaluation of LLM serving systems requires online workloads, dynamic arrivals, queueing, and the serving engine's local scheduling for execution batching, but running such experiments on GPUs is expensive. Existing simulators reduce this cost, but often operate offline or in time-warped mode, re-implement serving-engine schedulers, or require accurate operator/kernel-level latency models.

Open summary page · arXiv · PDF

#8 Exploring LLM biases to manipulate AI search overview

Score: 19.9

Matched keywords: ai, large language models, llm

Categories: cs.IR, cs.AI, cs.CL

Compressed abstract: Modern large language models (LLMs) are used in many business applications in general, and specifically in web search systems and applications that generate overviews of search results - LLM Overview systems. Such systems are using an LLM to select most relevant sources from search results and generate an answer to the user's query.

Open summary page · arXiv · PDF

#9 Improving LLM Code Generation via Requirement-Aware Curriculum Reinforcement Learning

Score: 23.2

Matched keywords: code generation, large language models, llm

Categories: cs.SE, cs.AI

Compressed abstract: Code generation, which aims to automatically generate source code from given programming requirements, has the potential to substantially improve software development efficiency. With the rapid advancement of large language models (LLMs), LLM-based code generation has attracted widespread attention from both academia and industry.

Open summary page · arXiv · PDF

#10 Evaluating the Architectural Reasoning Capabilities of LLM Provers via the Obfuscated Natural Number Game

Score: 20.5

Matched keywords: ai, benchmark, large language models, llm, reasoning

Categories: cs.LG

Compressed abstract: While Large Language Models have achieved notable success on formal mathematics benchmarks such as MiniF2 F, it remains unclear whether these results stem from genuine logical reasoning or semantic pattern matching against pre-training data. This paper identifies Architectural Reasoning: the ability to synthesize formal proofs using exclusively local axioms and definitions within an alien math domain, as the necessa…

Open summary page · arXiv · PDF