arXiv daily keyword digest · 2026-03-30

#1 Doctorina MedBench: End-to-End Evaluation of Agent-Based Medical AI

Score: 17.2

Matched keywords: agent, ai, reasoning

Categories: cs.CL, cs.AI, cs.LG, cs.MA

Open summary page · arXiv · PDF

#2 AMALIA Technical Report: A Fully Open Source Large Language Model for European Portuguese

Score: 16.4

Matched keywords: large language model, large language models, llm

Categories: cs.CL, cs.AI, cs.LG

Open summary page · arXiv · PDF

#3 Rocks, Pebbles and Sand: Modality-aware Scheduling for Multimodal Large Language Model Inference

Score: 23.8

Matched keywords: large language model, large language models, llm, multimodal, token

Categories: cs.DC, cs.AI

Open summary page · arXiv · PDF

#4 Selective Deficits in LLM Mental Self-Modeling in a Behavior-Based Test of Theory of Mind

Score: 13.2

Matched keywords: llm, reasoning

Categories: cs.LG, cs.AI, cs.CL

Open summary page · arXiv · PDF

#5 ETA-VLA: Efficient Token Adaptation via Temporal Fusion and Intra-LLM Sparsification for Vision-Language-Action Models

Score: 20.0

Matched keywords: benchmark, large language models, llm, reasoning, token

Categories: cs.RO, cs.AI

Open summary page · arXiv · PDF

#6 LLM Benchmark-User Need Misalignment for Climate Change

Score: 21.6

Matched keywords: ai, benchmark, large language models, llm, rag

Categories: cs.CL

Open summary page · arXiv · PDF

#7 A Judge Agent Closes the Reliability Gap in AI-Generated Scientific Simulation

Score: 15.4

Matched keywords: agent, ai, benchmark, large language models

Categories: cs.SE, cs.LG

Open summary page · arXiv · PDF

#8 A-SelecT: Automatic Timestep Selection for Diffusion Transformer Representation Learning

Score: 14.8

Matched keywords: artificial intelligence, diffusion, transformer

Categories: cs.CV, cs.AI, cs.LG, eess.IV

Open summary page · arXiv · PDF

#9 PerceptionComp: A Video Benchmark for Complex Perception-Centric Reasoning

Score: 10.2

Matched keywords: benchmark, reasoning

Categories: cs.CV, cs.AI, cs.CL, cs.LG

Open summary page · arXiv · PDF

#10 Machine Learning Transferability for Malware Detection

Score: 10.2

Matched keywords: machine learning

Categories: cs.CR, cs.AI, cs.LG

Open summary page · arXiv · PDF

2026-03-30 · arXiv Daily Keyword Digest (Top 10 of 487)

#1 Doctorina MedBench: End-to-End Evaluation of Agent-Based Medical AI

#2 AMALIA Technical Report: A Fully Open Source Large Language Model for European Portuguese

#3 Rocks, Pebbles and Sand: Modality-aware Scheduling for Multimodal Large Language Model Inference

#4 Selective Deficits in LLM Mental Self-Modeling in a Behavior-Based Test of Theory of Mind

#5 ETA-VLA: Efficient Token Adaptation via Temporal Fusion and Intra-LLM Sparsification for Vision-Language-Action Models

#6 LLM Benchmark-User Need Misalignment for Climate Change

#7 A Judge Agent Closes the Reliability Gap in AI-Generated Scientific Simulation

#8 A-SelecT: Automatic Timestep Selection for Diffusion Transformer Representation Learning

#9 PerceptionComp: A Video Benchmark for Complex Perception-Centric Reasoning

#10 Machine Learning Transferability for Malware Detection