June 25, 2026 Research Brief

Agent control gets explicit.

Today’s strongest papers replace prompt-only agent design with governed memory, formal verification, and system-level security evaluation, while more realistic benchmarks expose where long-horizon agents still break.

Why it matters Memory as the new agent attack and failure surface

Persistent memory is no longer just a convenience layer; it is a control plane for future actions. Several papers show that failures arise at write-time authority assignment, retrieval-time surfacing, cross-agent propagation, and experience consolidation.

What changed Verification, diagnosis, and control over long-horizon reasoning

As traces get longer and tasks more consequential, post hoc answer checking is too coarse. The strongest systems now verify steps, localize decisive faults, or maintain explicit beliefs over correctness before acting.

What to watch Security evaluation is becoming pipeline-level and system-level

Security failures increasingly emerge from end-to-end pipelines rather than isolated prompts. Today’s papers show that evaluation must include retrieval, memory, tool execution, multimodal judges, and sandbox boundaries.

Read the brief 中文版本

Recent Briefs

Each issue should tell you why the day is worth reopening, not only when it was published.

Jun 25

Agent control gets explicit. Today’s strongest papers replace prompt-only agent design with governed memory, formal verification, and system-level security evaluation, while more realistic benchmarks expose where long-horizon agents still break.

Jun 24

Agent safety gets operational. Today’s strongest papers replace answer-only evaluation and static guardrails with verifiable agent checks, runtime authorization, and privacy-aware controls built for real enterprise environments.

Jun 23

Evaluation becomes infrastructure. Today’s papers argue that progress claims increasingly hinge on benchmark repair, process-level verification, and deployment-interface audits, while agent gains come more from structured scaffolds than larger models alone.

Jun 22

Evaluation goes process-first. Today’s strongest papers replace outcome-only scoring with verifiable process checks, while agent training and inference methods add finer-grained feedback for safer, more reliable systems.

Jun 21

Evaluation turns lifecycle-aware. Today’s papers push AI assessment into realistic workflows while exposing brittle safety, grounding, and training assumptions that cleaner benchmarks often miss.

Jun 20

Agent safety gets operational. Today’s strongest papers replace static agent scores with deployment-predictive evaluation and runtime control, while exposing safety failures rooted in tool privilege, orchestration, and execution boundaries.

Jun 19

Agent safety moves structural. Today’s strongest papers argue that prompt-only defenses are brittle: safer agents come from typed interfaces, privacy-aware benchmarks, and finer-grained training signals that constrain what models can access or emit.

Jun 18

Agent evaluation grows teeth. Today’s papers push agent research away from single-score demos toward process-aware evaluation, transactional runtimes, and realistic security tests that expose cross-step failures.

Jun 17

Agent security moves down-stack. Today’s strongest papers show agent failures increasingly come from infrastructure, process, and reward channels, pushing evaluation and defenses beyond prompt-level alignment alone.

Jun 16

Auditable agents take over. Today’s strongest papers favor process-aware verification, black-box auditing, and protocol-level agent design over monolithic accuracy claims, while multiple papers warn that current evaluation practice is too brittle to trust at face value.

Jun 15

Agent reliability gets audited. Today’s strongest papers favor evidence-bearing, executable agent workflows over answer-only performance, while puncturing default multi-agent assumptions and exposing new modular security risks.

Jun 14

Evaluation gets operational. Today’s papers push AI assessment and safety toward deployment-shaped tests, explicit control layers, and operational security for agents, RAG, and long-form oversight.

Jun 13

Agent safety moves upstream. Today’s papers argue that reliable agents depend less on bigger models than on containment, memory control, harder evaluation, and failure-targeted training loops.

Jun 12

Agent safety moves runtime. Today’s strongest papers argue that safer AI depends less on static alignment alone and more on process-aware evaluation, runtime controls, and finer-grained supervision for agents.

Jun 11

Agent security turns stateful. Today’s strongest papers show agent risk moving into memory, execution state, and post-training drift, while executable benchmarks and internal monitors expose failures that output-only checks miss.

Jun 10

Agent safety gets systemic. Today’s strongest papers argue that reliable agents need infrastructure-level controls, calibrated oversight, and harder long-horizon evaluation because weak judges, brittle verifiers, and prompt-only defenses fail predictably.

Jun 9

Reliability shifts to control. Today’s strongest papers treat reliability as a controllable systems property: richer evaluation, explicit verification layers, and security defenses that break attacker feedback loops rather than only filtering outputs.

Jun 8

Agent control gets concrete. Today’s strongest papers push agents toward governed memory, consequence-aware control, and more realistic evaluation, while exposing new attack surfaces in steering, context, and workflow artifacts.

Jun 7

Agent evaluation turns adversarial. Today’s strongest papers show that agent progress depends less on raw task wins and more on cheating-resistant evaluation, runtime defenses, and structured process signals for tool use and evidence.

Jun 6

Agent safety moves outward. Today’s strongest papers argue that agent safety now lives in interfaces and workflows: tool surfaces, memory gates, offline evaluation, and human oversight all expose failures hidden by clean benchmarks.

Jun 5

Agent safety turns stateful. Today’s strongest papers show agent risk and evaluation moving from single prompts and final answers toward persistent state, process tracing, and structured control surfaces.

Jun 4

Agent safety moves runtime. Today’s strongest papers shift AI safety from model-only alignment to runtime governance, realistic auditing, and trajectory-aware defenses as agent attack surfaces widen across the lifecycle.

Jun 3

Agent safety moves runtime. Today’s strongest papers argue that agent safety is now a systems problem: execution-boundary controls, process-aware evaluation, and supply-chain defenses matter more than prompt-only safeguards.

Jun 2

Agent control gets explicit. Today’s strongest papers replace monolithic agents with governed pipelines, adaptive context handling, and harsher evaluation that rewards traceability, calibration, and deployable safeguards over raw scores.

Jun 1

Agent reliability gets operational. Today’s papers push agents and safety systems toward deployment reality: process-aware evaluation, verifier-first scaffolds, and localized multimodal safety tests expose failures static benchmarks miss.

May 31

Agent benchmarks meet reality. Today’s strongest papers show agent capability claims are highly scaffold-dependent, while security and reliability increasingly hinge on pre-execution controls at routing, retrieval, and tool boundaries.

May 30

Agent safety moves runtime. Today’s strongest papers shift safety from end-score evaluation to runtime auditing and enforcement, while showing retrieval, memory, and judging pipelines create new structural failure modes.

May 29

Safety moves into systems. Today’s strongest papers show AI safety failures increasingly emerge from state, tools, memory, and evaluation design, pushing defenses toward structural controls and process-aware diagnostics.

May 28

Agent safety moves inline. Today’s strongest papers argue that agent safety now depends on runtime control, provenance, and long-horizon evaluation, because models often detect risk without changing unsafe behavior.

May 27

Agent safety turns runtime. Today’s strongest papers argue that deployment-grade agent safety comes from runtime control, long-horizon evaluation, and structure-aware training rather than prompt filters or static benchmarks alone.

May 26

Agent safety moves runtime. Today’s strongest papers argue that agent security and reliability depend less on detecting bad inputs than on controlling provenance, authority, and action at execution time.

May 25

Agent reliability gets structured. Today’s strongest papers improve agents and high-stakes AI systems by adding explicit control, state tracking, and evidence checks, while new benchmarks and attacks expose hidden deployment failures.

May 24

Evaluation turns adaptive. Today’s strongest papers push AI evaluation and control beyond static scores toward adaptive audits, explicit intermediate state, and deployment-minded hardening for agents, retrieval, and model supply chains.

May 23

Agent safety gets stateful. Today’s strongest papers show agent reliability now depends less on bigger models than on realistic security evaluation, runtime scaffolds, and explicit control of state, logs, and interfaces.

May 22

Agent safety moves runtime. Today’s strongest papers shift safety from prompt-level behavior to runtime audits, long-horizon reward-hacking evaluation, and system-level controls around tools, deployment, and optimization.

May 21

Evaluation gets executable. Today’s strongest papers replace heuristic scores with verifiable environments, uncertainty-aware auditing, and system-level safeguards, while new security results show agent risk is spreading across retrieval, multimodality, and reasoning workflows.

May 18

Agent safety shifts outward. Today’s papers argue that reliable AI depends less on bigger models than on external verification, auditable control layers, and broader threat models that include hidden attack channels and workflow failures.

May 17

Agent evaluation gets harsher. Today’s papers show a shift from static benchmark wins to adaptive attacks, process-aware reliability metrics, and realistic tool environments that expose large autonomy and safety gaps.

May 16

Agent safety moves downstream. Today’s strongest papers shift safety from output filtering to runtime structure, trace-level auditing, and post-deployment checks, with quantization and memory emerging as major failure surfaces.

May 15

Agent safety moves outward. Today’s strongest papers argue that reliable agents need external control layers, process-aware evaluation, and multi-turn threat models because prompt-level alignment breaks under history, peers, and persistent state.

May 13

Agent safety turns operational. Today’s strongest papers push safety from model claims to runtime evidence: real-environment jailbreak tests, formal guardrail guarantees, and benchmark audits that expose unsupported scores.

May 12

AI reliability gets real. Today’s strongest papers move beyond benchmark wins toward deployment evidence: harsher evaluation, validated agent workflows, and targeted robustness.

May 11

Daily AI Paper Report (2026-05-11) Chinese version: [中文]

May 10

Daily AI Paper Report (2026-05-10) Chinese version: [中文]

May 9

Daily AI Paper Report (2026-05-09) Chinese version: [中文]

May 8

Daily AI Paper Report (2026-05-08) Chinese version: [中文]

May 6

Daily AI Paper Report (2026-05-06) Chinese version: [中文]

May 5

Daily AI Paper Report (2026-05-05) Chinese version: [中文]

May 4

Daily AI Paper Report (2026-05-04) Chinese version: [中文]

May 3

Daily AI Paper Report (2026-05-03) Chinese version: [中文]

May 1

Daily AI Paper Report (2026-05-01) Chinese version: [中文]

Apr 30

Daily AI Paper Report (2026-04-30) Chinese version: [中文]

Apr 29

Daily AI Paper Report (2026-04-29) Chinese version: [中文]

Apr 28

Daily AI Paper Report (2026-04-28) Chinese version: [中文]

Apr 27

Daily AI Paper Report (2026-04-27) Chinese version: [中文]

Apr 26

Daily AI Paper Report (2026-04-26) Chinese version: [中文]

Apr 25

Daily AI Paper Report (2026-04-25) Chinese version: [中文]

Apr 24

Daily AI Paper Report (2026-04-24) Chinese version: [中文]

Apr 23

Daily AI Paper Report (2026-04-23) Chinese version: [中文]

Apr 22

Daily AI Paper Report (2026-04-22) Chinese version: [中文]

Apr 21

Daily AI Paper Report (2026-04-21) Chinese version: [中文]

Apr 20

Daily AI Paper Report (2026-04-20) Chinese version: [中文]

Apr 19

Daily AI Paper Report (2026-04-19) Chinese version: [中文]

Apr 18

Daily AI Paper Report (2026-04-18) Chinese version: [中文]

Apr 17

Daily AI Paper Report (2026-04-17) Chinese version: [中文]

Apr 16

Daily AI Paper Report (2026-04-16) Chinese version: [中文]

Apr 15

Daily AI Paper Report (2026-04-15) Chinese version: [中文]

Apr 14

Daily AI Paper Report (2026-04-14) Chinese version: [中文]

Apr 13

Daily AI Paper Report (2026-04-13) Chinese version: [中文]

Apr 12

Daily AI Paper Report (2026-04-12) Chinese version: [中文]

Apr 11

Daily AI Paper Report (2026-04-11) Chinese version: [中文]

Apr 10

Daily AI Paper Report (2026-04-10) Chinese version: [中文]

Apr 9

Daily AI Paper Report (2026-04-09) Chinese version: [中文]

Apr 8

Daily AI Paper Report (2026-04-08) Chinese version: [中文]

Apr 7

Daily AI Paper Report (2026-04-07) Chinese version: [中文]

Apr 6

Daily AI Paper Report (2026-04-06) Chinese version: [中文]

Apr 5

Daily AI Paper Report (2026-04-05) Chinese version: [中文]

Apr 4

Daily AI Paper Report (2026-04-04) Chinese version: [中文]

Apr 3

Daily AI Paper Report (2026-04-03) Chinese version: [中文]

Apr 2

Daily AI Paper Report (2026-04-02) Chinese version: [中文]

Apr 1

Daily AI Paper Report (2026-04-01) Chinese version: [中文]

Mar 31

Daily AI Paper Report (2026-03-31) Chinese version: [中文]

Mar 30

Daily AI Paper Report (2026-03-30) Chinese version: [中文]

Mar 29

Daily AI Paper Report (2026-03-29) Chinese version: [中文]

Mar 27

Daily AI Paper Report (2026-03-27) Chinese version: [中文]

Mar 26

Daily AI Paper Report (2026-03-26) Chinese version: [中文]

Mar 25

Daily AI Paper Report (2026-03-25) Chinese version: [中文]

Mar 24

Daily AI Paper Report (2026-03-24) Chinese version: [中文]

Mar 23

Daily AI Paper Report (2026-03-23) Chinese version: [中文]

Mar 22

Daily AI Paper Report (2026-03-22) Chinese version: [中文]

Mar 21

Daily AI Paper Report (2026-03-21) Chinese version: [中文]

Mar 20

Daily AI Paper Report (2026-03-20) Chinese version: [中文]

Mar 19

Daily AI Paper Report (2026-03-19) Chinese version: [中文]

Mar 18

Daily AI Paper Report (2026-03-18) Chinese version: [中文]

Mar 17

Daily AI Paper Report (2026-03-17) Chinese version: [中文]

Mar 16

Daily AI Paper Report (2026-03-16) Chinese version: [中文]

Mar 15

Daily AI Paper Report (2026-03-15) Chinese version: [中文]

Mar 14

Daily AI Paper Report (2026-03-14) Chinese version: [中文]

Mar 13

Daily AI Paper Report (2026-03-13) Chinese version: [中文]

Mar 12

Daily AI Paper Report (2026-03-12) Chinese version: [中文]

Mar 11

Daily AI Paper Report (2026-03-11) Chinese version: [中文]

Mar 10

Daily AI Paper Report (2026-03-10) Chinese version: [中文]

Mar 9

Daily AI Paper Report (2026-03-09) Chinese version: [中文]

Mar 8

Daily AI Paper Report (2026-03-08) Chinese version: [中文]

Mar 7

Daily AI Paper Report (2026-03-07) Chinese version: [中文]

Mar 6

Daily AI Paper Report (2026-03-06) Chinese version: [中文]

Mar 5

Daily AI Paper Report (2026-03-05) Chinese version: [中文]

Mar 4

Daily AI Paper Report (2026-03-04) Chinese version: [中文]

Mar 3

Daily AI Paper Report (2026-03-03) Chinese version: [中文]

Mar 2

Daily AI Paper Report (2026-03-02) Chinese version: [中文]

Mar 1

Daily AI Paper Report (2026-03-01) Chinese version: [中文]

Feb 28

Daily AI Paper Report (2026-02-28) Chinese version: [中文]

Feb 27

Daily AI Paper Report (2026-02-27) Chinese version: [中文]

Feb 26

Daily AI Paper Report (2026-02-26) Chinese version: [中文]

Feb 25

Daily AI Paper Report (2026-02-25) Chinese version: /paper-news/2026-02-25/zh/

Feb 10

Daily AI Paper Report (2026-02-10) Chinese version: /paper-news/2026-02-10/zh/

Feb 9

Daily AI Paper Report (2026-02-09) Daily AI & AI Safety Paper Report 2026-02-09