On June 25, 2026, a preprint by Haoliang Han introduced EVAF, a gated LoRA-based consolidation mechanism that writes long-term goals into a small parametric store so agents retain behavior even after context is cleared. On June 28, 24 AI’s "Today in AI" digest spotlighted the work as a key advance in persistent memory for long-running language agents.
This article aggregates reporting from 2 news sources. The TL;DR is AI-generated from original reporting. Race to AGI's analysis provides editorial context on implications for AGI development.
Most agent work today treats memory as retrieval: vector databases, RAG, and notes the model can look up when it forgets. EVAF flips the lens to memory depth—writing stable, goal‑conditioned tendencies into parameters so the agent continues to behave consistently even when the working context is wiped. In controlled tests, EVAF agents needed only a handful of LoRA “writes” per 200 events yet maintained far stronger goal persistence than pure retrieval systems. That’s an important clue about how to build agents that don’t flail or drift when asked to run for hours or days. ([arxiv.org](https://arxiv.org/abs/2606.26806?utm_source=openai))
From a Race to AGI perspective, mechanisms like EVAF are the scaffolding that will let future systems act more like autonomous colleagues than chatbots. If an agent can internalize abstract goals (“keep this service secure”, “grow this product line”) into its weights, rather than just juggling tokens, it becomes easier to scale it across sessions, environments and tools without constant human babysitting. That kind of durable, parametric memory also sharpens both the promise and the risk: useful for persistent helpers, but worrying if misaligned goals also get baked in. The work’s careful dissection of selection versus actuation suggests we can tune how much an agent “internalizes” vs. simply recalls, which will be central to safe agent design.
For now, EVAF is a research prototype. But expect its core ideas—goal‑conditioned consolidation, loop‑drift stress tests, and parametric memory depth—to show up in production agent frameworks over the next year or two.

