Ramp Labs Introduces Multi-Agent Memory Sharing Solution, Token Consumption Reduced by Up to 65%

By: theblockbeats.news|2026/04/11 14:02:26
0
Share
copy

BlockBeats News, April 11th, AI infrastructure company Ramp Labs released research results on "Latent Briefing", achieving efficient memory sharing among multi-agent systems through direct compression of large-scale model KV cache, significantly reducing Token consumption without sacrificing accuracy.


In mainstream multi-agent architectures, the Orchestrator decomposes tasks and repeatedly calls Worker models. As the inference chain extends, Token usage exponentially inflates. The core idea of Latent Briefing is to leverage the attention mechanism to identify the truly critical parts in the context, directly discard redundant information at the representation layer, rather than relying on the slow-speed LLM summary or the unstable RAG retrieval.


In the LongBench v2 benchmark test, this method performed remarkably: Worker model Token consumption decreased by 65%, the median Token savings for medium-length documents (32k to 100k) reached 49%, the overall accuracy improved by approximately 3 percentage points compared to the baseline, and the additional time for each compression was only about 1.7 seconds, achieving a speedup of about 20 times compared to the original algorithm.


The experiment used Claude Sonnet 4 as the Orchestrator, and Qwen3-14B as the Worker model, covering various document scenarios such as academic papers, legal documents, novels, and government reports. The research also found that the optimal compression threshold varies depending on task difficulty and document length—difficult tasks are suitable for aggressive compression to filter out speculative reasoning noise, while long documents are more suitable for mild compression to retain scattered key information.

-- Price

--

You may also like

Uniswap is trapped in an innovation dilemma

The various iterations of Uniswap are one of the sources of vitality in the DeFi market, but since 2023, Uniswap has not proposed any substantial innovations, instead adhering to traditional business explorations in application chains, Launchpads, etc., leading to a slump in token prices and market ...

What is the key to competition in crypto banking?

Digital banks, crypto cards, wallets, super apps, and DeFi protocols are all converging towards the same goal: to become the primary gateway for your savings, spending, earning, and transferring in the new era.

The flow of stablecoins and the spillover effects in the foreign exchange market

Research has found that an exogenous increase in net inflows of stablecoins significantly widens the price deviation between stablecoins and traditional foreign exchange, leads to depreciation of the local currency, and worsens the financing conditions for synthetic dollars (i.e., increases the doll...

After two years, Hong Kong's first batch of stablecoin licenses finally issued: HSBC, Standard Chartered make the cut

The regulated entity is set to launch a stablecoin in the first half of this year.

The person who helped TAO rise by 90% has now single-handedly crashed the price again today

As long as people are around, the story continues. But once they're gone, you may not even find a worthy opponent to play against.

3-Minute Guide to Participating in the SpaceX IPO on Bitget

Bitget IPO Prime brings a rare opportunity for global users to participate in world-class unicorn IPOs, allowing ordinary users to equally access the potential economic benefits of top-tier IPOs.

Popular coins

Latest Crypto News

Read more