Pooling Engram Conditional Memory in Large Language Models using CXL

Ma, Ruiyang; Ma, Teng; Su, Zhiyuan; Zha, Hantian; Zhao, Xinpeng; Shang, Xuchun; Yi, Xingrui; Liu, Zheng; Cao, Zhu; Wu, An; Dou, Zhichong; Liu, Ziqian; Kuang, Daikang; Luo, Guojie

doi:10.5281/zenodo.18883519

Published March 6, 2026 | Version v1

Working paper Open

Pooling Engram Conditional Memory in Large Language Models using CXL

Engram conditional memory has emerged as a promising component for LLMs by decoupling static knowledge lookup from dynamic computation. Since Engram exhibits sparse access patterns and supports prefetching, its massive embedding tables are well-suited for offloading to lower-tier memory. In this paper, we propose using Compute Express Link (CXL) memory pool for Engram storage. Compared to RDMA, CXL provides fine-grained and low-latency access required by minimal and discrete retrieval patterns of Engram. We integrate the CXL-based Engram pool into SGLang, achieving near-DRAM end-to-end performance. This provides a scalable and cost-efficient storage solution for future Engram-integrated LLMs without compromising inference performance.

Files

Pooling Engram Conditional Memory in LLM using CXL.pdf

Files (943.1 kB)

Name	Size	Download all
Pooling Engram Conditional Memory in LLM using CXL.pdf md5:0b2dd38ca966aff3cb5e92d89d34ca0a	943.1 kB	Preview Download

Additional details

Submitted: 2026-03-06

116

Views

Downloads

Show more details

	All versions	This version
Views	116	116
Downloads	41	41
Data volume	46.2 MB	46.2 MB

More info on how stats are collected....

DOI

Resource type

Working paper

Publisher

Zenodo

License: Creative Commons Attribution 4.0 International

The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited. Read more

Technical metadata

Created: March 6, 2026
Modified: March 6, 2026

Pooling Engram Conditional Memory in Large Language Models using CXL

Authors/Creators

Description

Files

Pooling Engram Conditional Memory in LLM using CXL.pdf

Files (943.1 kB)

Additional details

Dates