The new AI method allows models to think harder and avoid valuable bandwidth

January 17, 2026January 17, 2026

3 minutes, 11 seconds Read

DeepSeek’s Engram separates static memory from computation, increasing efficiency in large AI models
The method reduces the need for high-speed memory by enabling DeepSeek models to use queries
Engram supports asynchronous prefetching across multiple GPUs with minimal performance overhead

DeepSeek, in collaboration with Beijing University, introduced a new training method called Engram, designed to decouple memory storage from computing processes.

Traditional large language models require high-bandwidth memory for knowledge retrieval and basic computation, creating a bottleneck in both performance and cost.