Close Menu

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot

    Today’s NYT Connections: Sports Edition Hints and Answers for Feb. 14, #509

    Google Revives Android 17 Beta 1 Just Days After Halting Launch

    Marvel’s Spider-Man 2 Leaps Onto PS Plus in February

    Facebook X (Twitter) Instagram
    • Artificial Intelligence
    • Business Technology
    • Cryptocurrency
    • Gadgets
    • Gaming
    • Health
    • Software and Apps
    • Technology
    Facebook X (Twitter) Instagram Pinterest Vimeo
    Tech AI Verse
    • Home
    • Artificial Intelligence

      Read the extended transcript: President Donald Trump interviewed by ‘NBC Nightly News’ anchor Tom Llamas

      February 6, 2026

      Stocks and bitcoin sink as investors dump software company shares

      February 4, 2026

      AI, crypto and Trump super PACs stash millions to spend on the midterms

      February 2, 2026

      To avoid accusations of AI cheating, college students are turning to AI

      January 29, 2026

      ChatGPT can embrace authoritarian ideas after just one prompt, researchers say

      January 24, 2026
    • Business

      The HDD brand that brought you the 1.8-inch, 2.5-inch, and 3.5-inch hard drives is now back with a $19 pocket-sized personal cloud for your smartphones

      February 12, 2026

      New VoidLink malware framework targets Linux cloud servers

      January 14, 2026

      Nvidia Rubin’s rack-scale encryption signals a turning point for enterprise AI security

      January 13, 2026

      How KPMG is redefining the future of SAP consulting on a global scale

      January 10, 2026

      Top 10 cloud computing stories of 2025

      December 22, 2025
    • Crypto

      US Investors Might Be Leaving Bitcoin and Ethereum ETFs for International Markets

      February 14, 2026

      Binance France President Targeted in Armed Kidnapping Attempt

      February 14, 2026

      Binance Fires Investigators as $1 Billion Iran-Linked USDT Flows Surface

      February 14, 2026

      Aave Proposes 100% DAO Revenue Model, Yet Price Remains Under Pressure

      February 14, 2026

      A $3 Billion Credit Giant Is Testing Bitcoin in the Mortgage System — Here’s How

      February 14, 2026
    • Technology

      Today’s NYT Connections: Sports Edition Hints and Answers for Feb. 14, #509

      February 14, 2026

      Google Revives Android 17 Beta 1 Just Days After Halting Launch

      February 14, 2026

      Marvel’s Spider-Man 2 Leaps Onto PS Plus in February

      February 14, 2026

      What Are the Best Wireless Earbuds Right Now?

      February 14, 2026

      YouTube TV, DirecTV, Sling and Others: Which Service Carries the Top 100 Live TV Channels

      February 14, 2026
    • Others
      • Gadgets
      • Gaming
      • Health
      • Software and Apps
    Check BMI
    Tech AI Verse
    You are at:Home»Technology»Deepseek may have found a way to solve the RAM crisis by eliminating the need for expensive HBM for AI inference and training — yes, the very reason why DRAM prices went up by 5X in 10 weeks
    Technology

    Deepseek may have found a way to solve the RAM crisis by eliminating the need for expensive HBM for AI inference and training — yes, the very reason why DRAM prices went up by 5X in 10 weeks

    TechAiVerseBy TechAiVerseJanuary 18, 2026No Comments4 Mins Read2 Views
    Facebook Twitter Pinterest Telegram LinkedIn Tumblr Email Reddit
    Deepseek may have found a way to solve the RAM crisis by eliminating the need for expensive HBM for AI inference and training — yes, the very reason why DRAM prices went up by 5X in 10 weeks
    Share
    Facebook Twitter LinkedIn Pinterest WhatsApp Email

    Deepseek may have found a way to solve the RAM crisis by eliminating the need for expensive HBM for AI inference and training — yes, the very reason why DRAM prices went up by 5X in 10 weeks

    (Image credit: Adobe Stock)

    • DeepSeek’s Engram separates static memory from computation, increasing efficiency in large AI models
    • The method reduces high-speed memory needs by enabling DeepSeek models to use lookups
    • Engram supports asynchronous prefetching across multiple GPUs with minimal performance overhead

    DeepSeek, in collaboration with Peking University, introduced a new training method called Engram, designed to decouple memory storage from computational processes.

    Traditional large language models require high-bandwidth memory for knowledge retrieval and basic computation, creating a bottleneck in both performance and cost.

    This HBM bottleneck is widely recognized as a key reason DRAM prices rose by 5X in just 10 weeks, as hardware demand spiked to support large AI models.

    Validation and technical approach

    The researchers said existing models waste sequential depth on trivial operations, which could otherwise support higher-level reasoning.

    Engram allows models to efficiently “look up” essential information without overloading GPU memory, freeing capacity for more complex reasoning tasks.

    The system was tested on a 27-billion-parameter model and showed measurable improvements across standard industry benchmarks.

    By performing knowledge retrieval through hashed N-grams, Engram provides static memory access independent of the current context.

    Sign up to the TechRadar Pro newsletter to get all the top news, opinion, features and guidance your business needs to succeed!

    The retrieved information is then adjusted using a context-aware gating mechanism to align with the model’s hidden state.

    This design allows models to handle long context inputs more efficiently and supports system-level prefetching with minimal performance overhead.

    The Engram method complements other hardware-efficient approaches, including solutions such as Phison’s AI inference accelerators.

    Engram minimizes the amount of high-speed memory required by using lookups for static information, making memory usage more efficient.

    Phison offers a cost-effective way to expand total memory using SSDs, supporting large AI models such as Engram or Mixture-of-Experts systems.

    Combined, these approaches allow AI systems to optimize fast-memory usage while affordably increasing overall memory capacity.

    It also works alongside emerging CXL (Compute Express Link) standards, which aim to overcome GPU memory bottlenecks in large-scale AI workloads.

    The method separates static pattern storage from dynamic computation, enhancing the Transformer backbone without increasing FLOPs or parameter counts.

    DeepSeek formalized a U-shaped expansion rule to optimize the allocation of parameters between the MoE conditional computation module and the Engram memory module.

    Tests show that reallocating around 20–25% of the sparse parameter budget to Engram yields better performance than pure MoE models, maintaining stable gains across different scales.

    Memory slot expansion provides predictable improvements without additional computational cost.

    This confirms the scalability of conditional memory as an independent axis for sparse models.

    Engram’s deterministic retrieval mechanism allows memory capacity to scale linearly across multiple GPUs while supporting asynchronous prefetching during inference.

    It offloads static knowledge reconstruction from lower layers, freeing attention mechanisms to focus on global context.

    Hierarchical caching of frequently used embeddings enhances efficiency, and the module works with existing GPU and system memory architectures, potentially avoiding costly HBM upgrades.

    This technique may relieve pressure on expensive memory hardware, particularly in regions such as China, where HBM access lags behind competitors such as Samsung, SK Hynix, and Micron.

    Early validation of Engram suggests models can expand parameter scale and reasoning capacity while managing memory demands more efficiently.

    This approach may help ease memory constraints across AI infrastructure, potentially reducing sharp DDR5 DRAM price swings.

    Via SCMP


    Follow TechRadar on Google News and add us as a preferred source to get our expert news, reviews, and opinion in your feeds. Make sure to click the Follow button!

    And of course you can also follow TechRadar on TikTok for news, reviews, unboxings in video form, and get regular updates from us on WhatsApp too.

    Efosa has been writing about technology for over 7 years, initially driven by curiosity but now fueled by a strong passion for the field. He holds both a Master’s and a PhD in sciences, which provided him with a solid foundation in analytical thinking.

    community guidelines.

    “>

    You must confirm your public display name before commenting

    Please logout and then login again, you will then be prompted to enter your display name.

    Share. Facebook Twitter Pinterest LinkedIn Reddit WhatsApp Telegram Email
    Previous ArticleNYT Connections hints and answers for Sunday, January 18 (game #952)
    Next Article This retro-style Bluetooth speaker is trying to outdo Marshall at its own game, but is it up to the task?
    TechAiVerse
    • Website

    Jonathan is a tech enthusiast and the mind behind Tech AI Verse. With a passion for artificial intelligence, consumer tech, and emerging innovations, he deliver clear, insightful content to keep readers informed. From cutting-edge gadgets to AI advancements and cryptocurrency trends, Jonathan breaks down complex topics to make technology accessible to all.

    Related Posts

    Today’s NYT Connections: Sports Edition Hints and Answers for Feb. 14, #509

    February 14, 2026

    Google Revives Android 17 Beta 1 Just Days After Halting Launch

    February 14, 2026

    Marvel’s Spider-Man 2 Leaps Onto PS Plus in February

    February 14, 2026
    Leave A Reply Cancel Reply

    Top Posts

    Ping, You’ve Got Whale: AI detection system alerts ships of whales in their path

    April 22, 2025671 Views

    Lumo vs. Duck AI: Which AI is Better for Your Privacy?

    July 31, 2025259 Views

    6.7 Cummins Lifter Failure: What Years Are Affected (And Possible Fixes)

    April 14, 2025153 Views

    6 Best MagSafe Phone Grips (2025), Tested and Reviewed

    April 6, 2025112 Views
    Don't Miss
    Technology February 14, 2026

    Today’s NYT Connections: Sports Edition Hints and Answers for Feb. 14, #509

    Today’s NYT Connections: Sports Edition Hints and Answers for Feb. 14, #509Looking for the most…

    Google Revives Android 17 Beta 1 Just Days After Halting Launch

    Marvel’s Spider-Man 2 Leaps Onto PS Plus in February

    What Are the Best Wireless Earbuds Right Now?

    Stay In Touch
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    • YouTube
    • Vimeo

    Subscribe to Updates

    Get the latest creative news from SmartMag about art & design.

    About Us
    About Us

    Welcome to Tech AI Verse, your go-to destination for everything technology! We bring you the latest news, trends, and insights from the ever-evolving world of tech. Our coverage spans across global technology industry updates, artificial intelligence advancements, machine learning ethics, and automation innovations. Stay connected with us as we explore the limitless possibilities of technology!

    Facebook X (Twitter) Pinterest YouTube WhatsApp
    Our Picks

    Today’s NYT Connections: Sports Edition Hints and Answers for Feb. 14, #509

    February 14, 20262 Views

    Google Revives Android 17 Beta 1 Just Days After Halting Launch

    February 14, 20262 Views

    Marvel’s Spider-Man 2 Leaps Onto PS Plus in February

    February 14, 20262 Views
    Most Popular

    7 Best Kids Bikes (2025): Mountain, Balance, Pedal, Coaster

    March 13, 20250 Views

    VTOMAN FlashSpeed 1500: Plenty Of Power For All Your Gear

    March 13, 20250 Views

    This new Roomba finally solves the big problem I have with robot vacuums

    March 13, 20250 Views
    © 2026 TechAiVerse. Designed by Divya Tech.
    • Home
    • About Us
    • Contact Us
    • Privacy Policy
    • Terms & Conditions

    Type above and press Enter to search. Press Esc to cancel.