Close Menu

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot

    A perpetual license for this PDF editor used to be $129, but now it’s only $30

    Get Lenovo’s decked-out Ryzen ThinkPad laptop for $400 off right now

    This HP OLED laptop with 24 hours of battery life is on sale for just $550

    Facebook X (Twitter) Instagram
    • Artificial Intelligence
    • Business Technology
    • Cryptocurrency
    • Gadgets
    • Gaming
    • Health
    • Software and Apps
    • Technology
    Facebook X (Twitter) Instagram Pinterest Vimeo
    Tech AI Verse
    • Home
    • Artificial Intelligence

      Amazon to lay off 14,000 corporate employees

      October 29, 2025

      Elon Musk launches Grokipedia as an alternative to ‘woke’ Wikipedia

      October 29, 2025

      Fears of an AI bubble are growing, but some on Wall Street aren’t worried just yet

      October 18, 2025

      The sleeper issue that could play a huge role in Virginia and New Jersey — and the midterms

      October 16, 2025

      California bill regulating top AI companies signed into law

      September 30, 2025
    • Business

      Government faces questions about why US AWS outage disrupted UK tax office and banking firms

      October 23, 2025

      Amazon’s AWS outage knocked services like Alexa, Snapchat, Fortnite, Venmo and more offline

      October 21, 2025

      SAP ECC customers bet on composable ERP to avoid upgrading

      October 18, 2025

      Revenue generated by neoclouds expected to exceed $23bn in 2025, predicts Synergy

      October 15, 2025

      You can now try Fortnite directly in Discord

      October 8, 2025
    • Crypto

      Pi Coin Price Recovery Appears Difficult Despite Investor Support

      November 8, 2025

      Bitcoin Treasuries Face Capital Shock as Falling Prices Erase Gains

      November 8, 2025

      Will Crypto Markets Rebound When the US Government Shutdown Ends?

      November 8, 2025

      Two Altcoins are Defying Market Odds With a Sustained Rally

      November 8, 2025

      Caffeine AI Lisbon: A Full-Day Event Exploring the Self-Writing Internet and the Future of AI-Built Applications

      November 8, 2025
    • Technology

      A perpetual license for this PDF editor used to be $129, but now it’s only $30

      November 9, 2025

      Get Lenovo’s decked-out Ryzen ThinkPad laptop for $400 off right now

      November 9, 2025

      This HP OLED laptop with 24 hours of battery life is on sale for just $550

      November 9, 2025

      Rockstar postpones GTA 6 release date again by several months

      November 9, 2025

      This Intel mini PC at $189 is the smartest buy you’ll make this week

      November 9, 2025
    • Others
      • Gadgets
      • Gaming
      • Health
      • Software and Apps
    Check BMI
    Tech AI Verse
    You are at:Home»Technology»Huawei releases an open weight model trained on Huawei Ascend GPUs
    Technology

    Huawei releases an open weight model trained on Huawei Ascend GPUs

    TechAiVerseBy TechAiVerseJuly 2, 2025No Comments2 Mins Read2 Views
    Facebook Twitter Pinterest Telegram LinkedIn Tumblr Email Reddit
    Huawei releases an open weight model trained on Huawei Ascend GPUs
    Share
    Facebook Twitter LinkedIn Pinterest WhatsApp Email

    Huawei releases an open weight model trained on Huawei Ascend GPUs

    [Submitted on 27 May 2025 (v1), last revised 28 May 2025 (this version, v2)]

    Authors:Yehui Tang, Xiaosong Li, Fangcheng Liu, Wei Guo, Hang Zhou, Yaoyuan Wang, Kai Han, Xianzhi Yu, Jinpeng Li, Hui Zang, Fei Mi, Xiaojun Meng, Zhicheng Liu, Hanting Chen, Binfan Zheng, Can Chen, Youliang Yan, Ruiming Tang, Peifeng Qin, Xinghao Chen, Dacheng Tao, Yunhe Wang (and Other Contributors)

    View PDF
    HTML (experimental)

    Abstract:The surgence of Mixture of Experts (MoE) in Large Language Models promises a small price of execution cost for a much larger model parameter count and learning capacity, because only a small fraction of parameters are activated for each input token. However, it is commonly observed that some experts are activated far more often than others, leading to system inefficiency when running the experts on different devices in parallel. Therefore, we introduce Mixture of Grouped Experts (MoGE), which groups the experts during selection and balances the expert workload better than MoE in nature. It constrains tokens to activate an equal number of experts within each predefined expert group. When a model execution is distributed on multiple devices, this architectural design ensures a balanced computational load across devices, significantly enhancing throughput, particularly for the inference phase. Further, we build Pangu Pro MoE on Ascend NPUs, a sparse model based on MoGE with 72 billion total parameters, 16 billion of which are activated for each token. The configuration of Pangu Pro MoE is optimized for Ascend 300I Duo and 800I A2 through extensive system simulation studies. Our experiments indicate that MoGE indeed leads to better expert load balancing and more efficient execution for both model training and inference on Ascend NPUs. The inference performance of Pangu Pro MoE achieves 1148 tokens/s per card and can be further improved to 1528 tokens/s per card by speculative acceleration, outperforming comparable 32B and 72B Dense models. Furthermore, we achieve an excellent cost-to-performance ratio for model inference on Ascend 300I Duo. Our studies show that Ascend NPUs are capable of training Pangu Pro MoE with massive parallelization to make it a leading model within the sub-100B total parameter class, outperforming prominent open-source models like GLM-Z1-32B and Qwen3-32B.

    Submission history

    From: Hang Zhou [view email]
    [v1]
    Tue, 27 May 2025 16:40:21 UTC (710 KB)
    [v2]
    Wed, 28 May 2025 10:42:15 UTC (710 KB)

    Share. Facebook Twitter Pinterest LinkedIn Reddit WhatsApp Telegram Email
    Previous ArticleMandelbrot in x86 Assembly by Claude
    Next Article NetApp: Not just NAS filers, and a comprehensive cloud strategy
    TechAiVerse
    • Website

    Jonathan is a tech enthusiast and the mind behind Tech AI Verse. With a passion for artificial intelligence, consumer tech, and emerging innovations, he deliver clear, insightful content to keep readers informed. From cutting-edge gadgets to AI advancements and cryptocurrency trends, Jonathan breaks down complex topics to make technology accessible to all.

    Related Posts

    A perpetual license for this PDF editor used to be $129, but now it’s only $30

    November 9, 2025

    Get Lenovo’s decked-out Ryzen ThinkPad laptop for $400 off right now

    November 9, 2025

    This HP OLED laptop with 24 hours of battery life is on sale for just $550

    November 9, 2025
    Leave A Reply Cancel Reply

    Top Posts

    Ping, You’ve Got Whale: AI detection system alerts ships of whales in their path

    April 22, 2025357 Views

    Lumo vs. Duck AI: Which AI is Better for Your Privacy?

    July 31, 202592 Views

    6.7 Cummins Lifter Failure: What Years Are Affected (And Possible Fixes)

    April 14, 202569 Views

    Is Libby Compatible With Kobo E-Readers?

    March 31, 202555 Views
    Don't Miss
    Technology November 9, 2025

    A perpetual license for this PDF editor used to be $129, but now it’s only $30

    A perpetual license for this PDF editor used to be $129, but now it’s only…

    Get Lenovo’s decked-out Ryzen ThinkPad laptop for $400 off right now

    This HP OLED laptop with 24 hours of battery life is on sale for just $550

    Rockstar postpones GTA 6 release date again by several months

    Stay In Touch
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    • YouTube
    • Vimeo

    Subscribe to Updates

    Get the latest creative news from SmartMag about art & design.

    About Us
    About Us

    Welcome to Tech AI Verse, your go-to destination for everything technology! We bring you the latest news, trends, and insights from the ever-evolving world of tech. Our coverage spans across global technology industry updates, artificial intelligence advancements, machine learning ethics, and automation innovations. Stay connected with us as we explore the limitless possibilities of technology!

    Facebook X (Twitter) Pinterest YouTube WhatsApp
    Our Picks

    A perpetual license for this PDF editor used to be $129, but now it’s only $30

    November 9, 20252 Views

    Get Lenovo’s decked-out Ryzen ThinkPad laptop for $400 off right now

    November 9, 20253 Views

    This HP OLED laptop with 24 hours of battery life is on sale for just $550

    November 9, 20252 Views
    Most Popular

    Xiaomi 15 Ultra Officially Launched in China, Malaysia launch to follow after global event

    March 12, 20250 Views

    Apple thinks people won’t use MagSafe on iPhone 16e

    March 12, 20250 Views

    French Apex Legends voice cast refuses contracts over “unacceptable” AI clause

    March 12, 20250 Views
    © 2025 TechAiVerse. Designed by Divya Tech.
    • Home
    • About Us
    • Contact Us
    • Privacy Policy
    • Terms & Conditions

    Type above and press Enter to search. Press Esc to cancel.