Close Menu

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot

    Build a Rocket Boy confirms more layoffs amid further claims of “organized espionage and corporate sabotage”

    Former Blizzard CCO and Bonfire CEO Rob Pardo to present keynote address at GDC Festival of Gaming

    Turkish mobile developer Vento Games secures $4m in seed round funding

    Facebook X (Twitter) Instagram
    • Artificial Intelligence
    • Business Technology
    • Cryptocurrency
    • Gadgets
    • Gaming
    • Health
    • Software and Apps
    • Technology
    Facebook X (Twitter) Instagram Pinterest Vimeo
    Tech AI Verse
    • Home
    • Artificial Intelligence

      What the polls say about how Americans are using AI

      February 27, 2026

      Tensions between the Pentagon and AI giant Anthropic reach a boiling point

      February 21, 2026

      Read the extended transcript: President Donald Trump interviewed by ‘NBC Nightly News’ anchor Tom Llamas

      February 6, 2026

      Stocks and bitcoin sink as investors dump software company shares

      February 4, 2026

      AI, crypto and Trump super PACs stash millions to spend on the midterms

      February 2, 2026
    • Business

      Google releases Gemini 3.1 Flash Lite at 1/8th the cost of Pro

      March 4, 2026

      Huawei Watch GT Series

      March 4, 2026

      Weighing up the enterprise risks of neocloud providers

      March 3, 2026

      A stolen Gemini API key turned a $180 bill into $82,000 in two days

      March 3, 2026

      These ultra-budget laptops “include” 1.2TB storage, but most of it is OneDrive trial space

      March 1, 2026
    • Crypto

      Banks Respond to Kraken’s Federal Reserve Access as Trump Sides with Crypto

      March 4, 2026

      Hyperliquid and DEXs Break the Top 10 — Is the CEX Era Ending?

      March 4, 2026

      Consensus Hong Kong 2026: The Institutional Turn 

      March 4, 2026

      New Crypto Mutuum Finance (MUTM) Reports V1 Protocol Progress as Roadmap Enters Phase 3

      March 4, 2026

      Bitcoin Short Sellers Caught Off Guard in New White House Move

      March 4, 2026
    • Technology

      Big tech companies agree to not ruin your electric bill with AI data centers

      March 5, 2026

      Mark Zuckerberg downplays Meta’s own research in New Mexico child safety trial

      March 5, 2026

      Bill Gates-backed TerraPower begins nuclear reactor construction

      March 5, 2026

      Assassin’s Creed Unity is getting a free 60 fps patch tomorrow

      March 5, 2026

      LG reveals pricing for its 2026 OLED TVs

      March 5, 2026
    • Others
      • Gadgets
      • Gaming
      • Health
      • Software and Apps
    Check BMI
    Tech AI Verse
    You are at:Home»Technology»Now it’s TikTok parent ByteDance’s turn for a reasoning AI: enter Seed-Thinking-v1.5!
    Technology

    Now it’s TikTok parent ByteDance’s turn for a reasoning AI: enter Seed-Thinking-v1.5!

    TechAiVerseBy TechAiVerseApril 12, 2025No Comments7 Mins Read2 Views
    Facebook Twitter Pinterest Telegram LinkedIn Tumblr Email Reddit
    Now it’s TikTok parent ByteDance’s turn for a reasoning AI: enter Seed-Thinking-v1.5!
    Share
    Facebook Twitter LinkedIn Pinterest WhatsApp Email

    Now it’s TikTok parent ByteDance’s turn for a reasoning AI: enter Seed-Thinking-v1.5!

    April 11, 2025 12:08 PM

    Credit: VentureBeat made with ChatGPT

    Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More


    It started with the announcement of OpenAI’s o1 model in Sept. 2024, but really took off with the DeepSeek R1 release in Jan. 2025.

    Now, it seems that most major AI model providers and trainers are in a new race to deliver better, faster, and cheaper “reasoning” AI language models — that is, ones that maybe take a little longer to respond to a human user, but ideally do so with better, more comprehensive, more well “reasoned” answers, which these class of models get by performing “chain-of-thought,” reflecting on their own conclusions and interrogating them for veracity before responding.

    ByteDance, the Chinese web media giant parent of TikTok, is the latest to join the party with the announcement and publication of the technical paper behind Seed-Thinking-v1.5, an upcoming large language model (LLM) designed to advance reasoning performance across both science, tech, math, and engineering (STEM) fields and general-purpose domains.

    The model is not yet available for download or use, and it’s unclear what the licensing terms will be—whether it will be proprietary/closed source, open source/free for all to use and modify at will, or somewhere in between. However, the technical paper provides some noteworthy details that are worth going over now and in advance of whenever they are made available.

    Built atop the increasingly popular Mixture-of-Experts (MoE) architecture

    Like Meta’s new Llama 4 and Mistral’s Mixtral before it, Seed-Thinking-v1.5 is built using a Mixture-of-Experts (MoE) architecture.

    This architecture is designed to make models more efficient. It essentially combines the capabilities of multiple models into one, each specializing in a different domain.

    In this case, the MoE architecture means that Seed-Thinking-v1.5 uses only 20 billion of the 200 billion parameters at a time.

    ByteDance says in its technical paper published to GitHub that Seed-Thinking-v1.5 prioritizes structured reasoning and thoughtful response generation.

    The results nearly speak for themselves, with Seed-Thinking-v1.5 outperforming DeepSeek R1 and approaching Google’s newly released Gemini 2.5 Pro and OpenAI’s o3-mini-high reasoner on many third-party benchmark evaluations. It even exceeds those two in the case of the ARC-AGI benchmark, which measures progress towards artificial general intelligence, seen as the goal or “Holy Grail” of AI. This model outperforms humans on most economically valuable tasks, according to OpenAI’s definition.

    Positioned as a compact yet capable alternative to larger state-of-the-art models, Seed-Thinking-v1.5 achieves competitive benchmark results. It introduces reinforcement learning (RL) innovations, training data curation and AI infrastructure.

    Performance benchmarks and model focus

    Seed-Thinking-v1.5 shows strong performance on a suite of challenging tasks, scoring 86.7% on AIME 2024, 55.0% pass@8 on Codeforces and 77.3% on the GPQA science benchmark. These results place it close to or matching models like OpenAI’s o3-mini-high and Google’s Gemini 2.5 Pro on specific reasoning metrics.

    On non-reasoning tasks, the model was evaluated through human preference comparisons and achieved an 8.0% higher win rate over DeepSeek R1, suggesting that its strengths generalize beyond logic or math-heavy challenges.

    To address saturation in standard benchmarks like AIME, ByteDance introduced BeyondAIME, a new, harder math benchmark with curated problems designed to resist memorization and better discriminate model performance. This and the Codeforces evaluation set are expected to be publicly released to support future research.

    Data strategy

    Training data played a central role in the model’s development. For supervised fine-tuning (SFT), the team curated 400,000 samples, including 300,000 verifiable (STEM, logic and coding tasks) and 100,000 non-verifiable problems like creative writing and role-playing.

    For RL training, data was segmented into:

    • Verifiable problems: 100,000 rigorously filtered STEM questions and logic puzzles with known answers, sourced from elite competitions and expert review.
    • Non-verifiable tasks: Human-preference datasets focused on open-ended prompts, evaluated using pairwise reward models.

    The STEM data leaned heavily on advanced mathematics, accounting for over 80% of the problem set. Additional logic data included tasks like Sudoku and 24-point puzzles, with adjustable difficulty to match model progress.

    Reinforcement learning approach

    Reinforcement learning in Seed-Thinking-v1.5 is powered by custom actor-critic (VAPO) and policy-gradient (DAPO) frameworks, developed to address known instabilities in RL training. These techniques reduce reward signal sparsity and enhance training stability, especially in long chain-of-thought (CoT) settings.

    Reward models play a critical role in supervising RL outputs. ByteDance introduced two key tools:

    • Seed-Verifier: A rule-based LLM that checks if generated and reference answers are mathematically equivalent.
    • Seed-Thinking-Verifier: A step-by-step reasoning-based judge that improves judgment consistency and resists reward hacking.

    This two-tiered reward system enables nuanced evaluation for both straightforward and complex tasks.

    Infrastructure and scaling

    To support efficient large-scale training, ByteDance built a system atop its HybridFlow framework. Execution is handled by Ray clusters, and training and inference processes are co-located to reduce GPU idle time.

    The Streaming Rollout System (SRS) is a notable innovation that separates model evolution from runtime execution. It accelerates iteration speed by asynchronously managing partially completed generations across model versions. This architecture reportedly delivers up to 3× faster RL cycles.

    Additional infrastructure techniques include:

    • Mixed precision (FP8) for memory savings
    • Expert parallelism and kernel auto-tuning for MoE efficiency
    • ByteCheckpoint for resilient and flexible checkpointing
    • AutoTuner for optimizing parallelism and memory configurations

    Human evaluation and real-world impact

    To evaluate alignment with human-centric preferences, ByteDance conducted human testing across a range of domains, including creative writing, humanities knowledge and general conversation.

    Seed-Thinking-v1.5 consistently outperformed DeepSeek R1 across sessions, reinforcing its applicability to real-world user needs.

    The development team notes that reasoning models trained primarily on verifiable tasks demonstrated strong generalization to creative domains—an outcome attributed to the structure and rigor embedded in mathematical training workflows.

    What it means for technical leaders, data engineers and enterprise decision-makers

    For technical leads managing the lifecycle of large language models—from data curation to deployment—Seed-Thinking-v1.5 presents an opportunity to rethink how reasoning capabilities are integrated into enterprise AI stacks.

    Its modular training process, which includes verifiable reasoning datasets and multi-phase reinforcement learning, particularly appeals to teams looking to scale LLM development while retaining fine-grained control.

    ByteDance’s moves to introduce Seed-Verifier and Seed-Thinking-Verifier offer mechanisms for more trustworthy reward modeling, which can be critical when deploying models into customer-facing or regulated environments.

    For teams operating under tight deadlines and limited bandwidth, the model’s stability under reinforcement learning, enabled by innovations like VAPO and dynamic sampling, could reduce iteration cycles and streamline fine-tuning for specific tasks.

    From an orchestration and deployment perspective, the model’s hybrid infrastructure approach—including the Streaming Rollout System (SRS) and support for FP8 optimization—suggests significant gains in training throughput and hardware utilization.

    These features would be valuable for engineers responsible for scaling LLM operations across cloud and on-prem systems. The fact that Seed-Thinking-v1.5 was trained with mechanisms to adapt reward feedback based on runtime dynamics speaks directly to the challenges of managing heterogeneous data pipelines and maintaining consistency across domains.

    For teams tasked with ensuring reliability, reproducibility, and continuous integration of new tools, Seed-Thinking-v1.5’s system-level design could serve as a blueprint for building robust, multi-modal orchestration systems.

    For data engineering professionals, the structured approach to training data—including rigorous filtering, augmentation and expert verification—reinforces the importance of data quality as a multiplier of model performance. This could inspire more deliberate approaches to dataset development and validation pipelines.

    Future outlook

    Seed-Thinking-v1.5 results from collaboration within ByteDance’s Seed LLM Systems team, led by Yonghui Wu and with public representation by Haibin Lin, a long-time AI contributor.

    The project also draws on previous efforts, such as Doubao 1.5 Pro, and incorporates shared techniques in RLHF and data curation.

    The team plans to continue refining reinforcement learning techniques, focusing on training efficiency and reward modeling for non-verifiable tasks. The public release of internal benchmarks such as BeyondAIME is intended to foster broader advancement in reasoning-focused AI research.

    Daily insights on business use cases with VB Daily

    If you want to impress your boss, VB Daily has you covered. We give you the inside scoop on what companies are doing with generative AI, from regulatory shifts to practical deployments, so you can share insights for maximum ROI.

    Read our Privacy Policy

    Thanks for subscribing. Check out more VB newsletters here.

    An error occured.

    Share. Facebook Twitter Pinterest LinkedIn Reddit WhatsApp Telegram Email
    Previous ArticleGamesBeat Summit 2025 speakers will help us navigate back to growth | The DeanBeat
    Next Article Alienware Collegiate Clash kicks off Apex Legends championship showdown today
    TechAiVerse
    • Website

    Jonathan is a tech enthusiast and the mind behind Tech AI Verse. With a passion for artificial intelligence, consumer tech, and emerging innovations, he deliver clear, insightful content to keep readers informed. From cutting-edge gadgets to AI advancements and cryptocurrency trends, Jonathan breaks down complex topics to make technology accessible to all.

    Related Posts

    Big tech companies agree to not ruin your electric bill with AI data centers

    March 5, 2026

    Mark Zuckerberg downplays Meta’s own research in New Mexico child safety trial

    March 5, 2026

    Bill Gates-backed TerraPower begins nuclear reactor construction

    March 5, 2026
    Leave A Reply Cancel Reply

    Top Posts

    Ping, You’ve Got Whale: AI detection system alerts ships of whales in their path

    April 22, 2025705 Views

    Lumo vs. Duck AI: Which AI is Better for Your Privacy?

    July 31, 2025289 Views

    6.7 Cummins Lifter Failure: What Years Are Affected (And Possible Fixes)

    April 14, 2025164 Views

    6 Best MagSafe Phone Grips (2025), Tested and Reviewed

    April 6, 2025124 Views
    Don't Miss
    Gaming March 5, 2026

    Build a Rocket Boy confirms more layoffs amid further claims of “organized espionage and corporate sabotage”

    Build a Rocket Boy confirms more layoffs amid further claims of “organized espionage and corporate…

    Former Blizzard CCO and Bonfire CEO Rob Pardo to present keynote address at GDC Festival of Gaming

    Turkish mobile developer Vento Games secures $4m in seed round funding

    Good Games Group has bought the Humble and Firestoke back catalogues. Now, newly renamed as Balor Games, it wants to invest in triple-I

    Stay In Touch
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    • YouTube
    • Vimeo

    Subscribe to Updates

    Get the latest creative news from SmartMag about art & design.

    About Us
    About Us

    Welcome to Tech AI Verse, your go-to destination for everything technology! We bring you the latest news, trends, and insights from the ever-evolving world of tech. Our coverage spans across global technology industry updates, artificial intelligence advancements, machine learning ethics, and automation innovations. Stay connected with us as we explore the limitless possibilities of technology!

    Facebook X (Twitter) Pinterest YouTube WhatsApp
    Our Picks

    Build a Rocket Boy confirms more layoffs amid further claims of “organized espionage and corporate sabotage”

    March 5, 20262 Views

    Former Blizzard CCO and Bonfire CEO Rob Pardo to present keynote address at GDC Festival of Gaming

    March 5, 20262 Views

    Turkish mobile developer Vento Games secures $4m in seed round funding

    March 5, 20262 Views
    Most Popular

    7 Best Kids Bikes (2025): Mountain, Balance, Pedal, Coaster

    March 13, 20250 Views

    VTOMAN FlashSpeed 1500: Plenty Of Power For All Your Gear

    March 13, 20250 Views

    Best TV Antenna of 2025

    March 13, 20250 Views
    © 2026 TechAiVerse. Designed by Divya Tech.
    • Home
    • About Us
    • Contact Us
    • Privacy Policy
    • Terms & Conditions

    Type above and press Enter to search. Press Esc to cancel.