DeFi Daily News
Monday, May 25, 2026
Advertisement
  • Cryptocurrency
    • Bitcoin
    • Ethereum
    • Altcoins
    • DeFi-IRA
  • DeFi
    • NFT
    • Metaverse
    • Web 3
  • Finance
    • Business Finance
    • Personal Finance
  • Markets
    • Crypto Market
    • Stock Market
    • Analysis
  • Other News
    • World & US
    • Politics
    • Entertainment
    • Tech
    • Sports
    • Health
  • Videos
No Result
View All Result
DeFi Daily News
  • Cryptocurrency
    • Bitcoin
    • Ethereum
    • Altcoins
    • DeFi-IRA
  • DeFi
    • NFT
    • Metaverse
    • Web 3
  • Finance
    • Business Finance
    • Personal Finance
  • Markets
    • Crypto Market
    • Stock Market
    • Analysis
  • Other News
    • World & US
    • Politics
    • Entertainment
    • Tech
    • Sports
    • Health
  • Videos
No Result
View All Result
DeFi Daily News
No Result
View All Result
Home DeFi Web 3

rewrite this title Remember DeepSeek? Two New AI Models Say They’re Even Better – Decrypt

Jose Antonio Lanz by Jose Antonio Lanz
January 30, 2025
in Web 3
0 0
0
rewrite this title Remember DeepSeek? Two New AI Models Say They’re Even Better – Decrypt
0
SHARES
0
VIEWS
Share on FacebookShare on TwitterShare on Telegram
Listen to this article


rewrite this content using a minimum of 1000 words and keep HTML tags

AI companies used to measure themselves against industry leader OpenAI. No more. Now that China’s DeepSeek has emerged as the frontrunner, it’s become the one to beat.

On Monday, DeepSeek turned the AI industry on its head, causing billions of dollars in losses on Wall Street while raising questions about how efficient some U.S. startups—and venture capital— actually are.

Now, two new AI powerhouses have entered the ring: The Allen Institute for AI in Seattle and Alibaba in China; both claim their models are on a par with or better than DeepSeek V3.

The Allen Institute for AI, a U.S.-based research organization known for the release of a more modest vision model named Molmo, today unveiled a new version of Tülu 3, a free, open-source 405-billion parameter large language model.

“We are thrilled to announce the launch of Tülu 3 405B—the first application of fully open post-training recipes to the largest open-weight models,” the Paul Allen-funded non-profit said in a blog post. “With this release, we demonstrate the scalability and effectiveness of our post-training recipe applied at 405B parameter scale.”

For those who like comparing sizes, Meta’s latest LLM, Llama-3.3, has 70 billion parameters, and its largest model to date is Llama-3.1 405b—the same size as Tülu 3.

The model was so big that it demanded extraordinary computational resources, requiring 32 nodes with 256 GPUs running in parallel for training.

The Allen Institute hit several roadblocks while building its model. The sheer size of Tülu 3 meant the team had to split the workload across hundreds of specialized computer chips, with 240 chips handling the training process while 16 others managed real-time operations.

Even with this massive computing power, the system frequently crashed and required round-the-clock supervision to keep it running.

Tülu 3’s breakthrough centered on its novel Reinforcement Learning with Verifiable Rewards (RLVR) framework, which showed particular strength in mathematical reasoning tasks.

Each RLVR iteration took approximately 35 minutes, with inference requiring 550 seconds, weight transfer 25 seconds, and training 1,500 seconds, with the AI getting better at problem-solving with each round.

Image: Ai2

Reinforcement Learning with Verifiable Rewards (RLVR) is a training approach that seems like a sophisticated tutoring system.

The AI received specific tasks, like solving math problems, and got instant feedback on whether its answers were correct.

However, unlike traditional AI training (like the one used by openAI to train ChatGPT), where human feedback can be subjective, RLVR only rewarded the AI when it produced verifiably correct answers, similar to how a math teacher knows exactly when a student’s solution is right or wrong.

This is why the model is so good at math and logic problems but not the best at other tasks like creative writing, roleplay, or factual analysis.

The model is available at Allen AI’s playground, a free site with a UI similar to ChatGPT and other AI chatbots.

Our tests confirmed what could be expected from a model this big.

It is very good at solving problems and applying logic. We provided different random problems from a number of math and science benchmarks and it was able to output good answers, even easier to understand when compared to the sample answers that benchmarks provided.

However, it failed in other logical language-related tasks that didn’t involve math, such as writing sentences that end in a specific word.

Also, Tülu 3 isn’t multimodal. Instead, it stuck to what it knew best—churning out text. No fancy image generation or embedded Chain-of-Thought tricks here.

On the upside, the interface is free to use, requiring a simple login, either via Allen AI’s playground or by downloading the weights to run locally.

The model is available for download via Hugging Face, with alternatives going from 8 billion parameters to the gigantic 405 billion parameters version.

Chinese Tech Giant Enters the Fray

Meanwhile, China isn’t resting on DeepSeek’s laurels.

Amid all the hubbub, Alibaba dropped Qwen 2.5-Max, a massive language model trained on over 20 trillion tokens.

The Chinese tech giant released the model during the Lunar New Year, just days after DeepSeek R1 disrupted the market.

Benchmark tests showed Qwen 2.5-Max outperformed DeepSeek V3 in several key areas, including coding, math, reasoning, and general knowledge, as evaluated using benchmarks like Arena-Hard, LiveBench, LiveCodeBench, and GPQA-Diamond.

The model demonstrated competitive results against industry leaders like GPT-4o and Claude 3.5-Sonne,t according to the model’s card.

Qwen3.5 Max results in AI benchmarks
Image: Alibaba

Alibaba made the model available through its cloud platform with an OpenAI-compatible API, allowing developers to integrate it using familiar tools and methods.

The company’s documentation showed detailed examples of implementation, suggesting a push for widespread adoption.

But Alibaba’s Qwen Chat web portal is the best option for general users and seems pretty impressive—for those who are okay with creating an account there. It is probably the most versatile AI chatbot interface currently available.

Qwen Chat allows users to generate text, code, and images flawlessly. It also supports web search functionality, artifacts, and even a very good video generator, all in the same UI—for free.

It also has a unique function in which users can choose two different models to “battle” against each other to provide the best response.

Overall, Qwen’s UI is more versatile than Allen AI’s.

In text responses, Qwen2.5-Max proved to be better than Tülu 3 at creative writing and reasoning tasks that involved language analysis. For example, it was capable of generating phrases ending in a specific word.

Its video generator is a nice addition and is arguably on par with offers like Kling or Luma Labs—definitely better than what Sora can make.

Also, its image generator provides realistic and pleasant images, showing a clear advantage over OpenAI’s DALL-E 3, but clearly behind top models like Flux or MidJourney.

The triple release of DeepSeek, Qwen2.5-Max, and Tülu 3 just gave the open-source AI world its most significant boost in a while.

DeepSeek had already turned heads by building its R1 reasoning model using earlier Qwen technology for distillation, proving open-source AI could match billion-dollar tech giants at a fraction of the cost.

And now Qwen2.5-Max has upped the ante. If DeepSeek follows its established playbook—leveraging Qwen’s architecture—its next reasoning model could pack an even bigger punch.

Still, this could be a good opportunity for the Allen Institute. OpenAI is racing to launch its o3 reasoning model, which some industry analysts estimated could cost users up to $1,000 per query.

If so, Tülu 3’s arrival could be a great open-source alternative—especially for developers wary of building on Chinese technology due to security concerns or regulatory requirements.

Edited by Josh Quittner and Sebastian Sinclair

Generally Intelligent Newsletter

A weekly AI journey narrated by Gen, a generative AI model.

and include conclusion section that’s entertaining to read. do not include the title. Add a hyperlink to this website http://defi-daily.com and label it “DeFi Daily News” for more trending news articles like this



Source link

Tags: DecryptDeepSeekModelsRememberrewriteTheyretitle
ShareTweetShare
Previous Post

Breaking Down Apple’s Surprise iPhone, China Sales Declines

Next Post

Is India Facing an Economic Slowdown?

Next Post
Is India Facing an Economic Slowdown?

Is India Facing an Economic Slowdown?

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Search

No Result
View All Result
  • Trending
  • Comments
  • Latest
rewrite this title Central Bank of Brazil: Stablecoins Dominate Over .9 Billion Crypto Purchases Registered in Q1

rewrite this title Central Bank of Brazil: Stablecoins Dominate Over $6.9 Billion Crypto Purchases Registered in Q1

April 26, 2026
Kā Kļūt par Miljonāru: Mēmu Monētu Tirgotāja Veiksmes Stāsts ar Tikai 96$ Investīciju

Kā Kļūt par Miljonāru: Mēmu Monētu Tirgotāja Veiksmes Stāsts ar Tikai 96$ Investīciju

October 21, 2024
rewrite this title Gumshoe Gives Back — Join Now, and We Give to Charity!

rewrite this title Gumshoe Gives Back — Join Now, and We Give to Charity!

December 9, 2025
rewrite this title Arteta refuses to rule out further additions amid Eze links – Soccer News

rewrite this title Arteta refuses to rule out further additions amid Eze links – Soccer News

July 27, 2025
[gpt3]rewrite this title and make it good for SEOIsrael chooses Kiryat Tivon for Nvidias new campus[/gpt3]

[gpt3]rewrite this title and make it good for SEOIsrael chooses Kiryat Tivon for Nvidias new campus[/gpt3]

November 12, 2025
rewrite this title Magic provide major Paolo Banchero update ahead of clash with Heat

rewrite this title Magic provide major Paolo Banchero update ahead of clash with Heat

December 5, 2025
rewrite this title Week 21: A Peek Into This Past Week + What I’m Reading, Listening to, and Watching!

rewrite this title Week 21: A Peek Into This Past Week + What I’m Reading, Listening to, and Watching!

May 25, 2026
rewrite this title Teen phenom Rafael Jodar becomes player to watch at French Open after brilliant first-round performance

rewrite this title Teen phenom Rafael Jodar becomes player to watch at French Open after brilliant first-round performance

May 25, 2026
rewrite this title Aave DAO Faces Vote on Native BTC Collateral as Babylon Labs Files Temp Check

rewrite this title Aave DAO Faces Vote on Native BTC Collateral as Babylon Labs Files Temp Check

May 25, 2026
rewrite this title Tether’s Georgia stablecoin plan moves early on national payment rails

rewrite this title Tether’s Georgia stablecoin plan moves early on national payment rails

May 25, 2026
rewrite this title Tom Lee Outlines Liquidity Catalyst for Ethereum Firm BitMine Following Russell Index Update – Decrypt

rewrite this title Tom Lee Outlines Liquidity Catalyst for Ethereum Firm BitMine Following Russell Index Update – Decrypt

May 25, 2026
rewrite this title Want to feel extremely jealous of a World Cup-ready home theater setup? This award-winning 9.4.4-channel Dolby Atmos home theater room was designed specifically with football in mind, including automatically pausing if someone interrupts — and with equipment from Sony, Artcoustic and more, you’d probably feel the crowd noise even more than in the actual stadium

rewrite this title Want to feel extremely jealous of a World Cup-ready home theater setup? This award-winning 9.4.4-channel Dolby Atmos home theater room was designed specifically with football in mind, including automatically pausing if someone interrupts — and with equipment from Sony, Artcoustic and more, you’d probably feel the crowd noise even more than in the actual stadium

May 25, 2026
DeFi Daily

Stay updated with DeFi Daily, your trusted source for the latest news, insights, and analysis in finance and cryptocurrency. Explore breaking news, expert analysis, market data, and educational resources to navigate the world of decentralized finance.

  • About Us
  • Blogs
  • DeFi-IRA | Learn More.
  • Advertise with Us
  • Disclaimer
  • Privacy Policy
  • DMCA
  • Cookie Privacy Policy
  • Terms and Conditions
  • Contact us

Copyright © 2024 Defi Daily.
Defi Daily is not responsible for the content of external sites.

Welcome Back!

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In
No Result
View All Result
  • Cryptocurrency
    • Bitcoin
    • Ethereum
    • Altcoins
    • DeFi-IRA
  • DeFi
    • NFT
    • Metaverse
    • Web 3
  • Finance
    • Business Finance
    • Personal Finance
  • Markets
    • Crypto Market
    • Stock Market
    • Analysis
  • Other News
    • World & US
    • Politics
    • Entertainment
    • Tech
    • Sports
    • Health
  • Videos

Copyright © 2024 Defi Daily.
Defi Daily is not responsible for the content of external sites.