DeFi Daily News
Saturday, May 30, 2026
Advertisement
  • Cryptocurrency
    • Bitcoin
    • Ethereum
    • Altcoins
    • DeFi-IRA
  • DeFi
    • NFT
    • Metaverse
    • Web 3
  • Finance
    • Business Finance
    • Personal Finance
  • Markets
    • Crypto Market
    • Stock Market
    • Analysis
  • Other News
    • World & US
    • Politics
    • Entertainment
    • Tech
    • Sports
    • Health
  • Videos
No Result
View All Result
DeFi Daily News
  • Cryptocurrency
    • Bitcoin
    • Ethereum
    • Altcoins
    • DeFi-IRA
  • DeFi
    • NFT
    • Metaverse
    • Web 3
  • Finance
    • Business Finance
    • Personal Finance
  • Markets
    • Crypto Market
    • Stock Market
    • Analysis
  • Other News
    • World & US
    • Politics
    • Entertainment
    • Tech
    • Sports
    • Health
  • Videos
No Result
View All Result
DeFi Daily News
No Result
View All Result
Home Other News Tech

rewrite this title ‘Catastrophic overtraining’ could harm large language AI models that are trained on more data for the sake of training

waynewilliams@onmail.com (Wayne Williams) by waynewilliams@onmail.com (Wayne Williams)
April 13, 2025
in Tech
0 0
0
rewrite this title ‘Catastrophic overtraining’ could harm large language AI models that are trained on more data for the sake of training
0
SHARES
0
VIEWS
Share on FacebookShare on TwitterShare on Telegram
Listen to this article


rewrite this content using a minimum of 1000 words and keep HTML tags

Researchers from top US universities warn extending pre-training can be detrimental to performance Too much pre-training can deliver worse performance due to something akin to the butterfly effect The more they are pre-trained, the more they become sensitive to small changes that could disrupt the end result

Researchers from Carnegie Mellon, Stanford, Harvard, and Princeton are challenging one of AI development’s accepted core beliefs – that the more pre-training data the better the performance.

As reported by HPCwire, a new paper discuses the concept of “catastrophic overtraining,” whereby extended pre-training can harm a model’s performance after fine-tuning.

The researchers compared two versions of the OLMo-1B model, one trained on 2.3 trillion tokens and another on 3 trillion. Despite the larger training set, the more extensively trained model reportedly performed up to 3% worse on benchmarks like AlpacaEval and ARC.


You may like

Reaching the inflection point

This performance drop, the study claims, is linked to a phenomenon called “progressive sensitivity.”

As the token count increases, the model becomes more fragile. Even small tweaks, like adjustments during fine-tuning, or the introduction of noise, can reverse earlier gains.

The authors demonstrated this by injecting Gaussian noise into pre-trained models, noting that performance degraded more sharply the longer the model was trained.

The point where this additional training starts to degrade performance is called the “inflection point.”

Sign up to the TechRadar Pro newsletter to get all the top news, opinion, features and guidance your business needs to succeed!

Once reached, the benefits of training start to become outweighed by the risk of internal instability. The study found that this tipping point often occurs beyond 2.5 trillion tokens in smaller models, like OLMo-1B.

“Catastrophic overtraining may be inevitable… especially when the pre-training and fine-tuning tasks are misaligned,” the authors warn in their paper, which you can access through the arXiv pre-print server.

While the researchers are not suggesting an end to pre-training, they do feel that developers should consider just how much pre-training is enough. As the paper concludes, “Our findings call for a renewed focus on model scaling that considers the entire training pipeline.”

For AI developers chasing scale, the message seems clear: sometimes, less really is more.

You might also like

and include conclusion section that’s entertaining to read. do not include the title. Add a hyperlink to this website [http://defi-daily.com] and label it “DeFi Daily News” for more trending news articles like this



Source link

Tags: CatastrophicdataHarmLanguageLargeModelsovertrainingrewriteSaketitleTrainedtraining
ShareTweetShare
Previous Post

rewrite this title Mythic Quest canceled; creators promise new version of finale for fans this week

Next Post

rewrite this title This Week in Crypto Games: Gaming Tokens Crash Out, Eve Frontier Opens Up – Decrypt

Next Post
rewrite this title This Week in Crypto Games: Gaming Tokens Crash Out, Eve Frontier Opens Up – Decrypt

rewrite this title This Week in Crypto Games: Gaming Tokens Crash Out, Eve Frontier Opens Up - Decrypt

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Search

No Result
View All Result
  • Trending
  • Comments
  • Latest
rewrite this title Gumshoe Gives Back — Join Now, and We Give to Charity!

rewrite this title Gumshoe Gives Back — Join Now, and We Give to Charity!

December 9, 2025
Kā Kļūt par Miljonāru: Mēmu Monētu Tirgotāja Veiksmes Stāsts ar Tikai 96$ Investīciju

Kā Kļūt par Miljonāru: Mēmu Monētu Tirgotāja Veiksmes Stāsts ar Tikai 96$ Investīciju

October 21, 2024
Zelenskyy’s chief of staff resigns amid corruption investigation

Zelenskyy’s chief of staff resigns amid corruption investigation

November 28, 2025
Turley: Minnesota scandal ‘getting WORSE by the day’

Turley: Minnesota scandal ‘getting WORSE by the day’

December 30, 2025
You don’t fix the Fed. You opt out of needing it.

You don’t fix the Fed. You opt out of needing it.

May 22, 2026
How The S&P 500 Quietly Became An AI Fund

How The S&P 500 Quietly Became An AI Fund

October 22, 2025
rewrite this title This is the James Bond game we’ve been waiting for

rewrite this title This is the James Bond game we’ve been waiting for

May 30, 2026
rewrite this title Cardano Price Rebounds From Critical Support—Can ADA Bulls Finally End the Bearish Grip?

rewrite this title Cardano Price Rebounds From Critical Support—Can ADA Bulls Finally End the Bearish Grip?

May 30, 2026
rewrite this title Vaibhav Sooryavanshi: Is IPL wonderkid, 15, best T20 opener in the world and do India have to pick him for England series?

rewrite this title Vaibhav Sooryavanshi: Is IPL wonderkid, 15, best T20 opener in the world and do India have to pick him for England series?

May 30, 2026
rewrite this title and make it good for SEOI helped design the system that brought down ISIS financing. I’ve got an AI governance idea the Pope and Anthropic would both like | Fortune

rewrite this title and make it good for SEOI helped design the system that brought down ISIS financing. I’ve got an AI governance idea the Pope and Anthropic would both like | Fortune

May 30, 2026
rewrite this title PSG vs Arsenal prediction, betting tips and odds for Champions League final

rewrite this title PSG vs Arsenal prediction, betting tips and odds for Champions League final

May 30, 2026
rewrite this title Coinbase To Bring Global Crypto Derivatives To US Institutions After CFTC Nod

rewrite this title Coinbase To Bring Global Crypto Derivatives To US Institutions After CFTC Nod

May 30, 2026
DeFi Daily

Stay updated with DeFi Daily, your trusted source for the latest news, insights, and analysis in finance and cryptocurrency. Explore breaking news, expert analysis, market data, and educational resources to navigate the world of decentralized finance.

  • About Us
  • Blogs
  • DeFi-IRA | Learn More.
  • Advertise with Us
  • Disclaimer
  • Privacy Policy
  • DMCA
  • Cookie Privacy Policy
  • Terms and Conditions
  • Contact us

Copyright © 2024 Defi Daily.
Defi Daily is not responsible for the content of external sites.

Welcome Back!

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In
No Result
View All Result
  • Cryptocurrency
    • Bitcoin
    • Ethereum
    • Altcoins
    • DeFi-IRA
  • DeFi
    • NFT
    • Metaverse
    • Web 3
  • Finance
    • Business Finance
    • Personal Finance
  • Markets
    • Crypto Market
    • Stock Market
    • Analysis
  • Other News
    • World & US
    • Politics
    • Entertainment
    • Tech
    • Sports
    • Health
  • Videos

Copyright © 2024 Defi Daily.
Defi Daily is not responsible for the content of external sites.