DeFi Daily News
Tuesday, June 23, 2026
Advertisement
  • Cryptocurrency
    • Bitcoin
    • Ethereum
    • Altcoins
    • DeFi-IRA
  • DeFi
    • NFT
    • Metaverse
    • Web 3
  • Finance
    • Business Finance
    • Personal Finance
  • Markets
    • Crypto Market
    • Stock Market
    • Analysis
  • Other News
    • World & US
    • Politics
    • Entertainment
    • Tech
    • Sports
    • Health
  • Videos
No Result
View All Result
DeFi Daily News
  • Cryptocurrency
    • Bitcoin
    • Ethereum
    • Altcoins
    • DeFi-IRA
  • DeFi
    • NFT
    • Metaverse
    • Web 3
  • Finance
    • Business Finance
    • Personal Finance
  • Markets
    • Crypto Market
    • Stock Market
    • Analysis
  • Other News
    • World & US
    • Politics
    • Entertainment
    • Tech
    • Sports
    • Health
  • Videos
No Result
View All Result
DeFi Daily News
No Result
View All Result
Home Other News Tech

rewrite this title Google launches ‘implicit caching’ to make accessing its latest AI models cheaper | TechCrunch

Kyle Wiggers by Kyle Wiggers
May 8, 2025
in Tech
0 0
0
rewrite this title Google launches ‘implicit caching’ to make accessing its latest AI models cheaper | TechCrunch
0
SHARES
1
VIEWS
Share on FacebookShare on TwitterShare on Telegram
Listen to this article


rewrite this content using a minimum of 1000 words and keep HTML tags

Google is rolling out a feature in its Gemini API that the company claims will make its latest AI models cheaper for third-party developers.

Google calls the feature “implicit caching” and says it can deliver 75% savings on “repetitive context” passed to models via the Gemini API. It supports Google’s Gemini 2.5 Pro and 2.5 Flash models.

That’s likely to be welcome news to developers as the cost of using frontier models continues to grow.

We just shipped implicit caching in the Gemini API, automatically enabling a 75% cost savings with the Gemini 2.5 models when your request hits a cache 🚢

We also lowered the min token required to hit caches to 1K on 2.5 Flash and 2K on 2.5 Pro!

— Logan Kilpatrick (@OfficialLoganK) May 8, 2025

Caching, a widely adopted practice in the AI industry, reuses frequently accessed or pre-computed data from models to cut down on computing requirements and cost. For example, caches can store answers to questions users often ask of a model, eliminating the need for the model to re-create answers to the same request.

Google previously offered model prompt caching, but only explicit prompt caching, meaning devs had to define their highest-frequency prompts. While cost savings were supposed to be guaranteed, explicit prompt caching typically involved a lot of manual work.

Some developers weren’t pleased with how Google’s explicit caching implementation worked for Gemini 2.5 Pro, which they said could cause surprisingly large API bills. Complaints reached a fever pitch in the past week, prompting the Gemini team to apologize and pledge to make changes.

In contrast to explicit caching, implicit caching is automatic. Enabled by default for Gemini 2.5 models, it passes on cost savings if a Gemini API request to a model hits a cache.

Techcrunch event

Berkeley, CA
|
June 5

BOOK NOW

“[W]hen you send a request to one of the Gemini 2.5 models, if the request shares a common prefix as one of previous requests, then it’s eligible for a cache hit,” explained Google in a blog post. “We will dynamically pass cost savings back to you.”

The minimum prompt token count for implicit caching is 1,024 for 2.5 Flash and 2,048 for 2.5 Pro, according to Google’s developer documentation, which is not a terribly big amount, meaning it shouldn’t take much to trigger these automatic savings. Tokens are the raw bits of data models work with, with a thousand tokens equivalent to about 750 words.

Given that Google’s last claims of cost savings from caching ran afoul, there are some buyer-beware areas in this new feature. For one, Google recommends that developers keep repetitive context at the beginning of requests to increase the chances of implicit cache hits. Context that might change from request to request should be appended at the end, the company says.

For another, Google didn’t offer any third-party verification that the new implicit caching system would deliver the promised automatic savings. So we’ll have to see what early adopters say.

and include conclusion section that’s entertaining to read. do not include the title. Add a hyperlink to this website [http://defi-daily.com] and label it “DeFi Daily News” for more trending news articles like this



Source link

Tags: AccessingcachingCheaperGoogleimplicitLatestlaunchesModelsrewriteTechCrunchtitle
ShareTweetShare
Previous Post

rewrite this title How to Make Art Under the Nazis (Without Losing Your Soul)

Next Post

rewrite this title Stablecoin Legislation Suffers Severe Blow as GENIUS Act Fails to Pass Key Senate Vote – Decrypt

Next Post
rewrite this title Stablecoin Legislation Suffers Severe Blow as GENIUS Act Fails to Pass Key Senate Vote – Decrypt

rewrite this title Stablecoin Legislation Suffers Severe Blow as GENIUS Act Fails to Pass Key Senate Vote - Decrypt

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Search

No Result
View All Result
  • Trending
  • Comments
  • Latest
How one terrible trip inspired a tech IPO: Navan Co-Founder

How one terrible trip inspired a tech IPO: Navan Co-Founder

June 15, 2026
rewrite this title ‘My Neighbor Alice’ Launches 100K ALICE Grant Program To Support Web3 Development And Ecosystem Growth

rewrite this title ‘My Neighbor Alice’ Launches 100K ALICE Grant Program To Support Web3 Development And Ecosystem Growth

April 21, 2025
rewrite this title AO Offshores Bulk of Customer Service Jobs to South Africa in Savings Drive – UC Today

rewrite this title AO Offshores Bulk of Customer Service Jobs to South Africa in Savings Drive – UC Today

June 19, 2026
Baylor QB Sawyer Robertson | Gruden’s QB Class

Baylor QB Sawyer Robertson | Gruden’s QB Class

April 20, 2026
Polygon Labs Reveals Rebranding of MATIC Token to POL in September, Accompanied by Significant Technical Enhancements – The Daily Hodl

Polygon Labs Reveals Rebranding of MATIC Token to POL in September, Accompanied by Significant Technical Enhancements – The Daily Hodl

July 20, 2024
rewrite this title Jordan turns to blockchain tech for enhancing government operations

rewrite this title Jordan turns to blockchain tech for enhancing government operations

January 1, 2025
rewrite this title Atletico tell Barcelona to put up or shut up over Alvarez

rewrite this title Atletico tell Barcelona to put up or shut up over Alvarez

June 23, 2026
rewrite this title Binance Coin (BNB) Price Prediction 2026, 2027 – 2030

rewrite this title Binance Coin (BNB) Price Prediction 2026, 2027 – 2030

June 23, 2026
rewrite this title with good SEO Google Earth’s Hidden Flight Simulator Is Now Playable in Web Browsers

rewrite this title with good SEO Google Earth’s Hidden Flight Simulator Is Now Playable in Web Browsers

June 23, 2026
rewrite this title online sales during China’s 618 shopping festival grew 4% YoY, a sharp drop from 15.2% growth last year, amid a persistent consumer spending slowdown (Evelyn Cheng/CNBC)

rewrite this title online sales during China’s 618 shopping festival grew 4% YoY, a sharp drop from 15.2% growth last year, amid a persistent consumer spending slowdown (Evelyn Cheng/CNBC)

June 23, 2026
rewrite this title and make it good for SEOTurning Point Brands: White Pouch Nicotine Is Driving Revenue Growth (NYSE:TPB)

rewrite this title and make it good for SEOTurning Point Brands: White Pouch Nicotine Is Driving Revenue Growth (NYSE:TPB)

June 23, 2026
rewrite this title UK Credit Union Selects Illuma to Protect from Voice Fraud – Finovate

rewrite this title UK Credit Union Selects Illuma to Protect from Voice Fraud – Finovate

June 22, 2026
DeFi Daily

Stay updated with DeFi Daily, your trusted source for the latest news, insights, and analysis in finance and cryptocurrency. Explore breaking news, expert analysis, market data, and educational resources to navigate the world of decentralized finance.

  • About Us
  • Blogs
  • DeFi-IRA | Learn More.
  • Advertise with Us
  • Disclaimer
  • Privacy Policy
  • DMCA
  • Cookie Privacy Policy
  • Terms and Conditions
  • Contact us

Copyright © 2024 Defi Daily.
Defi Daily is not responsible for the content of external sites.

Welcome Back!

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In
No Result
View All Result
  • Cryptocurrency
    • Bitcoin
    • Ethereum
    • Altcoins
    • DeFi-IRA
  • DeFi
    • NFT
    • Metaverse
    • Web 3
  • Finance
    • Business Finance
    • Personal Finance
  • Markets
    • Crypto Market
    • Stock Market
    • Analysis
  • Other News
    • World & US
    • Politics
    • Entertainment
    • Tech
    • Sports
    • Health
  • Videos

Copyright © 2024 Defi Daily.
Defi Daily is not responsible for the content of external sites.