DeFi Daily News
Friday, June 19, 2026
Advertisement
  • Cryptocurrency
    • Bitcoin
    • Ethereum
    • Altcoins
    • DeFi-IRA
  • DeFi
    • NFT
    • Metaverse
    • Web 3
  • Finance
    • Business Finance
    • Personal Finance
  • Markets
    • Crypto Market
    • Stock Market
    • Analysis
  • Other News
    • World & US
    • Politics
    • Entertainment
    • Tech
    • Sports
    • Health
  • Videos
No Result
View All Result
DeFi Daily News
  • Cryptocurrency
    • Bitcoin
    • Ethereum
    • Altcoins
    • DeFi-IRA
  • DeFi
    • NFT
    • Metaverse
    • Web 3
  • Finance
    • Business Finance
    • Personal Finance
  • Markets
    • Crypto Market
    • Stock Market
    • Analysis
  • Other News
    • World & US
    • Politics
    • Entertainment
    • Tech
    • Sports
    • Health
  • Videos
No Result
View All Result
DeFi Daily News
No Result
View All Result
Home Markets Stock Market

rewrite this title Chart of the Week: AI Is a Black Box

Ian King by Ian King
June 18, 2026
in Stock Market
0 0
0
rewrite this title Chart of the Week: AI Is a Black Box
0
SHARES
0
VIEWS
Share on FacebookShare on TwitterShare on Telegram
Listen to this article


rewrite this content using a minimum of 1000 words and keep HTML tags

A strange thing happened last week.

Anthropic was forced to take its newest AI models offline only days after releasing them.

The company’s new Fable 5 and Mythos 5 systems were designed to be some of the most powerful AI models ever released. But shortly after launch, researchers discovered ways to get around some of the models’ built-in safety measures.

Government officials soon got involved as fears spread that these systems could become powerful cybersecurity weapons in the wrong hands.

Maybe those concerns were justified, and maybe they weren’t.

But to me, they raise an obvious question that not enough people are asking.

How would anyone know?

What’s Inside the Box?

Modern AI systems aren’t like traditional software.

Engineers don’t sit down and write lines of code telling them exactly how to reason through a problem.

Instead, researchers train these systems and then observe their behavior.

The result is what many researchers call a black box.

We can see what goes in, and we can see what comes out.

But what happens in between is often much harder to explain.

That’s why companies like Anthropic spend so much time studying AI interpretability, or the science of understanding how these systems arrive at their conclusions.

And that brings us to this week’s chart.

Because a group of researchers recently performed a strange experiment.

They secretly modified an AI model’s internal state. Then they asked whether the model could detect that something had changed.

Image: Uzay Macar and Li Yang

This chart might look complicated, but the basic idea is simple.

Researchers injected information directly into an AI model’s internal processing, then tested whether it could tell the difference between those injections and its normal thought process.

The chart compares three versions of the same model.

The first is the Base model, the raw AI system before it receives additional training.

The second is the Instruct model, which was trained to behave more like the helpful AI assistants most people interact with today.

The third is an Abliterated version of the model, where some of the refusal and safety behaviors were removed.

The blue line shows how often the model correctly detected a real change, while the orange line shows how often it falsely claimed that something changed when nothing had actually happened.

And the results are surprising.

The Base model performed poorly. When researchers secretly altered its internal processing, it often couldn’t tell the difference between a real change and a false alarm.

But the Instruct model performed much better.

Somewhere during the additional training process, the model appears to have developed an ability to recognize when something unusual had happened inside its own processing.

And in several cases, the Abliterated model performed even better still.

In other words, removing some of the AI’s safety and refusal behaviors actually improved the model’s ability to detect what was going on inside it.

That doesn’t mean the model became conscious or self-aware.

You can compare it to a computer server that detects when someone has tampered with its memory. The server isn’t aware of anything, but it can still recognize when something unusual has happened.

Researchers believe something similar happened here.

More importantly, they think capabilities like this could eventually help us better understand what’s happening inside advanced AI systems.

After all, these models have access to information that remains largely hidden from the people studying them.

Which means one way researchers could eventually learn more about advanced AI systems is by asking the systems themselves.

That might seem counterintuitive.

But it would give researchers something they’ve never really had before.

A window into what’s happening inside the model itself.

Here’s My Take

The primary goal of the AI industry has been to build more capable models.

But another challenge is gaining urgency.

Understanding them.

The controversy surrounding Anthropic’s latest models shows why we need to get a handle on this issue sooner than later.

Because it’s one thing to build a powerful AI system. It’s something else entirely to create a new form of intelligence yet only partially understand how it works.

So here’s my question to you:

If future AI systems become too complex for humans to fully understand on their own, would you trust AI to help explain what’s happening inside other AI models?

Or does that sound like asking the fox to guard the henhouse?

I’d love to hear what you think.

Let me know at dailydisruptor@banyanhill.com.

We won’t reveal your full name in the event we publish a response, so feel free to share your honest opinion.

Regards,

Ian King's SignatureIan KingChief Strategist, Banyan Hill Publishing

and include conclusion section that’s entertaining to read. do not include the title. Add a hyperlink to this website http://defi-daily.com and label it “DeFi Daily News” for more trending news articles like this



Source link

Tags: BlackBoxChartrewritetitleWeek
ShareTweetShare
Previous Post

LIVE: White House Press Briefing with Vice President JD Vance

Next Post

rewrite this title with good SEO Bitcoin Price Falls To $62,000 As Hawkish Fed Shift Raises Risk Of Deeper Pullback

Next Post
rewrite this title with good SEO Bitcoin Price Falls To ,000 As Hawkish Fed Shift Raises Risk Of Deeper Pullback

rewrite this title with good SEO Bitcoin Price Falls To $62,000 As Hawkish Fed Shift Raises Risk Of Deeper Pullback

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Search

No Result
View All Result
  • Trending
  • Comments
  • Latest
Baylor QB Sawyer Robertson | Gruden’s QB Class

Baylor QB Sawyer Robertson | Gruden’s QB Class

April 20, 2026
You don’t fix the Fed. You opt out of needing it.

You don’t fix the Fed. You opt out of needing it.

May 22, 2026
Exclusive Shopkick Deal: Get a FREE Gift Card Worth - for Every User!

Exclusive Shopkick Deal: Get a FREE Gift Card Worth $3-$5 for Every User!

October 24, 2024
How one terrible trip inspired a tech IPO: Navan Co-Founder

How one terrible trip inspired a tech IPO: Navan Co-Founder

June 15, 2026
Samsung’s Galaxy Buds Series 3 Have a New Look You May or May Not Like

Samsung’s Galaxy Buds Series 3 Have a New Look You May or May Not Like

July 10, 2024
EigenLayer’s Price Drops 15% Following Justin Sun Sell-Off; Investors Shift to High-Yield 6,173% APY Vote-to-Earn Meme Coin

EigenLayer’s Price Drops 15% Following Justin Sun Sell-Off; Investors Shift to High-Yield 6,173% APY Vote-to-Earn Meme Coin

October 3, 2024
Get a HELOC to Pay Off My AMEX and buy a new car?

Get a HELOC to Pay Off My AMEX and buy a new car?

June 19, 2026
rewrite this title Wise Acquires International Living Guidance Expert Expatica – Finovate

rewrite this title Wise Acquires International Living Guidance Expert Expatica – Finovate

June 19, 2026
rewrite this title with good SEO US Department of War Seeks  Billion for Iran War as Deficit Fears Boost Bitcoin’s Case

rewrite this title with good SEO US Department of War Seeks $80 Billion for Iran War as Deficit Fears Boost Bitcoin’s Case

June 19, 2026
rewrite this title Exercise Tiger, 'Disastrous' D-Day Rehearsal in World War 2, Cost 749 Lives – Yet Many Have Never Heard of It

rewrite this title Exercise Tiger, 'Disastrous' D-Day Rehearsal in World War 2, Cost 749 Lives – Yet Many Have Never Heard of It

June 19, 2026
rewrite this title Amazon MGM Studios drops Luca Guadagnino’s mostly finished movie on Sam Altman; Amazon struck a major deal with OpenAI in February, including a B investment (Variety)

rewrite this title Amazon MGM Studios drops Luca Guadagnino’s mostly finished movie on Sam Altman; Amazon struck a major deal with OpenAI in February, including a $50B investment (Variety)

June 19, 2026
rewrite this title Hyperliquid Price is Approaching a Make-or-Break Zone After 250% Rally—Can HYPE Push to ?

rewrite this title Hyperliquid Price is Approaching a Make-or-Break Zone After 250% Rally—Can HYPE Push to $90?

June 19, 2026
DeFi Daily

Stay updated with DeFi Daily, your trusted source for the latest news, insights, and analysis in finance and cryptocurrency. Explore breaking news, expert analysis, market data, and educational resources to navigate the world of decentralized finance.

  • About Us
  • Blogs
  • DeFi-IRA | Learn More.
  • Advertise with Us
  • Disclaimer
  • Privacy Policy
  • DMCA
  • Cookie Privacy Policy
  • Terms and Conditions
  • Contact us

Copyright © 2024 Defi Daily.
Defi Daily is not responsible for the content of external sites.

Welcome Back!

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In
No Result
View All Result
  • Cryptocurrency
    • Bitcoin
    • Ethereum
    • Altcoins
    • DeFi-IRA
  • DeFi
    • NFT
    • Metaverse
    • Web 3
  • Finance
    • Business Finance
    • Personal Finance
  • Markets
    • Crypto Market
    • Stock Market
    • Analysis
  • Other News
    • World & US
    • Politics
    • Entertainment
    • Tech
    • Sports
    • Health
  • Videos

Copyright © 2024 Defi Daily.
Defi Daily is not responsible for the content of external sites.