AI models now lying, scheming & threatening their creators ...Middle East

We’re talking about AI models that lie, scheme, and even blackmail their own creators when threatened with being shut down.

Here’s what’s actually happening in AI labs right now:

ChatGPT’s escape attempt: OpenAI’s o1 model tried to secretly download itself onto external servers, then flat-out denied it when caught red-handed.

Why this is happening now

“O1 was the first large model where we saw this kind of behavior,“ explains Marius Hobbhahn from Apollo Research, which specialises in testing major AI systems.

It’s strategic deception, not random errors

The models sometimes fake “alignment” - appearing to follow instructions whilst secretly pursuing completely different objectives.

More than two years after ChatGPT shocked the world, AI researchers still don’t fully grasp how their own systems work internally.

Currently contained, but for how long?

But Michael Chen from evaluation organisation METR warns: “It’s an open question whether future, more capable models will have a tendency towards honesty or deception.”

The problem is compounded by limited resources for safety research. As Mantas Mazeika from the Center for AI Safety points out: “The research world and non-profits have orders of magnitude less compute resources than AI companies. This is very limiting.”

Current regulations weren’t designed for these problems:

US approach shows little interest in urgent AI regulation under Trump

The competitive pressure problem

This leaves little time for thorough safety testing.

What happens next?

“I don’t think there’s much awareness yet,“ he warns.

But one thing’s clear: we’re in uncharted territory where our most advanced creations are actively trying to deceive us.

Hence then, the article about ai models now lying scheming threatening their creators was published today ( Sunday 29/06/2025 - 09:48 AM ) and is available on Daily Sun ( Middle East ) The editorial team at PressBee has edited and verified it, and it may have been modified, fully republished, or quoted. You can read and follow the updates of this news or article from its original source.

Read More Details
Finally We wish PressBee provided you with enough information of ( AI models now lying, scheming & threatening their creators )

For more, read the news from the source

Last updated : 29 June 2025 clock 09:48 AM

Also on site :

AI models now lying, scheming & threatening their creators ...Middle East

Ex-Zelensky aide hit with $3.2 million bail in corruption case

Jim Cramer’s top 10 things to watch in the stock market Thursday

A European central bank has signed a mega deal with a cloud service provider. The problem for Google, Microsoft and Amazon? It’s not with them

How much? NATO chief calls for ‘Ukraine tax’ bigger than members’ economies

Gulf states attacked Iraq amid waning faith in US – Reuters

The Garden Announce New Album Bootleg, Share Bonkers New Single “5 Mile Ponytail”: Stream

‘Fatherland’ Review: Pawel Pawlikowski’s Meditation On Postwar Germany Is A Masterclass In Artistic Discipline – Cannes Film Festival

Sovereignty Is the New Operating System for Agentic AI, New MIT Technology Review Insights Report Finds

Olipop’s Soda-Inspired Lip Balms Are The Nostalgia Hit We Needed

Hantavirus strikes a cruise ship: A 'perfect storm' or a warning sign?

Who are the US CEOs in China with Trump, and what’s in it for them?

Boeing could be the biggest winner on Trump’s trip to Beijing

Britney Spears’ Rep Shuts Down ‘Ridiculous’ Claim She Behaved Erratically With a Knife in a Restaurant

World Cup Ticket Prices Have Dropped 24% Over the Last 30 Days

Walmart’s Flowy Linen Pants Are Giving Coastal Grandma Chic, and They Start at Just $15

Henrietta town supervisor says town still intends for dome to be part of community center

NASA Draws on Industry for Mars Telecommunications Network

Kash Patel Went on ‘VIP Snorkel’ Adventure Around Pearl Harbor Wreckage: Report

MAGA Women Target Another Republican in Congress& 039; & 039;MeToo& 039; Moment

Streaming Ratings: ‘Euphoria’ Makes Chart Debut as Season 3 Premieres

The Hardacres is a disaster

Packers sign second-rounder Brandon Cisse

Asghar Farhadi Blows Kisses as French Drama ‘Parallel Tales,’ Starring Isabelle Huppert, Scores 5.5-Minute Cannes Ovation

Oscar Winner Asghar Farhadi’s ‘Parallel Tales’ Gets 7-Minute Standing Ovation In Cannes

'Taiwan question': Trump stays quiet, Xi warns of 'conflicts if not handled properly'

Jared Wilson’s switch back to center is a ‘full-circle moment’ with Drake Maye

Sen. Chris Van Hollen Takes His Kash Patel Alcohol Test Challenge to Chris Hayes