Trying to break OpenAI's new o1 models? You might get banned

2 months ago 33

BOOK THIS SPACE FOR AD

ARTICLE AD

John Lund/Getty Images

Even the smartest AI models are prone to hallucinations, which can be amusing when provoked. May I remind you of glue pizza? However, if you try to induce hallucinations in OpenAI's advanced o1 reasoning models, you may lose access to the model altogether.

OpenAI unveiled its o1 models last week, which were trained to "think before they speak" and, as a result, are capable of solving complex math, science, and coding problems using advanced reasoning. With a model touting such impressive capabilities, naturally, people set out to break its string of reasoning.

Also: How well can OpenAI's o1-preview code? It aced my 4 tests - and showed its work in surprising detail

However, as first spotted by Wired, users who tried to do so got warnings within the chatbot interface, informing them that their actions violated OpenAI's terms of use and usage policies. The user actions included mentioning terms such as "reasoning trace" or "reasoning."

Furthermore, a user shared the OpenAI ChatGPT Policy Violation email via X, which informed them the system detected a policy violation for "attempting to circumvent safeguards or safety mitigations in our [OpenAI's] services." The email also requested that the user "halt" that activity. Although the email screenshot did not specify the consequences, OpenAI delineates the consequences of such violations in its Terms of Use documentation.

Per OpenAI's Terms of Use, last updated on January 31, 2024, the company reserves the right to "suspend or terminate your access to our Services or delete your account" if they determine that a user breached the Terms or Usage Policies, could cause risk or harm to OpenAI and other users, or do not comply with the law.

Reactions to these policies have been a mixed bag, with some people complaining that these limitations hinder proper red-teaming, while others are glad that active precautions are being taken to protect against loopholes in newer models.

If you want to try the o1 models for yourself, you can create a free ChatGPT account, sign in, toggle "alpha modes" from the model picker, and choose o1-mini. If you want to try o1-preview, you'll have to subscribe to a ChatGPT Plus account for $20 per month.

Editorial standards

Read Entire Article

LEFT SIDEBAR AD

Trying to break OpenAI's new o1 models? You might get banned

BOOK THIS SPACE FOR AD

Related

This retractable USB-C charger is my new favorite travel accessory (and it's on sale for Black Friday)

Skip the iPad: This tablet is redefining what a kids tablet can do, and it's 42% off for Black Friday

Why the iPad Mini 7 is the ultraportable tablet to beat this holiday travel season - and it's $50 off

This monster 240W charger has features I've never seen on other accessories (and get $60 off this Black Friday)

I tested the world's fastest SSD and the results will make power users cry (and now you can save over $50)

This power bank is thinner than your iPhone and this Black Friday deal slashes 27% off the price

Trending

Popular

Install waybackurls on Kali Linux

1-click RCE in Electron Applications

Microsoft Office Professional Plus 2019 (x64 & x86) Multilingual + Pre-Activated

Over 40 Apps With More Than 100 Million Installs Found Leaking AWS Keys

Install DalFox on Kali Linux

Adobe Master Collection CC 2022 v25.08.2022 (x64) Multilingual Pre-Activated

Maxon CINEMA 4D Studio S22.123 (x64) Multilingual + Crack

Autodesk Revit 2023 R1 Build 23.0.11.19 (x64) Multilingual + Crack

‘We are not motivated by profits’ – Open Bug Bounty maintainers on finding a niche in the crowdsourced AppSec market

Just Gopher It: Escalating a Blind SSRF to RCE for $15k

BOOK THIS SPACE FOR AD