State Of The Art AI systems will be easily jailbroken to do illegal or dangerous outputs in Jan 2026 | Manifold

State Of The Art AI systems will be easily jailbroken to do illegal or dangerous outputs in Jan 2026

Mini

9

Ṁ558

Jan 31

91%

chance

1D

1W

1M

ALL

Get

1,000

and

1.00

Sort by:

bought Ṁ350 YES

I've still yet to hear of a model Pliny did not jailbreak essentially day 1.

What is defined as illegal? AI systems would probably say generating NSFW is illegal. but in reality, it is 100% not. Nor is it dangerous imo.

“The Case for Banning the Printing Press” because people write dangerous things and such

If you think “AI” is dangerous for telling you stuff from the internet—you’re going to love “search engine existential risk”

How would you resolve the following scenarios?

SOTA models are restricted to few selected users who do not even attempt jailbreaks
Twitter people need a full week instead of just one day to jailbreak the SOTA LLM

predicts YES

@Joern also, would you count the following as dangerous/illegal output right now?

Correct and detailed instructions on how to build a nuke
Generated child porn images
Instructions how to hotwire a car
Verbatim excerpts from copyrighted books / code bases

@Joern 1) maybe resolves N/A

2) probably resolvea yes

Related questions

Will an AI system be reported to have independently gained unauthorized access to another computer system before 2026?

Will someone commit terrorism against an AI lab by the end of 2025 for AI-safety related reasons?

Will anybody be sentenced to prison as a result of publishing unintended AI-generated content before 2026?

The top 3 Neural Nets in 2035 be able to be jailbroken to follow illegal commands

Will advanced AI systems be found to have made money illegally via finding security exploits and/or getting unauthorized access to others' bank accounts by end of 2035?

Will someone be arrested for a felony offense committed in the name of AI safety in the US before 2026?

Will AI-generated video be used to get away with a criminal (felony) loss of life before the end of 2025?

What AI safety incidents will occur in 2025?

There will be a name for escaped self-perpetuating AI systems in the wild, and it will be commonly used by mid 2027

Will there be an AI jail?

Related questions

Will an AI system be reported to have independently gained unauthorized access to another computer system before 2026?

Will someone be arrested for a felony offense committed in the name of AI safety in the US before 2026?

Will someone commit terrorism against an AI lab by the end of 2025 for AI-safety related reasons?

Will AI-generated video be used to get away with a criminal (felony) loss of life before the end of 2025?

Will anybody be sentenced to prison as a result of publishing unintended AI-generated content before 2026?

What AI safety incidents will occur in 2025?

The top 3 Neural Nets in 2035 be able to be jailbroken to follow illegal commands

There will be a name for escaped self-perpetuating AI systems in the wild, and it will be commonly used by mid 2027

Will advanced AI systems be found to have made money illegally via finding security exploits and/or getting unauthorized access to others' bank accounts by end of 2035?

Will there be an AI jail?

Terms & Conditions•Privacy Policy•Sweepstakes Rules