Will the Jan 2024 version of the LLM detector "Binoculars" be effective against OpenAI's best model at end 2024? | Manifold

Will the Jan 2024 version of the LLM detector "Binoculars" be effective against OpenAI's best model at end 2024?

Mini

10

Ṁ299

Jan 1

59%

chance

1D

1W

1M

ALL

https://huggingface.co/spaces/tomg-group-umd/Binoculars

Over a wide range of document types, Binoculars detects over 90% of generated samples from ChatGPT (and other LLMs) at a false positive rate of 0.01%, despite not being trained on any ChatGPT data
Is there a correlation between Binoculars score and sequence length? Such correlations may create a bias towards incorrect results for certain lengths. In Figure 12, we show the joint distribution of token sequence length and Binocular score. Sequence length offers little information about class membership

I ran my own test here and here and it was very effective. Will it last? I'll rerun a similar test and make a subjective judgement as to whether it's effective. The target would be roughly >=90% true negative, >=95% true positive.

#Technical AI Timelines

Get

1,000

and

1.00

Sort by:

opened a Ṁ5,000 YES at 70% order

take my order?

Related questions

Will openAI have the most accurate LLM across most benchmarks by EOY 2024?

Will OpenAI have the best LLM in 2024?

Will an LLM be able to match the ground truth >85% of the time when performing PII detection by 2024 end?

Will OpenAI's next major LLM (after GPT-4) achieve over 50% resolution rate on the SWE-bench benchmark?

Will the most interesting AI in 2027 be a LLM?

What will be true of OpenAI's best LLM by EOY 2025?

OpenAI to release model weights by EOY?

Will there exist an LLM which can beat the latest version of AlphaZero by EOY 2024?

There will be one LLM/AI that is at least 10x better than all others in 2027

Will there be a free, public way to generate LLM text that evades jan2024 llm detector 'binoculars' by the end of 2024?

Related questions

Will openAI have the most accurate LLM across most benchmarks by EOY 2024?

What will be true of OpenAI's best LLM by EOY 2025?

Will OpenAI have the best LLM in 2024?

OpenAI to release model weights by EOY?

Will an LLM be able to match the ground truth >85% of the time when performing PII detection by 2024 end?

Will there exist an LLM which can beat the latest version of AlphaZero by EOY 2024?

Will OpenAI's next major LLM (after GPT-4) achieve over 50% resolution rate on the SWE-bench benchmark?

There will be one LLM/AI that is at least 10x better than all others in 2027

Will the most interesting AI in 2027 be a LLM?

Will there be a free, public way to generate LLM text that evades jan2024 llm detector 'binoculars' by the end of 2024?

Terms & Conditions•Privacy Policy•Sweepstakes Rules