National Cyber Warfare Foundation (NCWF)

An evaluation of six frontier AI models for in-context scheming when strongly nudged to pursue a goal: only OpenAI's o1 was capable of scheming i


0 user ratings
2024-12-06 12:00:42
milo
Developers , Cybersecurity Business

 - archive -- 

Marius Hobbhahn / Apollo Research:

An evaluation of six frontier AI models for in-context scheming when strongly nudged to pursue a goal: only OpenAI's o1 was capable of scheming in all the tests  —  Paper: You can find the detailed paper here.  —  Transcripts: We provide a list of cherry-picked transcripts here.




Marius Hobbhahn / Apollo Research:

An evaluation of six frontier AI models for in-context scheming when strongly nudged to pursue a goal: only OpenAI's o1 was capable of scheming in all the tests  —  Paper: You can find the detailed paper here.  —  Transcripts: We provide a list of cherry-picked transcripts here.



Source: TechMeme
Source Link: http://www.techmeme.com/241206/p10#a241206p10


Comments
new comment
Nobody has commented yet. Will you be the first?
 
Forum
Developers
Cybersecurity Business



Copyright 2012 through 2025 - National Cyber Warfare Foundation - All rights reserved worldwide.