National Cyber Warfare Foundation (NCWF)

Scale AI and CAIS' Remote Labor Index benchmark, which tests AI agents on freelance tasks, finds the best AI could perform just <3% of the wor


0 user ratings
2025-10-30 10:33:03
milo
Developers

Will Knight / Wired:

Scale AI and CAIS' Remote Labor Index benchmark, which tests AI agents on freelance tasks, finds the best AI could perform just <3% of the work, earning $1,810  —  A new benchmark measures how well AI agents can automate economically valuable chores.  Human-level AI is still some ways off.




Will Knight / Wired:

Scale AI and CAIS' Remote Labor Index benchmark, which tests AI agents on freelance tasks, finds the best AI could perform just <3% of the work, earning $1,810  —  A new benchmark measures how well AI agents can automate economically valuable chores.  Human-level AI is still some ways off.



Source: TechMeme
Source Link: http://www.techmeme.com/251030/p13#a251030p13


Comments
new comment
Nobody has commented yet. Will you be the first?
 
Forum
Developers



Copyright 2012 through 2025 - National Cyber Warfare Foundation - All rights reserved worldwide.