
Boaz Barak / Windows On Theory:
Research: AI's ability to complete long and complex software engineering tasks doubles every 6-7 months, but there is a “messiness tax” for real-world tasks — METR has had a very influential work by Kwa and West et al on measuring AI's ability to complete long tasks.

Boaz Barak / Windows On Theory:
Research: AI's ability to complete long and complex software engineering tasks doubles every 6-7 months, but there is a “messiness tax” for real-world tasks — METR has had a very influential work by Kwa and West et al on measuring AI's ability to complete long tasks.
Source: TechMeme
Source Link: http://www.techmeme.com/251105/p20#a251105p20