Anthropic:
Anthropic claims that its new Sonnet 3.5 model scores 49% on SWE-bench Verified, up from 33.4% and “higher than all” public models, and debuts Claude 3.5 Haiku — Today, we're announcing an upgraded Claude 3.5 Sonnet, and a new model, Claude 3.5 Haiku.
Anthropic:
Anthropic claims that its new Sonnet 3.5 model scores 49% on SWE-bench Verified, up from 33.4% and “higher than all” public models, and debuts Claude 3.5 Haiku — Today, we're announcing an upgraded Claude 3.5 Sonnet, and a new model, Claude 3.5 Haiku.
Source: TechMeme
Source Link: http://www.techmeme.com/241022/p14#a241022p14