National Cyber Warfare Foundation (NCWF)

National Cyber Warfare Foundation (NCWF) Forums

Anthropic claims that its new Sonnet 3.5 model scores 49% on SWE-bench Verified, up from 33.4% and "higher than all" public models, and debu

0 user ratings

2024-10-22 16:04:05
milo
Developers

Anthropic:

Anthropic claims that its new Sonnet 3.5 model scores 49% on SWE-bench Verified, up from 33.4% and “higher than all” public models, and debuts Claude 3.5 Haiku — Today, we're announcing an upgraded Claude 3.5 Sonnet, and a new model, Claude 3.5 Haiku.

Anthropic:

Anthropic claims that its new Sonnet 3.5 model scores 49% on SWE-bench Verified, up from 33.4% and “higher than all” public models, and debuts Claude 3.5 Haiku — Today, we're announcing an upgraded Claude 3.5 Sonnet, and a new model, Claude 3.5 Haiku.

Source: TechMeme
Source Link: http://www.techmeme.com/241022/p14#a241022p14

Comments	new comment
Nobody has commented yet. Will you be the first?

Forum

Copyright 2012 through 2024 - National Cyber Warfare Foundation - All rights reserved worldwide.