Anthropic news anthropic.com

via techmeme 5 hours ago
Anthropic: Anthropic unveils BioMysteryBench to test Claude's bioinformatics skills against human experts, and says Mythos solved ~30% of 23 questions that stumped experts  —  In this post, Brianna, a researcher on the discovery team, shares results from a recent bioinformatics benchmarking effort.

No comments yet…

Login to comment.