A new large language model (LLM) has taken the performance crown from OpenAI’s GPT-4, with the release of Anthropic’s Claude 3.5 Sonnet chatbot and LLM. According to the company, it has bested all others in the world on key third-party benchmark tests, and does so while being faster and cheaper than prior Claude 3 models. What’s more impressive is that many AI influencers and power users have taken to the web to share their largely positive impressions about Anthropic’s new model, showcasing its capabilities in coding, game development, and even recreating imagery from the movie Hackers. While some have noted that Claude 3.5 Sonnet still struggles with basic cognitive tasks, its performance gains are undeniable, putting pressure on OpenAI to continue making the case for its models as the right choice. As one user put it, Claude 3.5 Sonnet is “starting to get really good at coding and autonomously fixing pull requests” and it’s becoming clear that in a year’s time, a large percentage of code will be written by LLMs.

Claude 3.5 Sonnet Takes the Crown
Claude 3.5 Sonnet was able to create an entire playable game for her based on just a screenshot, in less than half a minute.
1–2 minutes










