Published on: 2025-04-30 11:46:22
Earlier this week, Meta landed in hot water for using an experimental, unreleased version of its Llama 4 Maverick model to achieve a high score on a crowdsourced benchmark, LM Arena. The incident prompted the maintainers of LM Arena to apologize, change their policies, and score the unmodified, vanilla Maverick. Turns out, it’s not very competitive. The unmodified Maverick, “Llama-4-Maverick-17B-128E-Instruct,” was ranked below models including OpenAI’s GPT-4o, Anthropic’s Claude 3.5 Sonnet, a
Keywords: arena experimental llama maverick version
Find related items on AmazonPublished on: 2025-05-08 11:29:07
is a senior AI reporter working with The Verge’s policy and tech teams. She previously worked at Fortune Magazine and Business Insider. Over the weekend, Meta dropped two new Llama 4 models: a smaller model named Scout, and Maverick, a mid-size model that the company claims can beat GPT-4o and Gemini 2.0 Flash “across a broad range of widely reported benchmarks.” Maverick quickly secured the number-two spot on LMArena, the AI benchmark site where humans compare outputs from different systems a
Keywords: ai llama maverick meta model
Find related items on AmazonPublished on: 2025-05-09 01:35:04
The 2025 Ford Maverick remains the sole compact pickup truck on sale in North America—and it’s been a smash success since it launched, with Ford adding a third shift at its plant to meet the demand. That suggests not every American pickup buyer needs something the size of Ford’s F-Series, which remains the best-selling vehicle line in the US after four decades. In particular, the Maverick made headlines in 2022 by offering a hybrid version that returned a 37 mpg combined EPA rating—against 26 m
Keywords: ford hybrid maverick new vehicle
Find related items on AmazonPublished on: 2025-05-09 12:32:31
is a senior AI reporter working with The Verge’s policy and tech teams. She previously worked at Fortune Magazine and Business Insider. Over the weekend, Meta dropped two new Llama 4 models: a smaller model named Scout, and Maverick, a mid-size model that the company claims can beat GPT-4o and Gemini 2.0 Flash “across a broad range of widely reported benchmarks.” Maverick quickly secured the number-two spot on LMArena, the AI benchmark site where humans compare outputs from different systems a
Keywords: ai llama maverick meta model
Find related items on AmazonPublished on: 2025-05-10 11:45:07
A Meta exec on Monday denied a rumor that the company trained its new AI models to present well on specific benchmarks while concealing the models’ weaknesses. The executive, Ahmad Al-Dahle, VP of generative AI at Meta, said in a post on X that it’s “simply not true” that Meta trained its Llama 4 Maverick and Llama 4 Scout models on “test sets.” In AI benchmarks, test sets are collections of data used to evaluate the performance of a model after it’s been trained. Training on a test set could m
Keywords: maverick meta model models rumor
Find related items on AmazonPublished on: 2025-05-11 16:16:03
One of the new flagship AI models Meta released on Saturday, Maverick, ranks second on LM Arena, a test that has human raters compare the outputs of models and choose which they prefer. But it seems the version of Maverick that Meta deployed to LM Arena differs from the version that’s widely available to developers. As several AI researchers pointed out on X, Meta noted in its announcement that the Maverick on LM Arena is an “experimental chat version.” A chart on the official Llama website, me
Keywords: arena lm maverick meta model
Find related items on AmazonPublished on: 2025-05-11 09:33:56
We’re sharing the first models in the Llama 4 herd, which will enable people to build more personalized multimodal experiences. Llama 4 Scout, a 17 billion active parameter model with 16 experts, is the best multimodal model in the world in its class and is more powerful than all previous generation Llama models, while fitting in a single NVIDIA H100 GPU. Additionally, Llama 4 Scout offers an industry-leading context window of 10M and delivers better results than Gemma 3, Gemini 2.0 Flash-Lite,
Keywords: llama maverick model models scout
Find related items on AmazonGo K’awiil is a project by nerdhub.co that curates technology news from a variety of trusted sources. We built this site because, although news aggregation is incredibly useful, many platforms are cluttered with intrusive ads and heavy JavaScript that can make mobile browsing a hassle. By hand-selecting our favorite tech news outlets, we’ve created a cleaner, more mobile-friendly experience.
Your privacy is important to us. Go K’awiil does not use analytics tools such as Facebook Pixel or Google Analytics. The only tracking occurs through affiliate links to amazon.com, which are tagged with our Amazon affiliate code, helping us earn a small commission.
We are not currently offering ad space. However, if you’re interested in advertising with us, please get in touch at [email protected] and we’ll be happy to review your submission.