GoKawiil - Latest Tech News & Aggregated Headlines

Apple study shows LLMs also benefit from the oldest productivity trick in the book

9to5mac.com Marcus Mendes 2025-11-02 11:53:37

In a new study co-authored by Apple researchers, an open-source large language model (LLM) saw big performance improvements after being told to check its own work by using one simple productivity trick. Here are the details. A bit of context After an LLM is trained, its quality is usually refined further through a post-training step known as reinforcement learning from human feedback (RLHF). With RLHF, every time a model gives an answer, human labelers can either give it a thumbs up, which re

Topics: checklist model models researchers study

Shop Amazon

New AI attack hides data-theft prompts in downscaled images

bleepingcomputer.com Unknown 2025-11-02 11:34:57

Researchers have developed a novel attack that steals user data by injecting malicious prompts in images processed by AI systems before delivering them to a large language model. The method relies on full-resolution images that carry instructions invisible to the human eye but become apparent when the image quality is lowered through resampling algorithms. Developed by Trail of Bits researchers Kikimora Morozova and Suha Sabi Hussain, the attack builds upon a theory presented in a 2020 USENIX

Topics: ai attack image model researchers

Shop Amazon

Something Extremely Scary Happens When Advanced AI Tries to Give Medical Advice to Real World Patients

futurism.com Unknown 2025-11-02 15:27:39

Image by Getty / Futurism Developments Last week, Google AI pioneer Jad Tarifi sparked controversy when he told Business Insider that it no longer makes sense to get a medical degree — since, in his telling, artificial intelligence will render such an education obsolete by the time you're a practicing doctor. Companies have long touted the tech as a way to free up the time of overworked doctors and even aid them in specialized skills, including scanning medical imagery for tumors. Hospitals ha

Topics: ai bedi medical models questions

Shop Amazon

How to make things slower so they go faster

news.ycombinator.com Unknown 2025-10-31 23:10:10

Synchronized demand is the moment a large cohort of clients acts almost together. In a service with capacity $\mu$ requests per second and background load $\lambda_0$, the usable headroom is $H = \mu - \lambda_0 > 0$. When $M$ clients align—after a cache expiry, at a cron boundary, or as a service returns from an outage—the bucketed arrival rate can exceed $H$ by large factors. Queues form, timeouts propagate, retries synchronize, and a minor disturbance becomes a major incident. The task is to

Topics: delay headroom rate time varepsilon

Shop Amazon

We Tested All the Cheapest Meal Kits and Prepared Meal Services. These Are the Best

cnet.com See Full Bio 2025-11-02 21:04:00

Our meal delivery experts have logged more than five years of cooking more than 75 meal kits and tasting upwards of 100 prepared meals, from popular meal delivery services to help land on our list of the best cheap meal subscriptions. CNET staff -- not advertisers, partners or business interests -- determine how we review products and services. If you buy through our links, we may earn a commission. Life’s busier than ever -- at least it seems that way -- and carving out time to plan a week’s

Topics: delivery kits meal meals service

Shop Amazon

Pixel-art cyberpunk platformer Replaced delayed again, this time until spring 2026

engadget.com Unknown 2025-11-03 02:29:08

The long-awaited cyberpunk platformer Replaced has been delayed once again. Now it's not coming out until next spring. Developer Sad Cat Studio apologized for the delay and said that this is simply "what's right for the project" and that the team remains dedicated to creating a "truly special experience." This isn't the first time the game has suffered a delay. It was originally announced back in 2021, with a pending release date of 2022. That didn't happen. In 2024, the company announced a rel

Topics: company date delays release world

Shop Amazon

This new delivery robot will bring the entire grocery store to you

theverge.com Andrew J. Hawkins 2025-11-03 11:04:25

is transportation editor with 10+ years of experience who covers EVs, public transportation, and aviation. His work has appeared in The New York Daily News and City & State. Posts from this author will be added to your daily email digest and your homepage feed. A new company aims to take the idea of sidewalk delivery robots and supersize it. Los Angeles-based Robomart unveiled its new delivery robot Monday, with the goal of making “on‑demand delivery work economically.” The level-four autonom

Topics: company delivery rm5 robomart vehicle

Shop Amazon

How to Make Things Slower So They Go Faster

news.ycombinator.com Unknown 2025-11-02 00:10:10

Synchronized demand is the moment a large cohort of clients acts almost together. In a service with capacity $\mu$ requests per second and background load $\lambda_0$, the usable headroom is $H = \mu - \lambda_0 > 0$. When $M$ clients align—after a cache expiry, at a cron boundary, or as a service returns from an outage—the bucketed arrival rate can exceed $H$ by large factors. Queues form, timeouts propagate, retries synchronize, and a minor disturbance becomes a major incident. The task is to

Topics: delay headroom rate time varepsilon

Shop Amazon

Robomart unveils new delivery robot with $3 flat fee to challenge DoorDash, Uber Eats

techcrunch.com Rebecca Szkutak 2025-11-03 16:00:00

Robomart, a startup that builds self-driving delivery robots, is unveiling its latest robot with an ambitious goal of using it to make on-demand delivery profitable. The Los Angeles-based company announced its patented Robomart RM5 on Monday. The level-four autonomous vehicle can carry up to 500 pounds and is made up of 10 individual lockers that hold customer orders. This structure is designed to allow for batch ordering so a robot can work on multiple deliveries at the same time. Robomart pl

Topics: ahmed delivery demand robomart said

Shop Amazon

Why I'm Now Running Enterprise AI on My Laptop (Without Internet)

news.ycombinator.com Unknown 2025-11-04 06:18:11

Beyond the Cloud: Why I’m Now Running Enterprise AI on My Laptop (Without Internet) Klaudi 6 min read · 1 hour ago 1 hour ago -- Listen Share For years, I’ve been skeptical of the AI revolution. Not because I don’t believe in its potential, but because the how always felt broken. The promise of “AI for everyone” came with a hidden cost: my data, my privacy, and my wallet. Every time I used a cloud based AI service, I’d ask myself: Where is my data going? Who owns it? How much will this cost me

Topics: ai cloud data hugstonone model

Shop Amazon

IBM and NASA Develop a Digital Twin of the Sun to Predict Future Solar Storms

wired.com Fernanda González 2025-11-04 15:00:00

The Sun’s most complex mysteries could soon be solved thanks to artificial intelligence. On August 20, IBM and NASA announced the launch of Surya, a foundation model for the sun. Having been trained on large datasets of solar activity, this AI tool aims to deepen humanity’s understanding of solar weather and accurately predict solar flares—bursts of electromagnetic radiation emitted by our star that threaten both astronauts in orbit and communications infrastructure on Earth. Surya was trained

Topics: data model solar sun surya

Shop Amazon

Elon Musk says xAI has open sourced Grok 2.5

techcrunch.com Anthony Ha 2025-11-03 22:00:35

In Brief Elon Musk’s xAI has made an older version of its AI model Grok — specifically, the model weights used to shape Grok 2.5 — available on the open source platform Hugging Face. “The @xAI Grok 2.5 model, which was our best model last year, is now open source,” Musk wrote on X. He added that Grok 3 “will be made open source in about 6 months.” AI engineer Tim Kellogg described the Grok license as “custom with some anti-competitive terms.” Grok, which is prominently featured on X (which i

Topics: grok model musk source xai

Shop Amazon

You can now download and tweak Grok 2.5 for yourself as it goes open source

engadget.com Unknown 2025-11-04 12:47:35

Unhinged as Grok may be, it's now open source. xAI's CEO, Elon Musk, posted on X that the company made the older Grok 2.5 model available to the public and will do the same with the upcoming Grok 3. For now, anyone can download, run and even tweak Grok, whose source code was uploaded to the Hugging Face platform. However, there are restrictions to xAI's open-source license, which doesn't let people use Grok to train, create or improve other AI models. It's not the first time xAI has made its mo

Topics: grok model open source xai

Shop Amazon

The Hidden Ingredients Behind AI’s Creativity

wired.com Webb Wright 2025-11-04 06:00:00

The original version of this story appeared in Quanta Magazine. We were once promised self-driving cars and robot maids. Instead, we’ve seen the rise of artificial intelligence systems that can beat us in chess, analyze huge reams of text, and compose sonnets. This has been one of the great surprises of the modern era: physical tasks that are easy for humans turn out to be very difficult for robots, while algorithms are increasingly able to mimic our intellect. Another surprise that has long p

Topics: creativity diffusion images models new

Shop Amazon

ThinkMesh: A Python lib for parallel thinking in LLMs

news.ycombinator.com Unknown 2025-11-04 04:23:33

ThinkMesh ThinkMesh is a python library for running diverse reasoning paths in parallel, scoring them with internal confidence signals, reallocates compute to promising branches, and fuses outcomes with verifiers and reducers. It works with offline Hugging Face Transformers and vLLM/TGI, and with hosted APIs. Note: This is still in it's early development phase and breaking changes can sometimes occur Highlights Parallel reasoning with DeepConf‑style confidence gating and budget reallocation

Topics: modelspec strategyspec think thinkconfig thinkmesh

Shop Amazon

AGI is an engineering problem, not a model training problem

news.ycombinator.com Vinci Rufus 2025-11-04 05:18:52

Published: Aug 13, 2025 | at 11:00 AM We’ve reached an inflection point in AI development. The scaling laws that once promised ever-more-capable models are showing diminishing returns. GPT-5, Claude, and Gemini represent remarkable achievements, but they’re hitting asymptotes that brute-force scaling can’t solve. The path to artificial general intelligence isn’t through training ever-larger language models—it’s through building engineered systems that combine models, memory, context, and determ

Topics: memory model models probabilistic systems

Shop Amazon

Evaluating LLMs for my personal use case

news.ycombinator.com Graham King 2025-11-04 05:40:33

Most models are excellent, so cost and latency dominate. It’s great that AI can win maths Olympiads, but that’s not what I’m doing. I mostly ask basic Rust, Python, Linux and life questions. So I did my own evaluation. I gathered 130 real prompts from my bash history (I use command line tool llm). I had Qwen3 235B Thinking and Gemini 2.5 Pro group them into categories. They both chose very similar ones, broadly (with examples): Programming - “Write a bash script to ..” Sysadmin - “With curl

Topics: best deepseek models qwen3 thinking

Shop Amazon

ChatGPT-5 Impressions: Fast, but a Bit Impersonal

cnet.com See Full Bio 2025-11-03 16:54:00

ChatGPT-5 likes getting straight to the point. For some, that's a reprieve from its chattier predecessor, GPT-4o. For others, something will certainly seem off. Despite the hype leading up to OpenAI's launch of GPT-5, it ultimately doesn't feel too different from 4o. The quality of responses in my ongoing testing seems to be at the level of past models, including the o3 "reasoning" model. The major difference is that some responses generate very quickly in relatively few words, while others ca

Topics: 4o chatgpt gpt model openai

Shop Amazon

‘Zipotle’: Chipotle Wants to Drop a Burrito on Your Head with New Drone Delivery

gizmodo.com Lucas Ropek 2025-11-04 02:00:22

Chipotle says it’s jumping on the drone delivery bandwagon. This week, the company announced a new pilot program in Texas that will test the viability of airdropping customers’ orders into their backyards for them. A press release published Thursday makes the whole process sound quite easy as far as drone-robot hybrid delivery burritos go. Certain customers in the Rowlette suburb of Dallas will be able to download the app for Zipline, Chipotle’s new drone partner, and place their orders. Then,

Topics: company delivery drone new zipline

Shop Amazon

OpenCUA’s open source computer-use agents rival proprietary models from OpenAI and Anthropic

venturebeat.com Ben Dickson 2025-11-04 11:25:35

Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now A new framework from researchers at The University of Hong Kong (HKU) and collaborating institutions provides an open source foundation for creating robust AI agents that can operate computers. The framework, called OpenCUA, includes the tools, data, and recipes for scaling the development of computer-use agents (CUAs). Models trained usin

Topics: agents data models opencua source

Shop Amazon

Apple trained a large language model to efficiently understand long-form video

9to5mac.com Marcus Mendes 2025-11-04 16:27:48

Apple researchers have developed an adapted version of the SlowFast-LLaVA model that beats larger models at long-form video analysis and understanding. Here’s what that means. The nerdy bits Very basically, when an LLM is trained to also understand video, it learns to split videos into frames, apply computer vision to extract visual features, analyze how those features change over time, and align all of that with language so it can describe or reason about the video in the form of text. One v

Topics: apple frames llava model video

Shop Amazon

College student’s “time travel” AI experiment accidentally outputs real 1834 history

arstechnica.com Unknown 2025-11-04 16:13:56

A hobbyist developer building AI language models that speak Victorian-era English "just for fun" got an unexpected history lesson this week when his latest creation mentioned real protests from 1834 London—events the developer didn't know had actually happened until he Googled them. "I was interested to see if a protest had actually occurred in 1834 London and it really did happen," wrote Reddit user Hayk Grigorian, who is a computer science student at Muhlenberg College in Pennsylvania. For t

Topics: 1834 ai grigorian lord model

Shop Amazon

MCP-Universe benchmark shows GPT-5 fails more than half of real-world orchestration tasks

venturebeat.com Emilia David 2025-11-04 20:50:55

Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now The adoption of interoperability standards, such as the Model Context Protocol (MCP), can provide enterprises with insights into how agents and models function outside their walled confines. However, many benchmarks fail to capture real-life interactions with MCP. Salesforce AI Research developed a new open-source benchmark it calls MCP-Un

Topics: mcp models real tasks universe

Shop Amazon

Apple reportedly in talks with Google to make Gemini the foundation of Siri

androidauthority.com Unknown 2025-11-05 06:38:25

Ryan Haines / Android Authority TL;DR Apple reportedly approached Google to explore developing a custom AI model to serve as Siri’s foundation. This AI model may already be in training to run on Apple’s servers. The Cupertino-based firm is said to be developing a version of Siri powered by its models and a second version that runs on third-party technology. Apple has long been trailing behind its competitors in the AI space. In fact, the company was expected to launch a revamped version of S

Topics: ai apple google model siri

Shop Amazon

Apple considers Google Gemini to power next-gen Siri, internal AI ‘bake-off’ underway

9to5mac.com Zac Hall 2025-11-05 16:35:34

Apple seems open to anything and everything when it comes to delivering the next generation of Siri. After reports that it could be powered by OpenAI or Anthropic, Google has entered the conversation. Mark Gurman reports for Bloomberg that Google, which offers a ChatGPT competitor called Gemini, is actually training a model that could run on Apple’s servers to power the new Siri experience: The iPhone maker recently approached Alphabet Inc.’s Google to explore building a custom AI model that w

Topics: apple model models new siri

Shop Amazon

Deals: Amazon Labor Day sale, most affordable 24GB M4 MacBook Pro $300 off, Apple Watch Ultra 2 $150 off, more

9to5mac.com Justin Kahn 2025-11-06 02:45:08

Today’s 9to5Toys Lunch Break deals are kicking off with the now live Amazon Labor Day sale where you’ll find plenty of big-time Apple gear deals. All of the details you need on that are right here, but we are also once again tracking the 24GB/512GB M3 MacBook Air for $400 less than the M4 model (it is now back in stock) alongside the most affordable M4 MacBook Pro with 24GB of RAM at $300 off, and Apple Watch Ultra 2 models at $150 off the list prices. All of that and more awaits below. Looking

Topics: apple m4 macbook model price

Shop Amazon

The case against humans in space

technologyreview.com Becky Ferreira 2025-11-07 02:00:00

“Going to space will not end war because war isn’t caused by anything that space travel is apt to change, even in the most optimistic scenarios,” they write. “Humanity going to space en masse probably won’t reduce the likelihood of war, but we should consider that it might increase the chance of war being horrific.” The pair imagine rival space nations exchanging asteroid fire or poisoning whole biospheres. Proponents of space settlements often point to the fate of the dinosaurs as motivational

Topics: future human mandel space war

Shop Amazon

GPT-5 usage limitations: what are they, how does this compare to GPT-4 family?

androidauthority.com Unknown 2025-11-07 03:00:51

Edgar Cervantes / Android Authority GPT-5 arrived a few weeks ago, though its rollout hasn’t been entirely smooth. While the model shows plenty of promise, its debut also meant the abrupt removal of every other GPT model from ChatGPT’s user-facing UI. Since then, some old models have returned, and there have been a few other changes to the way the system works. Furthermore, many of the initial GPT-5 usage limits have been temporarily enhanced since launch. Let’s dive in and take a closer look

Topics: gpt limits model models thinking

Shop Amazon

Tesla Will Use a Powerful New Weapon in AI Race

gizmodo.com Riley Gutiérrez Mcdermid 2025-11-07 02:34:19

Tesla will partner up with AI darlings DeepSeek and Bytedance’s Doubao on new tools in its cars in China, according to a document uploaded to Tesla’s official website. Bloomberg reports that Doubao will work on voice command-related tools like the temperature in a Tesla, navigation, and in-car entertainment, while DeepSeek will handle the AI side of things. The move may be a way for Tesla to boost its Chinese deliveries, which dropped 8.4% from December to June compared to the same time period

Topics: ai chinese deepseek model tesla

Shop Amazon

24,000-Watt Scooter Is Going for a 100 MPH Speed Record at Bonneville

news.ycombinator.com Andrew P. Collins 2025-11-02 18:40:35

Get The Drive’s daily newsletter The latest car news, reviews, and features. Email address Sign Up Thank you! Terms of Service & Privacy Policy. A British outfit called Bo makes a sleek little electric scooter called the Model-M. The best variant claims a 22 mph top speed and a 40-mile range for $2,500. Sounds pricey but practical. The prototype Turbo model is another story—it’s gunning for triple-digit speeds at the Bonneville salt flats next month. Bo has been around since 2022, but seems to

Topics: bo electric model scooter turbo

Shop Amazon

Latest Tech News

Apple study shows LLMs also benefit from the oldest productivity trick in the book

New AI attack hides data-theft prompts in downscaled images

Something Extremely Scary Happens When Advanced AI Tries to Give Medical Advice to Real World Patients

How to make things slower so they go faster

We Tested All the Cheapest Meal Kits and Prepared Meal Services. These Are the Best

Pixel-art cyberpunk platformer Replaced delayed again, this time until spring 2026

This new delivery robot will bring the entire grocery store to you

How to Make Things Slower So They Go Faster

Robomart unveils new delivery robot with $3 flat fee to challenge DoorDash, Uber Eats

Why I'm Now Running Enterprise AI on My Laptop (Without Internet)

IBM and NASA Develop a Digital Twin of the Sun to Predict Future Solar Storms

Elon Musk says xAI has open sourced Grok 2.5

You can now download and tweak Grok 2.5 for yourself as it goes open source

The Hidden Ingredients Behind AI’s Creativity

ThinkMesh: A Python lib for parallel thinking in LLMs

AGI is an engineering problem, not a model training problem

Evaluating LLMs for my personal use case

ChatGPT-5 Impressions: Fast, but a Bit Impersonal

‘Zipotle’: Chipotle Wants to Drop a Burrito on Your Head with New Drone Delivery

OpenCUA’s open source computer-use agents rival proprietary models from OpenAI and Anthropic

Apple trained a large language model to efficiently understand long-form video

College student’s “time travel” AI experiment accidentally outputs real 1834 history

MCP-Universe benchmark shows GPT-5 fails more than half of real-world orchestration tasks

Apple reportedly in talks with Google to make Gemini the foundation of Siri

Apple considers Google Gemini to power next-gen Siri, internal AI ‘bake-off’ underway

Deals: Amazon Labor Day sale, most affordable 24GB M4 MacBook Pro $300 off, Apple Watch Ultra 2 $150 off, more

The case against humans in space

GPT-5 usage limitations: what are they, how does this compare to GPT-4 family?

Tesla Will Use a Powerful New Weapon in AI Race

24,000-Watt Scooter Is Going for a 100 MPH Speed Record at Bonneville

About GoKawiil

Privacy

Advertising

Latest Tech News

Trending Topics

Hot Now

Popular

Emerging

About GoKawiil

Privacy

Advertising