Latest Tech News

Stay updated with the latest in technology, AI, cybersecurity, and more

Filtered by: model Clear Filter

AI Does Something Subtly Bizarre If You Make Typos While Talking to It

New research suggests that medical AI chatbots are woefully unreliable at understanding how people actually communicate their health problems. As detailed in yet-to-be-peer-reviewed study presented last month by MIT researchers, an AI chatbot is more likely to advise a patient not to seek medical care if their messages contained typos. The errors AI is susceptible to can be as seemingly inconsequential as an extra space between words, or if the patient used slang or colorful language. And strik

LLM-Ready Training Dataset for Apple's Foundation Models (iOS 26)

The only comprehensive training dataset for Apple's Foundation Models Framework (iOS 26) What You Get Three technical specification files for training LLMs on Apple's Foundation Models framework: 1. Core Framework Guide SystemLanguageModel availability, LanguageModelSession management, response generation, and context handling. 2. Advanced Implementation Guide @Generable/@Guide macros, constrained decoding, Tool protocol, and performance optimization. 3. Strategic Features Guide Adapter

Smollm3: Smol, multilingual, long-context reasoner LLM

SmolLM3: smol, multilingual, long-context reasoner Published July 8, 2025 Update on GitHub Base model: https://hf.co/HuggingFaceTB/SmolLM3-3B-Base Instruct and reasoning model: https://hf.co/HuggingFaceTB/SmolLM3-3B Small language models are becoming increasingly important as users seek capable models that can be deployed efficiently. The community has produced a fascinating range of capable small models, each pushing the boundaries of what's possible at this scale. With SmolLM3, we're excit

VITURE launches new series of XR glasses for gamers, enthusiasts, and professionals

TL;DR VITURE has announced the launch of its Luma series and The Beast XR glasses. The Luma series consists of three entries, including a base model, Pro, and Ultra. The Beast is the company’s flagship model, boasting the most advanced specs. The XR glasses market is getting increasingly crowded with major players like Google, Samsung, and Apple all looking to enter the space. However, those tech giants have their work cut out for them as competitors like Xreal and VITURE have their own produ

There’s now a cheaper Retroid Pocket Flip 2 available, but here’s why you shouldn’t buy it

Nick Fernandez / Android Authority TL;DR The Dimensity 1100 edition of the Retroid Pocket Flip 2 will go up for sale later today. With the 72-hour early bird discount, it costs $50 less than the Snapdragon 865 version. The D1100 is powerful enough for PS2, but the SD865 has better driver support. The release of the Retroid Pocket Flip 2 earlier this year was rocky to say the least, with the looming threat of tariffs and halted shipping marring what was otherwise one of the most anticipated A

The Kindle most people should buy is majorly discounted for Prime Day

ZDNET's key takeaways Amazon's base model Kindle promises quicker page-turning, a brighter display, and a fun matcha green colorway (alongside the classic black) It's on sale for $85 during Prime Day The e-reader is more reactive and vivid, and reading anything on the lightweight, portable device is convenient. This model has the shortest battery life out of the entire lineup, but it's still six weeks long. $109.99 at Amazon $109.99 at Target $109.99 at Best Buy more buying choices The base

Apple just released a weirdly interesting coding language model

Apple quietly dropped a new AI model on Hugging Face with an interesting twist. Instead of writing code like traditional LLMs generate text (left to right, top to bottom), it can also write out of order, and improve multiple chunks at once. The result is faster code generation, at a performance that rivals top open-source coding models. Here’s how it works. The nerdy bits Here are some (overly simplified, in the name of efficiency) concepts that are important to understand before we can move

Apple Reportedly Loses Key AI Mind

Apple has kept a low profile in the artificial intelligence arms race. But now, a major talent loss is raising fresh questions about whether the iPhone maker is falling behind. According to Bloomberg, Meta has hired Ruoming Pang, a high-level engineer who led Apple’s foundation models team. Pang, a former Google veteran and key architect behind the large language models (LLMs) powering Apple Intelligence, will now join Meta’s elite AI unit focused on building superintelligent systems. His exit

Topics: ai apple models pang team

New 1.5B router model achieves 93% accuracy without costly retraining

Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now Researchers at Katanemo Labs have introduced Arch-Router, a new routing model and framework designed to intelligently map user queries to the most suitable large language model (LLM). For enterprises building products that rely on multiple LLMs, Arch-Router aims to solve a key challenge: how to direct queries to the best model for the job

Meta reportedly recruits Apple’s head of AI models

In Brief Apple’s head of AI models, Ruoming Pang, is leaving the company to work at Meta, Bloomberg reported on Monday. This marks the latest high-ranking AI executive Meta CEO Mark Zuckerberg has scooped up to lead his new AI superintelligence unit. Pang previously ran Apple’s in-house team that trained the AI foundation models that underpin Apple Intelligence and other on-device AI features, according to the report. Apple’s AI models haven’t exactly been a huge success — they’re far less cap

Topics: ai apple meta models pang

Meta just hired Apple’s head of foundation models

Apple’s top executive overseeing its in-house AI models is leaving the company, and heading to Meta. As reported by Bloomberg, Ruoming Pang is bound to Menlo Park, and joining Mark Zuckerberg’s all-star Meta Superintelligence Labs group, announced last week. Apple’s AI setbacks just keep on coming Pang joined Apple from Google in 2021, and had been managing the roughly 100-person team behind the models that power Apple Intelligence features like Genmoji, Priority Notifications, and on-device t

Topics: ai apple meta models pang

LookingGlass: Generative Anamorphoses via Laplacian Pyramid Warping

Anamorphosis refers to a category of images that are intentionally distorted, making them unrecognizable when viewed directly. Their true form only reveals itself when seen from a specific viewpoint, which can be through some catadioptric device like a mirror or a lens. While the construction of these mathematical devices can be traced back to as early as the 17th century, they are only interpretable when viewed from a specific vantage point and tend to lose meaning when seen normally. In this p

Cursor apologizes for unclear pricing changes that upset users

The CEO of Anysphere, the company behind the popular AI-powered coding environment Cursor, apologized Friday for a poorly communicated pricing change to its $20-per-month Pro plan. The changes resulted in some users complaining that they unexpectedly faced additional costs. “We recognize that we didn’t handle this pricing rollout well and we’re sorry,” said Anysphere CEO Michael Truell in a blog post. “Our communication was not clear enough and came as a surprise to many of you.” Truell is ref

ChatGPT Glossary: 53 AI Terms Everyone Should Know

AI is everywhere. From the massive popularity of ChatGPT to Google cramming AI summaries at the top of its search results, AI is completely taking over the internet. With AI, you can get instant answers to pretty much any question. It can feel like talking to someone who has a Ph.D. in everything. But that aspect of AI chatbots is only one part of the AI landscape. Sure, having ChatGPT help do your homework or having Midjourney create fascinating images of mechs based on country of origin is co

Launch HN: Morph (YC S23) – Apply AI code edits at 4,500 tokens/sec

Hey HN, I’m Tejas at Morph. We’ve built a blazing-fast model for applying AI-generated code edits directly into your files at 4,500+ tokens/sec. No more slow full-file rewrites or brittle search-and-replace hacks. Why? AI spits out code that can’t reliably be inserted into existing code. Full file rewrites, brittle search-and-replace hacks are too slow, expensive, or error-prone. Morph's approach: - Your agent outputs edits “lazily”, referencing unmodified lines in the existing file (ex: // .

Leaker says iPhone 17 lineup getting two new screen changes

Two months from now, Apple is expected to unveil its new flagship iPhone lineup, including iPhone 17, iPhone 17 Air, iPhone 17 Pro, and iPhone 17 Pro Max. And today, a leaker has shared two screen changes that may be coming to various new models. Today in a new post on Weibo, leaker Digital Chat Station has shared a variety of expectations for the upcoming iPhone 17 line. Many of the details have been reported before, but there are two interesting new details that stand out most—both relating

Topics: 17 iphone models new pro

If You Love LEGO, the Star Wars Millennium Falcon Is at a Record Low Price for Prime Day

LEGO sets have become a true phenomenon during events like Prime Day and Black Friday, mainly because these sales are often the only times you’ll see real discounts on the most popular models. This year, Amazon is offering record low prices on LEGO’s two biggest franchises: Star Wars and Harry Potter. Among the top sellers, the LEGO Star Wars Millennium Falcon A New Hope 25th Anniversary model is one of the most demanded sets. It’s inexpensive, enjoyable to construct, readily recognizable and a

OpenAI says GPT-5 will unify breakthroughs from different models

OpenAI has again confirmed that it will unify multiple models into one and create GPT-5, which is expected to ship sometime in the summer. ChatGPT currently has too many capable models for different tasks. While the models are powerful, it can be confusing because all models have identical names. But another issue is that OpenAI maintains an "o" lineup for reasoning capabilities, while the 4o and other models have multi-modality. With GPT-5, OpenAI plans to unify the breakthrough in its lineu

I don't think AGI is right around the corner

“Things take longer to happen than you think they will, and then they happen faster than you thought they could.” - Rudiger Dornbusch I’ve had a lot of discussions on my podcast where we haggle out timelines to AGI. Some guests think it’s 20 years away - others 2 years. Here’s where my thoughts stand as of June 2025. Continual learning Sometimes people say that even if all AI progress totally stopped, the systems of today would still be far more economically transformative than the internet.

I extracted the safety filters from Apple Intelligence models

Decrypted Generative Model safety files for Apple Intelligence containing filters Structure decrypted_overrides/ : Contains decrypted overrides for various models. com.apple.*/ : Directory named using the Asset Specifier assosciated with the safety info Info.plist : Contains metadata for the override AssetData/ : Contains the decrypted JSON files : Contains decrypted overrides for various models. get_key_lldb.py : Script to get the encryption key (see usage info below) : Script to get the en

Reinforcement Learning from Human Feedback (RLHF) in Notebooks

Reinforcement Learning from Human Feedback (RLHF) in Notebooks This repository provides a reference implementation for Reinforcement Learning from Human Feedback (RLHF) [Paper] framework presented in the RLHF from scratch, step-by-step, in code YouTube video. Overview of RLHF RLHF is a method for aligning large language models (LLMs), like GPT-3 or GPT-2, to better meet users' intents. It is essentially a reinforcement learning approach, where rather than directly getting the reward or feedba

Just Ask for Generalization (2021)

Generalizing to what you want may be easier than optimizing directly for what you want. We might even ask for "consciousness". This blog post outlines a key engineering principle I’ve come to believe strongly in for building general AI systems with deep learning. This principle guides my present-day research tastes and day-to-day design choices in building large-scale, general-purpose ML systems. Discoveries around Neural Scaling Laws, unsupervised pretraining on Internet-scale datasets, and o

Optimizing Tool Selection for LLM Workflows with Differentiable Programming

Modern agentic architectures rely heavily on chaining LLM calls. A typical pattern looks like: Use an LLM to decide which tool to invoke Call the tool (e.g. search, calculator, API) Use another LLM call to interpret the result and generate a final response This structure is easy to reason about, simple to prototype, and generalizes well. But it scales poorly. Each LLM call incurs latency, cost, and token overhead. More subtly, it compounds context: every step includes not only the original q

A new, faster DeepSeek R1-0528 variant appears from German lab

Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now It’s been a little more than a month since Chinese AI startup DeepSeek, an offshoot of Hong Kong-based High-Flyer Capital Management, released the latest version of its hit open source model DeepSeek, R1-0528. Like its predecessor, DeepSeek-R1 — which rocked the AI and global business communities with how cheaply it was trained and how wel

Prompting LLMs is not engineering

Prompting LLMs is not engineering published in: With the proliferation of AI models and tools, there's a new industry-wide fascination with snake oil remedies called "prompt engineering". As of July 2025 the term is now "context engineering" or "context prompting" or "context manipulation". To put it succinctly, prompt engineering is nothing but an attempt to reverse-engineer a non-deterministic black box for which any of the parameters below are unknown: training set weights constraints o

Apple just released a weirdly interesting coding language model

Apple quietly dropped a new AI model on Hugging Face with an interesting twist. Instead of writing code like traditional LLMs generate text (left to right, top to bottom), it can also write out of order, and improve multiple chunks at once. The result is faster code generation, at a performance that rivals top open-source coding models. Here’s how it works. The nerdy bits Here are some (overly simplified, in the name of efficiency) concepts that are important to understand before we can move

Inside India’s scramble for AI independence

Nonetheless, a small but determined group of Indian builders is starting to shape the country’s AI future. For example, Sarvam AI has created OpenHathi-Hi-v0.1, an open-source Hindi language model that shows the Indian AI field’s growing ability to address the country’s vast linguistic diversity. The model, built on Meta’s Llama 2 architecture, was trained on 40 billion tokens of Hindi and related Indian-language content, making it one of the largest open-source Hindi models available to date.

WASM Agents: AI agents running in the browser

One of the main barriers to a wider adoption of open-source agents is the dependency on extra tools and frameworks that need to be installed before the agents can be run. In this post, we show how to write agents as HTML files, which can just be opened and run in a browser. One of the main barriers to a wider adoption and experimentation with open-source agents is the dependency on extra tools and frameworks that need to be installed before the agents can be run. In this post, we introduce the

xAI prepares Grok 4 Code as it plans to take on Claude and Gemini

xAI is preparing the rollout of Grok 4, which replaces Grok 3 as the new state-of-the-art model. Ahead of the rollout, testers on X have again spotted references to a few new Grok 4 models. One of the models is called "grok-4-prod-mimic," which reportedly excels at enterprise use cases like data extraction, coding, and text summarisation. It also possesses deep domain knowledge in finance, healthcare, law, and science. But "grok-4-prod-mimic" isn't the only new model coming to Grok. Another

Topics: called grok model new xai

Sakana AI’s TreeQuest: Deploy multi-model teams that outperform individual LLMs by 30%

Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now Japanese AI lab Sakana AI has introduced a new technique that allows multiple large language models (LLMs) to cooperate on a single task, effectively creating a “dream team” of AI agents. The method, called Multi-LLM AB-MCTS, enables models to perform trial-and-error and combine their unique strengths to solve problems that are too complex

Topics: ab ai mcts model models