Latest Tech News

Stay updated with the latest in technology, AI, cybersecurity, and more

Filtered by: deepseek Clear Filter

DeepSeek may be about to shake up the AI world again - what we know

picture alliance / Getty Images Follow ZDNET: Add us as a preferred source on Google. ZDNET's key takeaways DeepSeek will reportedly launch an agent by the end of this year. Agents have become a focal point in the ongoing AI race. The company's debut was a turning point in the global AI race. DeepSeek, the Chinese AI startup that sent shockwaves throughout Silicon Valley earlier this year with its sudden ascent onto the global tech scene, is reportedly gearing up to launch its most powerfu

Topics: ai build deepseek r1 tech

DeepSeek Is Working on an AI Agent. Will It Be Better Than ChatGPT?

China-based DeepSeek is working on developing a new agentic generative AI model, Bloomberg reports, citing anonymous sources. Agentic AI is the latest wave of AI technology. AI agents are a kind of digital assistant; they can complete tasks without a lot of human oversight. AI agents can do anything from coding to ordering you a pizza, as my colleague Imad Khan recently tested. Details about the specifics of the DeepSeek agent model are still fuzzy. An August update to DeepSeek's V3 model was

DeepSeek Is Working on an AI Agent: Will It Be Better Than ChatGPT?

China-based DeepSeek is working on developing a new agentic generative AI model, Bloomberg reports, citing anonymous sources. Agentic AI is the latest wave of AI technology. AI agents are a kind of digital assistant; they can complete tasks without a lot of human oversight. AI agents can do anything from coding to ordering you a pizza, as my colleague Imad Khan recently tested. Details about the specifics of the DeepSeek agent model are still fuzzy. An August update to DeepSeek's V3 model was

Fake accounts drove the DeepSeek AI hype and distorted markets

From AI Darling to Disinformation Playbook: How Fake Accounts Managed a Market Frenzy and What DeepSeek’s Hype Teaches Leaders DeepSeek, a new Chinese AI model, stormed app charts and briefly sent shockwaves through global markets. However, our disinformation research team found that much of this excitement was artificial — driven by thousands of fake profiles working in tandem. Their behavior bore the hallmarks of state-linked bot networks, amplifying hype and distorting market perception.

Deploying DeepSeek on 96 H100 GPUs

by: The SGLang Team , May 05, 2025 DeepSeek is a popular open-source large language model (LLM) praised for its strong performance. However, its large size and unique architecture, which uses Multi-head Latent Attention (MLA) and Mixture of Experts (MoE), require an advanced system for efficient serving at scale. In this blog, we explain how we match DeepSeek's inference system performance with SGLang. Our implementation, shown in the figure above, runs on 12 nodes in the Atlas Cloud, each equ

Elon Musk Just Suffered a Humiliating Defeat in China

So far, 2025 hasn't exactly been a year of resounding success for centibillionaire Elon Musk's AI efforts. The richest man on earth has struggled to get xAI's Grok off the ground, with setbacks taking the form of privacy scandals, misinformation controversies, not to mention a highly-public white supremacy episode. And now, more than a month after Musk promised to roll Grok out to Teslas "next week," it turns out a Chinese AI model will be taking the chatbot's place. According to Bloomberg, T

Evaluating LLMs for my personal use case

Most models are excellent, so cost and latency dominate. It’s great that AI can win maths Olympiads, but that’s not what I’m doing. I mostly ask basic Rust, Python, Linux and life questions. So I did my own evaluation. I gathered 130 real prompts from my bash history (I use command line tool llm). I had Qwen3 235B Thinking and Gemini 2.5 Pro group them into categories. They both chose very similar ones, broadly (with examples): Programming - “Write a bash script to ..” Sysadmin - “With curl

DeepSeek hints latest model will be compatible with China’s ‘next generation’ homegrown AI chips

Chinese artificial intelligence startup DeepSeek has hinted that China will soon have homegrown "next generation" chips to support its AI models, while announcing an update to one of its large language models. In a comment under a post on its official WeChat account, DeepSeek said the "UE8M0 FP8" precision format of its newly released model V3.1 is tailored for the next-generation domestically built chips that will be launched soon. FP8, or 8-bit floating point, is a data processing format tha

DeepSeek-v3.1

DeepSeek-V3.1 Release Introducing DeepSeek-V3.1: our first step toward the agent era! 🚀 🧠 Hybrid inference: Think & Non-Think — one model, two modes ⚡️ Faster thinking: DeepSeek-V3.1-Think reaches answers in less time vs. DeepSeek-R1-0528 🛠️ Stronger agent skills: Post-training boosts tool use and multi-step agent tasks Try it now — toggle Think/Non-Think via the "DeepThink" button: https://chat.deepseek.com/ 🔹 deepseek-chat → non-thinking mode 🔹 deepseek-reasoner → thinking mode 🧵 128K

DeepSeek hints latest model will be supported by China’s ‘next generation’ homegrown AI chips

Chinese artificial intelligence startup DeepSeek has hinted that China will soon have homegrown "next generation" chips to support its AI models, while announcing an update to one of its large language models. In a comment under a post on its official WeChat account, DeepSeek said the "UE8M0 FP8" precision format of its newly released model V3.1 is tailored for the next-generation domestically built chips that will be launched soon. FP8, or 8-bit floating point, is a data processing format tha

Tesla Will Use a Powerful New Weapon in AI Race

Tesla will partner up with AI darlings DeepSeek and Bytedance’s Doubao on new tools in its cars in China, according to a document uploaded to Tesla’s official website. Bloomberg reports that Doubao will work on voice command-related tools like the temperature in a Tesla, navigation, and in-car entertainment, while DeepSeek will handle the AI side of things. The move may be a way for Tesla to boost its Chinese deliveries, which dropped 8.4% from December to June compared to the same time period

DeepSeek-v3.1 Release

DeepSeek-V3.1 Release Introducing DeepSeek-V3.1: our first step toward the agent era! 🚀 🧠 Hybrid inference: Think & Non-Think — one model, two modes ⚡️ Faster thinking: DeepSeek-V3.1-Think reaches answers in less time vs. DeepSeek-R1-0528 🛠️ Stronger agent skills: Post-training boosts tool use and multi-step agent tasks Try it now — toggle Think/Non-Think via the "DeepThink" button: https://chat.deepseek.com/ 🔹 deepseek-chat → non-thinking mode 🔹 deepseek-reasoner → thinking mode 🧵 128K

The AI Battle’s Newest Warrior Strikes a Major Blow to Big Tech

The ongoing slugfest between tech players racing to get the most intuitive and powerful AI may have just gotten a brief knockout punch. The slammer that landed? A new version of DeepSeek’s increasingly impressive V3.1, which has a whopping 685-billion-parameter system and can deliver about $1.01 per complete coding task, compared to a beginning price of $70 for traditional systems. 🚨 BREAKING: DeepSeek V3.1 is Here! 🚨 The AI giant drops its latest upgrade — and it’s BIG: ⚡685B parameters 🧠L

DeepSeek V3.1 just dropped — and it might be the most powerful open AI yet

Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now Chinese artificial intelligence startup DeepSeek made waves across the global AI community Tuesday with the quiet release of its most ambitious model yet — a 685-billion parameter system that challenges the dominance of American AI giants while reshaping the competitive landscape through open-source accessibility. The Hangzhou-based compan

Upcoming DeepSeek AI model failed to train using Huawei’s chips

Chinese artificial intelligence company DeepSeek delayed the release of its new model after failing to train it using Huawei’s chips, highlighting the limits of Beijing’s push to replace US technology. DeepSeek was encouraged by authorities to adopt Huawei’s Ascend processor rather than use Nvidia’s systems after releasing its R1 model in January, according to three people familiar with the matter. But the Chinese startup encountered persistent technical issues during its R2 training process u

It shocked the market but has China's DeepSeek changed AI?

It shocked the market but has China's DeepSeek changed AI? 1 day ago Share Save Lily Jamali • @lilyjamali North America Technology Correspondent Reporting from San Francisco Share Save Shutterstock US President Donald Trump had been in office scarcely a week when a new Chinese artificial intelligence (AI) app called DeepSeek jolted Silicon Valley. Overnight, DeepSeek-R1 shot to the top of the Apple charts as the most downloaded free app in the US. The firm said at the time its new chatbot riv

Claude Code Router

Claude Code Router 中文版 A powerful tool to route Claude Code requests to different models and customize any request. ✨ Features Model Routing : Route requests to different models based on your needs (e.g., background tasks, thinking, long context). : Route requests to different models based on your needs (e.g., background tasks, thinking, long context). Multi-Provider Support : Supports various model providers like OpenRouter, DeepSeek, Ollama, Gemini, Volcengine, and SiliconFlow. : Support

LLM architecture comparison

It has been seven years since the original GPT architecture was developed. At first glance, looking back at GPT-2 (2019) and forward to DeepSeek-V3 and Llama 4 (2024-2025), one might be surprised at how structurally similar these models still are. Sure, positional embeddings have evolved from absolute to rotational (RoPE), Multi-Head Attention has largely given way to Grouped-Query Attention, and the more efficient SwiGLU has replaced activation functions like GELU. But beneath these minor refi

The Big LLM Architecture Comparison

It has been seven years since the original GPT architecture was developed. At first glance, looking back at GPT-2 (2019) and forward to DeepSeek-V3 and Llama 4 (2024-2025), one might be surprised at how structurally similar these models still are. Sure, positional embeddings have evolved from absolute to rotational (RoPE), Multi-Head Attention has largely given way to Grouped-Query Attention, and the more efficient SwiGLU has replaced activation functions like GELU. But beneath these minor refi

OpenAI tightens the screws on security to keep away prying eyes

In Brief OpenAI has reportedly overhauled its security operations to protect against corporate espionage. According to the Financial Times, the company accelerated an existing security clampdown after Chinese startup DeepSeek released a competing model in January, with OpenAI alleging that DeepSeek improperly copied its models using “distillation” techniques. The beefed-up security includes “information tenting” policies that limit staff access to sensitive algorithms and new products. For exa

A new, faster DeepSeek R1-0528 variant appears from German lab

Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now It’s been a little more than a month since Chinese AI startup DeepSeek, an offshoot of Hong Kong-based High-Flyer Capital Management, released the latest version of its hit open source model DeepSeek, R1-0528. Like its predecessor, DeepSeek-R1 — which rocked the AI and global business communities with how cheaply it was trained and how wel

HOLY SMOKES! A new, 200% faster DeepSeek R1-0528 variant appears from German lab TNG Technology Consulting GmbH

Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now It’s been a little more than a month since Chinese AI startup DeepSeek, an offshoot of Hong Kong-based High-Flyer Capital Management, released the latest version of its hit open source model DeepSeek, R1-0528. Like its predecessor, DeepSeek-R1 — which rocked the AI and global business communities with how cheaply it was trained and how wel

Germany asks Google, Apple to remove DeepSeek AI from app stores

The Berlin Commissioner for Data Protection has formally requested Google and Apple to remove the DeepSeek AI application from the application stores due to GDPR violations. The commissioner, Meike Kamp, alleges that DeepSeek’s owner, Hangzhou DeepSeek Artificial Intelligence, based in Beijing, unlawfully collects data from German users and transfers them for processing in servers in China. As per the GDPR and Article 46 (1) specifically, any personal data collected from individuals in the Eur

Germany asks Apple and Google to pull DeepSeek AI app over data privacy concerns

Germany just became the latest country to move against DeepSeek over mounting data privacy concerns. Here’s why this keeps happening. As you probably guessed, it’s a China thing When DeepSeek took the world by storm earlier this year, it wasn’t long before it found itself in the crosshairs of governments in the West. First, because users quickly learned that its models were heavily moderated, skirting answering questions that could cast China and its government in a bad light. Second, and mo

German data protection official wants Apple, Google to remove DeepSeek from the country’s app stores

In Brief A German data protection official has reported Chinese AI app DeepSeek to Apple and Google, saying the app transfers users’ information to China illegally. Meike Kamp, Berlin’s Commissioner for data protection and freedom of information, told the companies that DeepSeek did not provide “convincing evidence” that users’ data was protected as required by EU laws. “Chinese authorities have far-reaching access rights to personal data within the sphere of influence of Chinese companies,”

Germany tells Apple, Google to block DeepSeek as the Chinese AI app faces rising pressure in Europe

In this photo illustration, the DeepSeek logo is seen displayed on a smartphone screen and in the background, the flag of the European Union. One of Germany's data protection watchdogs on Friday said DeepSeek's app illegally sends user data to China and asked Google and Apple to consider blocking the artificial intelligence service. Berlin's data protection commissioner Meike Kamp said in a statement that DeepSeek's transfer of German user data to China is "unlawful." There is not a readily a

Rethinking AI: DeepSeek’s playbook shakes up the high-spend, high-compute paradigm

Join the event trusted by enterprise leaders for nearly two decades. VB Transform brings together the people building real enterprise AI strategy. Learn more When DeepSeek released its R1 model this January, it wasn’t just another AI announcement. It was a watershed moment that sent shockwaves through the tech industry, forcing industry leaders to reconsider their fundamental approaches to AI development. What makes DeepSeek’s accomplishment remarkable isn’t that the company developed novel ca

Nvidia CEO Jensen Huang says market got it wrong about DeepSeek’s impact

Nvidia founder and CEO Jensen Huang said the market got it wrong when it comes to DeepSeek’s technological advancements and its potential to negatively impact the chipmaker’s business. Instead, Huang called DeepSeek’s R1 open source reasoning model “incredibly exciting” while speaking with Alex Bouzari, CEO of DataDirect Networks, in a pre-recorded interview that was released on Thursday. “I think the market responded to R1, as in, ‘Oh my gosh. AI is finished,’” Huang told Bouzari. “You know,

What's Behind OpenAI's Recent Growth Spurt to 400M Weekly Users?

ChatGPT maker OpenAI says more than 400 million people a week are now actively using its artificial intelligence tools, a jump from the 300 million it reported this past December. OpenAI touted the figure this week, also telling media outlets that it's doubled its number of paid enterprise users to 2 million since September, and that its developer traffic has also doubled in the last six months. The news may surprise people who expected the rise of China's DeepSeek AI model to disrupt the grow

Beijing embraces DeepSeek to lead AI adoption as it looks for new growth drivers

HONG KONG, CHINA - JANUARY 28: In this photo illustration, the DeepSeek apps is seen on a phone in front of a flag of China on January 28, 2025 in Hong Kong, China. DeepSeek's sudden splash in the large language model space has given China a powerful tool to catalyze artificial-intelligence adoption in the country and boost economic growth. While Goldman Sachs pegs a 20-basis-point to 30-basis-point boost to China's GDP over the long term — by 2030 — it expects the country's economy to start r