Qwen3.6-35B-A3B: Agentic coding power, now open to all
(news.ycombinator.com)
31.
32.
Qwen3.6-35B-A3B: Agentic Coding Power, Now Open to All
(news.ycombinator.com)
33.
34.
Google Gemma 4 Runs Natively on iPhone with Full Offline AI Inference
(news.ycombinator.com)
35.
Show HN: Ghost Pepper – 100% local hold-to-talk speech-to-text for macOS
(news.ycombinator.com)
36.
Qwen-3.6-Plus is the first model to break 1T tokens processed in a day
(news.ycombinator.com)
37.
38.
Qwen3.6-Plus: Towards real world agents
(news.ycombinator.com)
39.
Qwen3.6-Plus: Towards Real World Agents
(news.ycombinator.com)
40.
Lemonade by AMD: a fast and open source local LLM server using GPU and NPU
(news.ycombinator.com)
41.
Running local models on Macs gets faster with Ollama's MLX support
(arstechnica.com)
42.
$500 GPU outperforms Claude Sonnet on coding benchmarks
(news.ycombinator.com)
43.
Quantization from the Ground Up
(news.ycombinator.com)
44.
LLM Neuroanatomy II: Modern LLM Hacking and Hints of a Universal Language?
(news.ycombinator.com)
45.
Flash-MoE: Running a 397B Parameter Model on a Laptop
(news.ycombinator.com)
46.
MacBook M5 Pro and Qwen3.5 = Local AI Security System
(news.ycombinator.com)
47.
Show HN: Duplicate 3 layers in a 24B LLM, logical deduction .22→.76. No training
(news.ycombinator.com)
48.
Qwen3.5-397B at 4.74 tok/s using 5.9GB RAM
(news.ycombinator.com)
49.
Tree Search Distillation for Language Models Using PPO
(news.ycombinator.com)
51.
Something is afoot in the land of Qwen
(news.ycombinator.com)
52.
Alibaba’s Qwen tech lead steps down after major AI push
(techcrunch.com)
53.
Qwen3.5 122B and 35B models offer Sonnet 4.5 performance on local computers
(news.ycombinator.com)
54.
55.
Qwen3.5: Towards Native Multimodal Agents
(news.ycombinator.com)
56.
A verification layer for browser agents: Amazon case study
(news.ycombinator.com)
57.
59.
60.
So Long, GPT-5. Hello, Qwen
(wired.com)