Skip to content
Tech News
clear
Topics: Today This Week This Month This Year
1.
ProgramBench: Can language models rebuild programs from scratch? (news.ycombinator.com)
2.
ProgramBench: Can Language Models Rebuild Programs from Scratch? (news.ycombinator.com)
3.
3DMark tests CPU and GPU performance with modern graphics workloads (techspot.com)
4.
AMD Ryzen AI Max+ PRO 495 APU could arrive with 192GB of unified memory — leaked PassMark benchmarks suggest modest update over Strix Halo (tomshardware.com)
5.
DeepClaude – Claude Code agent loop with DeepSeek V4 Pro (news.ycombinator.com)
6.
Asus Zenbook A16 (2026) Review: Savor the Power, Ignore the Beige (wired.com)
7.
Enthusiast creates Peltier thermoelectric cooler from scratch — impressive rig uses two 360mm AIOs, homemade DC controllers, and a custom loop (tomshardware.com)
8.
How fast is a macOS VM, and how small could it be? (news.ycombinator.com)
9.
Discovering hard disk physical geometry through microbenchmarking (2019) (news.ycombinator.com)
10.
Show HN: A new benchmark for testing LLMs for deterministic outputs (news.ycombinator.com)
11.
A Decade of AMD Ryzen: 10 Years of CPUs Tested (techspot.com)
12.
A Decade of AMD Ryzen: 10 Years of CPUs Tested (techspot.com)
13.
A good AGENTS.md is a model upgrade. A bad one is worse than no docs at all (news.ycombinator.com)
14.
Show HN: OSS Agent I built topped the TerminalBench on Gemini-3-flash-preview (news.ycombinator.com)
15.
Startup unveils benchtop metal 3D printer that brings industrial tech below $10,000 (techspot.com)
16.
The predictable failure of the QDay Prize (news.ycombinator.com)
17.
SWE-bench Verified no longer measures frontier coding capabilities (news.ycombinator.com)
18.
Why SWE-bench Verified no longer measures frontier coding capabilities (news.ycombinator.com)
19.
Mowing Down Simulated Elephants Could Help Self-Driving Cars Prepare For the Chaos of Real Life Streets (futurism.com)
20.
Lambda Calculus Benchmark for AI (news.ycombinator.com)
21.
Linux 7.1 Removes Drivers for Bus Mouse Support (news.ycombinator.com)
22.
OpenAI's GPT-5.5 is here, and it's no potato: narrowly beats Anthropic's Claude Mythos Preview on Terminal-Bench 2.0 (venturebeat.com)
23.
AMD Ryzen 9 9950X3D2 review: More cache, more cash (tomshardware.com)
24.
Kimi vendor verifier – verify accuracy of inference providers (news.ycombinator.com)
25.
Arc Prize Foundation (YC W26) Is Hiring a Platform Engineer for ARC-AGI-4 (news.ycombinator.com)
26.
Experience vs specs: Our readers have spoken, and benchmarks aren’t everything (androidauthority.com)
27.
Qwen3.6-35B-A3B on my laptop drew me a better pelican than Claude Opus 4.7 (news.ycombinator.com)
28.
Databricks tested a stronger model against its multi-step agent on hybrid queries. The stronger model still lost by 21%. (venturebeat.com)
29.
Databricks research shows multi-step agents consistently outperform single-turn RAG when answers span databases and documents (venturebeat.com)
30.
N-Day-Bench – Can LLMs find real vulnerabilities in real codebases? (news.ycombinator.com)
Today's top topics: google apple openai google health chatgpt anthropic samsung android authority nvidia spacex
View all today's topics →