GoKawiil - Tech News

1.

Alibaba's Qwen3.7-Plus supports text, video and imagery inputs at low cost of $0.4/$1.6 per 1M token — but it's proprietary (venturebeat.com)

2026-06-02 | get AI Text and Video Generator → | tags: alibaba, qwen3.7-plus, ai large language model

2.

People Flock to Privacy-Focused DuckDuckGo as Google Leans Heavily Into AI (cnet.com)

2026-05-28 | by See Full Bio | get DuckDuckGo Privacy Browser → | tags: duckduckgo, google i/o, ai initiatives

3.

People Are Flocking to DuckDuckGo as Google Leans Heavily Into AI (cnet.com)

2026-05-27 | by See Full Bio | get DuckDuckGo Privacy Browser → | tags: duckduckgo, google i/o, ai initiatives

4.

Google unveils Gemini Omni 'any-to-any' AI model: what enterprises should know (venturebeat.com)

2026-05-19 | get Google Gemini AI Developer Kit → | tags: google, gemini omni, ai plus

5.

Wirestock raises $23M to supply creative multimodal data to AI labs (techcrunch.com)

2026-05-14 | by Ivan Mehta | get AI Creative Data Subscription → | tags: wirestock, ai labs, stock photography

6.

Thinking Machines shows off preview of near-realtime AI voice and video conversation with new 'interaction models' (venturebeat.com)

2026-05-11 | get AI Conversation SDK → | tags: thinking machines, openai, interaction models

7.

Gemini API File Search is now multimodal (news.ycombinator.com)

2026-05-10 | by Ivan Solovyev | get OpenAI GPT-4 Multimodal Model → | tags: gemini, gemini embedding 2, retrieval-augmented generation

8.

Boosting multimodal inference performance by >10% with a single Python dict (news.ycombinator.com)

2026-05-06 | get Python Dictionary Optimization Kit → | tags: sglang, vllm, multimodal

9.

GLM-5V-Turbo: Toward a Native Foundation Model for Multimodal Agents (news.ycombinator.com)

2026-05-05 | get AI Multimodal Development Kit → | tags: glm-5v-turbo, foundation model, multimodal agents

10.

OpenAI releases GPT-5.5 Instant, a new default model for ChatGPT (techcrunch.com)

2026-05-05 | by Ivan Mehta | get OpenAI ChatGPT Mug → | tags: openai, gpt-5.5 instant, chatgpt

11.

Alibaba's Metis agent cuts redundant AI tool calls from 98% to 2% — and gets more accurate doing it (venturebeat.com)

2026-04-30 | get AI Call Optimization Tool → | tags: alibaba, metis, hdpo

12.

Muse Spark: Scaling towards personal superintelligence (news.ycombinator.com)

2026-04-08 | get Muse Spark Brainwave Headset → | tags: muse spark, meta superintelligence labs, hyperion data center

13.

Muse Spark: Scaling Towards Personal Superintelligence (news.ycombinator.com)

2026-04-08 | get Muse Spark AI Kit → | tags: muse spark, meta superintelligence labs, hyperion data center

14.

Gemma 4 on iPhone (news.ycombinator.com)

2026-04-05 | get Gemma 4 iPhone Dock → | tags: gemma 4, ai edge gallery, large language models

15.

Mistral's Small 4 consolidates reasoning, vision and coding into one model — at a fraction of the inference cost (venturebeat.com)

2026-03-20 | get Mistral 4B AI Model → | tags: mistral, small 4, open-source

16.

Google's Gemini Embedding 2 arrives with native multimodal support to cut costs and speed up your enterprise data stack (venturebeat.com)

2026-03-11 | get Google Gemini Embedding Kit → | tags: google gemini embedding 2, google, ai models

17.

Phi-4-reasoning-vision and the lessons of training a multimodal reasoning model (news.ycombinator.com)

2026-03-05 | by Brenda Potts | get Reasoning Vision → | tags: phi-4-reasoning-vision, multimodal reasoning, microsoft foundry

18.

Qwen3.5: Towards Native Multimodal Agents (news.ycombinator.com)

2026-02-16 | get Voice Recognition System → | tags: rust, qwen3.5, multimodal agents

19.

Most RAG systems don’t understand sophisticated documents — they shred them (venturebeat.com)

2026-01-31 | get Document Scanner → | tags: chunking, multimodal, semantic

20.

New Apple model combines vision understanding and image generation with impressive results (9to5mac.com)

2026-01-14 | by Marcus Mendes | get Augmented Reality Display → | tags: generation, image, manzano

21.

Z.ai debuts open source GLM-4.6V, a native tool-calling vision model for multimodal reasoning (venturebeat.com)

2025-12-09 | get GLM-4.6V → | tags: model, models, multimodal

22.

New training method boosts AI multimodal reasoning with smaller, smarter datasets (venturebeat.com)

2025-12-02 | get Neural Network → | tags: dataset, model, models

23.

OpenAI is ending API access to fan-favorite GPT-4o model in February 2026 (venturebeat.com)

2025-11-21 | get GPT-4 → | tags: model, models, multimodal

24.

AI Isn't Taking Your Job. It's Forcing You to Evolve. (feeds.feedburner.com)

2025-11-14 | get Artificial Intelligence → | tags: employees, human, multimodal

25.

Microsoft’s new AI agent can control software and robots (arstechnica.com)

2025-10-31 | get Meta Quest → | tags: ai, magma, microsoft