📊 Updated Weekly

Model Watch

Weekly LLM rankings, capability comparison and benchmarks. Track GPT, Claude, Gemini and domestic model updates.

Rankings

Benchmarks

📅 5 models

Updated 2026-06-25

⭐ Hot

Computer Use in Gemini 3.5 Flash

Google integrates Computer Use as a built-in tool into Gemini 3.5 Flash, enabling developers to build agents that operate across browsers, mobile, and desktop.

📰 Hacker News 热门 · 6/25/2026

⭐ Hot

GPT-5.5 Instant New Version

New GPT-5.5 Instant is more fun to chat with, better understands intent, more reliable at handling complex constraints.

📰 X：OpenAI · 6/25/2026

⭐ Hot

Bidirectional AI Voice Model Bidi 1 Testing

ChatGPT launches bidirectional AI voice model Bidi 1, supporting listening while speaking, users can interrupt mid-conversation.

📰 IT之家 · 6/24/2026

5-Second Video in 1.8 Seconds on RTX 5090

Sky Computing Lab releases FastWan-QAD, generating 5-second 480P video in 1.8 seconds on single RTX 5090, models open-sourced.

📰 X：Sky Computing Lab · 6/24/2026

Let Agents Learn to Predict First, Then Act

Alibaba releases first native language world model Qwen-AgentWorld, surpassing GPT-5.4 and Claude Opus 4.8 on AgentWorldBench, models open-sourced.

📰 公众号：通义实验室 · 6/24/2026