Industry News
- OpenAI commits to 30GW compute by 2030: After securing 8GW of the initial 10GW Stargate commitment by January, OpenAI now plans to reach 30GW of computing capacity by 2030 - roughly equivalent to New York City's peak power demand. link
- Anthropic's Mythos model predicts 40-hour autonomy window: Using internal ECI values from the Opus 4.7 model card, researchers predict Mythos has a 50% time horizon of 40 hours (vs 19 hours for Opus 4.7), suggesting significantly more autonomous capability before human intervention needed. link
- DeepSeek targeting $20B+ valuation despite minimal revenue: The Chinese AI lab is betting on investor demand for frontier AI capabilities, aiming for a valuation above $20 billion in upcoming fundraising despite having little current revenue. link
- Google Cloud processes 16B+ tokens/min via direct API: Up from 10B last quarter, Google announced at Cloud Next their 8th-gen TPUs and Gemini Enterprise Agent Platform for building production-scale agents. link
Tips & Techniques
- LLM over-editing is a real problem - here's how to measure it: Frontier models often rewrite entire functions when asked to fix simple bugs. New research introduces a modified Levenshtein Distance metric to quantify "minimal editing" and shows RL outperforms SFT/DPO for training models to make smaller, precise edits. link
- Inference-time optimization for LLMs via evolutionary selection: UT-Evolve introduces selection pressure at inference time rather than in model weights - shifting capability from pretraining to system design. Built end-to-end in ~3 months, showing how system architecture can compensate for model limitations. link
New Tools & Releases
- Qwen3.6-27B launched with flagship coding performance: Alibaba's latest dense 27B model rivals much larger models on coding benchmarks, runs locally on 18GB RAM via Unsloth dynamic GGUFs, and already has day-0 vLLM support. link
- Step 3.5 Flash open-source model drops: StepFun's most capable open-source foundation model promises "fast enough to think, reliable enough to act" with strong tool-calling capabilities. link
- Runable launches full agent sandbox on iOS: The mobile app lets users build websites, apps, decks, reports, and videos from one prompt on their phone - bringing production-grade agent execution to mobile with the same harness principles as desktop. link
- HyperFrames: Open-source video framework for AI agents: HeyGen open-sources the missing video infrastructure layer for AI agents, enabling programmatic video creation and editing workflows that integrate with agent systems. link
Research & Papers
- Diamond Maps: Accurate guidance for diffusion models: New paper unlocks efficient and accurate guidance for diffusion models by combining distilled flow maps with stochastic exploration, showing strong scaling properties across modalities. link
- Parallel Token Prediction achieves O(1) autoregressive sampling: ICLR 2026 paper shows how to amortize autoregressive generation across tokens, combining with techniques like speculative decoding for blazing fast LLM inference. link
- minAlphaFold2: Readable protein folding in pure PyTorch: Simple, readable AlphaFold2 implementation inspired by minGPT, designed for rapid experimentation with 130 tests, gradient checkpointing, and MIT license. Overfits single proteins to sub-Å accuracy in ≤1000 CPU steps. link
- ReasoningBank: Agent memory framework for continuous learning: New framework enables LLM agents to learn from both successes and failures, building a memory system that improves reasoning over time. link
- Do LLMs game formalization? Evaluating faithfulness in logical reasoning: Research examines whether language models truly formalize problems correctly or take shortcuts, testing the reliability of LLM-generated formal proofs. link
--- *Curated from 500+ tweets across engineering, AI research, and builder communities*
---
Emerging Trends
✨ Google Cloud Next 2026 (28 mentions) - NEW Google Cloud Next conference taking place in Las Vegas this week, with announcements about Gemini Enterprise Agent Platform, 8th-gen TPUs, and significant growth in cloud momentum (16B+ tokens/min API usage).
✨ GPT Image 2 Release (67 mentions) - NEW OpenAI released GPT Image 2 (gpt-image-2) with significantly improved text rendering, typography, and detailed image generation capabilities. Users praising its accuracy for posters, mockups, and design work.
✨ Qwen 3.6 Model Release (43 mentions) - NEW Alibaba released Qwen 3.6 models including 27B and 35B-A3B variants with advanced architecture features (DSA2 attention, MoE, Muon optimizer). Community discussing quality comparisons with Opus 4.7 and local deployment options.
🔥 Codex 4M Users Milestone (52 mentions) - RISING Codex reached 4 million weekly active users, adding over 1 million users in less than two weeks. Rate limits were reset to celebrate, and Codex Labs was launched for enterprise partnerships.
📊 ICLR 2026 Conference (89 mentions) - CONTINUING ICLR 2026 conference taking place in Rio de Janeiro, Brazil this week with numerous paper presentations, workshops on agents and AI safety, and networking events from major AI labs.