Industry News
- ElevenLabs hits $100M+ net new ARR in Q1: The voice AI company added over $100M in quarterly ARR driven by enterprise deployments of conversational agents across voice, chat, text, and email with clients like DT, Revolut, and Klarna. link
- Anthropic Claude Mythos completes AISI cyber range: First model to complete AISI's 32-step corporate network attack simulation, demonstrating capability to execute sophisticated multi-stage cyber operations from reconnaissance to full network takeover. link
- Compute demand defies bubble predictions: Six months after widespread predictions of a massive compute glut and AI recession, inference demand continues accelerating with persistent chip shortages and expanding data center capacity, contradicting the "compute bubble" thesis. link
Tips & Techniques
- Anthropic secretly nerfed Claude Code thinking effort: Claude Code's default thinking effort was quietly downgraded from "high" to "medium," causing performance drops. Users can restore performance by manually setting thinking effort to "max" in settings. link
- Telemetry opt-out reduces cache from 1hr to 5min: Disabling telemetry in Claude Code drops prompt cache duration from 1 hour to just 5 minutes, effectively punishing users who disable data collection with significantly degraded performance. link
New Tools & Releases
- Cloudflare Sandboxes now GA with secure credential injection: Sandboxes provide agents with full computer environments (terminal, filesystem) while using Workers as a trusted proxy to inject credentials at the network layer, preventing agents from ever seeing secrets. link
- ColGREP 1.2.0 combines hierarchical search with BM25: The code search tool now merges hierarchical multi-vector search with BM25 keyword scoring, dramatically improving local codebase navigation for AI agents and developers. link
- DFlash brings speculative decoding to Apple Silicon: New speculative decoding implementation for MLX delivers significantly faster inference on Apple Silicon without requiring custom forks, achieving major speedups on M5 Max at 2048 tokens. link
Research & Papers
- Behavioral diversity from path entropy maximization: Research suggests agents trained on entropy objectives (maximizing future action-state path occupancy) rather than reward exhibit variable behavior when safe and become surgically goal-oriented only when absorbing states threaten their future path space. link
- LLMs encode harmful content in distinct weights: Weight pruning reveals harmful generation depends on a tiny shared subset of weights across harm types, separate from benign capabilities—suggesting safety mechanisms could be surgically modified. link
- Adaptive action chunking improves VLA success by 15%: Dynamically selecting chunk size at inference time based on action entropy (small chunks for precision, large for coarse movement) improves real-world manipulation from 67% to 82% without retraining. link
- On-policy distillation now 40x faster with TRL: New TRL implementation enables on-policy distillation with 100B+ teacher models at speeds 40x faster than naive approaches, making real-time distillation practical for production systems. link
--- *Curated from 500+ tweets across multiple AI-focused lists*
---
Emerging Trends
✨ Cloudflare Agents Week and Sandboxes (78 mentions) - NEW Cloudflare announced GA of their Sandbox product for agents during "Agents Week," featuring secure compute environments with credential injection, programmable egress proxies, and tools for giving agents terminal and file system access.
🔥 Vibe Coding and AI Development (168 mentions) - RISING Discussion of "vibe coding" practices using AI tools like Claude Code, Cursor, and Codex for rapid application development, with developers sharing workflows, productivity tips, and debates about the craft versus output trade-offs.
🔥 Agent Harnesses and Workflows (142 mentions) - RISING Growing conversation around agent harness frameworks and multi-agent workflows, including discussion of tools like OpenClaw, Hermes Agent, and patterns for decomposing problems across multiple AI agents with orchestration layers.
📊 OpenAI Codex and ChatGPT Pro Usage (195 mentions) - CONTINUING Continued discussion of OpenAI's Codex tool for development work, including comparisons with Claude Code and debate over ChatGPT Pro pricing tiers ($100 vs $200) and their respective usage limits and bonuses.