- Noam Brown / OpenAI Research Scientist1411.GPT-5.3-Codex's much better token efficiency *AND* faster inference is the biggest story of this release.›LLM score 85 · 4 months ago
- Boris Cherny / Creator of Claude Code1412.Out now: Teams, aka. Agent Swarms in Claude Code Team are experimental, and use a lot of tokens.›LLM score 75 · 4 months ago
- Boris Cherny / Creator of Claude Code1413.I've been using Opus 4.6 for a bit -- it is our best model yet.›LLM score 92 · 4 months ago
- Igor Babuschkin / Cofounder of xAI1414.Goodfire is one of the many amazing companies I met through Babuschkin Ventures.›LLM score 25 · 4 months ago
- alphaXiv
- Ben Burtenshaw / Hugging Face Researcher1416.PSA: you define the leaderboards on the hub.›LLM score 92 · 4 months ago
- Kevin Murphy / DeepMind Research Scientist1417.I just read the first two of these and like them a lot.›LLM score 92 · 4 months ago
- Kevin Murphy / DeepMind Research Scientist1418.
- Richard Song / DeepMind Research Scientist1419.
- Clive Chan / OpenAI Hardware
- Noam Brown / OpenAI Research Scientist1421.GPT-5.2 evals are finally out for METR and it's state-of-the-art.›LLM score 92 · 4 months ago
- Jeff Dean / Chief Scientist at DeepMind1422.Very proud to see the progress across so many areas.›LLM score 92 · 4 months ago
- Noam Brown / OpenAI Research Scientist1423.It's fun watching Doug try to contain his exasperation with the bots' "logic" and actions in the replays.›LLM score 82 · 4 months ago
- Eric Jang / ex VP of AI at 1X Robotics1424.
- Asher Spector / Cofounder of Flapping Airplanes
- Andrej Karpathy / AI researcher1426.A lot of people quote tweeted this as 1 year anniversary of vibe coding.›LLM score 92 · 4 months ago
- alphaXiv
- Boris Cherny / Creator of Claude Code1428.Did you know that Claude Code is more than a terminal CLI?›LLM score 75 · 4 months ago
- Shane Legg / Chief AGI Scientist at DeepMind
- Aidan McLaughlin / OpenAI Research Scientist1430.
- Jim Fan / NVIDIA Director of Robotics
- Jason Weston / Meta Research Scientist1432.Self-Improving Pretraining We've updated our results given feedback:›LLM score 95 · 4 months ago
- Boris Cherny / Creator of Claude Code1433.You can now use Slack in Cowork to have Claude read & send messages without leaving the app›LLM score 85 · 4 months ago
- Omar Khattab / MIT CSAIL Asst professor1434.“Tools should exist to extend human capability.›LLM score 75 · 4 months ago
- Jerry Tworek / ex OpenAI VP of RL1435.As I see it right now, future dystopia is a single company controlling artificial intelligence.›LLM score 82 · 4 months ago
- Jerry Tworek / ex OpenAI VP of RL1436.
- Binyuan Hui / Alibaba Qwen Research Scientist1437.How to scale training environments for coding agents? Let the agent build their own! 🙌›LLM score 92 · 4 months ago
- alphaXiv1438.Kimi K2.5 paper is now available! A huge push towards open-source multimodal agents.›LLM score 92 · 4 months ago
- Sayak Paul / Hugging Face Researcher1439.New caching method dropped today in Diffusers -- MagCache 🔥›LLM score 92 · 4 months ago
- Sayak Paul / Hugging Face Researcher