- Ben Burtenshaw / Hugging Face Researcher1381.the @huggingface supports agentic environments that you can use for rl or evals.›LLM score 85 · 4 months ago
- John Carmack
- Cameron Wolfe / Researcher at Netflix1383.The Elon / Dwarkesh podcast was cool.›LLM score 82 · 4 months ago
- alphaXiv1384.New generative model paradigm: Drifting Models 1 step inference but at SoTA fidelity!›LLM score 92 · 4 months ago
- Jeff Dean / Chief Scientist at DeepMind1385.Gemini+Genie 3 are helping @Waymo simulate long tail scenarios to make driving safer.›LLM score 75 · 4 months ago
- Sharon Zhou / AMD VP of AI1386.
- Yingru Li / xAI Researcher1387.We trained Dr. Kernel - a 14B model that matches or beats GPT-5 and Claude-4.5-Sonnet on KernelBench.›LLM score 92 · 4 months ago
- Ben Burtenshaw / Hugging Face Researcher1388.Eval scores in 2026 are broken.›LLM score 92 · 4 months ago
- Merve Noyan / Hugging Face ML Engineer1389.we released Community Evals to fix transparency in evals 🤝›LLM score 85 · 4 months ago
- Thomas Wolf / Hugging Face Cofounder
- Cameron Wolfe / Researcher at Netflix1391.This is a really nice framing of efficiency constraints in RL training.›LLM score 92 · 4 months ago
- Ted Sanders / OpenAI Researcher1392.it's wild how much LLMs have transformed the profession of software engineering over the past year.›LLM score 85 · 4 months ago
- Kevin Murphy / DeepMind Research Scientist1393.I'm pleased to share our paper on learning temporally abstract world models and policies (options).›LLM score 95 · 4 months ago
- Sholto Douglas / Researcher at Anthropic1394.
- Richard Song / DeepMind Research Scientist1395.“Am I disappointed when my students don’t go off and be mathematicians?›LLM score 85 · 4 months ago
- Sholto Douglas / Researcher at Anthropic1396.Long weekends are good for GDP because people discover how good the models are https://t.co/yucnlrRos4LLM score 85 · 4 months ago
- Sherwin Wu / OpenAI API, Head of Engineering1397.This article really resonated with me, especially in relation to GPT-5.3-Codex.›LLM score 85 · 4 months ago
- Geoffrey Hinton1398.
- Andrew White / Edison Scientific Cofounder
- Andrew White / Edison Scientific Cofounder1400.Opus 4.6 is out today - which we've been using for a while.›LLM score 82 · 4 months ago
- Behnam Neyshabur / Anthropic Researcher1401.I've left Anthropic to start something new. 🧵 https://t.co/6VzY1T3ivNLLM score 75 · 4 months ago
- Andrew White / Edison Scientific Cofounder
- Clive Chan / OpenAI Hardware1403.hardware-software co-design, years in the making! https://t.co/y08aoW5tbRLLM score 92 · 4 months ago
- Chi Jin / OpenAI Researcher1404.
- Andrew Lampinen / Research Scientist at DeepMind
- Clive Chan / OpenAI Hardware1406.
- Emmanuel Ameisen / Anthropic Interpretability Researcher
- Aidan McLaughlin / OpenAI Research Scientist1408.we have an internal codex usage leaderboard and karel is like 10xing everyone else on our team›LLM score 85 · 4 months ago
- Noam Brown / OpenAI Research Scientist1409.GPT-5.3-Codex's much better token efficiency *AND* faster inference is the biggest story of this release.›LLM score 85 · 4 months ago
- Boris Cherny / Creator of Claude Code1410.Out now: Teams, aka. Agent Swarms in Claude Code Team are experimental, and use a lot of tokens.›LLM score 75 · 4 months ago