- Boris Cherny / Creator of Claude Code601.Love seeing how Spotify is shipping with Claude Code.›LLM score 92 · about 2 months ago
- Sharon Zhou / AMD VP of AI
- alphaXiv603.Something better than SAE just dropped "Learning a Generative Meta-Model of LLM Activations"›LLM score 92 · about 2 months ago
- Sayak Paul / Hugging Face Researcher
- Merve Noyan / Hugging Face ML Engineer605.Minimax M2.5 is out! 🔥 > leading in agentic tool calling (BFCL), search (BrowseComp), coding (SWE-Bench)›LLM score 75 · about 2 months ago
- Damek Davis / Assoc. Professor Wharton Stats
- Ben Burtenshaw / Hugging Face Researcher607.Custom Kernels for All from Codex and Claude(blog)LLM score 100 · about 2 months ago
- Skyler Miao / MiniMax Head of Engineering
- Shane Legg / Chief AGI Scientist at DeepMind609.I agree that AI testing is best thought of as a process and that what humans can typically do is a natural minimal bar for AGI.›LLM score 92 · about 2 months ago
- Omar Khattab / MIT CSAIL Asst professor610.BTW a lot of people think symbolic access to the prompt is essential in RLMs *only when* the prompt is extremely long.›LLM score 85 · about 2 months ago
- Jason Phang / OpenAI Researcher611.It is so nice to see codex tell me I’m wrong https://t.co/QgvBivmhqJLLM score 85 · about 2 months ago
- Skyler Miao / MiniMax Head of Engineering612.glad you love it! weights dropping in a few hours.›LLM score 92 · about 2 months ago
- Skyler Miao / MiniMax Head of Engineering613.M2.5 + OpenClaw still comes with a 7-day free trial via MiniMax OAuth.›LLM score 75 · about 2 months ago
- Aidan McLaughlin / OpenAI Research Scientist614.pro tip: 5.3-spark is basically my go-to research model now›LLM score 92 · about 2 months ago
- Jerry Tworek / ex OpenAI VP of RL615.It’s only AGI if it can improve itself continuously without us›LLM score 75 · about 2 months ago
- Noam Brown / OpenAI Research Scientist
- Radhakrishnan Venkataramani / ex MTS at xAI617.Farewell to @xai and friends — I left @xai this week.›LLM score 85 · about 2 months ago
- Boris Cherny / Creator of Claude Code618.A huge part of this raise is Claude Code. Weekly active users doubled since January.›LLM score 85 · about 2 months ago
- Alex Zhang / MIT CSAIL PhD619.Funnily enough I tried to dabble with ARC AGI before and with very little success…›LLM score 72 · about 2 months ago
- Eric Jang / ex VP of AI at 1X Robotics620.Really cool paper and blog post.›LLM score 85 · about 2 months ago
- Adams Wei Yu / DeepMind Research Scientist621.Proud to be part of the Deepthink team as we continue to push the frontiers of AI.›LLM score 92 · about 2 months ago
- Andrej Karpathy / AI researcher622.Congrats on the launch @simile_ai ! (and I am excited to be involved as a small angel.)›LLM score 82 · about 2 months ago
- Lucas Beyer / Meta Researcher623.Another earlier work along those same lines, thanks commenters! https://t.co/rpT8cS0HIY https://t.co/fER94WcnAkLLM score 85 · about 2 months ago
- Andy Jones / Anthropic Research Engineer624.i am glad this chart is public now because it is bananas.›LLM score 92 · about 2 months ago
- Leandro von Werra / Hugging Face Head of Research625.A 4B model… - at the level of Gemini 3 Pro - on IMO-ProofBench›LLM score 85 · about 2 months ago
- Asher Spector / Cofounder of Flapping Airplanes
- John Carmack627.The modern age has richly rewarded people with a combination of high intelligence and high agency.›LLM score 92 · about 2 months ago
- Lewis Tunstall / Hugging Face Researcher628.Very excited to share QED-Nano: the smallest theorem proving model to date 🤏At just 4B parameters, it matches the performance of much larger models on the challenging IMO-ProofBench benchmark and operates entirely in natural language, with no reliance on Lean or external tools.›LLM score 92 · about 2 months ago
- Yifei Zhou / Thinking Machines Researcher629.HLE 48% without using tools is next-level 🫡 would expected using tools to help much more tho https://t.co/0uaRHhKzOQLLM score 72 · about 2 months ago
- Omar Khattab / MIT CSAIL Asst professor630.Folks claim to set the state of the art on ARC-AGI-2 using an RLM, a deeply recursive one, to manage the long horizon.›LLM score 92 · about 2 months ago