AI300

Skyler Miao / MiniMax Head of Engineering
1501.
Great observation. We intentionally trained the model to be better at planning and at clarifying requirements with the user.›
LLM score 92 · 4 months ago
Alex Zhang / MIT CSAIL PhD
1502.
I really want to like this paper as an exploration of how to refine an RLM (which I am very much in favor of), but IMO the conclusion is too short-horizon and sort of misses the argument of the original paper.›
LLM score 85 · 4 months ago
Ronak Malde / DeepMind Researcher
1503.
This paper is almost too good that I didn't want to share it›
LLM score 92 · 4 months ago
Sebastien Bubeck / OpenAI MTS
1504.
My good friend Christian Coester solved a 50 years old open problem in self-organizing lists (was it really open? did people even care? yes and yes), and, you guessed it, ChatGPT-Pro made the key step in the proof*! (*a Christian Coester on the other side of the screen was needed›
LLM score 92 · 4 months ago
Shane Legg / Chief AGI Scientist at DeepMind
1505.
Check out this great work on measuring progress towards AGI and the associated global @Kaggle hackathon.›
LLM score 82 · 4 months ago
Sherwin Wu / OpenAI API, Head of Engineering
1506.
It’s never been easier to multitask with Codex! We’re very quickly moving towards a world where it feels like we as humans are now the bottleneck in coding.›
LLM score 75 · 4 months ago
Alex Zhang / MIT CSAIL PhD
1507.
Ran a small eval today on an LM using GPT-5.2 as a judge.›
LLM score 92 · 4 months ago
Richard Song / DeepMind Research Scientist
1508.
Absolutely beautiful. LLMs are both theoretically and practically Turing complete https://t.co/C66kEafCs3
LLM score 75 · 4 months ago
Lewis Tunstall / Hugging Face Researcher
1509.
Neat, reminds me of the reasoning cache by @ianwu97 that we used to train QED-Nano https://t.co/pJ99JjwMol https://t.co/wnI2APmpks https://t.co/14Pe93FiuB
LLM score 85 · 4 months ago
John Carmack
1510.
The corporate advisory boards that I have been a part of have almost exclusively been “vibe checks”, where presentations are made about work the company is doing, and the advisory panel chats about things for a while.›
LLM score 75 · 4 months ago
Sherwin Wu / OpenAI API, Head of Engineering
1511.
We're now in a world where you can get 54.4% SWE-Bench PRO performance, and 60% (!) T-Bench 2.0 performance:›
LLM score 92 · 4 months ago
Tri Dao / Chief Scientist at Together
1512.
The frontier has increasingly shifted to hybrid models - from Qwen to Kimi-Linear and now with NVIDIA's Nemotron-3 Super - that rely on a strong linear sequence model.›
LLM score 98 · 4 months ago
Albert Gu / Cartesia Chief Scientist
1513.
The newest model in the Mamba series is finally here 🐍›
LLM score 92 · 4 months ago
Jack Clark / Anthropic Cofounder
1514.
I'm scaling the economic research function here @AnthropicAI to meet the challenge of powerful AI.›
LLM score 85 · 4 months ago
Andrew Lampinen / Research Scientist at DeepMind
1515.
Pleased to share that our paper "Representation Biases: Variance is Not Always a Good Proxy for Importance" is now out as Theory/New Concepts paper in eNeuro! Thread: https://t.co/slZyrAPGG7
LLM score 92 · 4 months ago
Leandro von Werra / Hugging Face Head of Research
1516.
the greatest gifts of LLMs getting really good at computer tasks is that nobody will ever have to touch excel again.›
LLM score 72 · 4 months ago
Merve Noyan / Hugging Face ML Engineer
1517.
3d asset gen is solved, up next, worlds 🔥 we want a new 3D arena to compare 3d world models and we need you 🫵🏻›
LLM score 82 · 4 months ago
Sherwin Wu / OpenAI API, Head of Engineering
1518.
It's been honestly dizzying seeing the uptake of GPT-5.4 in our API since launch 😵‍💫 – it's a good model! https://t.co/KeioK10KNY
LLM score 75 · 5 months ago
Leandro von Werra / Hugging Face Head of Research
1519.
Some thoughts on why OSS libraries wont completely go away with coding agents:›
LLM score 92 · 5 months ago
Ben Burtenshaw / Hugging Face Researcher
1520.
PSA: hf skills are embedded into their tools.›
LLM score 92 · 5 months ago
Sander Dieleman / DeepMind Research Scientist
1521.
In October, I gave a talk at @MLinPL in Warsaw: a whirlwind tour of what goes into training image and video generation models at scale.›
LLM score 92 · 5 months ago
Sayak Paul / Hugging Face Researcher
1522.
Many things that seemed like eternities away feel possible with the advent of things like @claudeai! ›
LLM score 75 · 5 months ago
Merve Noyan / Hugging Face ML Engineer
1523.
ResNet connections but make it attention??? this could cut a lot of costs! https://t.co/VNw6fydMDa
LLM score 75 · 5 months ago
Jerry Tworek / ex OpenAI VP of RL
1524.
Rethink everything. deep leaning 2.0 is approaching https://t.co/niB6HZ18L3
LLM score 5 · 5 months ago
Chelsea Finn / Physical Intelligence Cofounder
1525.
Usually, we expect more diverse data >> less diverse data.›
LLM score 92 · 5 months ago
Eric Jang / ex VP of AI at 1X Robotics
1526.
I heard a claim today that the Korean govt buys about 10k H100/B200 class GPUs per year to be granted to businesses and universities.›
LLM score 85 · 5 months ago
Demis Hassabis / CEO of DeepMind
1527.
Cool use case of AlphaFold, this is just the beginning of digital biology! https://t.co/EdWEL2r33Z
LLM score 85 · 5 months ago
Damek Davis / Assoc. Professor Wharton Stats
1528.
Excited to launch the first stage of this competition with Terry and the @sairfoundation!›
LLM score 75 · 5 months ago
Sebastien Bubeck / OpenAI MTS
1529.
My 9 yo is now fully independent with codex and it's insane to watch, we built a few games together and then he went off to build his own tower defense, adding features by himself and testing them ...›
LLM score 75 · 5 months ago
Merve Noyan / Hugging Face ML Engineer
1530.
spot on summary of world models https://t.co/TiUm23RoPB
LLM score 85 · 5 months ago