- Jeff Dean / Chief Scientist at DeepMind631.We’ve updated Gemini 3 Deep Think to better tackle the complexity of real-world research, science, and engineering.›LLM score 25 · about 2 months ago
- Noam Shazeer632.An updated Gemini 3 Deep Think is out today: 📈 Achieves SOTA on ARC-AGI-2, MMMU-Pro, and HLE.›LLM score 85 · about 2 months ago
- Ben Burtenshaw / Hugging Face Researcher633.agentic RL envs need to go beyond games (wordle, sudoku) and into the real world tasks that people use them for.›LLM score 92 · about 2 months ago
- alphaXiv
- Sayak Paul / Hugging Face Researcher635.Time for another cool collaborative research story!›LLM score 85 · about 2 months ago
- Zixuan Li / Lead Z.ai636.Several AI reference glm5 .net (enter a space to prevent loading it) when summarizing GLM-5 information.›LLM score 92 · about 2 months ago
- Jonathan Lee / DeepMind Researcher637.Our latest versions of Deep Think are helping accelerate math research.›LLM score 92 · about 2 months ago
- Lucas Beyer / Meta Researcher638.I've been waiting forever for a video researcher to treat I-frames and P-frames differently.›LLM score 92 · about 2 months ago
- Jie Tang / Z.ai Cofounder639.pony alpha -> GLM-5 is coming with AA=50, scoring No.›LLM score 75 · about 2 months ago
- Damek Davis / Assoc. Professor Wharton Stats640.We've now reached 50+ constants and 20 contributors on the repo!›LLM score 75 · about 2 months ago
- Christian Szegedy / ex xAI Cofounder641.Happy to bet that this won't happen neither this year (2026) nor next year (2027) for 95% of software.›LLM score 92 · about 2 months ago
- Demis Hassabis / CEO of DeepMind
- Andrew Ma / ex MTS at xAI
- Aidan McLaughlin / OpenAI Research Scientist644.imo the linguistic question of whether ai can 'think' or 'reason' is not disinteresting›LLM score 72 · about 2 months ago
- Lucas Beyer / Meta Researcher645.Couple interesting things (new to me): - Macrohard is, roughly, computer-use agents.›LLM score 75 · about 2 months ago
- Andrej Karpathy / AI researcher646.New art project. Train and inference GPT in 243 lines of pure, dependency-free Python.›LLM score 98 · about 2 months ago
- Alex Zhang / MIT CSAIL PhD647.another related direction I’ll be paying attention to this year :) https://t.co/N9TmwamKWvLLM score 75 · about 2 months ago
- alphaXiv648."Expanding the Capabilities of RL via Text Feedback (RLTF)"›LLM score 92 · about 2 months ago
- Michael Elabd / DeepMind Researcher649.Love when papers introduce general frameworks for training-time continual learning!›LLM score 92 · about 2 months ago
- Leandro von Werra / Hugging Face Head of Research650.
- Sherwin Wu / OpenAI API, Head of Engineering651.One of my favorite experiments we've run internally: run a software team building 100% with Codex – i.e.›LLM score 92 · about 2 months ago
- Asher Spector / Cofounder of Flapping Airplanes
- Merve Noyan / Hugging Face ML Engineer653.GLM-5 is out on @huggingface 🔥 > A40B/744B, trained on more tokens (28.5T)›LLM score 75 · about 2 months ago
- Andrej Karpathy / AI researcher654.On DeepWiki and increasing malleability of software.›LLM score 92 · about 2 months ago
- Lucas Beyer / Meta Researcher655.Haha i didn't believe it at first, but i can reproduce this.›LLM score 72 · about 2 months ago
- Omar Khattab / MIT CSAIL Asst professor
- Ben Burtenshaw / Hugging Face Researcher657.tuning open weight models on colab is hands down biggest educational unlock out there.›LLM score 92 · about 2 months ago
- Jerry Tworek / ex OpenAI VP of RL658.Capital is necessary but not sufficient to define the future of machine intelligenceLLM score 75 · about 2 months ago
- Cameron Wolfe / Researcher at Netflix659.Some more really good papers on rubric rewards that I've been reading:›LLM score 85 · about 2 months ago
- Roland Gavrilescu / ex MTS at xAI660.I left xAI. Building something new with others that left xAI.›LLM score 35 · about 2 months ago