AI300

Shane Legg / Chief AGI Scientist at DeepMind
1951.
I agree that AI testing is best thought of as a process and that what humans can typically do is a natural minimal bar for AGI.›
LLM score 92 · 6 months ago
Omar Khattab / MIT CSAIL Asst professor
1952.
BTW a lot of people think symbolic access to the prompt is essential in RLMs *only when* the prompt is extremely long.›
LLM score 85 · 6 months ago
Jason Phang / OpenAI Researcher
1953.
It is so nice to see codex tell me I’m wrong https://t.co/QgvBivmhqJ
LLM score 85 · 6 months ago
Skyler Miao / MiniMax Head of Engineering
1954.
glad you love it! weights dropping in a few hours.›
LLM score 92 · 6 months ago
Skyler Miao / MiniMax Head of Engineering
1955.
M2.5 + OpenClaw still comes with a 7-day free trial via MiniMax OAuth.›
LLM score 75 · 6 months ago
Aidan McLaughlin / OpenAI Research Scientist
1956.
pro tip: 5.3-spark is basically my go-to research model now›
LLM score 92 · 6 months ago
Jerry Tworek / ex OpenAI VP of RL
1957.
It’s only AGI if it can improve itself continuously without us›
LLM score 75 · 6 months ago
Noam Brown / OpenAI Research Scientist
1958.
Francois Chollet: "AGI ~2030" Folks often point to @fchollet as an AGI skeptic, but he's said multiple times that he thinks it arrives within 5 years.›
LLM score 92 · 6 months ago
Radhakrishnan Venkataramani / ex MTS at xAI
1959.
Farewell to @xai and friends — I left @xai this week.›
LLM score 85 · 6 months ago
Boris Cherny / Creator of Claude Code
1960.
A huge part of this raise is Claude Code. Weekly active users doubled since January.›
LLM score 85 · 6 months ago
Alex Zhang / MIT CSAIL PhD
1961.
Funnily enough I tried to dabble with ARC AGI before and with very little success…›
LLM score 72 · 6 months ago
Eric Jang / ex VP of AI at 1X Robotics
1962.
Really cool paper and blog post.›
LLM score 85 · 6 months ago
Adams Wei Yu / DeepMind Research Scientist
1963.
Proud to be part of the Deepthink team as we continue to push the frontiers of AI.›
LLM score 92 · 6 months ago
Andrej Karpathy / AI researcher
1964.
Congrats on the launch @simile_ai ! (and I am excited to be involved as a small angel.)›
LLM score 82 · 6 months ago
Lucas Beyer / Meta Researcher
1965.
Another earlier work along those same lines, thanks commenters! https://t.co/rpT8cS0HIY https://t.co/fER94WcnAk
LLM score 85 · 6 months ago
Andy Jones / Anthropic Research Engineer
1966.
i am glad this chart is public now because it is bananas.›
LLM score 92 · 6 months ago
Leandro von Werra / Hugging Face Head of Research
1967.
A 4B model… - at the level of Gemini 3 Pro - on IMO-ProofBench›
LLM score 85 · 6 months ago
Asher Spector / Cofounder of Flapping Airplanes
1968.
this is such a cool research question---how faithful can you make general-purpose simulations to reality? and an even cooler set of people @mihikapoor :) https://t.co/IdPrrlYeza
LLM score 92 · 6 months ago
John Carmack
1969.
The modern age has richly rewarded people with a combination of high intelligence and high agency.›
LLM score 92 · 6 months ago
Lewis Tunstall / Hugging Face Researcher
1970.
Very excited to share QED-Nano: the smallest theorem proving model to date 🤏At just 4B parameters, it matches the performance of much larger models on the challenging IMO-ProofBench benchmark and operates entirely in natural language, with no reliance on Lean or external tools.›
LLM score 92 · 6 months ago
Yifei Zhou / Thinking Machines Researcher
1971.
HLE 48% without using tools is next-level 🫡 would expected using tools to help much more tho https://t.co/0uaRHhKzOQ
LLM score 72 · 6 months ago
Omar Khattab / MIT CSAIL Asst professor
1972.
Folks claim to set the state of the art on ARC-AGI-2 using an RLM, a deeply recursive one, to manage the long horizon.›
LLM score 92 · 6 months ago
Jeff Dean / Chief Scientist at DeepMind
1973.
We’ve updated Gemini 3 Deep Think to better tackle the complexity of real-world research, science, and engineering.›
LLM score 25 · 6 months ago
Noam Shazeer
1974.
An updated Gemini 3 Deep Think is out today: 📈 Achieves SOTA on ARC-AGI-2, MMMU-Pro, and HLE.›
LLM score 85 · 6 months ago
Ben Burtenshaw / Hugging Face Researcher
1975.
agentic RL envs need to go beyond games (wordle, sudoku) and into the real world tasks that people use them for.›
LLM score 92 · 6 months ago
alphaXiv
1976.
"ViT-5: Vision Transformers for The Mid-2020s" This paper shows that plain Vision Transformers still has a lot of low hanging fruits, with many under-optimized aspects.›
LLM score 92 · 6 months ago
Sayak Paul / Hugging Face Researcher
1977.
Time for another cool collaborative research story!›
LLM score 85 · 6 months ago
Zixuan Li / Lead Z.ai
1978.
Several AI reference glm5 .net (enter a space to prevent loading it) when summarizing GLM-5 information.›
LLM score 92 · 6 months ago
Jonathan Lee / DeepMind Researcher
1979.
Our latest versions of Deep Think are helping accelerate math research.›
LLM score 92 · 6 months ago
Lucas Beyer / Meta Researcher
1980.
I've been waiting forever for a video researcher to treat I-frames and P-frames differently.›
LLM score 92 · 6 months ago