- Jeffrey Wang / OpenAI Researcher451.a beautiful & simple & important GPT-5.5 proof :) https://t.co/VrDc4BHtPnLLM score 92 · about 1 month ago
- Aidan McLaughlin / OpenAI Research Scientist452.
- Damek Davis / Assoc. Professor Wharton Stats453.It's been fun looking through the stage 1 cheatsheets, while we finish up the evaluation.›LLM score 75 · about 1 month ago
- Yoshua Bengio454.AI is advancing faster than our ability to manage it.›LLM score 92 · about 1 month ago
- Jerry Tworek / ex OpenAI VP of RL455.If anyone ever asks what does "frontier" model means, show them this picture as a definition: https://t.co/BEfamwdffsLLM score 75 · about 1 month ago
- Jason Weston / Meta Research Scientist456.
- Alex Zhang / MIT CSAIL PhD457.Forgot to mention this but the wonderful folks at @AntimLabs are hosting and have eval'd the latest VLMs on VideoGameBench!›LLM score 82 · about 1 month ago
- Jack Clark / Anthropic Cofounder458.SNOWSUMMER, the Ultimate Insurance Policy(blog)LLM score 95 · about 1 month ago
- Sebastien Bubeck / OpenAI MTS459.Ramsey numbers are one the most basic objects in combinatorics, a beautiful illustration of structure within chaos.›LLM score 98 · about 1 month ago
- Ben Burtenshaw / Hugging Face Researcher460.DeepSeek-V4 dropped. 1M context.›LLM score 75 · about 1 month ago
- Merve Noyan / Hugging Face ML Engineer461.if you were in a cave, DeepSeek v4 is out, and it's groundbreaking, here's why:›LLM score 92 · about 1 month ago
- Ben Burtenshaw / Hugging Face Researcher462.deepseek-v4 is out and solves context rot at 1M tokens by taking on attention for the kv cache.›LLM score 92 · about 1 month ago
- Merve Noyan / Hugging Face ML Engineer463.DSv4 genuinely shines in 1M context window and peak efficiency to run many agents/users 😍›LLM score 85 · about 1 month ago
- Clive Chan / OpenAI Hardware
- Merve Noyan / Hugging Face ML Engineer465.DeepSeek v4 is out with 1M context window 🥵🔥 > Pro (13B/284B) & Flash (49B/1.6T) ›LLM score 92 · about 1 month ago
- Cameron Wolfe / Researcher at Netflix466.The idea of training LLMs to manage their own KV cache is super interesting to me.›LLM score 92 · about 1 month ago
- Sherwin Wu / OpenAI API, Head of Engineering467.Set Codex to this and never look back. Medium reasoning effort is good enough for me for ~anything I need to do now.›LLM score 85 · about 1 month ago
- Sherwin Wu / OpenAI API, Head of Engineering468.It's not every day you hear a frontend engineer of Tyler's caliber praise a model this much.›LLM score 75 · about 1 month ago
- Roland Gavrilescu / ex MTS at xAI469.Autoresearch will be the trend of the year https://t.co/xIMiSQi8ELLLM score 75 · about 1 month ago
- Roland Gavrilescu / ex MTS at xAI470.Fate loves irony: OpenAI has safer models than Anthropic https://t.co/pZnVVoQMI0LLM score 85 · about 1 month ago
- Clive Chan / OpenAI Hardware471.this 38h run was GPT5.5! managing a team of agents is mandatory for researchers now tbh›LLM score 92 · about 1 month ago
- Aidan McLaughlin / OpenAI Research Scientist472.over break i dictated to 5.5 for minutes describing a new ambitious rl run.›LLM score 85 · about 1 month ago
- Noam Brown / OpenAI Research Scientist473.A hill that I will die on: with today's AI models, intelligence is a function of inference compute.›LLM score 92 · about 1 month ago
- Sebastien Bubeck / OpenAI MTS474.GPT-5.5, not fully saturating the TikZ unicorn test yet but getting awfully close ...›LLM score 92 · about 1 month ago
- Jonathan Ross / TPU Creator
- Noam Brown / OpenAI Research Scientist476.I'm a manager at @OpenAI, but with GPT-5.5 I'm a more effective IC than I've ever been.›LLM score 92 · about 1 month ago
- Boris Cherny / Creator of Claude Code477.We’ve been looking into recent reports around Claude Code quality issues, and just published a post-mortem on what we found.›LLM score 95 · about 1 month ago
- Jerry Tworek / ex OpenAI VP of RL478.This is true for AI in almost any field.›LLM score 75 · about 1 month ago
- Damek Davis / Assoc. Professor Wharton Stats
- Jeff Dean / Chief Scientist at DeepMind480.