- Ben Burtenshaw / Hugging Face Researcher1231.your new Qwen 3.5 workhorse is here.›LLM score 12 · 4 months ago
- Merve Noyan / Hugging Face ML Engineer1232.Qwen3.5 @Alibaba_Qwen is out! > largest model (A17B/397B) in series, context window of 262k tokens›LLM score 25 · 4 months ago
- Lucas Beyer / Meta Researcher
- Lewis Tunstall / Hugging Face Researcher1234.A few thoughts after reading the @OpenAI paper on scattering amplitudes over the weekend:›LLM score 85 · 4 months ago
- Merve Noyan / Hugging Face ML Engineer1235.upcoming months we (me + @ariG23498) will focus on following›LLM score 92 · 4 months ago
- Igor Babuschkin / Cofounder of xAI
- Omar Khattab / MIT CSAIL Asst professor1237.
- alphaXiv1238.
- Damek Davis / Assoc. Professor Wharton Stats1239.This is a really cool project. My first thought was obviously to ask microagent to make picoagent.›LLM score 92 · 4 months ago
- Cameron Wolfe / Researcher at Netflix1240.I’m publishing a long-form overview of using rubrics for RL tomorrow.›LLM score 92 · 4 months ago
- Damek Davis / Assoc. Professor Wharton Stats1241.The second class is a crash course on stochastic optimization in machine learning.›LLM score 85 · 4 months ago
- Thang Luong / DeepMind Principal Scientist1242.
- Ben Burtenshaw / Hugging Face Researcher1243.it's good to know Dario's upper bound for 2026: - <$1tn in compute›LLM score 82 · 4 months ago
- Lewis Tunstall / Hugging Face Researcher1244.We trained a tiny 4B model to reason for millions of tokens through IMO-level problems.›LLM score 92 · 4 months ago
- Lucas Beyer / Meta Researcher1245.I've seen some people compliment this article being well/clearly written.›LLM score 82 · 4 months ago
- Christian Szegedy / ex xAI Cofounder1246.There must be huge low-hanging fruit in figuring out how to train metacognition incrementally.›LLM score 85 · 4 months ago
- Christian Szegedy / ex xAI Cofounder
- Christian Szegedy / ex xAI Cofounder
- Noam Brown / OpenAI Research Scientist
- Omar Khattab / MIT CSAIL Asst professor
- Sebastien Bubeck / OpenAI MTS1251.
- Jerry Tworek / ex OpenAI VP of RL
- Jerry Tworek / ex OpenAI VP of RL1253.Researchers will literally regularise their models for years instead of doing second-order optimizationLLM score 82 · 4 months ago
- Omar Khattab / MIT CSAIL Asst professor1254.RLMs are not sub-agents or the ability to iteratively retrieve context.›LLM score 92 · 4 months ago
- Jerry Tworek / ex OpenAI VP of RL1255.
- Jason Phang / OpenAI Researcher1256.Our models took a solid swing at the https://t.co/PX4qq5T4Sm problems!›LLM score 85 · 4 months ago
- Clive Chan / OpenAI Hardware
- Noam Brown / OpenAI Research Scientist
- Jakub Pachocki / OpenAI Chief Scientist1259.Very excited about the "First Proof" challenge.›LLM score 92 · 4 months ago
- Skyler Miao / MiniMax Head of Engineering1260.Appreciate it. The gap is closing fast.›LLM score 82 · 4 months ago