AI300

Ben Burtenshaw / Hugging Face Researcher
1231.
your new Qwen 3.5 workhorse is here.›
LLM score 12 · 4 months ago
Merve Noyan / Hugging Face ML Engineer
1232.
Qwen3.5 @Alibaba_Qwen is out! > largest model (A17B/397B) in series, context window of 262k tokens›
LLM score 25 · 4 months ago
Lucas Beyer / Meta Researcher
1233.
After initially being hyped about the speed, I have to say that 5.3-codex-spark, even on xhigh, is actually quite a bit dumber than 5.3-codex, to the point that I'm back to using the latter most of the time.
LLM score 92 · 4 months ago
Lewis Tunstall / Hugging Face Researcher
1234.
A few thoughts after reading the @OpenAI paper on scattering amplitudes over the weekend:›
LLM score 85 · 4 months ago
Merve Noyan / Hugging Face ML Engineer
1235.
upcoming months we (me + @ariG23498) will focus on following›
LLM score 92 · 4 months ago
Igor Babuschkin / Cofounder of xAI
1236.
What’s the best open alternative to OpenClaw right now? Doesn’t make sense to put all your data into it if it’s owned by OpenAI.›
LLM score 75 · 4 months ago
Omar Khattab / MIT CSAIL Asst professor
1237.
not sure what's going on but OAI folks are going all in on developing new scaffolds/harnesses/programs in the last 24 hours›
LLM score 82 · 4 months ago
alphaXiv
1238.
The first Transformer -> SSM hybrid distillation that proves you only need ~2% of attention heads to keep in-context retrieval!›
LLM score 92 · 4 months ago
Damek Davis / Assoc. Professor Wharton Stats
1239.
This is a really cool project. My first thought was obviously to ask microagent to make picoagent.›
LLM score 92 · 4 months ago
Cameron Wolfe / Researcher at Netflix
1240.
I’m publishing a long-form overview of using rubrics for RL tomorrow.›
LLM score 92 · 4 months ago
Damek Davis / Assoc. Professor Wharton Stats
1241.
The second class is a crash course on stochastic optimization in machine learning.›
LLM score 85 · 4 months ago
Thang Luong / DeepMind Principal Scientist
1242.
Yes, we provided 3 things for AI-assisted math: * Human-AI interaction (HAI) card (photo), inspired by model cards›
LLM score 92 · 4 months ago
Ben Burtenshaw / Hugging Face Researcher
1243.
it's good to know Dario's upper bound for 2026: - <$1tn in compute›
LLM score 82 · 4 months ago
Lewis Tunstall / Hugging Face Researcher
1244.
We trained a tiny 4B model to reason for millions of tokens through IMO-level problems.›
LLM score 92 · 4 months ago
Lucas Beyer / Meta Researcher
1245.
I've seen some people compliment this article being well/clearly written.›
LLM score 82 · 4 months ago
Christian Szegedy / ex xAI Cofounder
1246.
There must be huge low-hanging fruit in figuring out how to train metacognition incrementally.›
LLM score 85 · 4 months ago
Christian Szegedy / ex xAI Cofounder
1247.
In addition to adversarial attacks, the increasing use of LLMs for personal communication and AI-based image/video editing tools makes it even harder to detect whether a communication is authentic or not.›
LLM score 92 · 4 months ago
Christian Szegedy / ex xAI Cofounder
1248.
Super cool! It was a bit chatty, but it focused on getting across the main idea, its motivation, and the thinking behind the results, failed attempts, and fixes.›
LLM score 85 · 4 months ago
Noam Brown / OpenAI Research Scientist
1249.
Perhaps a 🌶️ take but I think the criticisms of @GoogleDeepMind's release are missing the point, and the real problem is that AI labs and safety orgs need to adapt to a world where intelligence is a function of inference compute.›
LLM score 92 · 4 months ago
Omar Khattab / MIT CSAIL Asst professor
1250.
Sadly despite all the chasing, 3/45 of the papers in my SAC batch for ACL ARR need one more emergency reviewer within the next 24 hours.›
LLM score 72 · 4 months ago
Sebastien Bubeck / OpenAI MTS
1251.
Honestly this time I do think it was pretty hard 😅 not breakthrough level yet but wow it's improving fast ...›
LLM score 82 · 4 months ago
Jerry Tworek / ex OpenAI VP of RL
1252.
Amazing artefact about explosive growth of AI industry is that every lab compares their latest unreleased model with competing public model and think they’re ahead›
LLM score 92 · 4 months ago
Jerry Tworek / ex OpenAI VP of RL
1253.
Researchers will literally regularise their models for years instead of doing second-order optimization
LLM score 82 · 4 months ago
Omar Khattab / MIT CSAIL Asst professor
1254.
RLMs are not sub-agents or the ability to iteratively retrieve context.›
LLM score 92 · 4 months ago
Jerry Tworek / ex OpenAI VP of RL
1255.
Agency is the opposite of hopelessness. In some ways, main goal of AI development is to reduce and cure human hopelessness
LLM score 75 · 4 months ago
Jason Phang / OpenAI Researcher
1256.
Our models took a solid swing at the https://t.co/PX4qq5T4Sm problems!›
LLM score 85 · 4 months ago
Clive Chan / OpenAI Hardware
1257.
1stproof didn't even last a week! these models really have gotten to research-level math these past couple months and it's going to be a crazy year ahead for math research.›
LLM score 85 · 4 months ago
Noam Brown / OpenAI Research Scientist
1258.
After the IMO results last summer, some dismissed it as “high school math.” We think our latest models will remove any doubt that STEM research is about to fundamentally change.›
LLM score 92 · 4 months ago
Jakub Pachocki / OpenAI Chief Scientist
1259.
Very excited about the "First Proof" challenge.›
LLM score 92 · 4 months ago
Skyler Miao / MiniMax Head of Engineering
1260.
Appreciate it. The gap is closing fast.›
LLM score 82 · 4 months ago