AI300

Jerry Tworek / ex OpenAI VP of RL
1201.
Your company is defined by the dashboards you look at
LLM score 65 · 3 months ago
Yoshua Bengio
1202.
I went on @BBCNewsnight this week to discuss the recent developments in AI's capabilities, as well as the potential harms and concentration of power they could entail.›
LLM score 85 · 3 months ago
Jonathan Ross / TPU Creator
1203.
In two years, nobody serious will call AI errors hallucinations.›
LLM score 92 · 3 months ago
Merve Noyan / Hugging Face ML Engineer
1204.
we ship the home for your coding and personal agents on hugging face(blog)
LLM score 95 · 3 months ago
Cameron Wolfe / Researcher at Netflix
1205.
Currently doing a write up on scaling laws for RL.›
LLM score 92 · 3 months ago
Sayak Paul / Hugging Face Researcher
1206.
We worked in close collaboration w/ @PyTorch & TorchAO teams to make offloading work with fancy quants 🔥›
LLM score 92 · 3 months ago
Ben Burtenshaw / Hugging Face Researcher
1207.
here's a hands on guide to setup multi-agent autoresearch by @karpathy.›
LLM score 92 · 3 months ago
Tim Dettmers / Research Scientist at Ai2
1208.
So cool to see that open-source, with open experimentation (and with the help of someone posting blog posts about their personal research), can yield a very robust method for MoE balancing.›
LLM score 92 · 3 months ago
Ben Burtenshaw / Hugging Face Researcher
1209.
yea tokenizers! super insightful thread on how and why a model's tokenizers can be changed without pretaining.›
LLM score 82 · 3 months ago
Eric Jang / ex VP of AI at 1X Robotics
1210.
I enjoyed the Jensen + @dwarkesh_sp podcast.›
LLM score 75 · 3 months ago
Albert Gu / Cartesia Chief Scientist
1211.
a dynamical systems point of view, which looks like an SSM applied along the residual stream, informs more principled ways to scale looped architectures https://t.co/Ry09zXCXY1
LLM score 92 · 3 months ago
Ronak Malde / DeepMind Researcher
1212.
Ultra long horizon benchmarks are the next frontier,›
LLM score 82 · 3 months ago
Alex Zhang / MIT CSAIL PhD
1213.
We need more super hard tasks to properly eval our models!›
LLM score 92 · 3 months ago
Sherwin Wu / OpenAI API, Head of Engineering
1214.
Of all the launches in Codex today, this one was the most game-changing for me – computer use, but without taking over your whole screen!›
LLM score 92 · 3 months ago
Hyung Won Chung / Research Scientist at Meta
1215.
Health care was built for a world where humans were the only interpreters of health data.›
LLM score 92 · 3 months ago
Chelsea Finn / Physical Intelligence Cofounder
1216.
LLM post-training used to mean fine-tuning to a downstream task›
LLM score 92 · 3 months ago
Karol Hausman / Physical Intelligence Cofounder
1217.
Today we're releasing our newest model π0.7: https://t.co/lk5H3Yv7Ru›
LLM score 92 · 3 months ago
Boris Cherny / Creator of Claude Code
1218.
Dogfooding Opus 4.7 the last few weeks, I've been feeling incredibly productive.›
LLM score 92 · 3 months ago
Ben Burtenshaw / Hugging Face Researcher
1219.
crucial new benchmark dropped for long reasoning we need this because even frontier models still can't think long even with 1M context.›
LLM score 92 · 3 months ago
Tri Dao / Chief Scientist at Together
1220.
The dynamical system view gives very clean conditions for looped transformer to be stable https://t.co/md8fispK0Q
LLM score 92 · 3 months ago
Merve Noyan / Hugging Face ML Engineer
1221.
new open-source Bonsai models are out 🔥 > ternary weights in 8B (1.75 GB), 4B (0.86 GB), and 1.7B (0.37 GB)›
LLM score 25 · 3 months ago
Sergey Levine / Physical Intelligence Cofounder
1222.
We finished evaluating π0.7, our new model at Physical Intelligence.›
LLM score 95 · 3 months ago
Andrew White / Edison Scientific Cofounder
1223.
Been using Opus 4.7 for a bit on vulnerability testing (mostly because of the unrelated mythos hype).›
LLM score 92 · 3 months ago
Jason Wei / AI Researcher at Meta
1224.
Beautifully written piece by @FAbnousi about how AI for health might look like in the future›
LLM score 85 · 3 months ago
Boris Cherny / Creator of Claude Code
1225.
Opus 4.7 feels more intelligent, agentic, and precise than 4.6.›
LLM score 92 · 3 months ago
Sander Dieleman / DeepMind Research Scientist
1226.
A while back we had the "rotation trick" to improve VQ bottlenecks (https://t.co/w4XVeN9L0J), now we have DiVeQ, which seems to improve codebook coverage quite significantly.›
LLM score 92 · 3 months ago
Sayak Paul / Hugging Face Researcher
1227.
Working at Hugging Face over the past 3.5+ years has allowed me to identify what technical areas truly interest me!›
LLM score 65 · 3 months ago
Boris Cherny / Creator of Claude Code
1228.
Opus 4.7 is in Claude Code today.›
LLM score 92 · 3 months ago
Ben Burtenshaw / Hugging Face Researcher
1229.
port any transformers model to mlx https://t.co/FtS6gnpVID
LLM score 92 · 3 months ago
Sander Dieleman / DeepMind Research Scientist
1230.
FlexTok/Semanticist provided an elegant recipe to learn semantically coarse-to-fine sequence representations of images.›
LLM score 92 · 3 months ago