AI300

Behnam Neyshabur / Anthropic Researcher
1411.
We have been heads down but wanted to share a bit about what we are doing 🧵 https://t.co/ICZlLwZ3Fq
LLM score 75 · 4 months ago
Andrej Karpathy / AI researcher
1412.
One common issue with personalization in all LLMs is how distracting memory seems to be for the models.›
LLM score 92 · 4 months ago
Merve Noyan / Hugging Face ML Engineer
1413.
does anyone know of a good multimodal tool calling dataset to fine-tune models on?›
LLM score 92 · 4 months ago
Ben Burtenshaw / Hugging Face Researcher
1414.
Give coding agent a sharable workspace with persistent storage.›
LLM score 92 · 4 months ago
Merve Noyan / Hugging Face ML Engineer
1415.
fav papers from 3DV #2 SAIL-Recon does reconstruction for inputs above 100 images (video)›
LLM score 92 · 4 months ago
Sander Dieleman / DeepMind Research Scientist
1416.
This looks like it will be really good! Everything you've ever wanted to know about generalisation in diffusion models.›
LLM score 75 · 4 months ago
Leandro von Werra / Hugging Face Head of Research
1417.
Which LLM would be better: - today's best architecture trained on 2023's best data›
LLM score 82 · 4 months ago
Thomas Wolf / Hugging Face Cofounder
1418.
What are the best current techniques to have autoresearch behave better than (slightly improved) random search?›
LLM score 92 · 4 months ago
Sayak Paul / Hugging Face Researcher
1419.
Introducing the first discrete diffusion pipeline for text in Diffusers -- LLaDA2 by @TheInclusionAI 🔥›
LLM score 75 · 4 months ago
Demis Hassabis / CEO of DeepMind
1420.
Excited to partner with Agile Robots! Looking forward to seeing our models being deployed through Agile Robots incredible platform to help solve some of the most complex industrial challenges https://t.co/gsx6eGWwUI
LLM score 75 · 4 months ago
Naman Jain / Researcher at Cursor
1421.
Check out the tech report detailing our continued pre-training and RL setup behind Composer2! Also sharing some example CursorBench problems by popular demand https://t.co/Ki9dDLcFX7 https://t.co/8FX9UfwnMx
LLM score 85 · 4 months ago
Dan Fu / VP of Kernels at Together
1422.
As the COLM deadline creeps up - check out @NeelGuha's blog on how to write ML research papers! https://t.co/S3tyJspX2k
LLM score 92 · 4 months ago
Niklas Muennighoff / AI Researcher at Stanford
1423.
One gem from Composer paper is that RL improved both pass@k & pass@1.›
LLM score 92 · 4 months ago
Cameron Wolfe / Researcher at Netflix
1424.
I’m happy to share that I’ve been promoted to Staff Research Scientist at Netflix! It's hard to overstate how rewarding my journey at Netflix has been.›
LLM score 15 · 4 months ago
Damek Davis / Assoc. Professor Wharton Stats
1425.
Codex took less than a week to formalize these 41000 lines and find the error.›
LLM score 95 · 4 months ago
Jim Fan / NVIDIA Director of Robotics
1426.
This is pure nightmare fuel. Identity theft of the past would be nothing compared to what vibe agents can do.›
LLM score 92 · 4 months ago
Jonathan Ross / TPU Creator
1427.
A pilot operates $100M in equipment and nobody blinks.›
LLM score 82 · 4 months ago
Leandro von Werra / Hugging Face Head of Research
1428.
Auto-research for ML training models is all the rage now, but underrated is: auto-research for data!›
LLM score 85 · 4 months ago
Damek Davis / Assoc. Professor Wharton Stats
1429.
'Proved' something new and had codex formalize it lean.›
LLM score 92 · 4 months ago
Andrej Karpathy / AI researcher
1430.
Software horror: litellm PyPI supply chain attack.›
LLM score 97 · 4 months ago
Jerry Tworek / ex OpenAI VP of RL
1431.
Todays AIs have a taste of maximally bland and median appeal RLHF.›
LLM score 72 · 4 months ago
Merve Noyan / Hugging Face ML Engineer
1432.
my fav papers from 3DV because why not 🤝 MapAnything by Meta›
LLM score 92 · 4 months ago
Sayak Paul / Hugging Face Researcher
1433.
Last year, I got to collaborate on a number of serious projects at the intersection of Diffusers x optimization ⚡️›
LLM score 75 · 4 months ago
Boris Cherny / Creator of Claude Code
1434.
Little known fact, the Anthropic Labs team (the team I joined Anthropic to be on) shipped:›
LLM score 85 · 4 months ago
Cameron Wolfe / Researcher at Netflix
1435.
Interesting observation on instruction following behavior induced by preference tuning versus RLVR.›
LLM score 92 · 4 months ago
Yang Chen / Nvidia Research Scientist
1436.
We released Nemotron Cascade 2 30B A3B. What makes this release especially meaningful to me is that it reflects a 1.5-year journey at NVIDIA around one core idea: improving AI math reasoning through self-improvement at test time.›
LLM score 92 · 4 months ago
Jim Fan / NVIDIA Director of Robotics
1437.
Teleop is so 2025. Ever since we unveiled EgoScale and the dexterity scaling law, it's been clear to us and the ecosystem that behavior cloning directly from humans is the way to break the curse of teleop.›
LLM score 92 · 4 months ago
Merve Noyan / Hugging Face ML Engineer
1438.
"train rt-detrv2 on mobile-ui-design" is all it takes to train an object detector 🔥 ›
LLM score 92 · 4 months ago
Horace He / Thinking Machines Founding Engineer
1439.
A conversation I've had several times lately: Them: "LLMs are so good for learning whatever you want — there's never been a better time to learn!"›
LLM score 92 · 4 months ago
Jason Weston / Meta Research Scientist
1440.
🌐Unified Post-Training via On-Policy-Trained LM-as-RM🔧›
LLM score 92 · 4 months ago