AI300

Jeff Dean / Chief Scientist at DeepMind
1861.
Many have really enjoyed our recent Nano Banana (+Pro) generative models for images and Veo models for video.›
LLM score 75 · 5 months ago
Andrew White / Edison Scientific Cofounder
1862.
After surprisingly long amount of work, our literature agent can finally read figures and tables from >150M papers and patents.›
LLM score 95 · 5 months ago
Sander Dieleman / DeepMind Research Scientist
1863.
Several new methods to shape the latent distributions of autoencoders have popped up recently.›
LLM score 95 · 5 months ago
Andrew Lampinen / Research Scientist at DeepMind
1864.
What is the relationship between memorization and generalization in AI? Is there a fundamental tradeoff? In a new blog post I’ve reviewed some of the evolving perspectives on memorization & generalization in machine learning, from classic perspectives through LLMs.›
LLM score 92 · 5 months ago
alphaXiv
1865.
GLM-5: A new SoTA open weights, that BEATS Gemini-3-Pro!?›
LLM score 85 · 5 months ago
Sander Dieleman / DeepMind Research Scientist
1866.
Neat idea: jointly diffuse pixels and DINO features with separate noise levels.›
LLM score 92 · 5 months ago
Merve Noyan / Hugging Face ML Engineer
1867.
you can just speak to language models with a microphone (natively, no TTS layer, runs on-device)›
LLM score 85 · 5 months ago
Jie Tang / Z.ai Cofounder
1868.
We just uploaded our GLM-5's tech report onto arxiv.›
LLM score 92 · 5 months ago
Alex Zhang / MIT CSAIL PhD
1869.
This is very exciting to see!! I'm hoping they also eventually move sub-agents to be integrated into their code execution sandbox as functions, and finally I will no longer have to compare CC to RLMs :) https://t.co/3hL2pXjqVL
LLM score 72 · 5 months ago
Sergey Levine / Physical Intelligence Cofounder
1870.
Check out Noriaki's thread about a way to get VLAs to run in real time with a fast "edge adapter"! https://t.co/HUzoSsrXzn
LLM score 75 · 5 months ago
Fei-Fei Li
1871.
Order matters in diffusion. Check out our latest work! https://t.co/EL1CQM2xHM
LLM score 92 · 5 months ago
Adams Wei Yu / DeepMind Research Scientist
1872.
It is consistent with what showed last week on codeforces ELO.›
LLM score 75 · 5 months ago
Damek Davis / Assoc. Professor Wharton Stats
1873.
Yes, compared to tools available in April, codex is very good at lean.›
LLM score 82 · 5 months ago
Leandro von Werra / Hugging Face Head of Research
1874.
Asked the web instead of the code agent to make a plot of some data and it came up with a creative use of the website screenshot tool: https://t.co/AhOnpTwBQp
LLM score 82 · 5 months ago
Chelsea Finn / Physical Intelligence Cofounder
1875.
Video gen models make pretty videos, but lack physical accuracy›
LLM score 92 · 5 months ago
Lucas Beyer / Meta Researcher
1876.
So, when I have GitHub Copilot point out a typo in my PR, and I'm lazy, I click "Commit suggestion" and it makes a commit.›
LLM score 82 · 5 months ago
Boris Cherny / Creator of Claude Code
1877.
Sonnet 4.6 is now live in Claude Code.›
LLM score 85 · 5 months ago
Boris Cherny / Creator of Claude Code
1878.
Loving Sonnet 4.6 in Cowork -- great balance of capability, speed, and token efficiency https://t.co/ZQErYmBgtE
LLM score 65 · 5 months ago
Lucas Beyer / Meta Researcher
1879.
I usually look at which benches the small model surpasses its previous big brother, if it's only few, i think it gives a hint as to what they focus on.›
LLM score 82 · 5 months ago
Eric Jang / ex VP of AI at 1X Robotics
1880.
Incredibly impressive. We are in the early days of seeing how much there is yet to be unlocked by the humanoid form factor https://t.co/4dXbAPD48x
LLM score 25 · 5 months ago
Lewis Tunstall / Hugging Face Researcher
1881.
Haha, I tried the car wash riddle on QED-Nano and not only did it get it right, but it made the creative suggestion to avoid driving altogether!›
LLM score 82 · 5 months ago
John Carmack
1882.
The glory work of GPU scheduling is in the frontier data centers with hundreds of thousands of GPUs, but a lot of research work is done with single GPU jobs on modest clusters, and the scheduling leaves much to be desired.›
LLM score 85 · 5 months ago
alphaXiv
1883.
Making LLMs truly learn from its experience "Experiential Reinforcement Learning (ERL)"›
LLM score 92 · 5 months ago
Cameron Wolfe / Researcher at Netflix
1884.
Rubric-based RL is one of the most active topics in AI research because it extends the benefits of large-scale RL training to non-verifiable domains.›
LLM score 95 · 5 months ago
Ben Burtenshaw / Hugging Face Researcher
1885.
bold small model release from @Cohere_Labs to add region focused versions of their Tiny Aya model.›
LLM score 85 · 5 months ago
Wenting Zhao / Alibaba Qwen Researcher
1886.
For Qwen-3.5, We took the training recipe from Qwen3-Coder-Next and scale up model parameters, and this results in a much stronger coding agent.›
LLM score 92 · 5 months ago
Andrew White / Edison Scientific Cofounder
1887.
I've been working on and off over the last year to update the book.›
LLM score 85 · 5 months ago
Sergey Levine / Physical Intelligence Cofounder
1888.
VLAs can enable vehicles to better handle complex edge cases: a VLM can "think through" a complex interaction, deduce a common sense behavior, and then a VLA can carry that out to maintain safe(r) behavior even in unusual situations.›
LLM score 92 · 5 months ago
Sander Dieleman / DeepMind Research Scientist
1889.
This post provides a comprehensive overview of the intersection of generative modelling and representation learning (think REPA, VA-VAE, RAE).›
LLM score 92 · 5 months ago
Lucas Beyer / Meta Researcher
1890.
Here's another prompt I quite like using, I call it `lbreview`, because while there are tons of "review pls" prompts, this one is mine, and I like it!›
LLM score 85 · 5 months ago