- Zhaocheng Zhu / Nvidia Research Scientist1501.ICML bidding observations: LLMs are emerging as a field separate from deep learning.›LLM score 80 · 4 months ago
- Hang Gao / ex MTS at xAI1502.
- Sherwin Wu / OpenAI API, Head of Engineering
- Omar Khattab / MIT CSAIL Asst professor1504.New updates for the RLM paper: We post-trained RLM-Qwen3-8B at tiny scale, the first natively recursive LM.›LLM score 85 · 4 months ago
- Alex Zhang / MIT CSAIL PhD1505.We just updated the RLM paper with some new stuff.›LLM score 85 · 4 months ago
- alphaXiv1506.2026 is the year of continual learning And we are getting some amazing papers towards that›LLM score 85 · 4 months ago
- Andrew Lampinen / Research Scientist at DeepMind1507.
- Sayak Paul / Hugging Face Researcher
- Cameron Wolfe / Researcher at Netflix1509.Trinity large is very sparse (400B-A13B, 256 experts w/ 4 active per token).›LLM score 85 · 4 months ago
- Roland Gavrilescu / ex MTS at xAI1510.Models haven’t been post-trained on progressive disclosure yet.›LLM score 92 · 4 months ago
- John Carmack1511.#PaperADay 13 2020: DREAM TO CONTROL: LEARNING BEHAVIORS BY LATENT IMAGINATION›LLM score 85 · 4 months ago
- Asher Spector / Cofounder of Flapping Airplanes
- Cyris Kissane / Researcher at Flapping Airplanes1513.I use muon when making decisions to minimize my regret Adam and SGD just weren't fast enough.LLM score 70 · 4 months ago
- Kushal Thaman / Researcher at Flapping Airplanes1514.I spent a bunch of time a year ago thinking about the data wall.›LLM score 85 · 4 months ago
- Mehtaab Sawhney / OpenAI for Science1515.I've recently gone on leave from Columbia to join OpenAI, working on OpenAI for Science.›LLM score 75 · 4 months ago
- Ben Burtenshaw / Hugging Face Researcher1516.We got Claude to teach open models how to write CUDA kernels.›LLM score 85 · 4 months ago
- alphaXiv1517.BIG new idea in interpretability called Patterning›LLM score 75 · 4 months ago
- John Carmack1518.#PaperADay 12 2019: Learning Latent Dynamics for Planning from Pixels (PlaNet)›LLM score 85 · 4 months ago
- Boris Cherny / Creator of Claude Code
- Lucas Beyer / Meta Researcher
- alphaXiv1521."LLM-in-Sandbox Elicits General Agentic Intelligence"›LLM score 85 · 4 months ago
- Ethan Shen / Ai2 Researcher1522.
- Lucas Beyer / Meta Researcher
- Tim Dettmers / Research Scientist at Ai21524.his work was mostly the genius of Ethan Shen.›LLM score 70 · 4 months ago
- Tim Dettmers / Research Scientist at Ai2
- Tim Dettmers / Research Scientist at Ai2
- Tim Dettmers / Research Scientist at Ai21527.
- Cameron Wolfe / Researcher at Netflix1528.Continual learning is a popular topic in LLM research, but it might not be as far away as we think.›LLM score 65 · 4 months ago
- Ethan Shen / Ai2 Researcher1529.Finally, we conduct an analysis of variance across SWE-Bench runs.›LLM score 95 · 4 months ago
- Ethan Shen / Ai2 Researcher