- Xiang Fu / Researcher at Periodic Labs811.Context management is error management.›LLM score 92 · 2 months ago
- alphaXiv812.Following Google's Genie 3, Ant Group released an open source alternative: LingBot-World!›LLM score 82 · 2 months ago
- Andrej Karpathy / AI researcher813.Finding myself going back to RSS/Atom feeds a lot more recently.›LLM score 72 · 2 months ago
- Merve Noyan / Hugging Face ML Engineer
- Richard Song / DeepMind Research Scientist
- Andrej Karpathy / AI researcher816.nanochat can now train GPT-2 grade LLM for <<$100 (~$73, 3 hours on a single 8XH100 node).›LLM score 95 · 2 months ago
- John Carmack817.#PaperADay 15 2024: Mastering Diverse Domains through World Models›LLM score 85 · 2 months ago
- Eric Jang / ex VP of AI at 1X Robotics818.
- Thang Luong / DeepMind Principal Scientist819.There has been so much noise on AI for Math research.›LLM score 92 · 2 months ago
- Thomas Wolf / Hugging Face Cofounder820.who's doing serious ai-thropology research on @moltbook rn?›LLM score 72 · 2 months ago
- Lucas Beyer / Meta Researcher821.PSA: never, ever write "we use the same learning rate across all methods for fair comparison"›LLM score 85 · 2 months ago
- Andrew White / Edison Scientific Cofounder822.Another nice example showing how our agents can reproduce analysis and figures from papers.›LLM score 92 · 2 months ago
- Ben Burtenshaw / Hugging Face Researcher823.PSA: skills are not docs. skills are for the hardest problems an agent can solve.›LLM score 65 · 2 months ago
- alphaXiv
- Jason Weston / Meta Research Scientist825.📈Self-Improving Pretraining 📈 ✍️: https://t.co/GsvYMuMT4b›LLM score 92 · 2 months ago
- Cyris Kissane / Researcher at Flapping Airplanes826.Hot take: the creation of Adam put AI research back multiple years.›LLM score 65 · 2 months ago
- Zhaocheng Zhu / Nvidia Research Scientist827.ICML bidding observations: LLMs are emerging as a field separate from deep learning.›LLM score 80 · 2 months ago
- Hang Gao / ex MTS at xAI828.
- Sherwin Wu / OpenAI API, Head of Engineering
- Omar Khattab / MIT CSAIL Asst professor830.New updates for the RLM paper: We post-trained RLM-Qwen3-8B at tiny scale, the first natively recursive LM.›LLM score 85 · 2 months ago
- Alex Zhang / MIT CSAIL PhD831.We just updated the RLM paper with some new stuff.›LLM score 85 · 2 months ago
- alphaXiv832.2026 is the year of continual learning And we are getting some amazing papers towards that›LLM score 85 · 2 months ago
- Andrew Lampinen / Research Scientist at DeepMind833.
- Sayak Paul / Hugging Face Researcher
- Cameron Wolfe / Researcher at Netflix835.Trinity large is very sparse (400B-A13B, 256 experts w/ 4 active per token).›LLM score 85 · 2 months ago
- Roland Gavrilescu / ex MTS at xAI836.Models haven’t been post-trained on progressive disclosure yet.›LLM score 92 · 2 months ago
- John Carmack837.#PaperADay 13 2020: DREAM TO CONTROL: LEARNING BEHAVIORS BY LATENT IMAGINATION›LLM score 85 · 2 months ago
- Asher Spector / Cofounder of Flapping Airplanes
- Cyris Kissane / Researcher at Flapping Airplanes839.I use muon when making decisions to minimize my regret Adam and SGD just weren't fast enough.LLM score 70 · 2 months ago
- Kushal Thaman / Researcher at Flapping Airplanes840.I spent a bunch of time a year ago thinking about the data wall.›LLM score 85 · 2 months ago