- Omar Khattab / MIT CSAIL Asst professor1.New updates for the RLM paper: We post-trained RLM-Qwen3-8B at tiny scale, the first natively recursive LM.›LLM score 85 · about 12 hours ago
- Alex Zhang / MIT CSAIL PhD2.We just updated the RLM paper with some new stuff.›LLM score 85 · about 13 hours ago
- alphaXiv3.2026 is the year of continual learning And we are getting some amazing papers towards that›LLM score 85 · about 15 hours ago
- Andrew Lampinen / Research Scientist at DeepMind4.New paper studying how language models representations of things like factuality evolve over a conversation.›LLM score 85 · about 16 hours ago
- Cameron Wolfe / Researcher at Netflix5.Trinity large is very sparse (400B-A13B, 256 experts w/ 4 active per token).›LLM score 85 · about 21 hours ago
- Ben Burtenshaw / Hugging Face Researcher6.We got Claude to teach open models how to write CUDA kernels.›LLM score 85 · 1 day ago
- alphaXiv7.BIG new idea in interpretability called Patterning›LLM score 75 · 1 day ago
- Boris Cherny / Creator of Claude Code
- alphaXiv9."LLM-in-Sandbox Elicits General Agentic Intelligence"›LLM score 85 · 2 days ago
- Ethan Shen / Ai2 Researcher
- Tim Dettmers / Research Scientist at Ai211.his work was mostly the genius of Ethan Shen.›LLM score 70 · 3 days ago
- Tim Dettmers / Research Scientist at Ai2
- Tim Dettmers / Research Scientist at Ai2
- Tim Dettmers / Research Scientist at Ai2
- Cameron Wolfe / Researcher at Netflix15.Continual learning is a popular topic in LLM research, but it might not be as far away as we think.›LLM score 65 · 3 days ago
- Ethan Shen / Ai2 Researcher16.Finally, we conduct an analysis of variance across SWE-Bench runs.›LLM score 95 · 3 days ago
- Ethan Shen / Ai2 Researcher
- Ethan Shen / Ai2 Researcher
- Niklas Muennighoff / AI Researcher at Stanford19.Community-built open benchmarks work really well, e.g., Terminal-Bench, HLE, MMTEB.›LLM score 80 · 3 days ago
- Noam Brown / OpenAI Research Scientist20.Had to cut this one for space: 2019: AI can't create art—creativity is uniquely humanLLM score 20 · 3 days ago
- Andrej Karpathy / AI researcher21.@0xabi96 It feels like I’m cheating.›LLM score 70 · 3 days ago
- Andrej Karpathy / AI researcher22.@ChiragLathiya The nearest neighbor really is some kind of a junior engineer.›LLM score 80 · 3 days ago
- Andrej Karpathy / AI researcher23.@jeremytwei Love the word "comprehension debt", haven't encountered it so far, it's very accurate.›LLM score 30 · 3 days ago
- Andrej Karpathy / AI researcher
- Andrej Karpathy / AI researcher25.A few random notes from claude coding quite a bit last few weeks.›LLM score 20 · 3 days ago
- Noam Brown / OpenAI Research Scientist26.1987: AI can't win at chess—planning is uniquely human›LLM score 70 · 3 days ago
- Yoshua Bengio
- Jonathan Ross / TPU Creator28.Success in the Information Age was about being able to answer questions.›LLM score 20 · 3 days ago
- Ben Burtenshaw / Hugging Face Researcher29.this is a blog post on claude + llama.cpp https://t.co/yej6WsNnQALLM score 20 · 4 days ago
- Yoshua Bengio