- Ethan Shen / Ai2 Researcher1531.Finally, we conduct an analysis of variance across SWE-Bench runs.›LLM score 95 · 4 months ago
- Ethan Shen / Ai2 Researcher
- Ethan Shen / Ai2 Researcher
- Niklas Muennighoff / AI Researcher at Stanford1534.Community-built open benchmarks work really well, e.g., Terminal-Bench, HLE, MMTEB.›LLM score 80 · 4 months ago
- John Carmack1535.#PaperADay 11 Discovering state-of-the-art reinforcement learning algorithms›LLM score 80 · 4 months ago
- Noam Brown / OpenAI Research Scientist1536.Had to cut this one for space: 2019: AI can't create art—creativity is uniquely humanLLM score 20 · 4 months ago
- Andrej Karpathy / AI researcher1537.@0xabi96 It feels like I’m cheating.›LLM score 70 · 4 months ago
- Andrej Karpathy / AI researcher1538.@ChiragLathiya The nearest neighbor really is some kind of a junior engineer.›LLM score 80 · 4 months ago
- Andrej Karpathy / AI researcher1539.@jeremytwei Love the word "comprehension debt", haven't encountered it so far, it's very accurate.›LLM score 30 · 4 months ago
- Andrej Karpathy / AI researcher
- Andrej Karpathy / AI researcher1541.A few random notes from claude coding quite a bit last few weeks.›LLM score 20 · 4 months ago
- Noam Brown / OpenAI Research Scientist1542.1987: AI can't win at chess—planning is uniquely human›LLM score 70 · 4 months ago
- Yoshua Bengio1543.
- Jonathan Ross / TPU Creator1544.Success in the Information Age was about being able to answer questions.›LLM score 20 · 4 months ago
- Ben Burtenshaw / Hugging Face Researcher1545.this is a blog post on claude + llama.cpp https://t.co/yej6WsNnQALLM score 20 · 4 months ago
- Yoshua Bengio
- Ben Burtenshaw / Hugging Face Researcher
- Ronak Malde / DeepMind Researcher1548.
- Clive Chan / OpenAI Hardware
- Cameron Wolfe / Researcher at Netflix1550.Continual learning is being positioned as a prerequisite for AGI (i.e., general systems must be adaptable).›LLM score 70 · 4 months ago
- Geoffrey Hinton1551.I just watched a really great conversation about the future of AI.›LLM score 30 · 4 months ago
- Ben Burtenshaw / Hugging Face Researcher1552.The only thing I don't get is why it has to be a Mac mini.›LLM score 20 · 4 months ago
- Chelsea Finn / Physical Intelligence Cofounder1553.Video models serve as a good pretrained backbone for robot policies.›LLM score 70 · 4 months ago
- Sander Dieleman / DeepMind Research Scientist
- Shane Legg / Chief AGI Scientist at DeepMind
- Sergey Levine / Physical Intelligence Cofounder
- John Carmack1557.#PaperADay 10 LeJEPA: Provable and Scalable Self-Supervised Learning Without the Heuristics›LLM score 80 · 4 months ago
- Boris Cherny / Creator of Claude Code1558.@ariccio Use it for when the main agent doesn't need to see the work that a skill did, only its output.›LLM score 40 · 4 months ago
- Michael Elabd / DeepMind Researcher1559.Honestly, I think memory is the biggest blocker to continual learning right now.›LLM score 85 · 4 months ago
- Andrew Lampinen / Research Scientist at DeepMind