- Ben Burtenshaw / Hugging Face Researcher481.this > llama-server > run my vanilla python harness›LLM score 75 · about 1 month ago
- Igor Babuschkin / Cofounder of xAI482.Building great AI products requires excellence in both creativity and technical execution.›LLM score 85 · about 1 month ago
- Christian Szegedy / ex xAI Cofounder483.
- Jack Clark / Anthropic Cofounder484.
- Cameron Wolfe / Researcher at Netflix485.Really interesting reward modeling approach that works well for small LLM judges: https://t.co/7Wqd9tiUsN›LLM score 85 · about 1 month ago
- Simon Zhai / ex MTS at xAI
- Andrew White / Edison Scientific Cofounder487.After a few years of procrastination, I've updated my textbook.›LLM score 92 · about 1 month ago
- Boris Cherny / Creator of Claude Code488.Introducing: built-in git worktree support for Claude Code ›LLM score 85 · about 1 month ago
- Andrej Karpathy / AI researcher489.Bought a new Mac mini to properly tinker with claws over the weekend.›LLM score 72 · about 1 month ago
- Boris Cherny / Creator of Claude Code490.We've been working on this for a while -- it's impressive (and scary) to see the kinds of security issues it has identified.›LLM score 92 · about 1 month ago
- Boris Cherny / Creator of Claude Code491.A massive ship from the Claude Code Desktop team.›LLM score 75 · about 1 month ago
- Sander Dieleman / DeepMind Research Scientist492.Reports of the extinction of continuous language diffusion have been greatly exaggerated😮›LLM score 92 · about 1 month ago
- Kevin Murphy / DeepMind Research Scientist493.I absolutely love using #ClaudeCode (thanks @bcherny and team!) for noodling around with ideas.›LLM score 85 · about 1 month ago
- Andrew White / Edison Scientific Cofounder494.
- Jim Fan / NVIDIA Director of Robotics495.
- alphaXiv
- Ben Burtenshaw / Hugging Face Researcher497.nano harness is a code first agent harness in 223 lines of code.›LLM score 92 · about 1 month ago
- Sander Dieleman / DeepMind Research Scientist
- Ben Burtenshaw / Hugging Face Researcher499.codex --yolo "fine tune LFM2.5-1.2B-Instruct for $2"›LLM score 92 · about 1 month ago
- Lucas Beyer / Meta Researcher500.So i asked Opus to benchmark multiple versions of a function for me.›LLM score 82 · about 1 month ago
- Lewis Tunstall / Hugging Face Researcher
- Eric Jang / ex VP of AI at 1X Robotics502.Google is currently missing an opportunity to have "Review with Gemini" for legal documents in Gmail.›LLM score 85 · about 1 month ago
- Demis Hassabis / CEO of DeepMind503.This is incredible btw - using Gemini 3.1 as a city builder.›LLM score 85 · about 1 month ago
- alphaXiv504.MaxRL is a slick one line change to GRPO that optimizes a maximum likelihood objective instead of expected reward›LLM score 92 · about 1 month ago
- Emmanuel Ameisen / Anthropic Interpretability Researcher505.Late last year, we found a precise counting mechanism in Claude.›LLM score 92 · about 1 month ago
- Jack Clark / Anthropic Cofounder506.
- alphaXiv507.The tools we use are just as critical as the algorithms we write›LLM score 75 · about 1 month ago
- alphaXiv508.📈 now trending on alphaXiv when using non-reasoning model, simply repeating your prompt improves performance.›LLM score 92 · about 1 month ago
- Ben Burtenshaw / Hugging Face Researcher509.You can finetune AI models with Unsloth + Hugging Face, and right now it’s free!›LLM score 85 · about 1 month ago
- Jeff Dean / Chief Scientist at DeepMind510.Today, we’re continuing to push the boundaries of AI with our release of Gemini 3.1 Pro.›LLM score 15 · about 1 month ago