- Jason Weston / Meta Research Scientist631.🏋️Thinking Mid-training: RL of Interleaved Reasoning🎗️›LLM score 92 · about 2 months ago
- Jie Tang / Z.ai Cofounder632.GLM 5.1 is coming https://t.co/rSnHYW95Dk.›LLM score 85 · about 2 months ago
- Zixuan Li / Lead Z.ai633.If you’ve encountered garbled output like this while using GLM-5 or GLM-5.1 on our official service, the issue is now resolved.›LLM score 92 · about 2 months ago
- Dan Fu / VP of Kernels at Together634.Super cool! Small models are super undertrained relative to their test-time scaling properties.›LLM score 75 · about 2 months ago
- Boris Cherny / Creator of Claude Code635.Mythos is very powerful, and should feel terrifying.›LLM score 85 · about 2 months ago
- Ben Burtenshaw / Hugging Face Researcher636.question: if I want to optimize kernels for apple silicon, how do I benchmark them with macos on apple silicon (as-a-service)?›LLM score 92 · about 2 months ago
- Thomas Wolf / Hugging Face Cofounder637.Releasing one of our *largest* robotics project yet in the open›LLM score 92 · about 2 months ago
- Sander Dieleman / DeepMind Research Scientist
- Fei-Fei Li639.Making improvements one step at a time for Marble.›LLM score 85 · about 2 months ago
- Tim Dettmers / Research Scientist at Ai2640.I was going crazy because I could not replicate TurboQuant.›LLM score 92 · about 2 months ago
- Merve Noyan / Hugging Face ML Engineer641.no shade but @huggingface YT channel has so many gem technical content & podcasts that are watched more 😄 https://t.co/oDzrhHaePgLLM score 75 · about 2 months ago
- Eric Jang / ex VP of AI at 1X Robotics
- Mehtaab Sawhney / OpenAI for Science
- Cameron Wolfe / Researcher at Netflix644.Really wonderful paper that perfectly demonstrates a practice people should care more about: setting strong baselines.›LLM score 92 · about 2 months ago
- John Carmack645.So many judging tasks could be improved by aggregating partial orderings, and in the limit, just ordering pairs.›LLM score 72 · about 2 months ago
- Thomas Wolf / Hugging Face Cofounder646.We’re very excited to deepen our work with the @SAIRfoundation co-founded by Terence Tao.›LLM score 75 · about 2 months ago
- Sander Dieleman / DeepMind Research Scientist
- Lewis Tunstall / Hugging Face Researcher648.Terence Tao's SAIR foundation is doing some really cool work on enabling AI4Maths to be open and collaborative›LLM score 82 · about 2 months ago
- Tri Dao / Chief Scientist at Together649.Fast muon optimizer coming to consumer cards.›LLM score 92 · about 2 months ago
- Karol Hausman / Physical Intelligence Cofounder
- Merve Noyan / Hugging Face ML Engineer651.tip: Gemma 4 exposes a thought channel for reasoning›LLM score 92 · about 2 months ago
- Damek Davis / Assoc. Professor Wharton Stats652.
- Sherwin Wu / OpenAI API, Head of Engineering653.Definitely want you building on top of our products! https://t.co/SX5nkJmSfVLLM score 85 · about 2 months ago
- Eric Jang / ex VP of AI at 1X Robotics654.Pretty fun puzzle generator for alignment and safety researchers https://t.co/pclOnYRVFTLLM score 85 · about 2 months ago
- Andrej Karpathy / AI researcher655.Farzapedia, personal wikipedia of Farza, good example following my Wiki LLM tweet.›LLM score 92 · about 2 months ago
- Andrej Karpathy / AI researcher
- Cameron Wolfe / Researcher at Netflix
- Damek Davis / Assoc. Professor Wharton Stats658.I added this problem to the Optimization Constants in Mathematics repository that I'm maintaining with @PI010101 and Terry Tao.›LLM score 92 · about 2 months ago
- Andrej Karpathy / AI researcher659.Wow, this tweet went very viral! I wanted share a possibly slightly improved version of the tweet in an "idea file".›LLM score 85 · about 2 months ago
- Ben Burtenshaw / Hugging Face Researcher660.PSA: models on hugging face are free to use.›LLM score 92 · about 2 months ago