AI300

Ben Burtenshaw / Hugging Face Researcher
451.
kinda can't believe where post-training and RL are right now.›
LLM score 65 · about 1 month ago
Damek Davis / Assoc. Professor Wharton Stats
452.
Dima spent way too much time on these over the last couple months.›
LLM score 25 · about 1 month ago
Zixuan Li / Lead Z.ai
453.
Open weights + open tooling is something we deeply believe in.›
LLM score 35 · about 1 month ago
Thomas Wolf / Hugging Face Cofounder
454.
TIL you can use open-source models in codex https://t.co/k8EXNi7x5t
LLM score 45 · about 1 month ago
Sayak Paul / Hugging Face Researcher
455.
A quick `hf sync` a day keeps the agents at bay 😉›
LLM score 72 · about 1 month ago
Jie Tang / Z.ai Cofounder
456.
（Claude、GPT、GLM） GLM-5.2 Tops Artificial Analysis as the #1 Open-Source Model, Ranking Top 3 Globally›
LLM score 35 · about 1 month ago
Zixuan Li / Lead Z.ai
457.
Finally, Artificial Analysis Intelligence Index concludes the GLM-5.2 release.›
LLM score 35 · about 1 month ago
Yingru Li / xAI Researcher
458.
Scaling RL bigrun on this one. Excited to keep pushing the frontier of intelligence with @cursor_ai https://t.co/HSjZFAcYcc
LLM score 25 · about 1 month ago
Aidan McLaughlin / OpenAI Research Scientist
459.
"will prompting skill differences be a thing with asi" is a way more fun thought experiment than it seems
LLM score 25 · about 1 month ago
Sherwin Wu / OpenAI API, Head of Engineering
460.
Jason going founder mode at OpenAI – and pulling us all along! https://t.co/QEflKKgJer
LLM score 15 · about 1 month ago
Zixuan Li / Lead Z.ai
461.
Gemini Enterprise Agent Platform (formerly Vertex AI) now supports one-click deployment of GLM-5.2-FP8.›
LLM score 25 · about 1 month ago
Jie Tang / Z.ai Cofounder
462.
We're introducing GLM-5.2, our latest flagship model for long-horizon tasks.›
LLM score 35 · about 1 month ago
Zixuan Li / Lead Z.ai
463.
GLM-5.2 is now available in @NotionHQ.›
LLM score 15 · about 1 month ago
Zixuan Li / Lead Z.ai
464.
Curious to hear if this is the kind of content you'd like to see more of.›
LLM score 25 · about 1 month ago
Ben Burtenshaw / Hugging Face Researcher
465.
we are so back fam. - MIT license - opus 4.8 level›
LLM score 35 · about 1 month ago
Zixuan Li / Lead Z.ai
466.
Haven't spotlighted https://t.co/0kDNwJvxtr in the main post for a while, but the general chat experience (including role-play) is something we never overlook.›
LLM score 35 · about 1 month ago
Niklas Muennighoff / AI Researcher at Stanford
467.
ad astra💫 https://t.co/7CxaVLwOC2
LLM score 25 · about 1 month ago
Cameron Wolfe / Researcher at Netflix
468.
Great day to emphasize that Composer 2.5 (from Cursor) is an extremely underrated model.›
LLM score 65 · about 1 month ago
Sholto Douglas / Researcher at Anthropic
469.
Thais is a good friend and absolute killer - go help her end slop https://t.co/b60ieXK3OQ
LLM score 25 · about 1 month ago
Merve Noyan / Hugging Face ML Engineer
470.
this is almost 1-1 same as my local setup! I use llama server + Pi + Gemma-4 (interchangeably with Qwen3.6) https://t.co/iU7o8HPFxR
LLM score 65 · about 1 month ago
Ben Burtenshaw / Hugging Face Researcher
471.
ICYMI, your agents can one-shot massive data-pipelines with sandboxes and petabytes of storage.›
LLM score 72 · about 1 month ago
Merve Noyan / Hugging Face ML Engineer
472.
this is going to be an eventful week in open-source AI, tune in 👀
LLM score 15 · about 1 month ago
Merve Noyan / Hugging Face ML Engineer
473.
my first finding with this pipeline is that it works well but (rarely) there's a false positive tendency, I don't pass bboxes as tokens but rather overlaid masks/bboxes to judges›
LLM score 72 · about 1 month ago
Cameron Wolfe / Researcher at Netflix
474.
There are three primary grading strategies we can use when evaluating an LLM agent…›
LLM score 72 · about 1 month ago
Alex Zhang / MIT CSAIL PhD
475.
Good harness designs can get around extreme token costs when information is structured.›
LLM score 72 · about 1 month ago
Jeff Dean / Chief Scientist at DeepMind
476.
A good essay by @pgasawa and @profjoeyg on a more nuanced view of AI advances.›
LLM score 65 · about 1 month ago
Dan Fu / VP of Kernels at Together
477.
This is a crazy demo, congrats @krandiash @_albertgu @cartesia on the launch!! https://t.co/U8WClKOS3o
LLM score 25 · about 1 month ago
John Carmack
478.
In the tradition of @fabynou 's game engine books, Bas Smits (on X?) has made the comprehensive book on the Commander Keen games.›
LLM score 15 · about 1 month ago
Albert Gu / Cartesia Chief Scientist
479.
Within the span of a week, we launched streaming TTS (text-to-speech) and STT (speech-to-text) models that topped the leaderboards.›
LLM score 35 · about 1 month ago
Jonas Geiping / AI Researcher in Tübingen
480.
Make your models safer with this one weird trick: ›
LLM score 72 · about 1 month ago