- Siyan Zhao / UCLA PhD
- Noam Brown / OpenAI Research Scientist1862.When we at @OpenAI released o1-preview a year ago, it would think for seconds.›LLM score 75 · 9 months ago
- Noam Brown / OpenAI Research Scientist1863.@isthisnessicary @emollick Some evals are harder to beat even with targeted data.›LLM score 70 · 9 months ago
- Noam Brown / OpenAI Research Scientist1864.
- Noam Brown / OpenAI Research Scientist
- Shuchao Bi / Meta Researcher
- Noam Brown / OpenAI Research Scientist1867.@sriramk Though by "long time" I mean 3+ years.›LLM score 20 · 9 months ago
- Noam Brown / OpenAI Research Scientist
- Noam Brown / OpenAI Research Scientist
- Noam Brown / OpenAI Research Scientist1870.
- Noam Brown / OpenAI Research Scientist1871.@emollick Also, those forecasts were for *any* AI system to get an IMO gold.›LLM score 70 · 9 months ago
- Noam Brown / OpenAI Research Scientist
- Noam Brown / OpenAI Research Scientist1873.
- Tri Dao / Chief Scientist at Together1874.
- Sheryl Hsu / OpenAI Researcher
- Alexander Wei / OpenAI Researcher1876.3/ We’ve come a long way since last summer.›LLM score 70 · 10 months ago
- Sheryl Hsu / OpenAI Researcher
- Sheryl Hsu / OpenAI Researcher
- Juntang Zhuang / MTS at xAI (pre-training lead)1879.It’s extremely fun though tough to train the first natively multimodal model ever in xAI.›LLM score 92 · 10 months ago
- Geoffrey Hinton1880.A major cut to the funding of the National Science Foundation would be very bad for the future of the US.›LLM score 20 · 10 months ago
- Ted Sanders / OpenAI Researcher1881.a cool thing you get to see building AI products: ›LLM score 75 · 10 months ago
- Ted Sanders / OpenAI Researcher1882.GPT-5 is here! it's way better at coding - not just in pointless evals, but real usage.›LLM score 70 · 10 months ago
- Jeremy Bernstein / Thinking Machines Researcher1883.I had wondered why there was no official Dion implementation by the authors...›LLM score 75 · 10 months ago
- Sally Zhu / Researcher at Flapping Airplanes1884.
- Tri Dao / Chief Scientist at Together1885.Hierarchical layout is super elegant.›LLM score 85 · 11 months ago
- Jason Wei / AI Researcher at Meta
- Jason Wei / AI Researcher at Meta1887.New blog post about asymmetry of verification and "verifier's law": https://t.co/bvS8HrX1jP›LLM score 80 · 11 months ago
- Jakub Pachocki / OpenAI Chief Scientist1888.I am extremely excited about the potential of chain-of-thought faithfulness & interpretability.›LLM score 80 · 11 months ago
- Lilian Weng / Thinking Machines Cofounder
- Tri Dao / Chief Scientist at Together1890.I played w it for 1h. Went through my usual prompts (math derivations, floating point optimizations, …).›LLM score 35 · 11 months ago