- Tri Dao / Chief Scientist at Together1891.@RaghuGanti @cHHillee Oh you’d want to use warp reduction if the whole row fits into 1 warp.›LLM score 80 · 11 months ago
- Tri Dao / Chief Scientist at Together1892.They’ve finally done it. They got rid of tokenizers! https://t.co/x4CXHdCw0WLLM score 60 · 11 months ago
- Tri Dao / Chief Scientist at Together
- Tri Dao / Chief Scientist at Together1894.
- Tri Dao / Chief Scientist at Together1895.Albert articulates really well the trade offs between transformers and SSMs.›LLM score 80 · 11 months ago
- Tri Dao / Chief Scientist at Together
- Shuchao Bi / Meta Researcher
- Yang Chen / Nvidia Research Scientist1898.The first thing we did was to make sure the eval setup is correct!›LLM score 92 · 12 months ago
- Yang Chen / Nvidia Research Scientist1899.📢We conduct a systematic study to demystify the synergy between SFT and RL for reasoning models.›LLM score 92 · 12 months ago
- Geoffrey Hinton
- Yang Chen / Nvidia Research Scientist1901.Does RL incentive reasoning capability over the starting SFT model? ›LLM score 92 · 12 months ago
- Geoffrey Hinton1902.I just watched a great compilation of various people's views about what is coming:›LLM score 10 · about 1 year ago
- Ludwig Schmidt / Anthropic MTS
- Geoffrey Hinton1904.AGI is the most important and potentially dangerous technology of our time.›LLM score 70 · about 1 year ago
- Geoffrey Hinton
- Karan Dalal / Stanford SAIL Researcher
- Adams Wei Yu / DeepMind Research Scientist
- Vahid Kazemi / ex MTS at xAI
- Vahid Kazemi / ex MTS at xAI1909.
- Vahid Kazemi / ex MTS at xAI1910.Finally finished editing my video.›LLM score 85 · over 1 year ago
- Aditya Ramesh / OpenAI VP of Worldsim
- Julian Schrittwieser / Anthropic Researcher
- Geoffrey Hinton
- Geoffrey Hinton
- Geoffrey Hinton1915.@ESYudkowsky LeCun p(doom) = 0.001; Yudkowsky p(doom) = .999;›LLM score 70 · almost 2 years ago
- Geoffrey Hinton1916.@OrniasDMF I am not "blindly opposing AI".›LLM score 80 · about 2 years ago
- Julian Schrittwieser / Anthropic Researcher1917.In the normal Turing test, an investigator tries to distinguish a human and an AI.›LLM score 70 · over 2 years ago
- Ilya Sutskever / Founder of SSI1918.
- Ilya Sutskever / Founder of SSI1919.Practical alignment work is both critically important and immediately impactful.›LLM score 10 · over 2 years ago
- Ilya Sutskever / Founder of SSI