- Ilya Sutskever / Founder of SSI1261.The transformer is well named, as it transformed everythingLLM score 10 · over 3 years ago
- Ilya Sutskever / Founder of SSI1262.The S of our collective LSTM is quite shortLLM score 70 · over 3 years ago
- Ilya Sutskever / Founder of SSI1263.Would be cool to implement dasher using today’s LMs:›LLM score 20 · over 3 years ago
- Ilya Sutskever / Founder of SSI
- Ilya Sutskever / Founder of SSI1265.Deep learning is based on the audacious conjecture that biological neurons and artificial neurons are not that different.›LLM score 80 · over 3 years ago
- Ilya Sutskever / Founder of SSI1266.I wonder which insights from developmental psychology will apply to our future NNsLLM score 70 · over 3 years ago
- Ilya Sutskever / Founder of SSI1267.“prompting” is a transitory term that’s relevant only thanks to flaws in our modelsLLM score 80 · over 3 years ago
- Ilya Sutskever / Founder of SSI1268.it's not the worst for an AGI effort to contribute to a future plurality of AGIs all of whom love humanityLLM score 10 · over 3 years ago
- Ilya Sutskever / Founder of SSI1269.Human culture is critical civilization-enabling infrastructure.›LLM score 20 · over 3 years ago
- Ilya Sutskever / Founder of SSI
- Ilya Sutskever / Founder of SSI
- Ilya Sutskever / Founder of SSI1272.a near term effect of human level AGI could be not unlike that of massive scale very high skilled immigration ›LLM score 85 · over 3 years ago
- Ilya Sutskever / Founder of SSI1273.working towards AGI while not feeling the AGI is the real riskLLM score 20 · over 3 years ago
- Ilya Sutskever / Founder of SSI1274.People who understand math often think that simple logical reasoning applies to all areas of life.›LLM score 30 · over 3 years ago
- Ilya Sutskever / Founder of SSI1275.a big mistake in old school ML was the belief that the logarithm is bounded from aboveLLM score 75 · over 3 years ago
- Zihang Dai / xAI Cofounder1276.In NLP, the O(TD^2) linear projections in Transformer often cost more FLOPs than the O(T^2D) attention as commonly D > T.›LLM score 95 · over 5 years ago
- Alec Radford / OpenAI Researcher
- Alec Radford / OpenAI Researcher1278.
- Alec Radford / OpenAI Researcher
- Alec Radford / OpenAI Researcher