235 B
235 B
- 11:40 quick capture: Beyond Self-Attention: How a Small Language Model Predicts the Next Token | Shyam's Blog #llm
- I numeri di 3 cifre il cui cubo finisce in 888 #teaching