DeepLearning 1 Understanding Transformer Architecture: From Attention Mechanism to LLM Foundation Apr 12, 2026