Transformer

1 revision
#11 week ago
+6
Migrated from pages table
+A Transformer is a novel [neural network](/wiki/Neural_Networks) architecture, revolutionizing models in [Deep Learning](/wiki/Deep_Learning) by processing entire sequences simultaneously. It achieves this primarily through its self-attention mechanism, enabling parallel computation and exceptional performance in tasks like natural language understanding.
+## See also
+- [Attention Mechanism](/wiki/Attention_Mechanism)
+- [Natural Language Processing](/wiki/Natural_Language_Processing)
+- [Large Language Models](/wiki/Large_Language_Models)
... 1 more lines