Transformer
A transformer is a type of neural network design that powers modern language models - it is especially good at processing sequences such as text and understanding context.
In this guide
What Transformer means
The transformer is a model design introduced in 2017 that changed AI dramatically. Its key strength is an attention mechanism, which lets the model weigh how important every word is to every other word when working out meaning.
For example, in the sentence "she put the book on the table because it was sturdy," a transformer can work out that "it" refers to the table, not the book - by paying attention to how the words relate across the whole sentence.
Why Transformer matters
The transformer is the breakthrough that made today's AI tools possible - the "T" in GPT stands for transformer. Knowing the term helps you understand where modern AI came from.
Frequently asked questions
More AI terms
Ready to build the AI skills your future depends on?
Take the free 5-minute quiz and get a personalized learning plan built around your goals, schedule, and experience.