What is Generative Pre-trained Transformer?

Generative AI

Generative Pre-trained Transformer

A Generative Pre-trained Transformer (GPT) is a type of large language model that generates text by predicting the next token in a sequence. Pre-trained on vast text corpora, GPT models exhibit broad language understanding and generation capabilities.

Understanding Generative Pre-trained Transformer

A Generative Pre-trained Transformer (GPT) is a type of large language model that uses the transformer architecture to generate coherent, contextually relevant text after being pre-trained on vast text corpora in a self-supervised manner. The "pre-trained" aspect means the model first learns general language patterns from billions of documents before being adapted through fine-tuning or prompt engineering for specific tasks like summarization, translation, question answering, and code generation. OpenAI's GPT series, from GPT-2 to GPT-4 and beyond, demonstrated that scaling model size, training data, and compute leads to emergent capabilities including few-shot learning and complex reasoning. GPTs use an autoregressive approach, predicting one token at a time based on all preceding tokens. These foundation models have catalyzed the generative AI revolution and underpin applications like ChatGPT, transforming how people interact with AI across industries.

Is AI recommending your brand?

Find out if ChatGPT, Perplexity, and Gemini mention you when people search your industry.

Check your brand — $9

Genetic Algorithm

Back to full glossary

Generative Pre-trained Transformer

Understanding Generative Pre-trained Transformer

Is AI recommending your brand?

Related Generative AI Terms

Chain of Thought

ChatGPT

Claude

Diffusion Model

Discriminator

Few-Shot Prompting

Foundation Model

GAN