Natural Language Processing

BERT

BERT (Bidirectional Encoder Representations from Transformers) is a language model developed by Google that reads text in both directions simultaneously. BERT revolutionized NLP by enabling deep bidirectional pre-training for language understanding tasks.

Understanding BERT

BERT, or Bidirectional Encoder Representations from Transformers, fundamentally changed the landscape of natural language processing when Google released it in 2018 by demonstrating that pre-training a deep bidirectional transformer on massive text corpora could produce universally useful language representations. Unlike previous models that read text left-to-right or right-to-left, BERT's masked language modeling objective trains the model to predict randomly hidden words based on their full surrounding context in both directions. This bidirectional understanding allows BERT to capture nuanced meanings that depend on both preceding and following words. After pre-training, BERT can be fine-tuned with relatively small amounts of labeled data for tasks like classification, question answering, and named entity recognition. BERT's architecture built directly on the attention mechanism and transformer innovations, and its success spawned a family of variants including RoBERTa, DistilBERT, and ALBERT. BERT also powers Google Search's understanding of queries, demonstrating its massive real-world impact.

Is AI recommending your brand?

Find out if ChatGPT, Perplexity, and Gemini mention you when people search your industry.

Check your brand — $9

Bias in AI

Back to full glossary

BERT

Understanding BERT

Is AI recommending your brand?

Related Natural Language Processing Terms

Abstractive Summarization

Beam Search

Bigram

Byte Pair Encoding

Corpus

Extractive Summarization

Grounding

Language Model