What is Adapter Layers?

Deep Learning

Adapter Layers

Adapter layers are small trainable modules inserted into a pre-trained model to enable parameter-efficient fine-tuning. They allow task adaptation while keeping the original model weights frozen.

Understanding Adapter Layers

Adapter layers are small, trainable modules inserted between the frozen layers of a pre-trained neural network, enabling efficient fine-tuning without modifying the original model weights. This technique is especially valuable when working with large language models where full fine-tuning would be prohibitively expensive in terms of compute and memory. Each adapter typically consists of a down-projection, a nonlinearity, and an up-projection, adding only a fraction of the total parameters. Organizations use adapter layers to customize foundation models for domain-specific tasks like legal document analysis or medical diagnosis while preserving the general knowledge encoded during unsupervised pre-training. This approach is closely related to parameter-efficient fine-tuning methods such as LoRA and prompt tuning, and helps mitigate catastrophic forgetting by keeping the base model intact.

Is AI recommending your brand?

Find out if ChatGPT, Perplexity, and Gemini mention you when people search your industry.

Check your brand — $9

Adversarial Attack

Back to full glossary

Adapter Layers

Understanding Adapter Layers

Is AI recommending your brand?

Related Deep Learning Terms

Activation Function

Adam Optimizer

Attention Mechanism

Autoencoder

Backpropagation

Batch Normalization

Batch Size

Boltzmann Machine