Transformers documentation

Auto Classes

Transformers

Get started

Transformers Installation Quickstart

Base classes

Models

Preprocessors

Inference

Pipeline API

Generate API

Optimization

Chat with models

Serving

Training

Get started

Customization

Parameter-efficient fine-tuning

Performance

Distributed training

Hardware

Quantization

Ecosystem integrations

Resources

API

Main Classes

Models

Internal helpers

Reference

You are viewing main version, which requires installation from source. If you'd like regular pip install, checkout the latest stable version (v5.12.0).

Join the Hugging Face community

and get access to the augmented documentation experience

Collaborate on models, datasets and Spaces

Faster examples with accelerated inference

Switch between documentation themes

to get started

Auto Classes

In many cases, the architecture you want to use can be guessed from the name or the path of the pretrained model you are supplying to the from_pretrained() method. AutoClasses are here to do this job for you so that you automatically retrieve the relevant model given the name/path to the pretrained weights/config/vocabulary.

Instantiating one of AutoConfig, AutoModel, and AutoTokenizer will directly create a class of the relevant architecture. For instance

model = AutoModel.from_pretrained("google-bert/bert-base-cased", device_map="auto")

will create a model that is an instance of BertModel.

There is one class of AutoModel for each task.

Extending the Auto Classes

Each of the auto classes has a method to be extended with your custom classes. For instance, if you have defined a custom class of model NewModel, make sure you have a NewModelConfig then you can add those to the auto classes like this:

from transformers import AutoConfig, AutoModel


AutoConfig.register("new-model", NewModelConfig)
AutoModel.register(NewModelConfig, NewModel)

You will then be able to use the auto classes like you would usually do!

If your NewModelConfig is a subclass of PreTrainedConfig, make sure its model_type attribute is set to the same key you use when registering the config (here "new-model").

Likewise, if your NewModel is a subclass of PreTrainedModel, make sure its config_class attribute is set to the same class you use when registering the model (here NewModelConfig).

Transformers

Auto Classes

Extending the Auto Classes

AutoConfig

class transformers.AutoConfig

from_pretrained

register

AutoTokenizer

class transformers.AutoTokenizer

from_pretrained

register

AutoFeatureExtractor

class transformers.AutoFeatureExtractor

from_pretrained

register

AutoImageProcessor

class transformers.AutoImageProcessor

from_pretrained

register

AutoVideoProcessor

class transformers.AutoVideoProcessor

from_pretrained

register

AutoProcessor

class transformers.AutoProcessor

from_pretrained

register

Generic model classes

AutoModel

class transformers.AutoModel

from_config

from_pretrained

Generic pretraining classes

AutoModelForPreTraining

class transformers.AutoModelForPreTraining

from_config

from_pretrained

Natural Language Processing

AutoModelForCausalLM

class transformers.AutoModelForCausalLM

from_config

from_pretrained

AutoModelForMaskedLM

class transformers.AutoModelForMaskedLM

from_config

from_pretrained

AutoModelForMaskGeneration

class transformers.AutoModelForMaskGeneration

AutoModelForSeq2SeqLM

class transformers.AutoModelForSeq2SeqLM

from_config

from_pretrained

AutoModelForSequenceClassification

class transformers.AutoModelForSequenceClassification

from_config

from_pretrained

AutoModelForMultipleChoice

class transformers.AutoModelForMultipleChoice

from_config

from_pretrained

AutoModelForNextSentencePrediction

class transformers.AutoModelForNextSentencePrediction

from_config

from_pretrained

AutoModelForTokenClassification

class transformers.AutoModelForTokenClassification

from_config

from_pretrained

AutoModelForQuestionAnswering

class transformers.AutoModelForQuestionAnswering

from_config

from_pretrained

AutoModelForTextEncoding

class transformers.AutoModelForTextEncoding

Computer vision

AutoModelForDepthEstimation

class transformers.AutoModelForDepthEstimation

from_config

from_pretrained

AutoModelForNormalEstimation