AlbertForMaskedLM,4 AllenaiLongformerBase,4 BartForCausalLM,4 BertForMaskedLM,16 BigBird,32 BlenderbotForCausalLM,32 DebertaV2ForMaskedLM,16 DistilBertForMaskedLM,128 DistillGPT2,16 ElectraForCausalLM,8 GoogleFnet,16 GPT2ForSequenceClassification,4 LayoutLMForMaskedLM,16 M2M100ForConditionalGeneration,16 MBartForCausalLM,4 MegatronBertForCausalLM,4 MobileBertForMaskedLM,64 MT5ForConditionalGeneration,16 OPTForCausalLM,2 PegasusForCausalLM,32 PLBartForCausalLM,8 RobertaForCausalLM,16 T5ForConditionalGeneration,4 T5Small,1 TrOCRForCausalLM,32 XGLMForCausalLM,8 XLNetLMHeadModel,8 YituTechConvBert,16