Optimum RBLN¶
Optimum RBLN serves as a bridge connecting the HuggingFace transformers
/diffusers
libraries to RBLN NPUs, i.e. ATOM (RBLN-CA02
) and ATOM+ (RBLN-CA12
). It offers a set of tools that enable easy model compilation and inference for both single and multi-NPU (Rebellions Scalable Design) configurations across a range of downstream tasks. The following table presents the comprehensive lineup of models currently supported by Optimum RBLN.
Transformers¶
Single NPU¶
- BertForQuestionAnswering | | | BERT-large | - BertForMaskedLM
- BertForQuestionAnswering | | | DistilBERT-base | DistilBertForQuestionAnswering | | | SecureBERT | RobertaForMaskedLM | | | RoBERTa | RobertaForSequenceClassification | | | BGE-M3 | XLMRobertaModel | | | BGE-Reranker-V2-M3 | XLMRobertaForSequenceClassification | | | BGE-Reranker-Base | XLMRobertaForSequenceClassification | | | BGE-Reranker-Large | XLMRobertaForSequenceClassification | | | Ko-Reranker | XLMRobertaForSequenceClassification | | | Whisper-tiny | WhisperForConditionalGeneration | | | Whisper-base | WhisperForConditionalGeneration | | | Whisper-small | WhisperForConditionalGeneration | | | Whisper-medium | WhisperForConditionalGeneration | | | Whisper-large-v3 | WhisperForConditionalGeneration | | | Whisper-large-v3-turbo | WhisperForConditionalGeneration | | | Wav2Vec2 | Wav2Vec2ForCTC | | | Audio-Spectogram-Transformer | ASTForAudioClassification | | | DPT-large | DPTForDepthEstimation | | | ViT-large | ViTForImageClassification | | | ResNet50 | ResNetForImageClassification | |
Multi-NPU (RSD)¶
Note
Rebellions Scalable Design (RSD) is only available on ATOM+ (RBLN-CA12
). You can check the type of your current RBLN NPU using the rbln-stat
command.
Diffusers¶
Note
Models marked with a superscript, †
, require more than one ATOM due to their large weight size exceeding the capacity of a single ATOM. This necessitates dividing the model's modules across multiple ATOMs. For detailed information regarding the specific module distribution, please refer to the model code.
- StableDiffusionPipeline
- StableDiffusionImg2ImgPipeline
- StableDiffusionInpaintPipeline
- StableDiffusionPipeline
- StableDiffusion3Pipeline
- StableDiffusion3Img2ImgPipeline
- StableDiffusion3InpaintPipeline
- StableDiffusionXLPipeline
- StableDiffusionXLImg2ImgPipeline
- StableDiffusionXLInpaintPipeline
- StableDiffusionXLPipeline
- StableDiffusionXLPipeline
- StableDiffusionXLImg2ImgPipeline
- StableDiffusionControlNetPipeline
- StableDiffusionControlNetImg2ImgPipeline
- StableDiffusionXLControlNetPipeline
- StableDiffusionXLControlNetImg2ImgPipeline
- KandinskyV22InpaintCombinedPipeline
- KandinskyV22PriorPipeline
- KandinskyV22InpaintPipeline