Skip to content

Optimum RBLN

Optimum RBLN¶

Optimum RBLN serves as a bridge connecting the HuggingFace transformers/diffusers libraries to RBLN NPUs, i.e. ATOM (RBLN-CA02) and ATOM+ (RBLN-CA12). It offers a set of tools that enable easy model compilation and inference for both single and multi-NPU (Rebellions Scalable Design) configurations across a range of downstream tasks. The following table presents the comprehensive lineup of models currently supported by Optimum RBLN.

Transformers¶

Single NPU¶

| Model | Model Architecture | Task | | :---: | :-----------------------------: | :-----: | | Phi-2 | PhiForCausalLM |

Text Generation

| | Gemma-2b | GemmaForCausalLM |

Text Generation

| | GPT2 | GPT2LMHeadModel |

Text Generation

| | GPT2-medium | GPT2LMHeadModel |

Text Generation

| | GPT2-large | GPT2LMHeadModel |

Text Generation

| | GPT2-xl | GPT2LMHeadModel |

Text Generation

| | T5-small | T5ForConditionalGeneration |

Text Generation

| | T5-base | T5ForConditionalGeneration |

Text Generation

| | T5-large | T5ForConditionalGeneration |

Text Generation

| | T5-3b | T5ForConditionalGeneration |

Text Generation

| | BART-base | BartForConditionalGeneration |

Text Generation

| | BART-large | BartForConditionalGeneration |

Text Generation

| | KoBART-base | BartForConditionalGeneration |

Text Generation

| | E5-base-4K | BertModel |

Embedding Retrieval

| | LaBSE | BertModel |

Embedding Retrieval

| | KR-SBERT-V40K-klueNLI-augSTS | BertModel |

Embedding Retrieval

| | BERT-base | - BertForMaskedLM
- BertForQuestionAnswering |

Masked Language Modeling

Question Answering

| | BERT-large | - BertForMaskedLM
- BertForQuestionAnswering |

Masked Language Modeling

Question Answering

| | DistilBERT-base | DistilBertForQuestionAnswering |

Question Answering

| | SecureBERT | RobertaForMaskedLM |

Masked Language Modeling

| | RoBERTa | RobertaForSequenceClassification |

Text Classification

| | BGE-M3 | XLMRobertaModel |

Embedding Retrieval

| | BGE-Reranker-V2-M3 | XLMRobertaForSequenceClassification |

Embedding Retrieval

| | BGE-Reranker-Base | XLMRobertaForSequenceClassification |

Embedding Retrieval

| | BGE-Reranker-Large | XLMRobertaForSequenceClassification |

Embedding Retrieval

| | Ko-Reranker | XLMRobertaForSequenceClassification |

Embedding Retrieval

| | Whisper-tiny | WhisperForConditionalGeneration |

| | Whisper-base | WhisperForConditionalGeneration |

| | Whisper-small | WhisperForConditionalGeneration |

| | Whisper-medium | WhisperForConditionalGeneration |

| | Whisper-large-v3 | WhisperForConditionalGeneration |

| | Whisper-large-v3-turbo | WhisperForConditionalGeneration |

| | Wav2Vec2 | Wav2Vec2ForCTC |

| | Audio-Spectogram-Transformer | ASTForAudioClassification |

Audio Classification

| | DPT-large | DPTForDepthEstimation |

Monocular Depth Estimation

| | ViT-large | ViTForImageClassification |

Image Classification

| | ResNet50 | ResNetForImageClassification |

Image Classification

|

Multi-NPU (RSD)¶

Note

Rebellions Scalable Design (RSD) is only available on ATOM+ (RBLN-CA12). You can check the type of your current RBLN NPU using the rbln-stat command.

| Model | Model Architecture | Recommended # of NPUs | Task | | :---: | :-----------------------------: | :-----: | :-----: | | DeepSeek-R1-Distill-Llama-8b | LlamaForCausalLM | 8 |

Text Generation

| | DeepSeek-R1-Distill-Llama-70b | LlamaForCausalLM | 16 |

Text Generation

| | DeepSeek-R1-Distill-Qwen-1.5b | Qwen2ForCausalLM | 8 |

Text Generation

| | DeepSeek-R1-Distill-Qwen-7b | Qwen2ForCausalLM | 8 |

Text Generation

| | DeepSeek-R1-Distill-Qwen-14b | Qwen2ForCausalLM | 8 |

Text Generation

| | DeepSeek-R1-Distill-Qwen-32b | Qwen2ForCausalLM | 16 |

Text Generation

| | Llama3.3-70b | LlamaForCausalLM | 16 |

Text Generation

| | Llama3.2-3b | LlamaForCausalLM | 8 |

Text Generation

| | Llama3.1-70b | LlamaForCausalLM | 16 |

Text Generation

| | Llama3.1-8b | LlamaForCausalLM | 8 |

Text Generation

| | Llama3-8b | LlamaForCausalLM | 4 |

Text Generation

| | Llama3-8b + LoRA | LlamaForCausalLM | 4 |

Text Generation

| | Llama2-7b | LlamaForCausalLM | 4 |

Text Generation

| | Llama2-13b | LlamaForCausalLM | 8 |

Text Generation

| | Gemma-7b | GemmaForCausalLM | 4 |

Text Generation

| | Mistral-7b | MistralForCausalLM | 4 |

Text Generation

| | Qwen2-7b | Qwen2ForCausalLM | 4 |

Text Generation

| | Qwen2.5-7b | Qwen2ForCausalLM | 4 |

Text Generation

| | Qwen2.5-14b | Qwen2ForCausalLM | 8 |

Text Generation

| | Salamandra-7b | LlamaForCausalLM | 4 |

Text Generation

| | KONI-Llama3.1-8b | LlamaForCausalLM | 8 |

Text Generation

| | EXAONE-3.0-7.8b | ExaoneForCausalLM | 4 |

Text Generation

| | EXAONE-3.5-2.4b | ExaoneForCausalLM | 4 |

Text Generation

| | EXAONE-3.5-7.8b | ExaoneForCausalLM | 8 |

Text Generation

| | Mi:dm-7b | MidmLMHeadModel | 4 |

Text Generation

| | SOLAR-10.7b | LlamaForCausalLM | 8 |

Text Generation

| | EEVE-Korean-10.8b | LlamaForCausalLM | 8 |

Text Generation

| | Llava-v1.6-mistral-7b | LlavaNextForConditionalGeneration | 4 |

Image Captioning

|

Diffusers¶

Note

Models marked with a superscript, †, require more than one ATOM due to their large weight size exceeding the capacity of a single ATOM. This necessitates dividing the model's modules across multiple ATOMs. For detailed information regarding the specific module distribution, please refer to the model code.

| Model | Model Architecture | Task | | :---: | :-----------------------------: | :-----: | | Stable Diffusion |

StableDiffusionPipeline
StableDiffusionImg2ImgPipeline
StableDiffusionInpaintPipeline

|

| | Stable Diffusion + LoRA |

StableDiffusionPipeline

|

Text to Image

| | Stable Diffusion V3^† |

StableDiffusion3Pipeline
StableDiffusion3Img2ImgPipeline
StableDiffusion3InpaintPipeline

|

| | Stable Diffusion XL |

StableDiffusionXLPipeline
StableDiffusionXLImg2ImgPipeline
StableDiffusionXLInpaintPipeline

|

| | Stable Diffusion XL + multi-LoRA |

StableDiffusionXLPipeline

|

Text to Image

| | SDXL-turbo |

StableDiffusionXLPipeline
StableDiffusionXLImg2ImgPipeline

|

| | Stable Diffusion + ControlNet |

StableDiffusionControlNetPipeline
StableDiffusionControlNetImg2ImgPipeline

|

| | Stable Diffusion XL + ControlNet |

StableDiffusionXLControlNetPipeline
StableDiffusionXLControlNetImg2ImgPipeline

|

| | Kandinsky V2.2 |

KandinskyV22InpaintCombinedPipeline
KandinskyV22PriorPipeline
KandinskyV22InpaintPipeline

|

|