Release v1.1.1: Patch release · huggingface/optimum

Habana

Installation details added for Optimum-Habana which provides optimized transformers integration for Intel's Habana Gaudi Processor (HPU).

Add the possibility to specify the execution provider in ORTModel.
Add IncludeFullyConnectedNodes class to find the nodes composing the fully connected layers in order to (only) target the latter for quantization to limit the accuracy drop.
Update QuantizationPreprocessor so that the intersection of the two sets representing the nodes to quantize and the nodes to exclude from quantization to be an empty set.
Rename Seq2SeqORTTrainer to ORTSeq2SeqTrainer for clarity and to keep consistency.
Add ORTOptimizer support for ELECTRA models.
Fix the loading of pretrained ORTConfig which contains optimization and quantization config.