Skip to content

v1.1.1: Patch release

Compare
Choose a tag to compare
@JingyaHuang JingyaHuang released this 26 Apr 15:47
· 1055 commits to main since this release

Habana

ONNX Runtime

  • Add the possibility to specify the execution provider in ORTModel.
  • Add IncludeFullyConnectedNodes class to find the nodes composing the fully connected layers in order to (only) target the latter for quantization to limit the accuracy drop.
  • Update QuantizationPreprocessor so that the intersection of the two sets representing the nodes to quantize and the nodes to exclude from quantization to be an empty set.
  • Rename Seq2SeqORTTrainer to ORTSeq2SeqTrainer for clarity and to keep consistency.
  • Add ORTOptimizer support for ELECTRA models.
  • Fix the loading of pretrained ORTConfig which contains optimization and quantization config.