Improve T5 encoder tests with more prompts and static context length #976

sogartar · 2025-02-17T20:48:41Z

The set of prompts is not big enough for statistically sound testing of
the T5 encoder. This is true for other text encoders.
With the expansion of the prompt set the bf16 numerical difference
between eager and IREE vanished. IREE is even more accurate.

In tests the tokenizer padding has been change to produce always max
length token sequence. This is in line how T5 is used int the Flux
pipeline. The T5 encoder export has been expanded with an option to
export with a static token sequence length.

The tests were refactored to share tolerance values for f32 and bf16.

sogartar · 2025-02-17T20:49:06Z

This PR is on top of #967, which must be merged first.

We don't want the stack to depend on the conversion tool from the lamma.cpp repo. Also the conversion to GGUF would not convert all tensors to bf16, but leave some in f32. We would like to control that ourselves if needed. This change makes any previously generated IRPA files obsolete.

The set of prompts is not big enough for statistically sound testing of the T5 encoder. This is true for other text encoders. With the expansion of the prompt set the bf16 numerical difference between eager and IREE vanished. IREE is even more accurate. In tests the tokenizer padding has been change to produce always max length token sequence. This is in line how T5 is used int the Flux pipeline. The T5 encoder export has been expanded with an option to export with a static token sequence length. The tests were refactored to share tolerance values for f32 and bf16.

sogartar force-pushed the t5-improve-test-numerics branch from 92c078e to 7226258 Compare February 17, 2025 23:42

sogartar added 2 commits February 17, 2025 23:43

sogartar force-pushed the t5-improve-test-numerics branch from 7226258 to 753414f Compare February 17, 2025 23:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve T5 encoder tests with more prompts and static context length #976

Improve T5 encoder tests with more prompts and static context length #976

sogartar commented Feb 17, 2025

sogartar commented Feb 17, 2025

Improve T5 encoder tests with more prompts and static context length #976

Are you sure you want to change the base?

Improve T5 encoder tests with more prompts and static context length #976

Conversation

sogartar commented Feb 17, 2025

sogartar commented Feb 17, 2025