Faster validation with tree-based constraint checking #16

laurmaedje · 2025-07-30T13:46:18Z

This PR implements a new, more efficient design for constraint validation.

Previously, each cache entry was stored with a list of tracked calls that needed to be validated. When looking for a cache hit, all entries with the same key hash were iterated and their constraints checked. This could lead to $O(n^2)$ runtime when performing cache lookups in cases where many entries shared the same key hash.

The new design makes use of the fact that tracked functions are typically called in a deterministic order. Instead of iterating over all cache entries, it walks through a call tree to find a matching cache entry while validating the minimum possible number of tracked calls.

The speedup this brings varies a lot depending on how the cache is populated by the consumer, but in Typst this can bring double-digit speedups in incremental compiles and huge speedups in previously pathological cases that ran into quadratic cache validation runtime.

To make sure the new implementation is robust, I added fuzz tests for both the tree data structure and the memoization itself.

(Breaking change) I opted to drop support for mutable methods with return values and mixes of mutable and immutable methods in tracked blocks. Making sure this feature still works correctly would have been significant additional work that would be somewhat in vain as Typst does not use the feature anymore and I'm not aware of anyone else using it.

(Breaking change) The new design requires memoized functions to adhere to a new definition of determinism which I call reorderably deterministic. It is explained in the docs of the #[memoize] attribute. In practice, it's typically fulfilled by deterministic functions. If it's not fulfilled, comemo will panic in debug mode. Meanwhile, in release mode, memoized functions will still yield correct results, but caching may prove ineffective.

(Breaking change) The Validate trait was removed. The new design for manual constraint handling and validation is centered around the newly public Constraint type. Note that manual constraint handling is not relevant for average usage, but it's useful in rare case, for example in Typst's introspection system to detect layout convergence.

laurmaedje force-pushed the faster-validation branch from 9a13d56 to e4a6f89 Compare July 30, 2025 13:53

laurmaedje mentioned this pull request Jul 30, 2025

Faster constraint checking in comemo (bumps comemo & krilla) typst/typst#6683

Merged

Faster validation with tree-based constraint checking

017b4d6

laurmaedje force-pushed the faster-validation branch from e4a6f89 to 017b4d6 Compare July 30, 2025 15:01

laurmaedje added 2 commits July 30, 2025 21:03

Test for nested mutable tracking

6c41328

Move struct

cda3b63

laurmaedje merged commit ec8f9b3 into main Jul 31, 2025
2 checks passed

laurmaedje deleted the faster-validation branch July 31, 2025 08:16

laurmaedje restored the faster-validation branch July 31, 2025 08:16

laurmaedje deleted the faster-validation branch July 31, 2025 21:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Faster validation with tree-based constraint checking #16

Faster validation with tree-based constraint checking #16

Uh oh!

laurmaedje commented Jul 30, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Faster validation with tree-based constraint checking #16

Faster validation with tree-based constraint checking #16

Uh oh!

Conversation

laurmaedje commented Jul 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

laurmaedje commented Jul 30, 2025 •

edited

Loading