Offline Visualization of Inference Traces#47
Open
archbilesherman wants to merge 5 commits into
Open
Conversation
Author
|
Updated the PR description with current backend functionality, test coverage, and remaining integration work. The reader now passes local tests and supports working Zarr archive loading, but still needs validation against a real standardized archive from the archive-writing pipeline. |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
This PR implements the backend reader layer for issue #41, which focuses on lightweight offline visualization of VizFold/OpenFold inference traces. The goal is to let visualization tools inspect saved inference traces without rerunning inference or depending on a heavy live backend.
The main contribution is a working
ArchiveReaderfor standardized Zarr-style trace archives, alongside the existingLegacyTxtReaderpath for older text-based trace dumps.What the code does
LegacyTxtReaderThe
LegacyTxtReadersupports the older VizFold text-dump format. This is useful because some existing trace outputs are still stored as plain text files rather than standardized archives. Keeping this reader lets the offline visualization stack continue to support current/legacy outputs while the archive format from issue #39 stabilizes.ArchiveReaderThe new
ArchiveReadersupports Zarr-based inference trace archives. It can:.zarrarchivestop_kfiltering to attention connectionsThis is meant to provide a stable backend interface for the frontend visualization work. The frontend should be able to call the reader interface without needing to know the exact internal archive layout.
Why both readers exist
There are currently two reader paths because they support two different stages of the project:
LegacyTxtReadersupports existing text-based outputs.ArchiveReadersupports the newer standardized Zarr archive direction from issue Design a Standardized Archive Format for Inference Traces #39.This lets issue #41 support existing data while also moving toward the scalable archive format that will be needed for larger protein traces.
How to run locally