Offline Visualization of Inference Traces by archbilesherman · Pull Request #47 · AI2Science/vizfold-foundation

archbilesherman · 2026-03-05T23:34:36Z

Summary

This PR implements the backend reader layer for issue #41, which focuses on lightweight offline visualization of VizFold/OpenFold inference traces. The goal is to let visualization tools inspect saved inference traces without rerunning inference or depending on a heavy live backend.

The main contribution is a working ArchiveReader for standardized Zarr-style trace archives, alongside the existing LegacyTxtReader path for older text-based trace dumps.

What the code does

`LegacyTxtReader`

The LegacyTxtReader supports the older VizFold text-dump format. This is useful because some existing trace outputs are still stored as plain text files rather than standardized archives. Keeping this reader lets the offline visualization stack continue to support current/legacy outputs while the archive format from issue #39 stabilizes.

`ArchiveReader`

The new ArchiveReader supports Zarr-based inference trace archives. It can:

open .zarr archives
open zipped Zarr archives by extracting them to a temporary directory
read metadata from archive attributes / metadata groups
discover available attention types
list available layers
list available heads
selectively load one attention head
load all attention heads for a layer
apply top_k filtering to attention connections
load single representations
load pair representations
load structure data
convert stored coordinates into basic PDB text as a lightweight visualization fallback

This is meant to provide a stable backend interface for the frontend visualization work. The frontend should be able to call the reader interface without needing to know the exact internal archive layout.

Why both readers exist

There are currently two reader paths because they support two different stages of the project:

LegacyTxtReader supports existing text-based outputs.
ArchiveReader supports the newer standardized Zarr archive direction from issue Design a Standardized Archive Format for Inference Traces #39.

This lets issue #41 support existing data while also moving toward the scalable archive format that will be needed for larger protein traces.

How to run locally

python -m pip install streamlit plotly py3Dmol matplotlib pandas zarr numcodecs fsspec
python -m pytest tests/test_archive_reader_contract.py tests/test_legacy_txt_reader.py
python -m streamlit run webui/app.py

…pport

…ith real data.

archbilesherman · 2026-04-29T02:48:32Z

Updated the PR description with current backend functionality, test coverage, and remaining integration work. The reader now passes local tests and supports working Zarr archive loading, but still needs validation against a real standardized archive from the archive-writing pipeline.

archbilesherman added 4 commits March 5, 2026 17:53

Add offline trace reader scaffolding for issue AI2Science#41

f63517f

ArchiveReader: add schema-aware metadata probing and optional Zarr su…

86d2794

…pport

Implement working Zarr ArchiveReader with tests. Needs to be tested w…

9c819c6

…ith real data.

Merged Frontend UI, Integration Layer, and Backend Reader.

06d0361

archbilesherman marked this pull request as ready for review April 29, 2026 02:39

archbilesherman changed the title ~~Add offline inference trace framework for issue #41~~ Offline Visualization Apr 29, 2026

archbilesherman changed the title ~~Offline Visualization~~ Offline Visualization of Inference Traces Apr 29, 2026

Clean up offline visualization reader integration

5a954b7

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Offline Visualization of Inference Traces#47

Offline Visualization of Inference Traces#47
archbilesherman wants to merge 5 commits into
AI2Science:mainfrom
archbilesherman:offline-viz

archbilesherman commented Mar 5, 2026 •

edited

Loading

Uh oh!

archbilesherman commented Apr 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

archbilesherman commented Mar 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

What the code does

LegacyTxtReader

ArchiveReader

Why both readers exist

How to run locally

Uh oh!

archbilesherman commented Apr 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

archbilesherman commented Mar 5, 2026 •

edited

Loading

`LegacyTxtReader`

`ArchiveReader`