Skip to content

fix: skip latents on A100 search submits#30

Merged
Jammy2211 merged 1 commit into
mainfrom
fix/skip-latents-on-a100-submit
May 28, 2026
Merged

fix: skip latents on A100 search submits#30
Jammy2211 merged 1 commit into
mainfrom
fix/skip-latents-on-a100-submit

Conversation

@Jammy2211
Copy link
Copy Markdown
Contributor

Summary

The first A100 search submit (#29 → job 322548) ran Nautilus to convergence (log_Z=+31690.50, 65500 evals, 11m40s) but errored in the post-fit latent-variable step:

ValueError: magzero must be passed to the Analysis via kwargs to compute
the 'total_lens_flux_mujy' latent. Disable it in config/latent.yaml or
pass magzero=<value>.

autolens_profiling/config/latent.yaml enables total_lens_flux_mujy (needed by the latent/imaging/ per-latent profiling scripts) but the search-profiling scripts' AnalysisImaging doesn't carry magzero. Result: the fit completed but _runner.py's metric-JSON write never happened.

PYAUTO_SKIP_LATENTS=1 bypasses latent computation entirely. Latents aren't part of what the search profile is measuring, and the search itself ran cleanly.

A proper upstream fix (so user code doesn't have to set this env var) will be tracked separately in PyAutoPrompt — this is an unblocking workaround for the immediate A100 run.

Test plan

  • Resubmit A100 job; metric JSON + PNG land at results/searches/nautilus/imaging/mge/hst/hpc_a100_fp64.{json,png}.

🤖 Generated with Claude Code

The first PR-29 submission of the searches/nautilus/imaging/mge A100 run
completed Nautilus convergence (log_Z=+31690.50, 65500 evals, 11m40s)
but errored in the post-fit latent-variable computation:

  ValueError: magzero must be passed to the Analysis via kwargs to
  compute the 'total_lens_flux_mujy' latent.

autolens_profiling/config/latent.yaml enables this latent for the
latent/imaging/ profiling scripts, but the search-profiling scripts'
AnalysisImaging doesn't carry magzero. PYAUTO_SKIP_LATENTS=1 bypasses
latent computation entirely — latents are not part of the search
profile, and the search itself ran to convergence.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
@Jammy2211 Jammy2211 merged commit aabfa3c into main May 28, 2026
1 check failed
@Jammy2211 Jammy2211 deleted the fix/skip-latents-on-a100-submit branch May 28, 2026 10:24
Jammy2211 added a commit that referenced this pull request May 28, 2026
)

PR #30 added PYAUTO_SKIP_LATENTS=1 as a workaround for the magzero-required
µJy latents that crashed the first A100 submit. Upstream PyAutoLens commit
dd3d09b4 ("fix(latent): raw-flux latents + soft-fail magzero-required µJy",
PR #557) replaces the _require_magzero raise with a soft NaN + one warning
per process, and adds three magzero-free raw-flux latents that default-on.

Search profiling now exercises the full first-class latent pipeline as
intended.

Co-authored-by: Jammy2211 <JNightingale2211@gmail.com>
Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant