Skip to content

Improve pipeline validation#4

Merged
galalqassas merged 3 commits into
mainfrom
feature/pipeline-validation
Mar 28, 2026
Merged

Improve pipeline validation#4
galalqassas merged 3 commits into
mainfrom
feature/pipeline-validation

Conversation

@galalqassas
Copy link
Copy Markdown
Owner

No description provided.

@galalqassas galalqassas self-assigned this Mar 28, 2026
@galalqassas galalqassas changed the title Improve Pipeline validation Improve pipeline validation Mar 28, 2026
Copy link
Copy Markdown

@chatgpt-codex-connector chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 38ce9c7dd8

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

  • Open a pull request for review
  • Mark a draft as ready
  • Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

Comment on lines +123 to +124
candidates.extend([stripped, normalized.replace("-", "_")])
return _dedupe_preserve_order(candidates)
Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

P1 Badge Stop probing non-importable package specifiers

_package_to_import_candidates now includes the raw requirement string (stripped) as an import target, which produces invalid probes for hyphenated packages such as pytest-cov and pytest-timeout (both are always added by extract_dependencies). Those probes can never succeed, so _discover_missing_dependencies keeps reporting them as missing and install_dependencies_with_retry repeatedly re-runs pip installs until retries are exhausted, even when dependencies are already installed. In CI/offline environments this can turn normal runs into repeated dependency-install failures and wasted time.

Useful? React with 👍 / 👎.

)
return failed_results, failed_evaluation, False

replay_results = run_tests(test_file, codebase_path)
Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

P2 Badge Preserve mutation validation in final acceptance replay

Final acceptance replays the saved suite with run_tests(test_file, codebase_path) but does not enable mutation testing, then overwrites the pipeline’s final evaluation with this replay result. If the accepted iteration previously ran mutation testing, the final report now drops to mutation_score = 0 and loses mutation evidence, while the replay check can still pass because the signature comparison ignores mutation fields. This silently weakens final validation and reporting.

Useful? React with 👍 / 👎.

@galalqassas galalqassas force-pushed the feature/pipeline-validation branch from 38ce9c7 to 055c449 Compare March 28, 2026 01:47
@galalqassas galalqassas merged commit 0f4813d into main Mar 28, 2026
7 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant