Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Delete outdated sub-ecosystem data inosv-vulnerabilities bucket #2969

Open
hogo6002 opened this issue Dec 9, 2024 · 1 comment
Open

Delete outdated sub-ecosystem data inosv-vulnerabilities bucket #2969

hogo6002 opened this issue Dec 9, 2024 · 1 comment
Assignees
Labels
cleanup Code hygiene and cleanup

Comments

@hogo6002
Copy link
Contributor

hogo6002 commented Dec 9, 2024

The osv-vulnerabilities bucket currently stores all vulnerabilities exported by the exporter, including historical entries (no auto deletion). We recently modified the exporter to only export vulnerabilities for the main ecosystem (e.g. Debian, Ubuntu) instead of individual sub-ecosystems (e.g. Debian:11, Debian:12). Because of this change, a large number of legacy sub-ecosystem directories remain there.

To maintain a cleaner bucket, we should remove these outdated directories and files.

If we also want to remove nonexistent vulnerability records from each ecosystem, this could resolve issues like #2902

@andrewpollock
Copy link
Contributor

I think we can break this into two pieces of work:

  1. a one-time manual cleanup of all of the directories with colons in the name
  2. engineering to the exporter for individual record cleaning, as described in exporter: individual records no longer in existence should be removed from the GCS export #2902

@andrewpollock andrewpollock self-assigned this Feb 4, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
cleanup Code hygiene and cleanup
Projects
None yet
Development

No branches or pull requests

2 participants