-
Notifications
You must be signed in to change notification settings - Fork 5
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add additional module metadata to the dataset #14
Comments
This commit adds a builder that can execute arbitrary commands for more custom apps that don't use a standard build system. This allows for more easily integrating projects like the Linux kernel and Chromium that aren't really standard like a typical CMake or Rust project.
Hi @boomanaiden154, I’m interested in contributing to this issue and would like some clarification and guidance to ensure I approach the solution correctly. I've few questions and requests: Current Dataset Structure: Preferred Hashing Algorithm: Version Control Details: Module Identifiers: Resources: Once I have these details, I can proceed to propose a detailed implementation plan and start contributing. Looking forward to your response! Best regards, |
This isn't really a great starter issue. We don't really have good testing infrastructure, so proper testing requires building the whole dataset which requires distributed computing/HPC resources, and the specifications aren't really fleshed out that well. If you want something better to hack on, looking at good first issues in the LLVM monorepo (https://github.com/llvmllvm-project) will probably be a better bet. |
As part of efforts to associate additional data with rows in the dataset (like inputs), we need a way to identify rows in the dataset across different versions. The plan to achieve this is to add the following columns to the dataset:
package_hash
top level field in the build manifest to represent the project versioning information. #25package_hash
field.The text was updated successfully, but these errors were encountered: