Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Create dev doc outlining move of document ingestion and taxonomy processing into instructlab/instructlab #504

Open
bbrowning opened this issue Jan 27, 2025 · 0 comments
Labels
Milestone

Comments

@bbrowning
Copy link
Contributor

We need to determine exactly what code we want to move out of sdg and into the instructlab repo. What actual ingestion and processing code? What tests? What documentation?

What's the new set of inputs and outputs along with the method instructlab will call into sdg to kick off the data generation pipeline?

We need a dev doc that we can discuss and get feedback from the instructlab core maintainers team on the work to be done to ensure they're ready to accept the new scope.

@bbrowning bbrowning added this to the 0.8.0 milestone Jan 27, 2025
@bbrowning bbrowning added the jira label Jan 27, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

1 participant