Skip to content
This repository has been archived by the owner on Dec 16, 2022. It is now read-only.

Adds Huggingface Dataset Reader #5194

Closed
wants to merge 1 commit into from
Closed

Adds Huggingface Dataset Reader #5194

wants to merge 1 commit into from

Conversation

dirkgr
Copy link
Member

@dirkgr dirkgr commented May 10, 2021

Moved over from #5095.

Added a new reader to allow for reading huggingface datasets as instance
Mapped limited datasets.features to allenlp.data.fields

Verified for selective dataset and/or dataset configurations for training split, mentioned in the documentation comments of the reader.

datasets==1.5.0

Joint work with @divijbajaj @annajung @prajaktakini-vmware @agururajvais

Signed-off-by: Abhishek P (VMware) [email protected]

Fixes #4962

Changes proposed in this pull request:
Introduce a new reader that wraps huggingface datasets to provide instances for a split of the dataset with configuration if required

@dirkgr
Copy link
Member Author

dirkgr commented May 10, 2021

GitHub now thinks I am the author, but it's really @pab-vmware. @pab-vmware, if it's important to you that you show up as the author, you can make a new PR that's identical to this one, and we'll keep working there. Then I can also review this properly (since you can't review your own PRs).

@dirkgr
Copy link
Member Author

dirkgr commented May 10, 2021

@pab-vmware fixed #5095, so we don't need this anymore.

@dirkgr dirkgr closed this May 10, 2021
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Integration with Huggingface datasets
2 participants