The README suggests that multiple variants would be available, but there's only the 1024 version. https://huggingface.co/datasets/willdepueoai/parameter-golf/tree/main/datasets/datasets Can we get official tokenizers and datasets up to 16k?
The README suggests that multiple variants would be available, but there's only the 1024 version.
https://huggingface.co/datasets/willdepueoai/parameter-golf/tree/main/datasets/datasets
Can we get official tokenizers and datasets up to 16k?