You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository was archived by the owner on Oct 31, 2023. It is now read-only.
When I run the above code to get train examples from translate-train jsons, I get IndexError(list index out of range) while the same code works for the files in mlqa-translate-test.
Do you happen to know why this is happening?
The text was updated successfully, but these errors were encountered:
Yes, not an issue with MLQA :). This is a general question on the provided translated datasets.
Since the datasets are translated from squad and maintain the squad dataset format. I tried the standard squadprocessor but this doesn't seem to work on the Translate-Train datasets but works on Translate-test. The above code works for other multilingual datasets like TydiQA, Xquad.
I will check if this is related to the dataset or the library.
feel free to circle back if there is something up with the data that causes HF to break.
The automatically-translated datasets are a bit noisy, its possible there are some things that are hard for systems to parse and use.
I downloaded the translate-train and transted-test datasets from the links in the Translate-Train and Translate-Test Data section of the readme page.
I am trying to train QA models with translated squad datasets in the translate-train folder.
When I run the above code to get train examples from translate-train jsons, I get IndexError(list index out of range) while the same code works for the files in mlqa-translate-test.
Do you happen to know why this is happening?
The text was updated successfully, but these errors were encountered: