Hi could you add details about how the .mp3 and .txt are named in the dataset and how we can map the text to the videos/timestamps ?
Hi could you add details about how the .mp3 and .txt are named in the dataset and how we can map the text to the videos/timestamps ?