Abstract extraction seems to not run smoothly and it is hard to validate, this issue collects other issues Better testing is required. #714 many abstracts were missing due to a 429 too many requests. #579 Phonetics like ` /rɪˈneɪsəns/` were not extracted properly. This has been fixed. But no test was written.