I'm happy to have a go at adding the [ThermoML archives](https://www.nist.gov/mml/acmd/trc/thermoml/thermoml-archive) as a dataset, if this is useful. Already mentioned on Discord by @marcosfelt, including the useful link to [thermopyl](https://github.com/choderalab/thermopyl) and @marcosfelt's [updated fork](https://github.com/sustainable-processes/thermopyl).