This repository was archived by the owner on Sep 19, 2025. It is now read-only.

Description
I was just reading about the past flag, for caching state, and wondered whether it's possible to use that in CoreML?
Also, is this repo still active, or has CoreML 4 (and the new pytorch->CoreML conversion tools) kind of eclipsed the work being done here? I'm using your GPT2, trained on custom data, and would be curious about experimenting with XLNet, but activity here seems pretty quiet... ?