Open
Description
I've seen #16 and appreciate the valid concerns raised about releasing the model, but the WebText corpus could be a tremendous help to general research if you were able to release it.
Are there plans to do so?
I did wonder if this might simply enable people to recreate the unreleased GPT-2 but presumably this is no trivial matter, needing expertise and time/resources, thus deterring the causal mischief maker!
Anyway, whatever you end up doing, I wanted to thank you for what you have released already which is really interesting 🙂