If anyone wants to fine-tune the 1.5B model, I ported the gpt-2 code to TPUs. Yo...

zitterbewegung · on Nov 5, 2019

Cool this is awesome !

I’m going to try to retrain this with a twitter dataset called sentiment140 ( I have already processed it with gpt2 345M).

MasterScrat · on Nov 6, 2019

Is your fine-tuned model available somewhere?

zitterbewegung · on Nov 6, 2019

I can provide it to you. I have only done 355M. I was trying this for 1.5B but ran into memory issues .

sillysaurusx · on Nov 6, 2019

Sorry about the memory issue! I’ll have a fix up later today. Some info: https://twitter.com/theshawwn/status/1192038627854946304?s=2...

MasterScrat · on Nov 6, 2019

I would be very interested! My email is on my profile.

zitterbewegung · on Nov 6, 2019

Email sent