Puri, R., Kirby, R., Yakovenko, N., & Catanzaro, B. 2018 Sep 24. Large Scale Language Modeling: Converging on 40GB of Text in Four Hours. Proceedings of the International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD). [Online] :