R. Puri, R. Kirby, N. Yakovenko, and B. Catanzaro. " Large Scale Language Modeling: Converging on 40GB of Text in Four Hours", in Anais do XXX International Symposium on Computer Architecture and High Performance Computing, Lyon/FR, 2018, pp. 290-297.