====== BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding ====== * https://arxiv.org/abs/1810.04805 * https://github.com/dhlee347/pytorchic-bert * http://docs.likejazz.com/bert/#position-wise-feed-forward-network {{tag>NLP BERT Google 2019}}