BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
https://arxiv.org/abs/1810.04805
https://github.com/dhlee347/pytorchic-bert
http://docs.likejazz.com/bert/#position-wise-feed-forward-network
NLP
,
BERT
,
Google
,
2019