자연어 처리(NLP, Natural Language Processing)
word embedding
NLP — Word Embedding & GloVe:
https://medium.com/@jonathan_hui/nlp-word-embedding-glove-5e7f523999f6
NLG
https://arxiv.org/abs/1906.00500
https://kh-kim.gitbook.io/natural-language-processing-with-pytorch/
pytorch, 한글
한국어 처리
http://konlpy.org/en/latest/
https://github.com/lovit/soynlp
https://www.slideshare.net/lucypark/nltk-gensim
한국어 형태소 분석기
https://iostream.tistory.com/144?utm_source=gaerae.com&utm_campaign=%EA%B0%9C%EB%B0%9C%EC%9E%90%EC%8A%A4%EB%9F%BD%EB%8B%A4&utm_medium=social&fbclid=IwAR0OofMGcJZdRJG2X18_jG3gacJhudAP_MQSFRy9HKxH7X3w7J-4tnG4-FU
https://stanfordnlp.github.io/stanfordnlp/index.html
한글 자소 분리
https://jybaek.tistory.com/790
https://github.com/lqez/awesome-hangul
본문추출
https://pypi.python.org/pypi/newspaper
Q&A
https://quepy.readthedocs.org/en/latest/
Domument classfication
http://vene.ro/blog/word-movers-distance-in-python.html
DL 기반
http://www.wildml.com/
강의자료
http://www.lucypark.kr/courses/2015-dm/text-mining.html
언어 이해
http://googleresearch.blogspot.kr/2016/02/on-personalities-of-dead-authors.html
textrank
https://github.com/davidadamojr/TextRank
https://github.com/theeluwin/textrankr
Predicting movie genres with PyMongo and Doc2Vec.
https://github.com/sindbach/doc2vec_pymongo
Word vector
https://github.com/ThoughtRiver/lmdb-embeddings
BERT
https://github.com/huggingface/pytorch-pretrained-BERT
https://www.lyrn.ai/2019/02/11/xlm-cross-lingual-language-model/
Transformer
https://www.lyrn.ai/2019/01/16/transformer-xl-sota-language-model/
XLM-R
https://ai.facebook.com/blog/-xlm-r-state-of-the-art-cross-lingual-understanding-through-self-supervision/