====== 자연어 처리(NLP, Natural Language Processing) ====== * word embedding * NLP — Word Embedding & GloVe: https://medium.com/@jonathan_hui/nlp-word-embedding-glove-5e7f523999f6 * NLG * https://arxiv.org/abs/1906.00500 * https://kh-kim.gitbook.io/natural-language-processing-with-pytorch/ * pytorch, 한글 * 한국어 처리 * http://konlpy.org/en/latest/ * https://github.com/lovit/soynlp * https://www.slideshare.net/lucypark/nltk-gensim * 한국어 형태소 분석기 * https://iostream.tistory.com/144?utm_source=gaerae.com&utm_campaign=%EA%B0%9C%EB%B0%9C%EC%9E%90%EC%8A%A4%EB%9F%BD%EB%8B%A4&utm_medium=social&fbclid=IwAR0OofMGcJZdRJG2X18_jG3gacJhudAP_MQSFRy9HKxH7X3w7J-4tnG4-FU * https://stanfordnlp.github.io/stanfordnlp/index.html * 한글 자소 분리 * https://jybaek.tistory.com/790 * https://github.com/lqez/awesome-hangul * 본문추출 * https://pypi.python.org/pypi/newspaper * Q&A * https://quepy.readthedocs.org/en/latest/ * Domument classfication * http://vene.ro/blog/word-movers-distance-in-python.html * DL 기반 * http://www.wildml.com/ * 강의자료 * http://www.lucypark.kr/courses/2015-dm/text-mining.html * 언어 이해 * http://googleresearch.blogspot.kr/2016/02/on-personalities-of-dead-authors.html * textrank * https://github.com/davidadamojr/TextRank * https://github.com/theeluwin/textrankr * Predicting movie genres with PyMongo and Doc2Vec. * https://github.com/sindbach/doc2vec_pymongo * Word vector * https://github.com/ThoughtRiver/lmdb-embeddings * BERT * https://github.com/huggingface/pytorch-pretrained-BERT * https://www.lyrn.ai/2019/02/11/xlm-cross-lingual-language-model/ * Transformer * https://www.lyrn.ai/2019/01/16/transformer-xl-sota-language-model/ * XLM-R * https://ai.facebook.com/blog/-xlm-r-state-of-the-art-cross-lingual-understanding-through-self-supervision/