Text extract
텍스트 추출
https://textract.readthedocs.io/en/stable/
https://github.com/chrismattmann/tika-python