Kuang Hao, Research Computing, NUS Information Technology, on 13 October 2021
Working with an enormous amount of textual data is always hectic and time-consuming. Hence many companies and organizations make use of Information Extraction (IE) techniques to automate the process. Information Extraction is the task of automatically extracting structured information from unstructured documents. In most of the cases, this activity concerns processing human language texts by means of natural language processing (NLP). In this article, we will introduce common subtasks in information extraction and how to make use of opensource tools for those tasks.