A Methodology for Extracting Knowledge about Controlled Vocabularies from Textual Data using FCA-Based Ontology Engineering

Simin Jabbari; Kilian Stoffel

doi:10.1109/BIBM.2018.8621239

2018 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)

A Methodology for Extracting Knowledge about Controlled Vocabularies from Textual Data using FCA-Based Ontology Engineering

Year: 2018, Pages: 1657-1661

DOI Bookmark: 10.1109/BIBM.2018.8621239

Authors

Simin Jabbari, Information Management Institute, University of Neuchâtel, Neuchâtel, 2000, Switzerland
Kilian Stoffel, Information Management Institute, University of Neuchâtel, Neuchâtel, 2000, Switzerland

Abstract

We introduce an end-to-end methodology (from text processing to querying a knowledge graph) for the sake of knowledge extraction from text corpora with a focus on a list of vocabularies of interest. We propose a pipeline that incorporates Natural Language Processing (NLP), Formal Concept Analysis (FCA), and Ontology Engineering techniques to build an ontology from textual data. We then extract the knowledge about controlled vocabularies by querying that knowledge graph, i.e., the engineered ontology. We demonstrate the significance of the proposed methodology by using it for knowledge extraction from a text corpus that consists of 800 news articles and reports about companies and products in the IT and pharmaceutical domain, where the focus is on a given list of 250 controlled vocabularies.

Like what you’re reading?

Already a member?

Get this article FREE with a new membership!

A Compiler to Transfer Controlled Vocabularies and Ontologies Represented in an Object-Oriented Programming Language into Text Mark-Up Languages
13th IEEE International Conference on BioInformatics and BioEngineering
Knowledge Management of Controlled Vocabularies for Semantic Interoperability of Healthcare Applications
2015 International Conference on Healthcare Informatics (ICHI)
Mining Fuzzy Domain Ontology from Textual Databases
2007 IEEE/WIC/ACM International Conference on Web Intelligence
Indexing into controlled vocabularies with XML
Proceedings of the 34th Annual Hawaii International Conference on System Sciences
Indexing into Controlled Vocabularies with XML
Proceedings of the 34th Annual Hawaii International Conference on System Sciences
Using Ontologies and Vocabularies for Dynamic Linking
IEEE Internet Computing
Documenting Context-Based Quality Assessment of Controlled Vocabularies
IEEE Transactions on Emerging Topics in Computing
Modeling semantic business trajectories of territories for multidisciplinary studies through controlled vocabularies
2023 IEEE 39th International Conference on Data Engineering Workshops (ICDEW)
On Visual-Textual-Knowledge Entity Linking
2020 IEEE 14th International Conference on Semantic Computing (ICSC)
Improved Discoverability of Digital Objects in Institutional Repositories Using Controlled Vocabularies
2021 ACM/IEEE Joint Conference on Digital Libraries (JCDL)

A Methodology for Extracting Knowledge about Controlled Vocabularies from Textual Data using FCA-Based Ontology Engineering

Authors

Abstract

Related Articles