My Documents

Christian Chiarcos

Goethe-University Frankfurt am Main, Germany

Annotation interoperability

Limited interoperability between annotations across different annotation schemes both within as well as across languages represents a major hurdle in the development of truly interoperable NLP tools and interoperable corpus querying. I will discuss the problem, motivate and describe the Ontologies of Linguistic Annotation (OLiA) as a component of a possible solution in the context of the Linguistic Linked Open Data (LLOD) cloud and sketch innovative, ontology-based approaches for NLP tasks such as cross-tagset morphosyntactic analysis. The OLiA ontologies represent a repository of annotation terminology for various linguistic phenomena on a great band-width of languages, they have been used to facilitate interoperability and information integration of linguistic annotations in corpora, NLP pipelines, and lexical-semantic resources. In the accompanying hands-on session, participants will be trained to implement new OLiA Annotation Models and their linking to the OLiA Reference Model.


Christian Chiarcos (a short biography)

He is Assistant Professor of Computer Science at Goethe University Frankfurt, Germany, and heading the Applied Computational Linguistics group. In 2010, he received a doctoral degree on Natural Language Generation from the University Potsdam, Germany, he subsequently worked at the Information Sciences Institute of the University of Southern California (ISI/USC), before joining Goethe University in 2013. His research focuses on semantic technologies, including computational semantics as well as the innovative application of Semantic Web standards to NLP problems. Specific interests cover computational discourse semantics (machine reading), NLP and language resource interoperability. As a computational linguist, Christian Chiarcos explored Semantic Web and Linked Data from an NLP perspective and contributed to the emergence of a community at the intersection of both areas: He has been co-founder of the Open Linguistics Working Group of the Open Knowledge Foundation (OWLG), he initiated and co-organized both the Linked Data in Linguistics workshop series and the accompanying development of the Linguistic Linked Open Data cloud.