Named Entities in the QTLeap Corpus of Online Helpdesk Interactions

Authors

  • Andreia Querido Faculdade de Ciências da Universidade de Lisboa
  • Rita de Carvalho
  • João Rodrigues Faculdade de Ciências da Universidade de Lisboa
  • João Silva Faculdade de Ciências da Universidade de Lisboa
  • Steven Neale Faculdade de Ciências da Universidade de Lisboa
  • Rita Valadas Pereira Faculdade de Ciências da Universidade de Lisboa
  • Patrícia Gomes Faculdade de Ciências da Universidade de Lisboa
  • Catarina Correia Faculdade de Ciências da Universidade de Lisboa
  • Diana Amaral Faculdade de Ciências da Universidade de Lisboa
  • António Branco Faculdade de Ciências da Universidade de Lisboa

DOI:

https://doi.org/10.26334/2183-9077/rapln2ano2016a20

Keywords:

annotated corpus, QTLeap Corpus, named entities, annotation task, disambiguation task

Abstract

In this paper we present the annotation of a corpus with named entities that are classified into semantic types and disambiguated by linking them to their corresponding entry in the Portuguese DBpedia. This corpus, QTLeap Corpus, is a multilingual collection of question and answer pairs from a chat-based helpdesk service for Information and Communication Technologies. The resulting annotated corpus is a gold-standard named entity annotated lexical resource that is useful in supporting the training and evaluation of named entity annotation and disambiguation tools for Portuguese.

Downloads

Download data is not yet available.

Published

2016-10-31

How to Cite

Andreia Querido, Rita de Carvalho, João Rodrigues, João Silva, Steven Neale, Rita Valadas Pereira, … António Branco. (2016). Named Entities in the QTLeap Corpus of Online Helpdesk Interactions. Journal of the Portuguese Linguistics Association, (2), 459–474. https://doi.org/10.26334/2183-9077/rapln2ano2016a20