Corpusen etiketatze linguistikoa

  1. Aldezabal Roteta, Izaskun
  2. Aranzabe Urruzola, María Jesús
  3. Díaz de Ilarraza Sánchez, Arantza
  4. Estarrona Ibarloza, Ainara
  5. Ezeiza Ramos, Nerea
  6. Uria Garin, Larraitz
Revista:
Anuario del Seminario de Filología Vasca Julio de Urquijo: International journal of basque linguistics and philology

ISSN: 0582-6152

Año de publicación: 2009

Título del ejemplar: Beñat Oihartzabali gorazarre - Festchrift for Bernard Oyharçabal

Volumen: 43

Número: 1-2

Páginas: 37-50

Tipo: Artículo

Otras publicaciones en: Anuario del Seminario de Filología Vasca Julio de Urquijo: International journal of basque linguistics and philology

Resumen

In this article, we shall comment on the steps that have to be taken to give a linguistic label to a corpus and the difficulties that appear in this process. Our main objective was to highlight the importance of the labelling when preparing a corpus that is useful for linguistic research, and the need to establish criteria and to take the decisions that this entails. We also explain how semi-automatic methods are applied and how the manual revision that guarantees the quality of the corpus is carried out. Once the corpus has been revised and labelled, it will be useful both for carrying out linguistic analyses and for improving or assessing the linguistic tools and resources, and also for channelling automatic study.