Corpusen etiketatze linguistikoa

  1. Aldezabal Roteta, Izaskun
  2. Aranzabe Urruzola, María Jesús
  3. Díaz de Ilarraza Sánchez, Arantza
  4. Estarrona Ibarloza, Ainara
  5. Ezeiza Ramos, Nerea
  6. Uria Garin, Larraitz
Aldizkaria:
Anuario del Seminario de Filología Vasca Julio de Urquijo: International journal of basque linguistics and philology

ISSN: 0582-6152

Argitalpen urtea: 2009

Zenbakien izenburua: Beñat Oihartzabali gorazarre - Festchrift for Bernard Oyharçabal

Alea: 43

Zenbakia: 1-2

Orrialdeak: 37-50

Mota: Artikulua

Beste argitalpen batzuk: Anuario del Seminario de Filología Vasca Julio de Urquijo: International journal of basque linguistics and philology

Laburpena

In this article, we shall comment on the steps that have to be taken to give a linguistic label to a corpus and the difficulties that appear in this process. Our main objective was to highlight the importance of the labelling when preparing a corpus that is useful for linguistic research, and the need to establish criteria and to take the decisions that this entails. We also explain how semi-automatic methods are applied and how the manual revision that guarantees the quality of the corpus is carried out. Once the corpus has been revised and labelled, it will be useful both for carrying out linguistic analyses and for improving or assessing the linguistic tools and resources, and also for channelling automatic study.