Idiomatikotasunaren karakterizazio automatikoaizena+aditza
- Antton Gurrutxaga Hernaiz
- Iñaki Alegria Loinaz
- Xabier Artola Zubillaga
ISSN: 0214-9001
Argitalpen urtea: 2016
Zenbakien izenburua: 2013-2014 Euskal tesien 10 pasarte
Zenbakia: 1
Orrialdeak: 47-68
Mota: Artikulua
Beste argitalpen batzuk: Ekaia: Euskal Herriko Unibertsitateko zientzi eta teknologi aldizkaria
The goal of this research is to develop and experimentally test different techniques for the automatic extraction of phraseological units (PUs) of noun+verb structure in Basque and for their characterization according to the idiomaticity level. Idiomaticity is considered the defining feature of the concept of phraseological unit (PU), ande we have measured its following components: institutionalization (statistical idiosyncrasy), semantic non-compositionality, morphosyntact ic fixedness and lexical fixedness. The results show that the standard cooccurence techniques are significantly ourtperformed by semantic measures, and, to a lower extent, by measures of morphosyntactic flexibility. The results of lexical flexibility are poorer than expected. Finally, we obtain experimental evidence for several predictions of phraseological theory.