Morphosyntactic structure of terms in Basque for automatic terminology extraction
- Heid, Ulrich (ed. lit.)
- Evert, Stefan (ed. lit.)
- Lehmann, Egbert (ed. lit.)
- Rohrer, Christian (ed. lit.)
Editorial: Stuttgart : Institut für Maschinelle Sprachverarbeitung, Universität Stuttgart, 2000
Año de publicación: 2000
Páginas: 373-381
Congreso: EURALEX. International Congress (9. 2000. Stuttgart)
Tipo: Aportación congreso
Resumen
This paper describes the morphosyntactic patterns of technical terms in Basque and presents an architecture for a term-extracting tool. As Basque is a highly inflected agglutinative language, part-of-speech information is not enough to define term patterns. The use of morphological and syntactic information is essential to reduce considerably the number of structures. For example, a noun, an adverbial, a postpositive adjectival, the nominal form of a verb and even a determiner in the genitive case may work as a prepositive adjective; however, they all share the same syntactic function. Therefore, for the termextracting tool to perform properly, the texts must be morphosyntactically analysed and disambiguated. Then a shallow syntactic parser will identify the previously described patterns.