Marco de trabajo tecnológico para la realización de estudios  de caracterización articulatoria sobre imágenes MRI

García Arroyo, Jose Luis; Oleagordia Ruiz, Ibon; García Zapirain, Begoña; Méndez Zorrilla, Amaia

Marco de trabajo tecnológico para la realización de estudios de caracterización articulatoria sobre imágenes MRI

Revista:

Estudios de fonética experimental

ISSN: 2385-3573, 1575-5533

Año de publicación: 2013

Número: 22

Tipo: Artículo

DIALNET GOOGLE SCHOLAR Acceso abierto editor

Otras publicaciones en: Estudios de fonética experimental

Resumen

En este artículo se presenta un marco de trabajo tecnológico innovador diseñado y desarrollado por nuestro grupo de investigación para posibilitar la realización de estudios de caracterización articulatoria de los sonidos de una lengua a partir de medidas tomadas sobre secuencias de imágenes de cine-MRI. Como elemento fundamental se tiene la herramienta software de producción propia DicomPas, que permite realizar la toma de medidas de parámetros articulatorios sobre las secuencias de imágenes MRI y la ejecución de algoritmos ad hoc sobre dichas medidas, de cara al procesamiento de los datos, con vistas a la posterior extracción del conocimiento, en forma de generación de inferencias estadísticas o de inteligencia artificial. En estos momentos este marco de trabajo está siendo aplicado a la realización de diversos estudios en euskara y español de Euskadi, disponiéndose para ello de una base de datos con dos repositorios de imágenes tomadas en el plano medio sagital, correspondientes a 18 informantes diferentes.

Referencias bibliográficas

ALWAN, A.; S. NARAYANAN y K. HAKER (1997): «Toward articulatory-acoustic models for liquid approximants based on MRI and EPG data. Part II. The rhotics», The Journal of the Acoustical Society of America, 101, 2, pp.1078-1089.
BADIN, P; G. Bailly; M. Raybaudi y C. Segebarth (1998): «A three-dimensional linear articulatory model based on MRI data», Proceedings of the Third ESCA/COCOSDA InternationalWorkshop on Speech Synthesis, Jenolan Caves House, Blue Mountains, NSW, Australia, pp. 249–254.
BADIN, P.; G. Bailly; L. Reveret; M. Baciu; C. Segebarth y C. Savariaux (2002): «Three-dimensional linear articulatory modeling of tongue, lips and face, based on MRI and video images», Journal of Phonetics, 30, 3, pp.533–553.
BADIN, P. y A. SERRURIER (2006): «Three-dimensional modeling of speech organs: Articulatory data and models», Transactions on Technical Committee of Psychological and Physiological Acoustics, The Acoustical Society of Japan, 36, 5, pp.421–426.
BAER, T. (1991): «Analysis of vocal tract shape and dimensions using magnetic resonance imaging: Vowels», The Journal of the Acoustical Society of America, 90, 2, pp.799–828.
BEAUTEMPS, D.; P. BADIN y G. BAILLY (1996): «Evaluation of an articulatory-acoustic model based on a reference subject», Proceedings of the First ESCA Tutorial and Research Workshop on Speech Production Modeling - Fourth Speech Production Seminar, Autrans, Francia, pp.45-48
DONOHO, D. L. (2006): «Compressed sensing», IEEE Transactions on Information Theory, 52, 4, pp.1289–1306.
ELEJABEITIA, A.; A. IRIBAR y R.M. PAGOLA (2009): «El cine-MRI aplicado a la descripción de las sibilantes vascas», Estudios de fonética experimental, XVIII, pp.145–160.
ENGWALL, O. y P. BADIN (1999): «Quarterly Progress and Status Report Collecting and analysing two- and three-dimensional MRI data for Swedish», Dept. for Speech, Music and Hearing. Quarterly Progress and Status Report (TMH-QPSR), 40, 3-4, pp. 11–38.
ENGWALL, O. (2000): «Are static MRI measurements representative of dynamic speech? Results from a comparative study using MRI, EPG and EMA», en B. Yuan; T. Huang y X. Tang (eds.): Proceedings of the International Conference on Spoken Language Processing (ICSLP), Pekín, China, pp. 17-20.
ENGWALL, O. (2003a): «A revisit to the Application of MRI to the Analysis of Speech Production-Testing our assumptions», en S. Palethorpe y M. Tabain (eds): Proceedings of 6th International Seminar on Speech Production, Sydney, Australia, pp.43-48.
ENGWALL, O. (2003b): «Combining MRI, EMA and EPG measurements in a three -dimensional tongue model», Speech Communication, 41, 2-3, pp.303-329.
FERNÁNDEZ PLANAS, A. M. (2008): «La electropalatografía (EPG) en el estudio articulatorio del habla. El WinEPG de Articulate Instruments Ltd», Estudios de fonética experimental, XVII, pp.285–299.
FITCH, W. T. y J. GIEDD (1999): «Morphology and development of the human vocal tract: A study using magnetic resonance imaging», The Journal of the Acoustical Society of America, 106, 3, pp.1511–1522.
GURLEKIAN, J.A; N. ELISEI y M. ELETA (2004): «Caracterización articulatoria de los sonidos vocálicos del español de Buenos Aires mediante técnicas de resonancia magnética», Revista Fonoaudiológica, 50, 2, pp.7–14.
HERMAN, G. T. (1980): Fundamentals of Computerized Tomography, Londres, Springer-Verlag, 2009.
HOOLE, P. y C. MOOSHAMMER (2002): «Articulatory analysis of the German vowel system», en P. Auer; P. Gilles y H. Spiekerman (ed): Silbenschnitt und Tonakzente, Tubingen, pp. 129–152.
HORNAK, J. (1996): The Basics of MRI, ScientificCommons. http://www.cis.rit.edu/htbooks/mri/[5/2/2013]
IBM (2013): SPSS http://www-01.ibm.com/software/analytics/spss/. [5/2/2013]
IRIBAR, A. (2013): «Apuntes para la caracterización articulatoria experimental del vocalismo del español», Estudios de fonética experimental. XXII, pp. 37-80.
IRIBAR, A. (2012): Caracterización fonética experimental del vocalismo vasco-románico, tesis doctoral. Universidad de Deusto.
IRIBAR, A.; R.M. PAGOLA e I. TÚRREZ (en prensa): «Observaciones sobre la articulación de la lateral alveolar en euskara y castellano», en Actas del V Congreso de Fonética Experimental, Cáceres 2011.
IRIBAR, A.; R.M. PAGOLA e I. TÚRREZ (2013): «Caracterización articulatoria de ele en español y euskara», Estudios de fonética experimental, XXII, pp.129-171.
KARTHIKESWARAN, D. y S. DINAKAR (2011): «Developing a scientific visualization tool for Inner articulators», en Proceedings of the 2011 International Conference on Emerging Trends in Electrical and Computer Technology, IEEE, Chunkankadai, Nargelcoil, India, pp. 480–488.
KIM, Y.C.; S.S. NARAYANAN y K.S. NAYAK (2009a): «Accelerated 3D MRI of vocal tract shaping using compressed sensing and parallel imaging», Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP),Taipei, Taiwan, pp.389–392.
KIM, Y.C.; S.S. NARAYANAN y K.S. NAYAK (2009b): «Accelerated three-dimensional upper airway MRI using compressed sensing», Magnetic resonance in medicine: official journal of the Society of Magnetic Resonance in Medicine, 61, 6, pp.1434–40.
LUSTIG, M.,D. DONOHO y J.M. PAULY (2007): «Sparse MRI: The application of compressed sensing for rapid MR imaging», Magnetic Resonance in Medicine: Official Journal of the Society of Magnetic Resonance in Medicine, 58, 6, pp.1182–95.
MARTINS, P.; I. Carbone; A. Pinto; A. Silva y A. Teixeira (2008): «European Portuguese MRI based speech production studies», Speech Communication, 50, 11-12, pp.925–952.
MICROSOFT (2013a): Excel, versión 2010 http://office.microsoft.com/en-us/excel/. [5/2/2013]
MICROSOFT (2013b): RTF, versión 1.9.1. http://www.microsoft.com/en-us/download/details.aspx?id=10725. [5/2/2013]
NARAYANAN, S.; K. Navak; S. Lee y D. Byrd (2004): «An approach to real-time magnetic resonance imaging for speech production», The Journal of the Acoustical Society of America, 115, 4, pp.1771–1776.
NARAYANAN, S.; A .ALWAN y K. HAKER (1997): «Toward articulatory-acoustic models for liquid approximants based on MRI and EPG data. Part I. The laterals», The Journal of the Acoustical Society of America, 101, 2, pp.1064–1077.
NARAYANAN, S.; D. BYRD y A. KAUN (1999): «Geometry, kinematics, and acoustics of Tamil liquid consonants», The Journal of the Acoustical Society of America, 106, 4, pp.1993–2007.
NARAYANAN, S.S. y A. ALWAN (1995): «An articulatory study of fricative consonants using magnetic resonance imaging», The Journal of the Acoustical Society of America, 98, 3, pp.1325–1347.
NEMA (2013): DICOM specification. http://medical.nema.org/dicom/.[5/2/2013]
ORACLE-SUN MICROSYSTEMS (2013): Java. http:// java.sun.com. [5/2/2013]
PAGOLA, R. M. (1992): Euskal fonetika Nafarroan. Iruñea, Nafarroako Gobernua.
PAGOLA, R.M.; A. IRIBAR e I. TÚRREZ (2012): «La descripción articulatoria de los sonidos en euskara y castellano: el proyecto DAELPACE», en M. Acillona (ed.): Marcos interpretativos de la realidad social contemporánea, Bilbao, Universidad de Deusto, pp.107–118.
ROMANO, A. y P. BADIN (2009): «An MRI study of the articulatory properties of italian consonants», Estudios de fonética experimental, XVIII, pp.327–344.
ROMERO, J. (2008): «La electromagnetometría en el estudio de la producción del habla», Estudios de fonética experimental, XVII, pp. 359–374.
SERRURIER, A. y P. BADIN (2005): «Towards a 3D articulatory model of velum based on MRI and CT imagesۚ», ZAS Papers in Linguistics (Speech production and perception: Experimental analyses and models), 40, 1, pp.195–211.
SERRURIER, A. y P. BADIN (2008): «A three-dimensional articulatory model of the velum and nasopharyngeal wall based on MRI and CT data», Journal of the Acoustical Society of America, 123, 4, pp. 2335–2355.
STORY, B.H.; I.R. TITZE y E.A. HOFFMAN (1996): «Vocal tract area functions from magnetic resonance imaging», The Journal of the Acoustical Society of America, 100, 1, pp.537–54.
TAKEMOTO, H.; T. Kitamura; H. Nishimoto y K. Honda (2004): «A method of tooth superimposition on MRI data for accurate measurement of vocal tract shape and dimensions», Acoustical Science and Technology, 25, 6, pp.468–474.
TAKEMOTO, H.; K. Honda; S. Masaki; Y. Shimada y I. Fujimoto (2006): «Measurement of temporal changes in vocal tract area function from 3D cine-MRI data», The Journal of the Acoustical Society of America, 119, 2, pp.1037–1049.
THE FREE DICTIONARY (2013a): Body planes. http://medical-dictionary.thefreedictionary.com/coronal+planes. [5/2/2013]
THE FREE DICTIONARY (2013b): Midsagittal plane. http://medical-dictionary.thefreedictionary.com/midsagittal+plane. [5/2/2013]
THE UNIVERSITY OF WAIKATO (2013): WEKA. http://www.cs.waikato.ac.nz/ml/weka/. [5/2/2013]
TIEDE, M.K.S. MASAKI y E. VATIKIOTIS-BATESON (2000): «Contrasts in speech articulation observed in sitting and supine conditions», Proceedings of the Fifth Seminar on Speech Production: Models and Data, Kloster Seeon, Bavaria, Alemania, pp. 25–28.
TXILLARDEGI (1980): Euskal fonologia. Donostia, Ediciones Vascas.
U.S.NATIONAL INSTITUTES OF HEALTH (2013): Image J, versión 1.46. http://rsbweb.nih.gov/ij/. [5/2/2013]
WORLD WIDE WEB CONSORTIUM (W3C) (2013a): JPEG. http://www.w3.org/Graphics/JPEG/. [5/2/2013]
WORLD WIDE WEB CONSORTIUM (W3C) (2013b): XML http://www.w3.org/XML/. [5/2/2013]
WORLD WIDE WEB CONSORTIUM (W3C) (2013c): XML Schema. http://www.w3.org/XML/Schema. [5/2/2013]
YANG, B. (1999): «Measurement and synthesis of the vocal tract of Korean monophthongs by MRI», en J. Ohala; Y. Hasegawa; M. Ohala; D. Granville y A. C. Bailey (eds.): Proceedings of the XIVth International Congress of Phonetic Sciences (ICPhS) 1999, San Francisco, E.E.U.U, pp. 2005–2008.
ZHOU, X. (2009): An MRI-based articulatory and acoustic study of American English liquid sounds/r/and/l/, tesis doctoral. Universidad de Maryland, College Park.
ZHOU, X. et al. (2010): «An MRI-based articulatory and acoustic study of lateral sound in American English», Proceedings of the 2010 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Dallas, E.E.U.U, pp. 4182–4185.

Fuente de los datos: Dialnet