FREE ELECTRONIC LIBRARY - Dissertations, online materials

Pages:     | 1 | 2 ||

«Audio-Visual Integration: Generalization Across Talkers A Senior Honors Thesis Presented in Partial Fulfillment of the Requirements for graduation ...»

-- [ Page 3 ] --

The average percent correct responses for the pre-test and post-test for each of the talkers in the V-only condition are displayed in Figure 8. Unlike in the A-only condition, within this modality there was no particular talker who showed a baseline average intelligibility notably higher than the rest. Again improvements were seen from pre-test to post-test with the training talkers, and that this improvement appeared to generalize to the testing talkers.

These results are similar to those found in Figure 9, which shows percent correct responses for the pre-test and post-test for each of the talkers in the A+V condition.

Here we see that LG did have a higher baseline average intelligibility, but the difference was not as great as that seen in the A-only condition. Two important features of these data are that for each of the talkers, training and testing, we see an improvement in performance from pre-test to post-test, indicating that generalization occurred. Also, in this condition the pre-test average intelligibility for all talkers is higher than that in the single modality conditions. Even at the post-test, there was still room for improvement, ruling out a possible ceiling-effect. Thus, ceiling effects do not explain the decrease in integration observed in Figure 6.

Integration of Discrepant Stimuli The responses to discrepant stimuli in the A+V condition were categorized into “auditory” (percent of time subject chose the auditory stimulus as the response), “visual” (percent of time the subject chose a response reflecting the visual place of articulation), or “other” (any other type of response). Figure 10 shows the percent response averaged across listeners for the pre-test and post-test discrepant stimuli for the training talkers, and Figure 11 shows the results for the testing talkers. While an increase in “other” responses is seen, this increase was not statistically significant. ANOVA results revealed there was no main effect of test (pre vs. post), F(1,4)=6.221, p=.067, just missing the.05 alpha limit. There was also no significant main effect of talker (training vs. testing), F(1,4)=.125, p=.74. The fact that the “other” responses increased from pretest to post-test for both the training talkers and the testing talkers shows a decrease on reliance of the individual modalities and a possible increase in audio-visual integration.

To determine whether there had indeed been an increase in integration, the responses in the “other” category were further analyzed. Figures 12 and 13 show the results.

In Figures 12 and 13 “fusion” and “combination” responses indicate McGurk-type integration, whereas the “neither” category represents those responses that do not show integration. For both the training talkers and the testing talkers we see an increase in “fusion” and “combination” responses from pre-test to post-test and a corresponding decrease in “neither” responses. This suggests that training facilitated integration for the discrepant stimuli and this integration process appears to have generalized from the training talkers to the testing talkers. However, ANOVA results revealed that the main effect of test was not statistically significant, F(1,4) = 4.438, p=.103, although the main effect of talker approached significance, F(1,4)=6.831, p=.059.

–  –  –

Overall, the present results indicate that training in the A-only, V-only and A+V conditions with one set of talkers does generalize to a different set of talkers. For both sets of talkers, improvements in all testing modalities were observed from pre-test to post-test. Further, results for discrepant stimuli suggest that audio-visual integration increased from pre-test to post-test, as measured by an increase in McGurk-type fusion and combination responses. In contrast, integration for congruent stimuli, measured as the difference between A+V and the best single modality (A or V), appeared to decrease after training, because the improvement in single-modality conditions was greater than that for the A+V condition. This apparent inconsistency can be attributed to differences in the way integration measured in the present study and argues for further investigation into the utility of different measures of integration.

Grant (2002) critiqued and compared several models for predicting integration efficiency. He focused specifically on two models the pre-labeling model of Braida and the fuzzy logic model of Massaro and argues that the pre-labeling model is superior to the fuzzy logic model. One primary difference of these two models is their assumption about the time course of audio-visual integration. The pre-labeling model assumes that integration occurs early in the cognitive process, prior to a response decision. The fuzzy logic model, in contrast, assumes that integration is a later occurrence, after initial response decisions for each individual modality have been made. Grant applied both models to one data set and found conflicting results; the fuzzy logic model suggested there were no significant signs of inefficient integrators, while the pre-labeling model showed significant differences. Grant argued for the use of the pre-labeling model due to the fact that the fuzzy logic model uses a formula designed to minimize the difference between obtained and predicted scores. This creates a model that attempts to fit obtained A+V scores rather than act as a tool to predict optimal audio-visual speech perception performance. Rather than attempting to fit observed data, the pre-labeling model estimates audio-visual performance based on single-modality information and predicts performance based on the notion that there is no interference across modalities. In situations where this model has been used, the predicted audio-visual scores were always greater than or equal to actual performance whereas the predictions made using the fuzzy-logic model were equally distributed as over-predicting and under-predicting. Grant concluded that the pre-labeling model places a stronger emphasis on individual differences and is therefore a better model for measuring integration efficiency.

Tye-Murray et al. (2007) further analyzed the pre-labeling model. This model, as well as a computationally simpler integration efficiency model, was used to compare integration results for normal hearing and hearing-impaired subjects. Consistent with Grant’s findings, the pre-labeling model predicted higher integration performance than that observed for both hearing-impaired and normal hearing listeners. However, this model found no significant difference between the two groups of listeners, suggesting that while neither group achieved their maximum integration ability, their performances were comparable. The integration efficiency model also did not find a significant difference between the two groups. Unlike the pre-labeling model, the integration efficiency model predicted scores for audio-visual performance that were consistently lower than the actual scores. The integration efficiency model takes into account singlemodality performance for an individual listener. Tye-Murray et al. argued that this is beneficial, because it allows for a deeper investigation into a listener’s skills that can result in the most effective rehabilitation strategy. This model allows insight to a listener’s strengths, weaknesses and integration ability and allows for the formation a rehabilitation strategy that is customized for each hearing-impaired individual.

Recently, Altieri (2008) proposed a different type of model of audio-visual integration, one that employs listener reaction time as an indicator of cognitive processing complexity. While the present study did not collect reaction time data, future work could add this measure to empirical studies to determine its potential usefulness for aural rehabilitation.

Future work could use the present results to compare the measures used in the present study to model-predictive measures (Grant & Seitz, 1998), simple measures of integration efficiency (Tye-Murray et al., 2007), and processing capacity measures (Altieri, 2008) to determine which, if any, of these measures can be used to develop optimized aural rehabilitation strategies for hearing-impaired persons. Nonetheless, these results support the generalizability of training in audio-visual speech perception for aural rehabilitation programs, and argue strongly for inclusion of training in all modalities (auditory, visual, and audio-visual) to achieve maximum benefits.

–  –  –

DiStefano, S. (2010). Can audio-visual integration improve with training, Senior Honors Thesis, The Ohio State University.

Gariety, M. (2009). Effects of training on intelligibility and integration of sine-wave speech. Senior Honors Thesis, The Ohio State University.

Grant, K.W. & Seitz, P.F. (1998). Measures of auditory-visual integration in nonsense syllables and sentences. The Journal of the Acoustical Society of America, 104 (4), 2438-2450.

James, K. (2009). The effects of training on intelligibility of reduced information speech stimuli. Senior Honors Thesis, The Ohio State University.

McGurk, H., & MacDonald, J (1976). Hearing lips and seeing voices. Nature 264, 746Ranta, A. (2010). How does feedback impact training in audio-visual speech perception, Senior Honors Thesis, The Ohio State University.

Richie, C. & Kewley-Port, D. (2008). The effects of auditory-visual vowel identification training on speech recognition under difficult listening conditions. Journal of Speech, Language, and Hearing Research, 51, 1607-1619.

Shannon, R.V., Zeng, F.G., Kamath, V., Wygonski, J., & Ekelid, M. (1995). Speech recognition with primarily temporal cues. Science, 270, 303-304.

Tye-Murray, N., Sommers, M. S., & Spehar B. (2007). Audiovisual integration and lipreading abilities of older adults with normal and impaired hearing. Ear &

–  –  –

Figure 3: Percent correct responses for A-only tests, averaged separately across listeners for training talkers and testing talkers Figure 4: Percent correct responses for V-only tests, averaged separately across listeners for training talkers and testing talkers Figure 5: Percent correct responses for A+V congruent stimuli tests, averaged separately across listeners for training and testing talkers Figure 6: Amount of integration by test, averaged across listeners separately for training talkers and testing talkers Figure 7: Percent correct responses for pre-test and post-test averaged by talker in the

–  –  –

Figure 8: Percent correct responses for pre-test and post-test averaged responses by talker in the V-only condition Figure 9: Percent correct responses for pre-test and post-test averaged by talker in the

–  –  –

Figure 10: Percent response for discrepant stimuli averaged for training talkers across listeners, for pre-test and post-test Figure 11: Percent response for discrepant stimuli averaged for testing talkers across listeners, for pre-test and post-test Figure 12: McGurk-type integration results for pre-test and post-test, averaged across listeners for training talkers Figure 13: McGurk-type integration results for pre-test and post-test, averaged across listeners for testing talkers

Pages:     | 1 | 2 ||

Similar works:

«Organizational Networks as Catalysts for Strategic Sustainable Development Molly H. S. Doyle, Dermot C. Hikisch, Shawn M. Westcott School of Engineering Blekinge Institute of Technology Karlskrona, Sweden 2008 Thesis submitted for completion of Master of Strategic Leadership towards Sustainability, Blekinge Institute of Technology, Karlskrona, Sweden.Abstract: In an increasingly connected and interdependent world, the global sustainability challenge needs to be addressed by organizational...»

«JOB TITLE: CHAPTER PRESIDENT Rev. June 1999 The President is the presiding officer of the Board of Directors and Executive Committee and an ex-officio member of all committees: represents the Board of Directors between its meetings and reports to the Board of Directors allimportant interim actions. This person serves as an identified NASW leader and fills a two-year term. The President works with Chapter Officers, Board and Chapter members to fulfill the mission of the Chapter. All Chapter...»

«“Ironman™” Melbourne 2013 Luke Yeatman The Preamble Yes it was a long day, yes it was hard, yes I got the finisher’s medal, towel and t-shirt but no-one can tell me I am an IRONMAN™ finisher. When the swim course was announced as a one lapper (which turned out being about 1500m) I felt a bit ripped off because I had paid for and was generally prepared for the full distance. Even prior to starting or having my marathon meltdown I knew I would have to do another IRONMAN™, no matter...»

«MINNETONKA INDEPENDENT SCHOOL DISTRICT #276 Service Center 5621 County Road 101 Minnetonka, Minnesota Minutes of January 18, 2007 The School Board of Minnetonka Independent School District #276, met in regular session at 7:15 p.m. on January 18, 2007 in the Community Room at the District Service Center, 5621 County Road 101, Minnetonka, Minnesota. Chairperson Judy Erdahl presided. Other Board members present were: Erin Adams, Pamela Langseth, Calvin Litsey, Cathy Maes, Peggy Stefan, William...»

«Carbon Footprint Game Teacher Resource Pack Primary Years Middle Years NRM Education The NRM Education Program is playing a critical role in contributing to the knowledge, skills and confidence of young people and educators to manage natural resources sustainably. This resource provides information and activities for students to learn about their carbon footprint. Students work out how many tonnes of carbon they would produce based on lifestyle choices, then discuss what changes they might...»

«BACKGROUND Purpose: To present information on the water quality of the Thames River for 2013.Executive Summary: Thames River surface water quality criteria were met or bettered in 11 of 16 Ministry of Environment’s surface water quality objectives. Of the five that did not meet the objectives, E. Coli concentrations were exceeded upstream of the City and downstream at Byron during the disinfection period. The Total Coliform levels leaving the city are higher than those entering during the...»

«Theo D'haen Curriculum vitae Theo D’haen (Antwerp, Belgium, 1950) was educated at the Universities of Antwerp (English, Spanish, BA [1970], MA in Translation Studies [1972] and Conference Interpreting [1973]), Vanderbilt (Nashville, Tennessee Comparative Literature), Brussels (Germanic Philology), Massachusetts (Amherst, Massachusetts MA [1976] and PhD in Comparative Literature [1981]), and the Ecole des Hautes Etudes en Sciences Sociales, Paris (1976-1977). D’haen has worked as a...»

«ArtWorks Internship in the Arts Program Hired to Create. Inspired to Succeed. Application for paid internship *For current 9th 12th graders only. Students must be at least 14 years old, and eligible for employment within the U.S. Application Due: August 29, 2016 ALL of the following materials must be received by 3:00 pm on the deadline date for your application to be accepted for consideration. Application checklist: This completed application form.  Typed and completed responses to...»

«Handbook on the Applied Sciences and Engineering, Vol.2, 2015 ISBN : 978-969-9952-11-1 CONFERENCE PROCEEDINGS BOOK 2nd ISCASE-2015 Dubai 2nd International Scientific Conference on Applied Sciences and Engineering 16-17 February, 2015 Movenpick Ibn Battuta Gate Hotel, Dubai i Handbook on the Applied Sciences and Engineering, Vol.2, 2015 ISBN : 978-969-9952-11-1 Contents Article I.D Title Page No. 2nd ISCASE-432 Facies Associations and Evolution Of Jurassic Carbonate Patform, Northern Atlasic...»

«Bajmócy, Z. – Lengyel, I. (eds) 2009: Regional Competitiveness, Innovation and Environment. JATEPress, Szeged, pp. 222-236. The Social Role and Responsibility of Small and Medium-sized Enterprises – Results of an Empirical Investigation Applying the Social Capital Approach György Málovics An increasing number of projects deal with the social role and responsibilities of small and medium-sized enterprises (SMEs). The special literature on corporate social responsibility (CSR) and most...»

«Overnight Camp Welcome to Camp Quinipet 2016! We are beyond thrilled to have you joining us at camp this summer, and we’ve got some pretty great stuff planned for you. In this packet you should find everything you need to prep for the best summer ever. We can’t wait for all of the fun we’ll have this year. Each day we’ll enjoy swimming, arts and crafts and exploring Camp Quinipet. Remember, we’ll be spending a lot of time outdoors, so plan for the weather when packing! Check out our...»

«Diyala Journal ISSN 1999-8716 of Engineering Printed in Iraq Sciences Vol. 06, No. 04, pp. 1-11, December 2013 SETTLEMENT-TIME BEHAVIOR OF STEEL PILES IN GYPSIFEREOUS SAND A MODEL PROTOTYPE STUDY Waad Abdulsattar Zakaria Lecturer, College of Engineering, Diyala University, Iraq. E-mail: waadzakariya@yahoo.com (Received: 1/2/2011; Accepted:11 /5/2011) ABSTRACT: There are a lot of studies conducted on gypseous soils dealing with the effect of collapsibility on the general behavior of the soil...»

<<  HOME   |    CONTACTS
2016 www.dissertation.xlibx.info - Dissertations, online materials

Materials of this site are available for review, all rights belong to their respective owners.
If you do not agree with the fact that your material is placed on this site, please, email us, we will within 1-2 business days delete him.