Contextual Information Sharing in Natural Language and Gesture Crossmodal Integration for Aged People Assistive Home Care Application Olga Vybornova ([email protected]) Monica Gemo ([email protected]) Ronald Moncarey ([email protected]) Benoit Macq ([email protected]) UCL-TELE, Universite Catholique de Louvain, Batiment Stevin, Place du Levant, 2, B-1348, Louvain-la-Neuve, Belgium Keywords: multimodal assistive interface, domain ontology, user profile, context awareness, semantic representations, multimodal fusion.

Application overview We present a method for knowledge-based context sharing within multimodal high level fusion integrating data obtained from spoken input and visual scene analysis. The goal of our work is to develop a multimodal interface as an “intelligent diary” to proactively assist elderly people living alone at home to perform their daily activities, to prolong their safe and secure personal autonomy, to support their active ageing and social cohesion. To provide natural interaction with the user(s) a system must be able to comprehend the fully coordinated mind-andbody behavior, to handle semantic-level input data fusion, i.e. to combine information arriving simultaneously from different modalities into one or several unified and coherent representations of the user’s intention. Our context-aware user-centered application should accept spontaneous multimodal input – English speech, 3D gestures (pointing, iconic, possibly metaphoric) and user’s physical action. In the near future we plan to add in the research also eye gaze tracking modality to facilitate capturing salient objects in the scene. Thus we have a restricted domain to work with, but we deal with unrestricted natural human behavior – spontaneous spoken input and gesture. At present we are implementing multi-stage crossmodal fusion that is seen promising from the point of view of reference ambiguity resolution before the final fusion. It is exactly crossmodal fusion that helps us cope with problems of speech recognition for elderly people caused by age-related decline of language production ability (for instance, difficulties in retrieving appropriate (familiar) words or tip-of-the-tongue (TOT) states when a person produces one or more incorrect sounds in a word (Burke & Shafto, 2004) because information from other modalities refines the language analysis at the early stage of recognition.

Method Everything that is said or done is meaningful only in a particular context. To accomplish the task of semantic fusion we should take into account the information obtained at least in the following three types of context (Chai, Pan and Zhou, 2005): (i) domain context (meaning personalized

prior knowledge of the domain, semantic frames with predefined action patterns, adaptive user profile, situation modeling, a priori developed and dynamically updated ontology defining subjects, objects, activities and relations between them for a particular person). (ii) conversational context (derived from natural language semantic analysis); (iii) visual context (capturing the user’s gesture/action in the observation scene and allowing eye gaze tracking to enable salience models while activity monitoring). To derive contextual information from spoken input we extract natural language semantic representations (discourse representation structures) (Bos, 2005) and map them onto the restricted domain ontology. This information is then processed together with visual scene input for multimodal reference resolution. The ontology allows contextual information sharing within the domain and serves as a metamodel for Bayesian networks used to analyze and combine the modalities of interest. With the help of nondeterministic weighting of multimodal data streams we obtain robust contextual fusion to recognize the user’s intentions, to predict behavior, to provide reliable interpretation and to reason about the cognitive status of the person.

Acknowledgments This work is supported by the European Commission FP6 Network of Excellence SIMILAR, project # FP6-507609 (http://www.similar.cc).

References Bos J. Towards wide-coverage semantic interpretation. (2005). Proceedings of IWCS-6. Burke D. and Shafto M. (2004). Aging and language production, Current Directions in Psychological Science, 13. Chai J., Pan S. and Zhou M. (2005). MIND: A contextbased multimodal interpretation framework, Kluwer Academic Publishers. Pfleger N. and Alexandersson J. (2006) Towards resolving referring expressions by implicitly activated referents in practical dialogue systems, Proc. of the Workshop on the Semantics and Pragmatics of Dialogue. Pollack M. (2005) Intelligent Technology for an Aging Population: The Use of AI to Assist Elders with Cognitive Impairment, AI Magazine, 26(2):9-242005.

Pre Test Excerpt

goal of our work is to develop a multimodal interface as an. “intelligent diary” to proactively assist elderly people living alone at home to perform their daily activities, to prolong their safe and secure personal autonomy, to support their active ageing and social cohesion. To provide natural interaction with the user(s) a system.

22KB Sizes 1 Downloads 224 Views

Recommend Documents

Pre Test Excerpt
A soft constraints hypothesis (Gray et al., 2006) posits that people strategically .... measures analysis of variance revealed a significant main effect for condition ...

Pre Test Excerpt
Thus, because learners must track possible mappings across learning events, real world word learning is much more difficult than tested in recent research on desirable difficulties in word learning. Research on cross-situational learning has indicate

Pre Test Excerpt
Panels A shows a single frame extracted from a video sequence, while the output of the motion filtering algorithm is shown in Panel B. It is important to note that background noise (i.e. the patterns on the walls and ceilings) have been correctly fil

Pre Test Excerpt
In physics and engineering textbooks, simple line drawings are often used to .... crossed between-subjects design with four conditions: Realistic-Support ...

Pre Test Excerpt
Children (M age = 8 yrs, 9 mos) who solved pretest equations .... additional math equivalence problems alone on the laptop screen for them to practice doing the ...

Pre Test Excerpt
child data from five typologically-different languages. Application .... This made it the first word in the utterance, and the system .... CanCorp (Lee, Wong, Leung,.

Pre Test Excerpt
College students participated in a task which required them to trace or draw various forms ... the drawing and tracing of figure-8s. .... administered via computer.

Pre Test Excerpt
all spatial descriptions (in languages which employ them). ..... cluster, consisting of p67, an owl in a hole in a tree, p02, an apple in an otherwise empty bowl,.

Pre Test Excerpt
well as engagement levels that were self-reported during the ... In general, self-reported interest in the task ... learning environment (classroom, human tutor, high stakes learning) .... comparing the predictive power of three banks of predictors.

Pre Test Excerpt
been used to test for the emergence of lexical competition were cohort competitors ..... Experiment 2 widened the domain of reference for our lexical footprint test.

Pre Test Excerpt
Amsterdam: IOS Press. Graesser, A., Lu, S., Olde, B., Cooper-Pye, E., & Whitten,. S. (2005). Question asking and eye tracking during cognitive disequilibrium: ...

Pre Test Excerpt
Department of Cognitive, Perceptual & Brain Sciences, University College London, Gower Street, London, UK. Punit Shah ([email protected]) and ...

Pre Test Excerpt
1771-1776). Austin, TX: Cognitive Science Society. For Evaluation Only. Copyright (c) by Foxit Software Company, 2004 - 2007. Edited by Foxit PDF Editor ...

Pre Test Pharmacology.pdf
There was a problem previewing this document. Retrying... Download. Connect more apps... Try one of the apps below to open or edit this item. Pre Test ...

Pre Test Pharmacology.pdf
Sign in. Page. 1. /. 305. Loading… Page 1 of 305. Page 1 of 305. Page 2 of 305. zzzPsgiolePfrp. Page 2 of 305. Page 3 of 305. zzzPsgiolePfrp. Page 3 of 305.

Pre test Electrostatica.pdf
Franklin frota un trozo de lana con una barra de goma. dura, dando a la barra una carga negativa. ¿Lo que pasa. es que ? A Los protones se quitan de la varilla.

PRE TEST MATCHING SHEET pdf.pdf
Whoops! There was a problem loading more pages. Whoops! There was a problem previewing this document. Retrying... Download. Connect more apps... Try one of the apps below to open or edit this item. PRE TEST MATCHING SHEET pdf.pdf. PRE TEST MATCHING S

PRE TEST MATCHING SHEET pdf.pdf
There was a problem previewing this document. Retrying... Download. Connect more apps... Try one of the apps below to open or edit this item. PRE TEST ...Missing:

evs pre test tool.pdf
Connect more apps... Try one of the apps below to open or edit this item. evs pre test tool.pdf. evs pre test tool.pdf. Open. Extract. Open with. Sign In. Main menu.

Excerpt - Peachtree Publishers
Atlanta : Peachtree Publishers, [2018] | Summary: Kalinka, a little yellow bird who loves to be helpful, tries to tidy up the home of her grouchy neighbor, Grakkle, ...

Excerpt - Peachtree Publishers
Atlanta : Peachtree Publishers, [2018] | Summary: Kalinka, a little yellow bird who loves to be helpful, tries to tidy up the home of her grouchy neighbor, Grakkle, ...

Excerpt from Flyboys.pdf
Sign in. Loading… Whoops! There was a problem loading more pages. Retrying... Whoops! There was a problem previewing this document. Retrying.

Pubols Gazette Excerpt Turnover.pdf
There was a problem previewing this document. Retrying... Download. Connect more apps... Try one of the apps below to open or edit this item. Pubols Gazette ...