Language grounding in robots for natural HumanRobot Interaction Aneesh Chauhan and Luís Seabra Lopes IEETA - Instituto de Engenharia Electrónica e Telemática de Aveiro

Abstract

Introduction

Motivated by the need to support language-based communication between robots and their human users, as well as grounded symbolic reasoning, this thesis aims to develop learning architectures that can be used by robotic agents for long-term, incremental and open-ended category acquisition. A social language grounding experiment is designed, where, a human instructor teaches a robotic agent the names of the objects present in a visually shared environment. During the research period, a set of novel learning architectures have been developed for vocabulary acquisition and visual category formation in robots through active interaction with humans. These architectures have been evaluated through systematic experiments and the most recent architecture seems to outperform several previous works with similar goals.

This research aims to develop learning architectures that can be used by robotic agents for long-term, incremental and openended category acquisition. “Situating the problem” Symbol grounding: Meanings of symbols (e.g. words) lie in their association with the entities of the world they refer to [3]. - The agent should support grounded symbolic reasoning. Role of social interactions: Language is a cultural product that is acquired through social interactions (language transfer). -Learning a human language will require the participation of humans as language instructors. -Humans and robots can share a language if they have the same words grounded to same entities. Experimental setup - Agent: A simple agent has been developed, which consists of a computer, with an attached camera and a robotic arm, running appropriate perceptual, learning and interaction procedures. - Scenario: A scenario was designed where a human instructor teaches the names of objects present in a visually shared environment (language transfer). - The agent grounds the object names in sensor-based descriptions (symbol grounding), leading to a shared vocabulary with its instructor.

Fig.1 Experimental setup

Architectures Two approaches identify the agent architectures, based upon reliability of the communicated “word”: 1. Textual input (reliable) [2, 4, 5]. 2. Spoken words (noisy) [1]. Both architectures share the visual perception functions: Object segmentation from the scene and extraction of multiple feature spaces for instance representations

a

a Fig.3 a. Category learning and recognition architecture; b. Object images

- Novel features were designed, where most of these features capture the shaper information (and are scale, rotation and transformation invariant) [4] The architecture supporting spoken words extracts the auditory features (phonemes and mel-frequency cepstral coefficients) [1]. -The agent uses its perceptual input to ground these words and dynamically form/organize visual category descriptions. Category learning and recognition The architecture supports lifelong openended vocabulary acquisition based on online user feedback (teach, ask, correct). “Learning” -Instance-based learning. Categories are simply represented by sets of known instances. New instances are stored in the following situations: - Explicit teach action or Corrective feedback: “Classification ” -One-class classification [5] -Base classifiers: 6 nearest-neighbor (NN) classifiers [2,4] and 10 nearest-cluster (NC) classifiers [2]. A color-based classifier is also included . - 7 classifier combinations, based on majority voting and Dempster-Shafer evidence theory . - A metacognitive component [4] maintains updated success statistics for all the classifiers and, based on these statistics, reconfigures classifier combinations. Experimental evaluations Teaching protocol: An exhaustive and generic protocol was developed to evaluate online vocabulary acquisition. - This protocol is applicable to any online, incremental and open-ended category learning system.

b Fig.2 a. Conceptual frameworks for the agent architecture. One supports reliable (textual) input and the other supports spoken input; b. Stages of object segmentation and visual feature extraction

b

Results The performance of the learning model has been evaluated on vocabulary acquisition, using the teaching protocol.

Conclusions - We have developed a physically embodied robot with language grounding capabilities. - During the course of the research several online, incremental and open-ended learning architectures with many innovations were developed (supporting textual/spoken words) - Overall, our approach seems to outperform several previous works with similar goals -While previous approaches enabled learning of up to 12 categories, the proposed approach enabled learning of 69 categories (69 being the limit where no new categories were available to teach) -… but of course previous works are not directly comparable - The agent is able to learn simple words as well as homonyms equally proficiently. References [1] Chauhan, A. & Seabra Lopes, L. (2011): Using spoken words to guide open-endeded category formation. Cognitive Processing. (in press) [2] Chauhan, A. & Seabra Lopes, L. (2010): Acquiring vocabulary through human robot interaction: a learning architecture for grounding words with multiple meanings. AAAI-FSS-10 on Dialogue with Robots. [3] Harnad, S. (1990): The symbol grounding problem. Physica D 42, 335-346. [4] Seabra Lopes, L. and Chauhan, A. (2008): Open-ended category learning for language acquisition. Conn Sci, 20(4), pp. 277-297. [5] Seabra Lopes, L. and Chauhan, A. (2007): How many words can my robot learn? An approach and experiments with One-Class Learning. Interaction Studies, 8(1), pp. 53-81.

Language grounding in robots for natural Human

using the teaching protocol. Conclusions. - We have developed a physically embodied robot with language grounding capabilities. - During the course of the research several online, incremental and open-ended learning architectures with many innovations were developed (supporting textual/spoken words). - Overall, our ...

1MB Sizes 3 Downloads 231 Views

Recommend Documents

Grounding language in action
using abstract, amodal, and arbitrary symbols (i.e., words) combined by syntactic rules (e.g., Burgess & Lund, 1997;. Chomsky, 1980 ... meaningful to an animal.

Partitivity in natural language
partitivity in Zamparelli's analysis to which I turn presently. Zamparelli's analysis of partitives takes of to be the residue operator. (Re') which is defined as follows:.

Ambiguity Management in Natural Language Generation - CiteSeerX
from these interactions, and an ambiguity ... of part of speech triggers obtained by tagging the text. .... menu commands of an Internet browser, and has used.

Ambiguity Management in Natural Language Generation - CiteSeerX
Ambiguity Management in Natural Language Generation. Francis Chantree. The Open University. Dept. of Maths & Computing,. Walton Hall, Milton Keynes, ...

Blunsom - Natural Language Processing Language Modelling and ...
Download. Connect more apps. ... Blunsom - Natural Language Processing Language Modelling and Machine Translation - DLSS 2017.pdf. Blunsom - Natural ...

Natural Language Watermarking
Watermark Testing. Watermark Selecting. ○ Stylistic concerns. ○ Security concerns. Watermark Embedding. 13:38. The 1st Workshop on Info. Hiding. 16 ...

natural language processing
In AI, more attention has been paid ... the AI area of knowledge representation via the study of ... McTear (http://www.infj.ulst.ac.uk/ cbdg23/dialsite.html).

NATURAL LANGUAGE PROCESSING.pdf
There was a problem previewing this document. Retrying... Download. Connect more apps... Try one of the apps below to open or edit this item. NATURAL ...

Grounding stress in expiratory activity
folds appear to lack the essential biomechanical properties that would permit modulation of… intensity using adult-like strategies.' So to increase the loudness of ...

Rule-based Approach in Arabic Natural Language ...
structures and entities are neither available nor easily affordable, and 2) for ... Domain rules have ... machine translation, name-entity recognition, and intelligent.

Gradability in Natural Language: Logical and ...
Feb 25, 2016 - This work would never have been possible without all the help and support that I have received from friends and colleagues during my time as ...

Identifying Nocuous Ambiguity in Natural Language ...
We present an automated approach to determine whether ambiguities in text are ...... tabular specifications in the SCR (Software Cost Reduction) requirements method aim ...... to, for instance, international dialing codes or IP addresses. ..... Gavas

Ambiguity Management in Natural Language Generation
to the domain or to a company's requirements. .... WYSIWYM a promising system for us to be working with. ..... Resolution in Software Development: a Linguistic.

Rule-based Approach in Arabic Natural Language ...
based approach in developing their Arabic natural processing tools and systems. ...... at homes and businesses through the Web, Internet and Intranet services.

Measuring Human-Robots Interactions - Springer Link
Published online: 3 May 2012. © Springer Science & Business Media BV 2012 ... should be intuitive and easy: these two key characteristics strongly define the ...

Storage of Natural Language Sentences in a Hopfield Network
This paper looks at how the Hopfield memory can be used to store and recall ... We view the need for machine learning of language from examples and a self- ...

Rule-based Approach in Arabic Natural Language ...
structures and entities are neither available nor easily affordable, and 2) for ... Edinburgh, UK (phone: 971-4-3671963; fax: 971-4-3664698; E-mail:.

[PDF] Natural Gas in Nontechnical Language READ ...
Based on educational material from the Institute of Gas Technology, this new nontechnical guide to the natural gas industry provides a balanced overview of the ...

Context-theoretic Semantics for Natural Language
Figure 2.1 gives a sample of occurrences of the term “fruit” in the British National ... Christmas ribbon and wax fruit can be added for colour. .... tree .041 .847 2.33 -.68 1.35 4.36 1.68 1.78 computer 1.56 .679 .731 3.13 1.62 -1.53 .635 -.455.

Natural Language as the Basis for Meaning ... - Springer Link
Our overall research goal is to explore how far we can get with such an in- ...... the acquisition of Kaltix and Sprinks by another growing company, Google”, into a .... invent, kill, know, leave, merge with, name as, quote, recover, reflect, tell,

Exploiting Syntactic Structure for Natural Language ...
Assume we compare two models M1 and M2 they assign probability PM1(Wt) and PM2(Wt) ... A common choice is to use a finite set of words V and map any word not ... Indeed, as shown in 27], for a 3-gram model the coverage for the. (wijwi-2 ...