CONFIDENTIAL. Limited circulation. For review only.

Dynamic Sensorimotor Model for open-ended acquisition of tool-use Raphael Braud, Alexandre Pitti and Philippe Gaussier Laboratory ETIS, UMR CNRS 8051 University of Cergy-Pontoise, ENSEA, France E-mail: [email protected]

Abstract—Open-ended acquisition of multimodal sensorimotor skills can lead to achieve tasks through unseen scenarios. In this paper, we propose a Dynamic Sensorimotor Model to achieve a tool-use task. We use a 6-degree of freedom robotic arm and a camera to perform a reaching task. The robot first tries to reach the target directly, fails, then make a detour to grasp a tool perceived as an extension of its arm, and then successfully reaches the target with the tool. To perform this, the robot only learns to predict sensory variations through its motor activities and a given sensory pattern. By simulating possible motor activities and alternative sensory pattern, our Dynamic Sensorimotor Model (DSM) is able to control the robot and to create sub-goals onthe-fly.

GOAL

Dynamic Sensorimotor Model ( ) Working Memory Selected

Priority Selection

Selected

Distance Computation

SUB-GOALS

I. I NTRODUCTION Open-ended learning is a central issue for autonomous robotic. A major challenge is to define what kind of knowledge robots should learn in order to achieve tasks never encountered before [1], [2]. It remains unclear to what extent low-level skills (e.g., sensorimotor mappings) and higher-level cognitive abilities (e.g., action planing, tool-use) share common learning mechanisms. A continuous development between different levels of cognitive abilities can lead to behaviors as complex as using tools [3], [4]. In a developmental framework, low level behaviors can clearly form atomic components of high level learning [5]. This paper proposes a computational model that encodes sensorimotor laws and, through a simulation mechanism, uses those laws in an open ended manner to solve unseen tasks on-the-fly. This model is applied in an experiment with a 6degree of freedom robotic arm and a camera, and shows how the robot uses the tool in a way never learned before. II. DYNAMIC S ENSORIMOTOR M ODEL In previous works [6], we compared different strategies to encode sensorimotor transformations in the context of tooluse. In the continuity of those works, we use what we called a relative strategy to predict sensory variations based on motor activities and sensory pattern (see also [7] for a closely related model). Sensorimotor laws are learned for each sensor i, and those laws are then used in two different simulations to predict, for each case, possible sensory variations: 1) Motor simulation is used to predict sensory variation in case of a motor activity, within the current sensory context.

Sensorimotor Law Simulator (SLS) Fig. 1. Dynamic Sensorimotor model (DSM): from a given external goal, the model generates motor commands, and eventually sub-goals, kept in a working memory. It selects a goal thanks to a priority mechanism, and sends the variations required for achieving the goal to the Sensorimotor Law Simulator of each sensor, which in return provide the desired motor command or subgoals.

2) Sensory pattern simulation is used to predict sensory variation in case of a motor activity but in an different context, i.e. in a different sensory pattern. The Dynamic Sensorimotor Model (DSM), see fig. 1, can then be decomposed as follow. First, the experimenter gives the robot a goal, i.e. a desired sensory pattern SDes G . Then the distances between the current sensory pattern and the desired sensory pattern (goal) are computed and sent to the Sensorimotor Law Simulator (SLS). Finally, the two kinds of simulations mentioned above are performed in the SLS. If the motor simulations (1) predict sensory variations that lead closer to the goal, corresponding motor commands are sent to the actuators. With this method, we performed in [6] a reaching task of the end-effector of a 4-degree of freedom robot arm, in a 3-dimensional space. In case motor simulations (1) aren’t able to provide motor activities predicting a sensory variation in the desired direction, then the result of the sensory pattern simulation (2) can be possibly used: if, for a simulated sensory pattern SSim , a

Preprint submitted to 6th International Conference on Development and Learning and on Epigenetic Robotics. Received April 17, 2016.

CONFIDENTIAL. Limited circulation. For review only. motor activity can help the robot to get closer to the goal, Sim then the robot creates a sub-goal: SDes . Sub-goals sG = S created through SLS are kept in a working memory until they are satisfied. The robot must then choose between the goal given by the experimenter, and the sub-goals given by the SLS. In order to do this, a simple priority mechanism is used, which associates each desired sensory pattern SDes with a priority value P . The robot selects the SDes associated with the highest priority, and the priority values are given thanks to the following mechanism. The external goal, received from the experimenter, is set at P = 1. The SLS receives desired sensory variations associated with a priority P = p, coming from the selected SDes (i.e. the one with the highest priority). Then, the SLS sends a desired motor activity, or a desired sensory pattern, with a priority P = p + 1. Thanks to this mechanism, subgoals are created with a highest priority than the goal pursued at first. III. R EAL ROBOT EXPERIMENT We perform an experiment on a Katana arm with a 6-degree of freedom, a camera, a tool and a target. In our experiment, a tool is considered as an extension of the body schema (see [8]), so when it is grasped, the robot considers the position of its end-effector to be extended to the tool. The camera is used to recognize and localize the arm, the tool and the target on the image thanks to a previously developed bio-inspired object recognition algorithm based on local points of interests (see [9]). The goal of the experiment is a reaching task, the end effector must be close enough to the target. The robot discovers its own sensorimotor laws through a babbling phase, and learns which sensory pattern and motors contribute to the variation of each sensors. Sensors are the proprioception, the objects and end-effector presence and positions in the visual field, and finally the tactile information coming from the end-effector (activated when the tool is grasped). Moreover, since the tool is perceived as an extension of the body schema, then during the babbling phase, the robot discovers that grasping the rakelike tool visually shifts its end-effector upward. In fig. 2, we can see the position of the end-effector computed by the object recognition algorithm, during the reaching experiment. The target is in the top-right corner, and, in the beginning, the arm is in the bottom-left corner, and the rake-like tool is on the left. During phase 1 the motor simulations (1) provide motor commands that reduce the distance between the end-effector and the target. But the arm is limited in its ability to move upward, which is something the robot has learned during the babbling phase. After a while, because of this predicted limitation, the motor simulations (1) don’t provide motor commands to get closer to the target. As a consequence the sensory pattern simulation (2) is triggered and provides a sub-goal, which consists in grasping the tool, in order to go higher to reach the target. The higher priority of this sub-goal inhibits the main goal, and the robot makes a detour to grasp the tool in phase 2. Once the tool is grasped,

Fig. 2. Hand trajectory of the Katana arm (perceived by the object recognition algorithm) with a target out of reach along the Y axis. The robot makes a detour to grasp the adapted tool.

the working memory of the sub-goal is erased. The main goal is no longer inhibited and the arm, extended with the tool, tries again to reach the target in phase 3. IV. C ONCLUSION In this paper, we propose a method in which a set of sensorimotor laws is used for achieving goals through unseen scenarios (more details in [10]). In DSM, the ability to make a sequence on-the-fly is only based upon the learned laws, and depends on the motor effect on sensory variations, in specific sensory pattern or context. Then, the ability of the robot to learn efficient contexts (i.e. having the tool in the hand moves the end-effector upward) is crucial in this model, and will be the subject of future works. R EFERENCES [1] J. Weng, J. McClelland, A. Pentland, O. Sporns, I. Stockman, M. Sur, and E. Thelen, “Autonomous mental development by robots and animals,” Science, vol. 291, no. 5504, pp. 599–600, 2001. [2] C. Prince, N. Helder, and G. Hollich, “Ongoing emergence: A core concept in epigenetic robotics,” 2005. [3] Y. Nagai and M. Asada, “Predictive learning of sensorimotor information as a key for cognitive development,” in Proc. of the IROS 2015 Workshop on Sensorimotor Contingencies for Robotics, 2015. [4] J. J. Lockman, “A perception-action perspective on tool use development,” Child development, pp. 137–144, 2000. [5] F. Guerin, “Learning like a baby: a survey of artificial intelligence approaches,” The Knowledge Engineering Review, vol. 26, no. 02, pp. 209–236, 2011. [6] R. Braud, A. Pitti, and P. Gaussier, “Comparison of absolute and relative strategies to encode sensorimotor transformations in tool-use,” in Development and Learning and on Epigenetic Robotics (ICDL), 2015 IEEE International Conference on. IEEE, 2015, pp. 267–268. [7] D. Bullock, S. Grossberg, and F. H. Guenther, “A self-organizing neural model of motor equivalent reaching and tool use by a multijoint arm,” Journal of Cognitive Neuroscience, vol. 5, no. 4, pp. 408–435, 1993. [8] A. Maravita and A. Iriki, “Tools for the body (schema),” Trends in cognitive sciences, vol. 8, no. 2, pp. 79–86, 2004. [9] R. Braud, G. Mostafaoui, A. Karaouzene, and P. Gaussier, “Simulating the emergence of early physical and social interactions: A developmental route through low level visuomotor learning,” in From Animals to Animats 13. Springer, 2014, pp. 154–165. [10] R. Braud, A. Pitti, and P. Gaussier, “A modular dynamic sensorimotor model for affordances learning, sequences planing and tool-use,” Cognitive and Developmental Systems, IEEE Transactions on, Submitted.

Preprint submitted to 6th International Conference on Development and Learning and on Epigenetic Robotics. Received April 17, 2016.

Dynamic Sensorimotor Model for open-ended acquisition of tool-use

reach the target directly, fails, then make a detour to grasp a tool perceived as an extension of ... (e.g., action planing, tool-use) share common learning mech- anisms. A continuous .... The target is in the top-right corner, and, in the ... the emergence of early physical and social interactions: A developmental route through low ...

572KB Sizes 0 Downloads 262 Views

Recommend Documents

Dynamic Sensorimotor Model for open-ended ...
sensors and motors : dS = M. - Uses motor and sensor simulations, for motor control and creation of ... Tool Use : from low to high level skills. A - Context :.

Accuracy of Dynamic SPECT Acquisition for Tc-99m ...
rapid myocardial washout and a rapidly increasing hepatic activity which eventu- ally plateaus. For standard SPECT imaging the choice is to start acquiring very ...

A Neural Circuit Model of Flexible Sensorimotor Mapping: Learning ...
Apr 18, 2007 - 1 Center for Neurobiology and Behavior, Columbia University College of Physicians and Surgeons, New York, NY 10032, ... cross the road, we need to first look left in the US, and right ...... commonplace in human behaviors.

Dynamic workflow model fragmentation for distributed execution
... technology for the coordination of various business processes, such as loan ... Workflow model is the basis for workflow execution. In ...... One way to deal with ...

Sensorimotor coupling via Dynamic Bayesian Networks
hand movements and a high degree of voluntary attentional control in directing fixations .... a function α(d) that associates the data with an hypothesis. A ..... Computer Science Division (2002). [6] A. De ... movements. Science, 265 540-543, 1994.

A multiple controller model of skill acquisition
Jul 24, 2009 - transferred from cortical planning areas (e.g., the prefrontal cortex, PFC) to the basal ganglia (BG). ..... that it encodes reward prediction error — the difference between ...... The basal ganglia and chunking of action repertoires

Optimal sensorimotor transformations for balance
Ting, L. H. & Macpherson, J. M. A limited set of muscle synergies for force control during a postural task. J Neurophysiol 93 ... A. Platform perturbation kinematics.

Dynamic Digital Human Model for ergonomic ...
Virtual actuator control. In: IEEE International Conference on Intelligent Robots and Systems. pp. 1219–1226. Savin, J., August 2011. Digital human manikins for ...

Dynamic Model Selection for Hierarchical Deep ... - Research at Google
Figure 2: An illustration of the equivalence between single layers ... assignments as Bernoulli random variables and draw a dif- ..... lowed by 50% Dropout.

A dynamic causal model for evoked and induced ...
Jul 30, 2011 - changes in spectral activity induced by other sources and exogenous. (e.g., stimulus) ..... of model alternatives, all one can do is “motivate model space carefully” .... The free-energy principle: a unified brain theory? Nat. Rev.

A Three-dimensional Dynamic Posture Prediction Model for ...
A three-dimensional dynamic posture prediction model for simulating in-vehicle seated reaching movements is presented. The model employs a four-segment ...

Information Acquisition and Portfolio Bias in a Dynamic ...
prior information advantages, and hypothesizes that such large information ... countries for which there is an extensive amount of portfolio data available, with .... analysis, and do not speak directly to the evolution of the home bias over time.

Appendices - A Novel Dynamic Pricing Model for the ...
Appendices - A Novel Dynamic Pricing Model for the Telecommunications Industry.pdf. Appendices - A Novel Dynamic Pricing Model for the ...

A Self-Similar Traffic Prediction Model for Dynamic ...
known about the traffic characteristics of wireless networks. It was shown in [1] that wireless traffic traces do indeed exhibit a certain degree of self-similarity and ...

A dynamic stochastic general equilibrium model for a small open ...
the current account balance and the real exchange rate. ... a number of real frictions, such as habit formation in consumption, investment adjustment costs ...... also define the following equations: Real imports. (. ) m t t t t m Q c im. = +. (A30).

A Self-Similar Traffic Prediction Model for Dynamic ...
The availability of precise high-quality and high-volume data sets of traffic ... to forecast real-time traffic workload could make dynamic resource allocation more ...

adaptive model combination for dynamic speaker ...
as MAP [7]) and speaker space family (such as eigenvoice. [6]). .... a global weight vector is learned for all phone classes of test ..... Signal Processing, vol. 9, pp.

A Quantitative Model of Dynamic Customer ...
5.3 Who Buys ADSL? ... B.4 PSTN to ADSL Transition Sensitivity . ...... When contemplating the issue of computational tractability and speed, itrs interesting ..... seems left(skewed and this is confirmed by a DrAgostino test for skewness, so a ...

A Dynamic Model of Price Signaling and Consumer ...
A Dynamic Model of Price Signaling and Consumer. Learning∗. Matthew Osborne†and Adam Hale Shapiro‡. March 7, 2012. Preliminary Draft: Please do not cite without the authors' permission. Abstract. We develop a model of consumer learning and pric

A Dynamic Model of Privatization with Endogenous Post-Privatization ...
Aug 21, 2008 - would imply minimal changes in post-privatization firm performance in the ..... G.3 Meanwhile, by entitling G to monitor, government ownership allows ...... “Rent Seeking and Government Ownership of Firms: An Application to ...

Model Dynamic RAD GRK.pdf
Average Data from. International Cities. Data from BPS, and. Recent studies. Method Multiple linear variables Energy Modelling. Analysis Growth, VKT (veh.kilo ...

Parametric Identification of Stochastic Dynamic Model ...
Tochigi 321-8585, Japan [email protected]. Fig. 1. Experimental system. the PDFs of the human participant and those of a control model. The experiment is conducted using a virtual tracking sys- tem [9]. The common virtual mechanical system (contro

A Dynamic Model of Price Signaling, Consumer ...
Nov 22, 2014 - consumers' beliefs about the product's quality by altering both the price and ... price and quality, accounting for the fact that its choice will affect ...

A Dynamic Spatial Model of Rural-Urban ...
Jun 30, 2014 - The new economic geography models presented in Fujita et al (2001) are also related to our analysis. In their ...... Afghanistan. Bangladesh. Brazil. Chile. China. Dominican Republic. India. Indonesia. Korea, Rep. Malaysia. Mexico. Bhu