Recurrent cerebellar loops simplify adaptive control of redundant and non-linear motor systems John Porrill & Paul Dean Centre for Signal Processing in Neural Imaging & Systems Neuroscience University of Sheffield

We have described elsewhere an adaptive filter model of cerebellar learning in which the cerebellar microcircuit acts to decorrelate motor commands from their sensory consequences (Dean, Porrill & Stone, Proc Roy Soc B, 2002). We found that stable learning for generic motor plants required the cerebellar microcircuit to be embedded in a recurrent loop; this arrangement lead to a simple and modular adaptive control architecture when applied to the linearised 3D vestibular ocular reflex (Porrill, Dean & Stone, Proc Roy Soc B, 2004). Here we investigate the properties of recurrent loop connectivity for the case of redundant and non-linear motor systems using the example of kinematic control of a robot arm. We will demonstrate that •

the learning rule does not depend on the availability of 'motor error' signals or require complex neural 'reference structures' to estimate such signals. It is thus a genuinely local learning architecture in which proximal rather than distal error can be used for learning



the form of the learning rule is biologically plausible, independent of the details of the controlled plant and in particular is unaffected by motor plant nonlinearities



control of redundant systems is not subject to the 'convexity problem' in which incorrect average motor commands are learnt for end-effector positions which can be accessed by more than one arm configuration



cerebellar connectivity is intrinsically modular and its complexity scales linearly with the dimensionality N of output space rather than with the product of N and the (for highly redundant biological motor systems usually much higher) dimensionality of motor command space.

These properties suggest that the highly specific closed cerebellar loops which have been shown to be ubiquitous in motor systems (Kelly & Strick, 2003) may play a central role in simplifying the adaptive control problem.

Figure 1. Recurrent cerebellar architecture applied to kinematic control of a robot arm. Robot arm motor plant (forward kinematics) is denoted by P. Desired end effector positions x(t) are processed by a recurrent loop containing a fixed approximation B to the inverse kinematics P-1 and an adaptive  element C = Σ wi Gi representing the cerebellar microcircuit. Reaching error e = x − x is used to train the weights wi using the local learning rule w i = − β e Gi ( y ) . Convergence follows since sum square synaptic error V = 1 2Σ ( wi − wi opt ) 2 is a Lyapounov function with V = − β e 2 . This behaviour contrasts

with conventional forward loop architectures where an approximation to the Jacobian of P is required to enable learning.

Figure 2. Adaptive re-calibration of a two joint robot arm. Left panel shows simulated two joint arm.

Initially B (see Fig 1) is an exact inverse kinematics for arm lengths L1 =1, L2 = 2 and C is an RBF net with weights wi=0: in this case the arm reaches accurately to points within the workspace outlined by the blue grid. If arm lengths are changed to L1 =0.9, L2 = 2.2 the arm reaches to inaccurate positions shown by the distorted pre-training (red) grid. Performance after 104 training examples is shown by the post-training (dotted green) grid. Top right panel shows stochastic decrease of RMS reaching error during training. Bottom right panel shows monotonic decrease in V (exactly as predicted by the Lyapounov analysis). Note that much faster convergence can be obtained using optimised bases Gi.

An Adaptive Recurrent Architecture for Learning Robot ...

be accessed by more than one arm configuration. • cerebellar connectivity is intrinsically modular and its complexity scales linearly with the dimensionality N of output space rather than with the product of N and the (for highly redundant biological motor systems usually much higher) dimensionality of motor command space.

52KB Sizes 0 Downloads 303 Views

Recommend Documents

Hierarchical Deep Recurrent Architecture for Video Understanding
Jul 11, 2017 - and 0.84333 on the private 50% of test data. 1. Introduction ... In the Kaggle competition, Google Cloud & ... for private leaderboard evaluation.

An Architecture for Affective Management of Systems of Adaptive ...
In: Int'l Workshop on Database and Expert Systems Applications (DEXA 2003), ... Sterritt, R.: Pulse monitoring: extending the health-check for the autonomic grid.

An Agile and Adaptive Holonic Architecture for ...
discussions, that had enriched my knowledge in this research area; a special thank to José .... TCP/IP. Transmission Control Protocol/Internet Protocol. UML.

Learning Reactive Robot Behavior for Autonomous Valve ...
Also, the valve can. be rusty and sensitive to high forces/torques. We specify the forces and torques as follows: 368. Page 3 of 8. Learning Reactive Robot Behavior for Autonomous Valve Turning_Humanoids2014.pdf. Learning Reactive Robot Behavior for

An Architecture for Learning Stream Distributions with Application to ...
the stream. To the best of our knowledge this is the first ... publish, to post on servers or to redistribute to lists, requires prior specific permission ..... 3.4 PRNG and RNG Monitoring ..... Design: Architectures, Methods and Tools (DSD), 2010.

An Architecture for Learning Stream Distributions with Application to ...
chitecture for learning the CDF of a data stream and apply our technique to the .... stitute of Standards and Technology recommendation [19]. Our contribution ...

Adaptive Pairwise Preference Learning for ...
Nov 7, 2014 - vertisement, etc. Automatically mining and learning user- .... randomly sampled triple (u, i, j), which answers the question of how to .... triples as test data. For training data, we keep all triples and take the corresponding (user, m

A Robot Supervision Architecture for Safe and ... - Robotics Institute
+1-412-268-7988; email: [email protected] ... email: [email protected] ..... [9] G. Podnar, J. Dolan, A. Elfes, M. Bergerman, H.B. Brown and A.D. Guisewite.

A Robot Supervision Architecture for Safe and Efficient Space ...
NASA JPL or the K10 at NASA ARC, a secure Internet-based connection is used. Referring to Figure 4, we ... Transactions on Industrial Electronics, Vol. 50, No.

Adaptive Learning and Distributional Dynamics in an ...
Such an equilibrium requires that the economic agents choose the best ... capital stock, triggers changes in the economy's aggregate saving rate, which leads in ..... precautionary savings account only for a small proportion of total wealth in this .

An Adaptive Hybrid Multiprocessor Technique for ... - Kaust
must process large amounts of data which may take a long time. Here, we introduce .... and di are matched, or -4 when qi and di are mismatched. To open a new ...

An Adaptive Fusion Algorithm for Spam Detection
adaptive fusion algorithm for spam detection offers a general content- based approach. The method can be applied to non-email spam detection tasks with little ..... Table 2. The (1-AUC) percent scores of our adaptive fusion algorithm AFSD and other f

APPLICATION OF AN ADAPTIVE BACKGROUND MODEL FOR ...
Analysis and Machine Intelligence, 11(8), 1989, 859-872. [12] J. Sklansky, Measuring concavity on a rectangular mosaic. IEEE Transactions on Computing, ...

An Adaptive Fusion Algorithm for Spam Detection
An email spam is defined as an unsolicited ... to filter harmful information, for example, false information in email .... with the champion solutions of the cor-.

Mini-Rank: Adaptive DRAM Architecture for Improving ...
classify the twenty-six benchmarks of the SPEC2000 suite .... trend of power saving and performance impact are similar to ..... IEEE Micro, 26(4):52–60, 2006.

13. AN ARCHITECTURE FOR REALIZING TRANSMISSION FOR ...
AN ARCHITECTURE FOR REALIZING TRANSMISSION FOR 2_2 MIMO CHANNEL.pdf. 13. AN ARCHITECTURE FOR REALIZING TRANSMISSION FOR 2_2 ...

Learning Reactive Robot Behavior for Autonomous Valve ...
Connect more apps... Try one of the apps below to open or edit this item. Learning Reactive Robot Behavior for Autonomous Valve Turning_Humanoids2014.pdf.

Adaptive Multi-Robot Coordination: A Game-Theoretic ...
Analyzing and visualizing multiagent rewards in ... [8] J. R. Galbraith. Designing Complex Organizations. ... Collective intelligence, data routing and braess' ...

Effects of Adaptive Robot Dialogue on Information ...
robot, instructing novice and expert cooks with a male voice and responding to ... otherwise, or republish, to post on servers or to redistribute to lists, requires ...

Reinforcement Learning for Adaptive Dialogue Systems
43 items - ... of user action ˜su, system action based on noisy state estimate ˜as, system action given current state as) ... Online learning. .... Simulate phone-level confusions, e.g.. [Pietquin ... Example: Cluster-based user simulations from sm

Interactive Robot Learning of Visuospatial Skills_ICAR_2013.pdf ...
There was a problem previewing this document. Retrying... Download. Connect more apps... Try one of the apps below to open or edit this item. Interactive ...

Controlled Permutations for Testing Adaptive Learning ...
Complementary tests on such sets allow to analyze sensitivity of the ... decade, a lot of adaptive learning models for massive data streams and smaller ... data. For that we would need to build a statistical model for the sequence and use that.