Biologically-Based Functional Mechanisms of Coarticulation Ashvin Shah , Andrew G. Barto , and Andrew H. Fagg 1

1

1,2

3

Neuroscience and Behavior Program and 2Department of Computer Science, University of Massachusetts Amherst, 3School of Computer Science, University of Oklahoma

http://www-all.cs.umass.edu

2a

1

1

q Seen at many levels: ü ü ü ü

2b

non-coarticulated coarticulated

how fingers are recruited2 2a 2b how a chosen arm3 or hand13 is used Schematic illustrating coarticulation effects. The task is to preshaping12 ,bimanual coordination22 move from the top region to region 1, and then to either region 2a or 2b, with the shortest possible path. Redundancy transfer of sensory representation11 in the target regions allow for coarticulation.

Mechanisms Attributable to Brain Areas q Different areas perform different functions

ü cortical areas: represent task , devise reasonable solutions6, working memory8 ü cerebellum: error correction15 ü basal ganglia: exploration10 and reward-mediated learning4,21, critical for coarticulation20 6,19

Basal Ganglia

Cerebellum

q Exploration occurs on several levels Movement Control

ü coarse action (e.g., which arm or finger to use) - possibly due to coarse segregation of pathways1,16 ü fine action (e.g., how to use an arm or hand) - possibly due to fine integration of pathways9 ü sensory5,7,9 (e.g., which sensory modality to use)

q Different functions cooperate to solve task

(Thalamus, Brain Stem, Spinal Cord)

Environment

The three basic pathways through which voluntary movement is controlled.

ü BG explores and uses rewards to find better solutions ü cortex and cerebellum restrict exploration to the null space of the subtask - space of action and sensory choices such that the subtask is always solved

Hypotheses q In a redundant system, coarticulation and, hence, better movement, is elicited by 1. evaluating movements based on overall task, not subtask, performance 2. exploring over several levels simultaneously

References and Acknowledgements

1.

2.

3.

4.

5.

6. 7.

Alexander GE, Delong MR, and Strick PL (1986). Parallel Organization of Functionally Segregated Circuits Linking Basal Ganglia and Cortex. Annual Review of Neuroscience. 9:357-381. Baader A, Kasennikov O, and Wiesendanger M (2005). Coordination of Bowing and Fingering in Violin Playing. Cognitive Brain Research. 23:436-443. Breteler MK, Hondzinski J, and Flanders M (2003). Drawing Sequences of Segmens in 3D: Kinetic Influences on Arm Configuration. Journal of Neurophysiology. 89:3253-3263. Centonze D, Picconi B, Gubellini P, Bernari G, and Calabresi P (2001). Dopaminergic Control of Synaptic Plasticity in the Dorsal Striatum. European Journal of Neuroscience.13:1071-1077. Debaere F, Wenderoth N, Sunaert S, Hecke PV, and Swinner S (2003). Internal vs External Generation of Movements: Differential Neural Pathways Involved in Bimanual Coordination Performed in the Presence or Absence of Augmented Visual Feedback. Neuroimage. 19:764-776. Duncan J (2001). An Adaptive Coding Model of Neural Function in the Prefrontal Cortex. Nature Reviews Neuroscience. 2:820-829. Flaherty AW and Graybiel AM (1991). Corticostriatal Transformations in the Primate Somatosensory System. Projections From Physiologically Mapped Body-Part Represen-

8. 9.

10.

11.

12.

13. 14. 15.

tations. Journal of Neurophysiology. 66:12491263. Goldman-Rakic PS (1995). Celluar Basis of Working Memory. Neuron. 14:477-485. Graybiel AM, Aosaki T, Flaherty AW, and Kimura M (1994). The Basal Ganglia and Adaptive Motor Control. Science. 265:18261831. Gurney K, Prescott T, and Redgrave R (2001). A Computational Model of Action Selection in the Basal Ganglia. I. A New Functional Anatomy. Biological Cybernetics.84:401-410. Hikosaka O, Nakahara H, Rand MK, Sakai K, Lu X, Nakamura K, Miyachi S, and Doya K (1999). Parallel Neural Networks for Learning Sequential Procedures. Trends in Neuroscience. 22:464-471. Jeannerod M (1981). Intersegmental Coordination During Reaching at Natural Visual Objects. Attention and Performance IX. Long J and Baddeley A (eds). pages 153-169, Hillsdale, NJ: Lawrence Erlbaum Associates. Jerde T, Soechting J, and Flanders M (2003). Coarticulation in Fluent Finger Spelling. The Journal of Neuroscience. 23:2383-2393. Jordan MI (1992). Constrained Supervised Learning. Journal of Mathematical Psychology. 36:396-425. Kitazawa S, Kimura T, and Yin P (1998). Cerebellar Complex Spikes Encode Both Destinations and Errors in Arm Movements.

16.

17.

18.

19.

20.

21.

22.

q Termination of movement depends on sensory modality (s.m.):

Nature. 392:494-497. Middleton FA and Strick PL (2000). Basal Ganglia and Cerebellar Loops: Motor and Cognitive Circuits. Brain Research Reviews. 31:236-250. Platt R, Fagg A, and Grupen R (2002). Nullspace Composition of Control Laws for Grasping. Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems. Rohanimanesh K, Platt R, Mahadevan S, and Grupen R (2004). Coarticulation in Markov Decision Processes. 18th Annual Conference on Neural Information Processing Systems, Vancouver, BC, Canada. Tanj J. (2001). Sequential Organization of Multiple Movements: Involvement of Cortical Motor Areas. Annual Review of Neuroscience. 24:631-651. Tyrone M, Kegl J, and Poizner H (1999). Interarticulator Coordination in Deaf Signers with Parkinson’s Disease. Neuropsychologia. 37:1271-1283. Wickens J, Reynolds J, and Hyland B (2003). Neural Mechanisms of Reward-Related Motor Learning. Current Opinion in Neurobiology. 13:685-690. Wiesendanger M and Serrien D (2001). Toward a Physiological Understanding of Human Dexterity. News in Physiological Science. 15:228-233.

ü States: current target ü Actions: RA

ü s.m. A, analogous to vision - based on extrinsic information - terminates movement when hand hits target or expected end-point - only directed towards one hand at a time ü s.m. B, analogous to proprioceptive - based on intrinsic information - terminates movement when q = qi for chosen arm and base - includes a penalty of 10 time steps ü if one arm uses s.m. B, the other can move with s.m. A concurrently

General Control Scheme

xtarg

q Train on two sets of targs.: ascending and descending (first targ. same for both) q Hypothesis #1 supported

specific

r

ag

q Generic controller, G ü can find a reasonable qi (most direct solution) to hit any target ü provides initial solutions and corrections ü uses Jacobian matrix ü requires s.m. A

Cortex

q Inspired by behavioral3 and theoretical14 studies q Task: hit three targets with right arm

xtarg s

q Specific controller, S ü uses exploration and reward information to find better solutions

x

τ

τ=

s

as

ag

generic

ü arm configuration for 1st target depends on context ü solution suboptimal for 1st target in isolation

x, s

plant and environment

a s first a g after, if target not hit

Control flow diagram. S suggests a control signal, such as a candidate configuration. After the system moves, if the target isn’t hit, G calculates a configuration from the current one that does hit the target. S uses reward information to update its solutions.

Neural Representation

q Inspired by behavioral studies2 q Task: Hit sequence of four targets with either arm ü States: current target + previous action ü Actions: LA, RA

q Hypotheses #1 and #2 supported ü initial solution uses right arm for all 4 targets ü after actions modified, best solution uses left arm for 2nd target (not found in every run)

1

2

3

4

ü fine action exploration: search in continuous space coarse action exploration: allows for discrete learning mechanisms and more effective search ü for additional leverage of excess DOFs, can allow other arm to move while one arm moves towards target

Cortex

Task 3: Action and Sensory Exploration q Task: use either arm to hit sequence of three targets (primary task) and a secondary target at any time

ü represented in model as best reward received for taking that action in that state

BG

DA Actions

LA

LB

RA

RB

Learning

ü ü ü ü

States: current primary target Actions (for primary targets): LA, LB, RA, RB restricted to always use G for secondary target initial solution restricted to use only s.m. A

q Hypotheses #1 and #2 supported:

Sensory FB

Cerebellum

Thalamus

Movement

q For each state, make a movement: 1. select an action based on reward information (e-greedy, coarse action & sensory exploration) 2. add noise, N(0,s), to selected qi (fine action exploration) 3. move towards noisy target configuration until termination 4. if necessary, use G to make a corrective movement 5. record reward and new configuration as qi* 6. transition to next state q After entire task is completed, for each selected action, if total reward > current best reward, replace stored qi with qi* and update current best reward ü analogous to modifying weights of corticostriatal mapping

This research was made possible by NIH grant # NS 044393-01A1

Descending

Task 2: Coarse and Fine Action Exploration

q Notes

q State: current target + limited history (none or previous action) Action: choose arm (e.g., L or R) and sensory modality (e.g., A or B) (coarse action and sensory exploration) q qi stored in action and can be modified States (PFC, SMA, ...) (fine action exploration) q Reward information (e.g., DA) moduPlanning (PFC, PMAs, ...) lates weights of corticostriatal mapping Neural representation of control scheme, illustrated for a task consisting of four targets and a system which represents the current target with no history and has four Actions (LA, LB, RA, RB). The Planning area and Cerebellum provide the functions of the generic controller, while the BG provide the functions of the specific controller. The thin arrows from State 1 to Actions LA, RA, and RB indicate that the associated rewards are less than the current best choice, LB.

Ascending

Before Learning

ü differ depending on overall task ü be suboptimal in isolation

Task 1: Fine Action Exploration

s.m. B

After Learning

q Exploit excess DOFs to best solve multiple subtasks in sequence or concurrently q For a given subtask, the coarticulated strategy may

Base: 2 DOFs Each Arm: 4 DOFs

ü initial solution uses RA for each of the three primary targets and then LA for the secondary target. ü learned solution uses RB for some primary targets, allowing LA to move left arm to secondary target concurrently

q Note

ü without secondary target, best to use RA for all primary targets

Conclusion and Remarks

Before Learning

Coarticulation

q System: 10 DOF planar kinematic “robot ” q Task: hit a set sequence of extrinsic targets (xitarg) with its “hands” in minimum time q Must specify a qi to hit xitarg q Movement: constant velocity from q to qi

After Learning

Often, a complex motor task can be decomposed into a set sequence of subtasks. When there is redundancy in how each subtask is performed, we choose a way that tends to be best for the overall task. This behavior, termed coarticulation, is characteristic of a learned motor skill. Previous theories of motor control suggest that coarticulation may be elicited by explicitly combining motor commands of contiguous movements17,18 or by introducing tertiary objectives, such as smoothness14, in solving a task. While these theories provide valuable clues as to what strategies are useful in learning a task, they were not based on biologically-plausible mechanisms. In this poster, we present a model in which functional mechanisms attributable to brain areas control a redundant system in order to solve a set sequence of subtasks. Resulting behavior displays characteristics of coarticulation.

s.m. A

14

Before Learning

Model Description

After Learning

Introduction

The following figures illustrate the kinematic robot’s behavior for several tasks. Shown are the robot’s configurations when it hits the targets. On the top of each graph are the rewards received for each movement, including a corrective movement if necessary, and the total reward. For each figure, the top graph shows the robot’s configurations before learning (using just the generic controller to find configurations), and the bottom graph shows the robot’s configurations after learning for about 15,000 trials. For clarity, the left arm is plotted in red and the right arm is plotted in blue. In addition, the configuration of the robot is plotted with an alternating pattern of thick and thin lines.

Coarticulation is a measurable behavioral characteristic of a learned motor skill. We used a learning scheme, based on functional mechanisms attributable to brain areas, to show that a search in the null space of subtasks and an evaluation based on the overall task produces improved movements and coarticulated behavior in a redundant system. We also showed that a multi-level search strategy, including sensory exploration, produces improved movements and coarticulated behavior. Finally, the strategies used do not rely on any assumptions as to what constitutes better movements; they rely on a reward signal as defined by the task. Such a strategy allows for flexibility in what objectives are optimized. For example, when signing two letters that are easily distinguishable, sign language users may choose configurations that are as similar as possible to expedite transition13. When signing two letters that look similar, signers may choose configurations that are as distinct as possible to expedite discrimination13. The learning process presented in this poster can be used for both objectives.

Ashvin Shah1, Andrew G. Barto1,2, and Andrew H ...

ger M (2005). Coordination of Bowing and. Fingering in Violin Playing. Cognitive Brain ... Working Memory. Neuron. 14:477-485. Graybiel AM, Aosaki T, Flaherty ...

441KB Sizes 3 Downloads 280 Views

Recommend Documents

Andrew G. Reiter
Transitional Justice Review 1:3 (2015): 123-132, with Karen Zamora Surian (Mount Holyoke. College '13). • “Overcoming .... “Amnesty in the Age of Accountability: Brazil in Comparative Context,” in Real Social Science: ..... American Abroad Me

Andrew Murphy.pdf
Whoops! There was a problem loading more pages. Whoops! There was a problem previewing this document. Retrying... Download. Connect more apps... Try one of the apps below to open or edit this item. Andrew Murphy.pdf. Andrew Murphy.pdf. Open. Extract.

Andrew Zhang - Groups
Election Place: Your school. If you want to become a student leader and/or get involved with SGA, this is a great opportunity to develop your leadership skills!

Andrew Bair
Programmed in Java to apply unreleased, and constantly updated, user-interface system to IBM's Webspehre Transfer Extension (WTX). • Learned large legacy WTX project in short amount of time in order to deploy WTX as web application, instead of desk

Andrew Zhang - Groups
If Andrew wins, it will be a great victory for our Chinese community in. Montgomery county. Let's all go vote for Andrew on April 24, 2013, Wednesday!

Andrew Cribb.pdf
Whoops! There was a problem loading more pages. Retrying... Whoops! There was a problem previewing this document. Retrying... Download. Connect more ...

andrew jackson
Cherokee case, did Jackson uphold federal or state authority? __State___. Indian Removal Act. Law passed by Congress authorizing Forced Removal of Native Americans to designated Indian Territory (present-day Oklahoma). (Remember) Phase 2: Assimilatio

gerald andrew mcdermott
Secondary Appointment, Department of Political Science, University of ... Wharton Executive MBA Program – Global Strategic Management: Summer 2008. Wharton Executive ..... Strategic Management Society, Miami, FL, November 7, 2011.

I Don't Know - Andrew Little
Aug 29, 2017 - Page 1 ... experts “fake it", decision-makers may be misled into poor business ...... Since there is a chance that the deviation is successful (if they guess the state ...... surement: Evidence from big field experiments at facebook.

andrew rayel - impulse.pdf
Andrew rayel find. your harmony. Andrew rayel find your harmony 2014 mp3 320 Ðoб/Ñ trance. Andrew rayel feat. christian burns miracles lyrics musixmatch.

andrew loomis pdf
There was a problem previewing this document. Retrying... Download. Connect more apps... Try one of the apps below to open or edit this item. andrew loomis ...

02 Andrew Witt.pdf
2014 Park City Cup Champions. • 2014 Arsenal 'Keeper Wars' Finalist. • 2013 State League Champions. • 2013 Adidas Cup Finalist. • 2013 Park City Cup Finalist. • 2012 StrikerFest Finalist. • 2012 Dixie Cup Finalist. • 2011 Dixie Cup Finalist. • 2010 D

Andrew Bowie - sikkim university library
retrieval system, or transmitted, in any form or by any means, electronic, mechanical, photocopying .... why philosophical positions become generally accepted in a wider ...... the world in which the tree is where one leaves one's signature in a.

gerald andrew mcdermott
“The Communist Aftermath: Industrial Networks and the Politics of .... [10] McDermott, Gerald A., Rafael Corredoira & Greg Kruse, 2009, “Public-Private ... [19] Giuliani, Elisa, Carlo Pietrobelli, & Gerald McDermott, 2014, “Social Network.

pdf-0425\twiggy-the-high-stakes-life-of-andrew-forrest-by-andrew ...
Page 1 of 7. TWIGGY: THE HIGH-STAKES LIFE OF. ANDREW FORREST BY ANDREW BURRELL. DOWNLOAD EBOOK : TWIGGY: THE HIGH-STAKES LIFE OF ANDREW. FORREST BY ANDREW BURRELL PDF. Page 1 of 7 ...

pdf-1374\the-red-fairy-book-andrew-lang-by-andrew-lang.pdf
pdf-1374\the-red-fairy-book-andrew-lang-by-andrew-lang.pdf. pdf-1374\the-red-fairy-book-andrew-lang-by-andrew-lang.pdf. Open. Extract. Open with. Sign In.

ocsb-st-andrew-boundary-map.pdf
There was a problem previewing this document. Retrying... Download. Connect more apps... Try one of the apps below to open or edit this item.

The Trial of Andrew Johnson.pdf
Download. Connect more apps... Try one of the apps below to open or edit this item. The Trial of Andrew Johnson.pdf. The Trial of Andrew Johnson.pdf. Open.

andrew heywood political theory pdf
pdf. Download now. Click here if your download doesn't start automatically. Page 1 of 1. andrew heywood political theory pdf. andrew heywood political theory ...