On Ability to Autonomously Execute Agent Programs with Sensing Sebastian Sardi˜ na

Giuseppe De Giacomo

Dept. of Computer Science University of Toronto [email protected]

Dip. Informatica e Sistemistica Univer. di Roma “La Sapienza” [email protected]

Yves Lesp´erance

Hector J. Levesque

Dept. of Computer Science York University [email protected]

Dept. of Computer Science University of Toronto [email protected]

http://www.cs.toronto.edu/∼cogrobo

CogRobo-2004, Valencia

Sardi˜ na et al.

Motivation

Agent programming languages should support planning/deliberation under incomplete information with sensing actions. Need a spec. for this. Most existing formal models of deliberation are epistemic, while agent programming languages have transition system semantics. Hard to relate. Here, develop new non-epistemic formal model of deliberation. Set within situation calculus and IndiGolog. But important lessons for agent programming languages in general.

CogRobo-2004, Valencia

1

Sardi˜ na et al.

High-Level Programs and Histories

Very simple deterministic language: a, δ1 ; δ 2 , if φ then δ1 else δ2 endIf, while φ do δ endWhile,

primitive action sequence conditional while loop

A history σ = (a1, µ1), (a2, µ2), ..., (an, µn) describes a run with actions and their sensing results; e.g.: (go(Airport), 1) · (check departures, 0) · (go(GateB), 1). A configuration is a pair (δ, σ) with a program δ and a history σ specifying the actions performed so far and the sensing results obtained. CogRobo-2004, Valencia

2

Sardi˜ na et al.

IndiGolog Semantics for Online Executions

Based on two notions: • Configuration (δ, σ) may evolve to (δ 0, σ 0) w.r.t. a model M (relative to an underlying theory of action D). • A configuration is final, i.e. may legally terminate.

The theory D, augmented with the sensing results in σ, must entail that the transition is possible. Model M above represents a possible environment and is just used to generate the sensing results. CogRobo-2004, Valencia

3

Sardi˜ na et al.

Deliberation: An EC-based Account

The idea:

An agent can/knows how to execute program δ in history σ iff he can always choose a next action/transition and eventually reach a terminating configuration no matter what sensing results are obtained.

Agent must know that the chosen action/transition is allowed by the program and is executable. In what follows, we we will define KHowEC (δ, σ) using both entailment and consistency.

CogRobo-2004, Valencia

4

Sardi˜ na et al.

Deliberation: An EC-based Account (cont.)

We define KHowEC (δ, σ) to be the smallest relation R(δ, σ) such that:

CogRobo-2004, Valencia

5

Sardi˜ na et al.

Deliberation: An EC-based Account (cont.)

We define KHowEC (δ, σ) to be the smallest relation R(δ, σ) such that: (E1) if (δ, σ) is final, then R(δ, σ);

CogRobo-2004, Valencia

5

Sardi˜ na et al.

Deliberation: An EC-based Account (cont.)

We define KHowEC (δ, σ) to be the smallest relation R(δ, σ) such that: (E1) if (δ, σ) is final, then R(δ, σ); (E2) if (δ, σ) may evolve to configurations (δ 0, σ · (a, µi)) w.r.t. some models Mi with k = 1..k for some k ≥ 1, and R(δ 0, σ · (a, µi)) holds for all i = 1..k, then R(δ, σ). Obs.: always consider the underlying theory plus the observations in history σ.

CogRobo-2004, Valencia

5

Sardi˜ na et al.

E.g. Tree Chopping Problem

Agent wants to cut down a tree. Possible actions are: • chop, take one chop at the tree, and • look, which tells the agent whether the tree has fallen (i.e., senses fluent Down). It is known that tree will eventually fall, but number of chops needed is not bounded.

CogRobo-2004, Valencia

6

Sardi˜ na et al.

E.g. Tree Chopping Problem

Agent wants to cut down a tree. Possible actions are: • chop, take one chop at the tree, and • look, which tells the agent whether the tree has fallen (i.e., senses fluent Down). It is known that tree will eventually fall, but number of chops needed is not bounded. Intuitively, the following program δtc: while ¬Down do chop; look endWhile solves the problem. But . . . CogRobo-2004, Valencia

6

Sardi˜ na et al.

Formalizing the Tree Chopping E.g.: D tc • Dss contains the following successor state axiom: RemainingChops(do(a, s)) = n ≡ a = chop ∧ RemainingChops(s) = n + 1 ∨ a 6= chop ∧ RemainingChops(s) = n. • Dap contains the following 2 precondition axioms: P oss(chop, s) ≡ (RemainingChops(s) > 0),

P oss(look, s) ≡ T rue.

• Dsf contains the following sensing axiom: SF (look, s) ≡ (RemainingChops(s) = 0). • DS0 = {RemainingChops(S0 ) > 0} (initial situation). def

Down(s) = (RemainingChops(s) = 0) CogRobo-2004, Valencia

7

Sardi˜ na et al.

Tree Chopping E.g. (cont.) Intuitively, the following program δtc: while ¬Down do chop; look endWhile solves the problem. But . . .

CogRobo-2004, Valencia

8

Sardi˜ na et al.

Tree Chopping E.g. (cont.) Intuitively, the following program δtc: while ¬Down do chop; look endWhile solves the problem. But . . . Theorem 1. For all k ∈ N, KHowEC (δtc, [(chop, 1) · (look, 0)]k ) does not hold. In particular, when k = 0, KHowEC (δtc, ) does not hold. In fact, ...

CogRobo-2004, Valencia

8

Sardi˜ na et al.

Tree Chopping E.g. (cont.) Intuitively, the following program δtc: while ¬Down do chop; look endWhile solves the problem. But . . . Theorem 1. For all k ∈ N, KHowEC (δtc, [(chop, 1) · (look, 0)]k ) does not hold. In particular, when k = 0, KHowEC (δtc, ) does not hold. In fact, ... Theorem 2. Whenever KHowEC (δ, σ) holds, there is simple kind of conditional plan, a TREE program, that can be followed to execute δ in σ. which means that ...

CogRobo-2004, Valencia

8

Sardi˜ na et al.

Tree Chopping E.g. (cont.) Intuitively, the following program δtc: while ¬Down do chop; look endWhile solves the problem. But . . . Theorem 1. For all k ∈ N, KHowEC (δtc, [(chop, 1) · (look, 0)]k ) does not hold. In particular, when k = 0, KHowEC (δtc, ) does not hold. In fact, ... Theorem 2. Whenever KHowEC (δ, σ) holds, there is simple kind of conditional plan, a TREE program, that can be followed to execute δ in σ. which means that ... Corollary 1. KHowEC is only correct when problem has a bounded solution CogRobo-2004, Valencia

8

Sardi˜ na et al.

Why does KHowEC Fail?

ments

δtc

look ; δtc

(chop, 1)

(chop, 1)

(look, 0) chop

(look, 0) look (look, 0) chop (chop, 1) (chop, 1)

0 δtc

look ; δtc

[ ] chop (chop, 1)

δtc

look ; δtc

(chop, 1)

(chop, 1)

(look, 0)

1

(chop, 1) (look, 1)

(look, 0) (chop, 1)

look

1

def

δtc

δtc

(chop, 1) (look, 0) (chop, 1) (look, 1)

(chop, 1) (look, 0) (chop, 1) (look, 0) (chop, 1) (look, 1)

δtc = while ¬Down do chop; look endWhile

CogRobo-2004, Valencia

(look, 0) (chop, 1)

0

look

1 δtc √

0

9

Sardi˜ na et al.

Why does KHowEC Fail? (cont.)

If KHowEC (δtc, ), then for all j ∈ N: KHowEC (δtc, [(chop, 1) · (look, 0)]j ) and KHowEC (look; δtc, [(chop, 1) · (look, 0)]j · (chop, 1)).

CogRobo-2004, Valencia

10

Sardi˜ na et al.

Why does KHowEC Fail? (cont.)

If KHowEC (δtc, ), then for all j ∈ N: KHowEC (δtc, [(chop, 1) · (look, 0)]j ) and KHowEC (look; δtc, [(chop, 1) · (look, 0)]j · (chop, 1)). KHowEC is “taking into account” the execution where the tree never comes down!

Every finite prefix of the execution is consistent with D tc . But, the set of all of them together is not!

CogRobo-2004, Valencia

10

Sardi˜ na et al.

Deliberation: ET-based Account

To solve the problem, we consider each environment/model individually. We define KHowInM(δ, σ, M ) to be the smallest relation R(δ, σ) such that:

CogRobo-2004, Valencia

11

Sardi˜ na et al.

Deliberation: ET-based Account

To solve the problem, we consider each environment/model individually. We define KHowInM(δ, σ, M ) to be the smallest relation R(δ, σ) such that: (T1) if (δ, σ) is final, then R(δ, σ);

CogRobo-2004, Valencia

11

Sardi˜ na et al.

Deliberation: ET-based Account

To solve the problem, we consider each environment/model individually. We define KHowInM(δ, σ, M ) to be the smallest relation R(δ, σ) such that: (T1) if (δ, σ) is final, then R(δ, σ); (T2) if (δ, σ) may evolve to (δ 0, σ · (a, µ)) w.r.t. M and R(δ 0, σ · (a, µ)), then R(δ, σ). Uses truth in a model!

CogRobo-2004, Valencia

11

Sardi˜ na et al.

Deliberation: ET-based Account (cont.)

KHowET (δ, σ): iff it knows how to execute it in every model of the history (i.e., in every possible environment). Formally: iff KHowInM(δ, σ, M ) holds for every model M of the theory + sensing at σ.

CogRobo-2004, Valencia

12

Sardi˜ na et al.

Deliberation: ET-based Account (cont.)

KHowET (δ, σ): iff it knows how to execute it in every model of the history (i.e., in every possible environment). Formally: iff KHowInM(δ, σ, M ) holds for every model M of the theory + sensing at σ.

KHowET handles the tree chopping example. It can be generalized to deal with non-deterministic programs.

CogRobo-2004, Valencia

12

Sardi˜ na et al.

Lessons for Agent Programming Languages

Most agent programming languages have transition semantics and use entailment to evaluate tests and action preconditions (e.g., ConGolog, 3APL, FLUX, etc.). Most agent programming languages only do on-line reactive execution; no deliberation/lookahead.

Deliberation is only a different control regime involving search over the agent program’s transition tree. With sensing, need to find more than just a path to a final configuration, need a plan/subtree with branches for all possible sensing results. Can determine possible sensing results by checking consistency with KB. Essentially an EC-based approach. CogRobo-2004, Valencia

13

Sardi˜ na et al.

Lessons for Agent Programming Languages (cont.)

As methods for implementing deliberation in restricted cases, EC-based approaches are fine. But as a semantics or specification, we claim they are wrong. ⇓

CogRobo-2004, Valencia

14

Sardi˜ na et al.

Lessons for Agent Programming Languages (cont.)

As methods for implementing deliberation in restricted cases, EC-based approaches are fine. But as a semantics or specification, we claim they are wrong. ⇓ KHowEC gives the wrong results on problems that can’t be solved in a bounded number of actions.

What’s required is something like our ET-based account.

CogRobo-2004, Valencia

14

Sardi˜ na et al.

Summary

Agent programming languages should support planning/deliberation under incomplete information with sensing actions. We develop some non-epistemic formal models of deliberation. The obvious model is one where agent makes decisions about what to do in terms of what is entailed by or consistent with his KB. This model doesn’t work properly for problems that have no bounded solution and require iteration. We propose an alternative entailment and truth-based model that works. There are important lessons for agent programming languages in general.

CogRobo-2004, Valencia

15

Sardi˜ na et al.

Further Research

Relate this metatheoretic account of deliberation to epistemic ones. Re-state everything in terms of other agent formalisms: 3APL, AgentSpeak, FLUX, etc. Implementation of search/planning with non-binary sensing actions. More general accounts of sensing and knowledge change. Multiagent planning. Etc.

CogRobo-2004, Valencia

16

Sardi˜ na et al.

IndiGolog Semantics for Online Executions — Formal Details

Configuration (δ, σ) may evolve to (δ 0, σ 0) w.r.t. a model M (relative to an underlying theory of action D) iff (i) M is a model of D ∪ C ∪ {Sensed[σ]}, and (ii) D ∪ C ∪ {Sensed[σi]} |= T rans(δ, end[σ], δ 0, end[σ 0]) and (iii)  σ · (a, 1) if end[σ 0] = do(a, end[σ])     and M |= SF (a, end[σ])  σ · (a, 0) if end[σ 0] = do(a, end[σ]) σ0 =   and M 6|= SF (a, end[σ]).    σ if end[σ 0] = end[σ],

CogRobo-2004, Valencia

17

On Ability to Autonomously Execute Agent Programs ...

Dept. of Computer Science. University of ... Sardi˜na et al. High-Level Programs and Histories ... IndiGolog Semantics for Online Executions. Based on two ...

236KB Sizes 0 Downloads 142 Views

Recommend Documents

Agent Programming via Planning Programs
May 12, 2009 - Possibly non-terminating. • Atomic instructions: requests to “achieve goal φ while maintaining goal ψ”. • Meant to run in a dynamic domain: the ...

To autonomously recognize and manipulate ...
real time, such as the placement of a read head on a disk drive suspension or ..... proprietary code under WINE on linux. .... depth data via a point cloud.

Proprioception and imitation: on the road to agent ...
As a result, after few minutes of interactions, the robot can imitates the facial expression of the human partner. We will show the different proprioceptive signals.

Introduction to the special issue on agent-based ...
exploring the potential economic applicability of evolutionary game theory with ... In the lead-o! article, John Du!y focuses on the well-known search model of ..... with multiple agent use depends on their assumption that a server can e$ciently.

Extending Modern PaaS Clouds with BSP to Execute ...
Extending Modern PaaS Clouds with BSP to Execute Legacy. MPI Applications. Hiranya Jayathilaka, Michael Agun. Department of Computer Science. UC Santa Barbara, Santa Barbara, CA, USA. Abstract. As the popularity of cloud computing continues to in- cr

Comprehension is the ability to understand something.
available online.) · Ask your child to ... Retelling: Good readers can recall the important parts of a text in order. ... Every time we use someone‟s body language ...

Studies on combining ability studies in sesame ...
VS 9701 and CO 1 were the best combiners for five traits including seed yield per ... Recommended package of practices ... Email: [email protected].

Studies on hybrid vigour and combining ability for ... - Semantic Scholar
and combining ability analysis were carried out in line x tester model using five lines viz., Kanakamany, ... cm standard package of practices were followed with.

Studies on combining ability studies in sesame ...
crossing of nine lines viz., RT 125, VS 9701, Rama,. Uma, Cordeborega, TVS 0011, ... Email: [email protected]. Observations are days to maturity, ...

1.4.2 The fetch-execute cycle.pdf
Memory Address Register (MAR) - the address in main memory that is currently being read or. written. Memory Data Register (MDR) - a two-way register that ...

quantitative ability - Rediff.com
The following data refers to a survey carried out in four cities Mumbai, Delhi, Kolkata and Chennai regarding ... Kolkata Chennai. 1. 100. 75. 60. 80 .... A. Overcoming the notion that the black audience is not worth big advertising bucks will take t

quantitative ability - Rediff.com
D. Smiley, a BET talk show host, was instrumental in coordinating 10 weeks of protests ... or asked to give back pay and benefits for the 'good of the company'. b.

quantitative ability - Rediff.com
QUANTITATIVE ABILITY. 1. In a school of exactly 300 students, every student reads exactly 5 magazines and every magazine is read by exactly 60 students. The number of magazines is a. 25 b. 30 c. 360 d. 75. 2. If. 1. 1. 1. 1 a b c d. 3. 4. 5. 6. = = =

The effect of error correction on learners' ability to write ...
In the last decade, there has been a great deal of discussion on the value of ... Cohen (1992) offered the following rule of thumb for interpreting d: small ... One can either average all the individual effect sizes found or use only one for each stu

The effect of error correction on learners' ability to ... - Semantic Scholar
changes in students' ability to write accurately, i.e. their learning. Studies of learning look at the difference between a measure of accuracy at one time and a comparable measure done at a later time. A writing task that students do with help from

Inequality and ability
explain very little of the increase in inequality, technology is suspected to be responsible .... indicate that the OLS returns to mental skills and education are biased downward in the ... E.D. Gould / Labour Economics 12 (2005) 169–189. 171 ...

(initialization; condition; update) { execute this code } For Loops
char name[] = "milo"; for (int i = 0, j = strlen(name); i < j; i++). { name[i] = toupper(name[i]);. } Page 5. Check condition. Execute code in body if true if false.

Developmental Changes in Infants' Ability to Cope with ...
Towards the end of their first year of life, infants' overly specified word repre- ..... along different stages of language development (e.g., Werker & Curtin,. 2005).

Comprehension is the ability to understand something.
page. Their comprehension age gives the level of text that they can understand, talk about and evaluate. If your child can understand a text they can learn from it ...