On Character of the Programmed Iteration Method Convergence for Control Problems with Elements of Uncertainty Yu. Averboukh, A. G. Chentsov Institute of Mathematics and Mechanics, Ural Branch of the Russian Academy of Sciences, Ekaterinburg, Russia [email protected], [email protected]

Symposium on Functional Differential Equations , Ariel, Israel, September, 11-15, 2006.

Yu. Averboukh, A. G. Chentsov

Programmed Iteration Method

Differential game

The control systems of the form: x˙ = f (t, x, u, v), t ∈ [t0 , ϑ0 ], x ∈ Rn , u ∈ P, v ∈ Q. are considered. The variables u and v are called the controls of first and second players, respectively. Purposes Let M ⊂ [t0 , ϑ0 ] × Rn . The first player is trying to bring the system onto the set M . The second player is trying to prevent this meeting.

Yu. Averboukh, A. G. Chentsov

Programmed Iteration Method

Condition

M is a close set. P and Q are finite-dimensional compacts. the function f : [t0 , ϑ0 ] × Rn × P × Q → Rn is continuous. the function f is locally Lipschitz with respect to x. f satisfies the sublinear growth condition with respect to x. for any t ∈ [t0 , ϑ0 ], x ∈ Rn , s ∈ Rn , min max < s, f (t, x, u, v) >= max min < s, f (t, x, u, v) > . u∈P v∈Q

Yu. Averboukh, A. G. Chentsov

v∈Q u∈P

Programmed Iteration Method

Strategies and Motions Consider a position (t∗ , x∗ ). The positional (feedback) control is used. Let U : [t0 , ϑ0 ] × Rn → P be a positional strategy. Choose a partition ∆ = t∗ = τ0 < τ1 < . . . < τn = ϑ0 . Any solution of the inclusions x(t) ˙ ∈ {f (t, x(t), U (τi , x(τi )), v) : v ∈ Q}, t ∈ [τi , τi+1 ] x(t∗ ) = x∗ is called step-by-step motion. Constructive motions Limits of these step-by-step motions is called constructive motions generated by a positional strategy U and emerging from the position (t∗ , x∗ ). This formalization was suggested by N. N. Krasovskii. Yu. Averboukh, A. G. Chentsov

Programmed Iteration Method

Positional Absorption Set The structure of differential games solution is given by Krasovskii-Subbotin alternative theorem. The set of successful solvability of approach problem is called positional absorption set. The positional absorption set is the maximal u-stable bridge by the alternative theorem. Definition The set W ⊂ [t0 , ϑ0 ] × Rn is called u-stable bridge if the following conditions holds: 1

M ⊂W

2

∀v∗ ∈ Q, ∀(t∗ , x∗ ) ∈ W ∃y(·) y(t) ˙ ∈ co{f (t, x, u, v∗ ) : u ∈ P }, y(t∗ ) = x∗ , ∃θ ∈ [t∗ , ϑ0 ] : ((θ, y(θ) ∈ M )& ((t, y(t)) ∈ W, ∀t ∈ [t∗ , θ]). Yu. Averboukh, A. G. Chentsov

Programmed Iteration Method

The Programmed Iteration Method Let E ⊂ [t0 , ϑ0 ] × Rn . A(E) , {(t, x) ∈ E| for all controls v(t) there exists a solution of differential inclusion y(t) ˙ ∈ co{f (t, y(t), u, v(t))|u ∈ P }, y(t) = x such that (θ, y(θ)) ∈ M for some θ ∈ [t, ϑ0 ] and (t, y(t)) ∈ E ∀t ∈ [t, θ] } A(E) , {(t, x) ∈ E| for all constant control v ∗ there exists a solution of differential inclusion y(t) ˙ ∈ co{f (t, y(t), u, v ∗ )|u ∈ P }, y(t) = x such that (θ, y(θ)) ∈ M for some θ ∈ [t, ϑ0 ] and (t, y(t)) ∈ E ∀t ∈ [t, θ] }

Yu. Averboukh, A. G. Chentsov

Programmed Iteration Method

The sequences of sets Definition W (0) , [t0 , ϑ] × Rn , W (k) = A(W (k−1) ), k > 0. W0 , [t0 , ϑ] × Rn , Wk = A(Wk−1 ), k > 0. Properties W (k) ↓ W. Wk ↓ W. W – Positional Absorption set. W (k) ⊂ Wk . Yu. Averboukh, A. G. Chentsov

Programmed Iteration Method

Results:

Let M be a compact set. Then 1

2

3

Sequences W (k) and Wk convergence to W in Hausdorff metric. Either W[t] = ∅ and there exists K, such that W (k) [t] = ∅, Wk [t] = ∅ for any k > K, or W (k) [t] 6= ∅, Wk [t] 6= ∅ for any natural k and W[t] 6= ∅. Let t ∈ [t0 , ϑ0 ] be a moment such that W[t] 6= ∅. In this case the Hausdorff convergence of W (k) [t] and Wk [t] to W[t] takes place. E[t] , {x ∈ Rn |(t, x) ∈ E}.

Yu. Averboukh, A. G. Chentsov

Programmed Iteration Method

Extremal Shift to an Unstable Set (Analog of Krasovskii-Subbotin rule) Let (t∗ , x∗ ) be a position, ∆ = {τi }N i=0 be a partition of the segment [t∗ , ϑ0 ]. Formation of control by first player Let xi be a location of the system at the moment τi , and let (k) (k) yi be a closest element of W (k) [τi ] to the xi . The control ui is defined by the rule: (k) (k) max < yi − xi , f (τi , xi , ui , v) >= v∈Q (k) = min max < yi − xi , f (τi , xi , u, v) > . u∈P v∈Q

Motions The motion on [τi , τi+1 ] isZdefined as a solution of equation: t (k) x(t) = xi + f (ξ, x(t), ui , v[ξ])dξ. τi

Function v[·] is a second player control. Yu. Averboukh, A. G. Chentsov

Programmed Iteration Method

Extremal Shift to an Unstable Set Theorem Let τ∗ ∈ I0 be a moment such that W[τ∗ ] 6= ∅, and let ε > 0. Then there exist δ > 0 such that for any partition ∆ = {τi }N i=0 of segment [τ∗ , ϑ0 ], satisfying the condition max (τi+1 − τi ) ≤ δ,

i=0,N −1

one can choose J ∈ N with property for all j > J and x∗ ∈ W (j) [τ∗ ] ∃θ ∈ [τ∗ , ϑ0 ] : d[x[θ], M [θ]] ≤ ε. Here x[·] is a motion defined by a extremal to the set W (j) shift rule. d(x, A) is a distance between x and set S ⊂ Rn . Yu. Averboukh, A. G. Chentsov

Programmed Iteration Method

The End

Questions?

Yu. Averboukh, A. G. Chentsov

Programmed Iteration Method

1On Character of the Programmed Iteration Method ...

The sequences of sets. Definition. W(0). [t0,ϑ] × Rn,. W(k) = A(W(k−1)), k> 0. W0. [t0,ϑ] × Rn,. Wk = A(Wk−1), k> 0. Properties. W(k) ↓ W. Wk ↓ W. W – Positional Absorption set. W(k) ⊂ Wk. Yu. Averboukh, A. G. Chentsov. Programmed Iteration Method ...

189KB Sizes 1 Downloads 164 Views

Recommend Documents

Explanation of the HDFE iteration with 3 FEs - GitHub
Jul 30, 2014 - DVs, we would have to create a lot of dummies which is impractical (b/c of out of memory errors, etc.). Instead, we'll apply a fixed-point iteration ...

An Iteration Method.pdf
There was a problem previewing this document. Retrying... Download. Connect more apps... Try one of the apps below to open or edit this item. An Iteration ...

Random Iteration of Rational Functions
For the case of an RDS consisting of rational functions, there is only one previously known result due to Jonsson ('00):. Theorem (Jonsson, '00). Suppose that (T,Ω,P,θ) is a random dynamical system on ̂C consisting of rational functions Tω, such

Metric regularity of Newton's iteration
Under ample parameterization, metric regularity of the mapping associated with convergent Newton's ... programs “José Castillejo” and “Juan de la Cierva.”.

A new handwritten character segmentation method ...
Mar 29, 2012 - Telecommunications and Pattern Recognition and Intelligent. System Laboratory. The database contains 3755 frequently used simplified ...

Safety, Explanation, Iteration
Good: Appeal to some general principles about gravity and potential energy, to show ... together with non-mental environmental conditions (e.g., facts about the ..... argument.16 But if there is a well-motivated alternative conception of safety, ...

Rollout sampling approximate policy iteration
Jun 22, 2008 - strated experimentally in two standard reinforcement learning domains: inverted pendulum and mountain-car. ... Recent studies have investigated the use of super- ..... could be replaced by fresh ones which might yield meaningful result

Comparison of fine structural mice via coarse iteration
Apr 30, 2014 - α+1, we associate an ex- tender F∗ α+1 = E∗ ↾ β where β is .... (i) iU (Cn(NΛ)) ⊴ IT ; or. (ii) bT does not drop in model or degree and IT = Cm(N.

A MODIFICATION OF MANN'S ITERATION SCHEME ...
Key words and phrases: Continuous pseudo-contractive mappings, strong and weak convergence, Mann's iteration, errors. 1. Introduction. Let E be a real Banach space with dual E∗ and K a non-empty closed convex subset of E. Let J : E → 2E* denote t

The Method of Punctured Containers - Semantic Scholar
Feb 12, 2007 - circular arc of radius a is dilated into an elliptic arc with horizontal semi axis a and vertical semi axis ...... E-mail address: [email protected].

Method for the production of levulinic acid
Nov 8, 1996 - tives are of interest for conversion into pyridaZiones and for the preparation of .... method results in a high yield of sugars as an intermediate.

The Method of Punctured Containers - Semantic Scholar
Feb 12, 2007 - This geometric method extends to more general solids we call ..... The same is true for the slice of the shell and for the center of mass of a.

Emerson, The Method of Nature.pdf
Emerson, The Method of Nature.pdf. Emerson, The Method of Nature.pdf. Open. Extract. Open with. Sign In. Main menu. Displaying Emerson, The Method of ...

The connected locus for complex cubic iteration - Mark McClure
SupplementaryFiles directory, which should come with this notebook. The initialization .... 9 » a »2 +2 M, then the orbit of z0 will diverge to ¶. (See [4], page 266.) ...

The connected locus for complex cubic iteration - Mark McClure
SupplementaryFiles directory, which should come with this notebook. The initialization .... 9 » a »2 +2 M, then the orbit of z0 will diverge to ¶. (See [4], page 266.) ...

Partitioned External-Memory Value Iteration
External Memory Value Iteration To solve the scala- bility problem, Edelcamp ..... is another grid world, which is a popular testbed in reinforcement learning. The.

Focused Topological Value Iteration
Classical dynamic programming algorithms, such as ..... Dynamic program- ..... Number of goal states. R unn ing tim e. (s econd s). ILAO*. BRTDP. TVI. FTVI. 0. 5.