Approximate Constraint Satisfaction requires Large LP Relaxations David Steurer Cornell

Siu On Chan MSR

James R. Lee Washington TCS+ Seminar, December 2013

Prasad Raghavendra Berkeley

best-known (approximation) algorithms for many combinatorial optimization problems:

Max Cut, Traveling Salesman, Sparsest Cut, Steiner Tree, …

common core = linear / semidefinite programming (LP/SDP)

LP / SDP relaxations particular kind of reduction from hard problem to LP/SDP running time: polynomial in size of relaxation

what guarantees are possible for approximation and running time?

example: basic LP relaxation for Max Cut Max Cut: Given a graph, find bipartition 𝑥 ∈ ±1 that cuts as many edges as possible

maximize subject to

1 𝐸

𝑥𝑖 = 1

𝑥𝑖 = −1

𝑛

intended solution

𝜇𝑖𝑗 𝑖𝑗∈𝐸

𝜇𝑖𝑗 − 𝜇𝑖𝑘 − 𝜇𝑘𝑗 ≤ 0

𝜇𝑥

𝑖𝑗

1, if 𝑥𝑖 ≠ 𝑥𝑗 , = 0, otherwise.

𝜇𝑖𝑗 + 𝜇𝑖𝑘 + 𝜇𝑘𝑗 ≤ 2

integer linear program

𝜇𝑖𝑗 ∈ 0,1

(relax integrality constraint)

𝜇𝑖𝑗 ∈ 0,1

𝑂 𝑛3 inequalities

approximation guarantee

depend only on instances size (but not instance itself)

optimal value of instance vs. optimal value of LP relaxation

challenges many possible relaxations for same problem small difference syntactically  big difference for guarantees goal: identify “right” polynomial-size relaxation hierarchies = systematic ways to generate relaxations best-known: Sherali-Adams (LP), sum-of-squares/Lasserre (SDP); best possible? goal: compare hierarchies and general LP relaxations often: more complicated/larger relaxations  better approximation P ≠ NP predicts limits of this approach; can we confirm them? goal: understand computational power of relaxations Rule out that poly-size LP relaxations show 𝐏 = 𝐍𝐏?

hierarchies

[Lovász–Schrijver, Sherali–Adams, Parrilo / Lasserre]

great variety (sometimes different ways to apply same hierarchy) current champions: Sherali–Adams (LP) & sum-of-squares / Lasserre (SDP) connections to proof complexity

(Nullstellensatz and Positivstellensatz refutations)

lower bounds Sherali-Adams requires size 2𝑛

[Mathieu–Fernandez de la Vega Charikar–Makarychev–Makarychev] Ω 1

to beat ratio ½ for Max Cut [Grigoriev, Schoenebeck]

sum-of-squares requires size 2Ω

𝑛

to beat ratio 7

8

for Max 3-Sat

upper bounds implicit: many algorithms (e.g., Max Cut and Sparsest Cut) explicit: Coloring, Unique Games, Max Bisection

[Goemans-Williamson, Arora-Rao-Vazirani]

[Chlamtac, Arora-Barak-S., Barak-Raghavendra-S., Raghavendra-Tan]

lower bounds for general LP formulations (extended formulations) characterization; symmetric formulations for TSP & matching [Yannakakis’88] [Fiorini–Massar–Pokutta –Tiwary–de Wolf’12]

general, exact formulations for TSP & Clique approximate formulations for Clique

[Braun–Fiorini–Pokutta–S.’12 Braverman–Moitra’13]

general, exact formulation for maximum matching geometric idea:

complicated polytopes can be projections of simple polytopes

[Rothvoß’13]

universality result for LP relaxations of Max CSPs

[this talk]

general polynomial-size LP relaxations are no more powerful than polynomial-size Sherali-Adams relaxations concrete consequences

also holds for almost quasi-polynomial size unconditional lower bound in powerful computational model

confirm non-trivial prediction of P≠NP: poly-size LP relaxations cannot achieve 0.99 approximation for Max Cut, Max 3-Sat, or Max 2-Sat (NP-hard approximations)

approximability and UGC: poly-size LP relaxation cannot refute Unique Games Conjecture (cannot improve current Max CSP approximations)

separation of LP relaxation and SDP relaxation: poly-size LP relaxations are strictly weaker than SDP relaxations for Max Cut and Max 2Sat

universality result for LP relaxations of Max CSPs

[this talk]

general polynomial-size LP relaxations are no more powerful than polynomial-size Sherali-Adams relaxations also holds for almost quasi-polynomial size

for concreteness: focus on Max Cut notation: cut 𝐺 𝑥 = fraction of edges that bipartition 𝑥 cuts in 𝐺 Max Cut 𝑛 = Max Cut instances / graphs on 𝑛 vertices compare: general 𝑛 1−𝜀 𝑑 -size LP relaxation for Max Cut 𝑛 vs. 𝑛𝑑 -size Sherali-Adams relaxations for Max Cut 𝑛

general LP relaxation for 𝐌𝐚𝐱 𝐂𝐮𝐭 𝐧

example linearization 1 𝐸

𝐿𝐺 𝜇 =

linearization 𝐺 ↦ 𝐿𝐺 : ℝ𝑚 → ℝ linear 𝑥 ↦ 𝜇𝑥 ∈ ℝ𝑚

polytope of size R 𝑃𝑛 ⊆ ℝ𝑚 , at most 𝑅 facets, 𝜇𝑥 𝑥∈ ±1 𝑛 ⊆ 𝑃𝑛

𝜇𝑥

such that

𝑖𝑗

1, 0,

=

𝑖𝑗∈𝐸 𝜇𝑖𝑗

if 𝑥𝑖 ≠ 𝑥𝑗 , otherwise.

𝐿𝐺 𝜇x = cut 𝐺 𝑥

𝜇𝑥

.

𝑃𝑛 ℝ𝑚 same polytope for all instances of size 𝑛 makes sense because solution space for Max Cut depends only on 𝑛

computing with size-𝑹 LP relaxation 𝓛 input

computation

output

graph G on n vertices

maximize 𝐿𝐺 𝜇 subject to 𝜇 ∈ 𝑃𝑛

value ℒ 𝐺 = max 𝐿𝐺 𝜇

poly(𝑅)-time computation

approximation ratio 𝛼

𝜇∈𝑃

always upper-bounds Opt G how far in the worst-case?

𝑐, 𝑠 -approximation

ℒ 𝐺 ≤ 𝛼 ⋅ Opt 𝐺

Opt 𝐺 ≤ 𝑠 ⇒ ℒ 𝐺 ≤ 𝑐

for all 𝐺 ∈ Max Cut 𝑛

for all 𝐺 ∈ Max Cut 𝑛

general computational model—how to prove lower bounds?

geometric characterization (à la Yannakakis’88) every size-R LP relaxation 𝓛 for Max Cu𝐭 𝒏 corresponds to nonnegative functions 𝒒𝟏 , … , 𝒒𝑹 : ±1 𝑛 → ℝ≥0 such that ℒ 𝐺 ≤𝑐

iff

𝑐 − cut 𝐺 =

for all 𝐺 ∈ Max Cut 𝑛 example 2𝑛 standard basis functions correspond to exact 2𝑛 -size LP relaxation for Max Cut 𝑛

𝑟 𝜆𝑟 𝑞𝑟

and 𝜆1 , … , 𝜆𝑅 ≥ 0

certifies cut 𝐺 ≤ 𝑐 over ±1 canonical linear program of size 𝑅

𝑛

geometric characterization (à la Yannakakis’88) every size-R LP relaxation 𝓛 for Max Cu𝐭 𝒏 corresponds to nonnegative functions 𝒒𝟏 , … , 𝒒𝑹 : ±1 𝑛 → ℝ≥0 such that ℒ 𝐺 ≤𝑐 for all 𝐺 ∈ Max Cut 𝑛

iff

𝑐 − cut 𝐺 =

𝑟 𝜆𝑟 𝑞𝑟

and 𝜆1 , … , 𝜆𝑅 ≥ 0

intuition: all inequalities for functions on ±1 with local proofs

connection to Sherali-Adams hierarchy 𝑛𝑑 -size Sherali-Adams relaxation for Max Cut 𝑛 exactly corresponds to

𝑑-junta = function on ±1 𝑛 depends on ≤ d coordinates

nonnegative combinations of nonnegative 𝑑-juntas on ±1 𝑛 )

n

geometric characterization (à la Yannakakis’88) every size-R LP relaxation 𝓛 for Max Cu𝐭 𝒏 corresponds to nonnegative functions 𝒒𝟏 , … , 𝒒𝑹 : ±1 𝑛 → ℝ≥0 such that ℒ 𝐺 ≤𝑐

iff

𝑐 − cut 𝐺 =

𝑟 𝜆𝑟 𝑞𝑟

and 𝜆1 , … , 𝜆𝑅 ≥ 0

for all 𝐺 ∈ Max Cut 𝑛

𝑐 − cut 𝐺

cone 𝑞1 , … , 𝑞𝑅 = 𝑟 𝜆𝑟 𝑞𝑟 𝜆𝑟 ≥ 0 to rule out (c,s)-approx. by size-R LP relaxation, show:

for every size-𝑅 nonnegative cone, exists 𝐺 ∈ Max Cut 𝑛 with Opt 𝐺 ≤ 𝑠 but 𝑐 − cut 𝐺 outside of cone

lower-bound for Sherali–Adams relaxations of size 𝑛𝑑

lower-bounds for size-𝑛𝑑 nonneg. cones with restricted functions

𝑑-juntas

𝑛𝜀 -juntas

non-spiky

lower-bound for general LP relaxations of size 𝑛

general

1−𝜀 𝑑

from 𝒅-juntas to 𝒏𝜺 -juntas let 𝑞1 , … , 𝑞𝑅 be nonneg. 𝑛𝜀 -juntas on ±1 want:

𝑛

for 𝑅 = 𝑛

1−10𝜀 𝑑

subset 𝑆 ⊆ 𝑛 of size 𝑚 ≈ 𝑛𝜀 where functions behave like 𝑑-juntas

let 𝐽1 , … , 𝐽𝑅 be junta-coordinates of 𝑞1 , … , 𝑞𝑅

[n]

claim: there exists subset 𝑆 ⊆ [𝑛] of size 𝑚 = 𝑛𝜀 such that 𝐽𝑟 ∩ 𝑆 ≤ 𝑑 for all 𝑟 ∈ 𝑅 proof: choose 𝑆 at random ℙ 𝑆 ∩ 𝐽𝑟 > 𝑑 ≤

𝑆 𝑛

⋅ 𝐽𝑟

𝑑

= 𝑛−

1−2𝜀 𝑑

 can afford union bound over 𝑅 junta sets 𝐽1 , … , 𝐽𝑅

𝐽𝑛𝑑/2

𝐽1

𝐽2

S 𝐽3

𝐽4

lower-bound for Sherali–Adams relaxations of size 𝑛𝑑

lower-bounds for size-𝑛𝑑 nonneg. cones with restricted functions

𝑑-juntas

𝑛𝜀 -juntas

non-spiky

lower-bound for general LP relaxations of size 𝑛

general

1−𝜀 𝑑

from 𝒏𝜺 -juntas to non-spiky functions let 𝑞 be a nonnegative function on ±1 𝑛 with 𝔼𝑞 = 1 non-spiky: max 𝑞 ≤ 2𝑡 small low-degree junta structure lemma: can approximate 𝑞 by nonnegative 𝑛𝜀 -junta 𝑞′, Fourier coefficients error 𝜂 = 𝑞 − 𝑞′ satisfies 𝜂𝑆 2 ≤ 𝑡𝑑/𝑛𝜀 for 𝑆 < 𝑑

proof: nonnegative function 𝑞 non-spiky

 probability distribution over ±1 𝑛 , +1/-1 rand. variables 𝑋1 , … , 𝑋𝑛 (dependent)  entropy 𝐻 𝑋1 , … , 𝑋𝑛 ≥ 𝑛 − 𝑡

want: 𝐽 ⊆ [𝑛] of size 𝑛𝜀 such that ∀𝑆 ⊈ 𝐽.

𝑋𝑆 ∣ 𝑋𝐽 ≈ uniform, that is, 𝑡𝑑 ( 𝑆 < 𝑑) S − 𝐻 𝑋𝑆 𝑋𝐽 ≤ 𝛽 for 𝛽 = 𝑛𝜀 construction: start with 𝐽 = ∅; as long as bad 𝑆 exists, update 𝐽 ← 𝐽 ∪ 𝑆 𝑡 𝛽

analysis: total entropy defect ≤ 𝑡  stop after iterations  𝐽 ≤

𝑑𝑡 𝛽

= 𝑛𝜀

lower-bound for Sherali–Adams relaxations of size 𝑛𝑑

lower-bounds for size-𝑛𝑑 nonneg. cones with restricted functions

𝑑-juntas

𝑛𝜀 -juntas

non-spiky

lower-bound for general LP relaxations of size 𝑛

general

1−𝜀 𝑑

from non-spiky functions to general functions let 𝑞1 , … , 𝑞𝑅 be general nonneg. functions on ±1

𝑛

for 𝑅 = 𝑛𝑑

non-spiky

claim: exists nonneg. 𝑞1′ , … , 𝑞𝑅′ such that 𝑞𝑖′ ≤ 𝑛2𝑑 , 𝔼𝑞𝑖′ = 1 and cone 𝑞1 , … , 𝑞𝑅 ≈ cone(𝑞1′ , … , 𝑞𝑅′ ) proof: truncate functions carefully intuition: 𝑐 − cut 𝐺 is non-spiky. Thus, spiky 𝑞𝑖 don’t help!

lower-bound for Sherali–Adams relaxations of size 𝑛𝑑

lower-bounds for nonneg. cones of size 𝑛𝑑 with restricted functions

𝑑-juntas

𝑛𝜀 -juntas

non-spiky

lower-bound for general LP relaxations of size 𝑛

general

1−𝜀 𝑑

open problems 1. LP size 𝟐𝒏

𝜺

2. beyond CSPs (e.g., TSP)

3. SDPs

lower-bound for Sherali–Adams relaxations of size 𝑛𝑑

lower-bounds for nonneg. cones of size 𝑛𝑑 with restricted functions

𝑑-juntas

𝑛𝜀 -juntas

non-spiky

general

Lower-bound for general LP relaxations of size 𝑛 1−𝜀 𝑑 Recent: for symmetric relaxations [Lee-Raghavendra-S.-Tan’13]

Thank you!

open problems 1. LP size 𝟐𝒏

𝜺

2. beyond CSPs (e.g., TSP)

3. SDPs

Approximate Constraint Satisfaction requires Large LP ...

many possible relaxations for same problem small difference syntactically → big difference for guarantees goal: identify “right” polynomial-size relaxation.

738KB Sizes 1 Downloads 153 Views

Recommend Documents

Validation of a constraint satisfaction neural network for ...
In addition, the effect of missing data was evaluated in more detail. Medical Imaging .... Receiver Operating Characteristics (ROC) analysis. We used the ROCKIT ...

Partial and Fuzzy Constraint Satisfaction to Support ...
challenges to the integration of Information Systems. Coalitions ... formation of such coalitions and their security management are time-consuming and error prone, if we .... 2.1 Problem formulation - Shared resources access example. Through ...

Partial and Fuzzy Constraint Satisfaction to Support ...
challenges to the integration of Information Systems. Coalitions ... Electronic Notes in Theoretical Computer Science 179 (2007) 75–86. 1571-0661/$ ..... D is a best solution if the degree of joint satisfaction of all the constraints. C((c1,c2, ..,

New exact algorithms for the 2-constraint satisfaction ...
bound on the size of a vertex separator for graphs in terms of the average degree of the graph. We then design a simple algorithm solving MAX-2-CSP in time O∗(2cdn), cd = 1 − 2α ln d d for some α < 1 and d = o(n). Keywords: exact exponential ti

Interactive Activation and Mutual Constraint Satisfaction ...
ISSN: 0364-0213 print / 1551-6709 online ... cDepartment of Psychology, University of Maryland ..... 2b, shows the time course of activation, demonstrating.

Computational Efficiency Requires Simple Taxation
Aug 2, 2016 - enough domains the taxation complexity is at most the communication ..... Theorem: Let A be an n-player truthful mechanism in any domain.

Approximate Time-Optimal Control via Approximate ...
and µ ∈ R we define the set [A]µ = {a ∈ A | ai = kiµ, ki ∈ Z,i = 1, ...... [Online]. Available: http://www.ee.ucla.edu/∼mmazo/Personal Website/Publications.html.

Channel Coding LP Decoding and Compressed Sensing LP ...
Los Angeles, CA 90089, USA ... San Diego State University. San Diego, CA 92182, ..... matrices) form the best known class of sparse measurement matrices for ...

LP rankwise.pdf
MARY`S L P S. KUZHIKKATTUSSERY A 226. 35 2 ANGEL MARIYA WILSON 23510 - ST. J B C L P S ALOOR A 211. 36 3 JOHN K.B 23539 - ST. PAUL`S L P S.

LP HIPERTENSI.pdf
Darah tersebut mengalir melalui. vena kava superior, vena kava inferior, serta sinus koronariusyang. berasal dari jantung sendiri. Kemudian darah dipompakan ...

lp general.pdf
There was a problem previewing this document. Retrying... Download. Connect more apps... Try one of the apps below to open or edit this item. lp general.pdf.Missing:

LP TELUGU.pdf
There was a problem previewing this document. Retrying... Download. Connect more apps... Try one of the apps below to open or edit this item. LP TELUGU.pdf.

LP Gen.pdf
Kozhikode C. 13 RINAS K P 17229 - S. R. K. M. L. P. S C. 14 DILSHA KV 17207 - G. L. P. S. Kannancheri C. 15 AMINA MINNATH .S.V. 17212 - G. L. P. S. Pallikandi C. 16 AJAY T K 17226 - Mathrubhandhu V. S. L. P. S C. 17 ABHIRAMI.T 17253 - St. Angelas A.

Stabilizing climate requires near-zero emissions
curvefit to observed temperature data from 1880 to 2005. From 2005 to 2500, ..... Matthews, H. D., Emissions targets for CO2 stabilization as mod- ified by carbon ...

Self-deception requires vagueness
The task was a video game in which players started with the cursor on the left side of a computer screen and moved the cursor as fast as possible to a .... and higher cognitive capacities share underlying computa- tional processes which cause ...

lp general overall.pdf
There was a problem previewing this document. Retrying... Download. Connect more apps... Try one of the apps below to open or edit this item. lp general ...

LP GRADEWISE lp.pdf
Page 3 of 8. LP GRADEWISE lp.pdf. LP GRADEWISE lp.pdf. Open. Extract. Open with. Sign In. Main menu. Displaying LP GRADEWISE lp.pdf.

Support Constraint Machines
by a kernel-based machine, referred to as a support constraint machine. (SCM) ... tor machines. 1 Introduction. This paper evolves a general framework of learning aimed at bridging logic and kernel machines [1]. We think of an intelligent agent actin

lp 111.pdf
13093 - Govt.Higher Secondary School. Vayakkara(Kannur) 0 B 3. 27 VINAYRAJ.P.V. 13905 - ERAMAM SOUTH L P S(Kannur) 0 B 3. 28 VARSHA.K.V 13904 ...