On the Configuration-LP for Scheduling on Unrelated Machines José Verschae and Andreas Wiese Technische Universität Berlin, Straße des 17. Juni 136, 10623 Berlin, Germany {verschae,wiese}@math.tu-berlin.de

Abstract. Closing the approximability gap between 3/2 and 2 for the minimum makespan problem on unrelated machines is one of the most important open questions in scheduling. Almost all known approximation algorithms for the problem are based on linear programs (LPs). In this paper, we identify a surprisingly simple class of instances which constitute the core difficulty for LPs: the so far hardly studied unrelated graph balancing case in which each job can be assigned to at most two machines. We prove that already for this basic setting the strongest known LP-formulation – the configuration-LP – has an integrality gap of 2, matching the best known approximation factor for the general case. This points towards an interesting direction of future research. The result is shown by a sophisticated construction of instances, based on deep insights on two key weaknesses of the configuration-LP. For the objective of maximizing the minimum machine load in the unrelated graph balancing setting we present an elegant purely combinatorial 2-approximation algorithm with only quadratic running time. Our algorithm uses a novel preprocessing routine that estimates the optimal value as good as the configuration-LP. This improves on the computationally costly LP-based (2 + ε)-approximation algorithm by Chakrabarty et al. [6].

1

Introduction

The problem of minimizing the makespan on unrelated machines, usually denoted R||Cmax , is one of the most prominent and important problems in the area of machine scheduling. In this setting we are given a set of n jobs and a set of m unrelated machines to process the jobs. Each job j requires pi,j ∈ N+ ∪{∞} time units of processing if it is assigned to machine i. The scheduler must find an assignment of jobs to machines with the objective of minimizing the makespan, i. e., the largest completion time of a job. In a seminal work, Lenstra, Shmoys, and Tardos [16] give a 2-approximation algorithm based on a natural LP-relaxation. On the other hand, they show that the problem is N P -hard to approximate within a better factor than 3/2, unless 

This work was partially supported by Berlin Mathematical School (BMS) and by the DFG Focus Program 1307 within the project “Algorithm Engineering for Real-time Scheduling and Routing”.

C. Demetrescu and M.M. Halldórsson (Eds.): ESA 2011, LNCS 6942, pp. 530–542, 2011. c Springer-Verlag Berlin Heidelberg 2011 

On the Configuration-LP for Scheduling on Unrelated Machines

531

P = N P . Reducing this gap is considered to be one of the most important open questions in the area of machine scheduling [20] and it has been opened for more than 20 years. The best known approximation algorithms for this problem and its special cases are derived by linear programming techniques [8,16,23]. A special role plays the configuration-LP (which has been successfully used for Bin-Packing [13] and was first used for a scheduling problem by Bansal and Sviridenko [4]). It is the strongest linear program for the problem considered in the literature and it implicitly contains a vast class of inequalities. In fact, for the most relevant cases of R||Cmax the best known approximation factors match the best known upper bounds on the integrality gap of the configuration-LP. Given the apparent difficulty of this problem, researchers have turned to consider simpler cases. One special case that has drawn a lot of attention is the restricted assignment problem. In this setting each job can only be assigned to a subset of machines, and it has the same processing time on all its available machines. That is, the processing times pi,j of a job j equal either a machine-independent processing time pj ∈ N+ or infinity. Surprisingly, the best known approximation algorithm for this problem continues to be the 2-approximation algorithm by Lenstra et al. [16]. However, Svensson [23] shows that the configuration-LP has an integrality gap of 33/17 ≈ 1.94. Thus, it is possible to compute in polynomial time a lower bound that is within a factor 33/17 + ε to the optimum. However, no polynomial time algorithm is known to construct an α-approximate solution for α < 2. The restricted assignment case seems to capture the complexity of the general case to a major extend. However, we show that the core complexity of the problem, in terms of the configuration-LP, lies in the unrelated graph balancing case, where each job can be assigned to at most two machines, but with possibly different processing times on each of them. In the second part of this paper we study a different objective function which has been actively studied by the scheduling community in recent years, see e. g. [1,3,4,6]. In the MaxMin-allocation problem we are also given a set of jobs, a set of unrelated machines, and processing times pi,j as before. The load of a machine i, denoted by i , is the sum of the processing times assigned to machine i. The objective is to maximize the minimum load of the machines, i. e., to maximize mini i . The idea behind this objective function is a fairness property: Consider that jobs represent resources that must be assigned to machines. Each machine i has a personal valuation of job (resource) j, namely pi,j . The objective of maximizing the minimum machine load is equivalent to maximizing the total valuation of the machine that receives the least total valuation. 1.1

Related Work

Minimum Makespan Problem. Besides the paper by Lenstra et al. [16] that we have already mentioned, there has not been much progress on how to diminish the approximability gap for R||Cmax . Shchepin and Vakhania [21] give a more sophisticated rounding procedure for the LP by Lenstra et al. and improve the

532

J. Verschae and A. Wiese

approximation guarantee to 2 − 1/m, which is best possible among all rounding algorithms for this LP. On the other hand, Gairing, Monien, and Woclaw [11] propose a more efficient combinatorial 2-approximation algorithm based on unsplittable flow techniques. Also, the preemptive version of this problem has been studied [7,14]. An important special case is the restricted assignment case. Besides the result by Svensson [23] mentioned above, some special cases are studied depending on the structure of the machines and the processing times of the jobs [17,18]. For the graph balancing case—where each job can be processed on at most two machines with the same processing time on both machines—Ebenlendr, Krčál, and Sgall [8] give a 1.75-approximation algorithm based on an tighter version of the LP-relaxation by Lenstra et al. [16]. They also show that the problem is N P -hard to approximate with a better factor than 3/2, which matches the best known N P -hardness result for the general case. If the underlying graph is a tree, there is an FPTAS [15]. In this paper we consider a slight generalization of the graph balancing problem, where each job can have different processing times on its two available machines. We call this setting the unrelated graph balancing problem. To the best of our knowledge, this problem has not been studied in its own right before and, hence, all results for this case follow from the general case. Generalized Assignment. A well-known generalization of the minimum makespan problem is the generalized assignment problem. In this setting we are given a budget B, and jobs have machine-dependent processing times and costs. The objective is to find a schedule minimizing the makespan among all schedules with cost at most B. Shmoys and Tardos [22] extend the rounding procedure given by Lenstra et al. [16] to this setting, obtaining a 2-approximation algorithm for this problem. MaxMin-Allocation Problem. The MaxMin-allocation problem has drawn a lot of attention recently. In contrast to R||Cmax , for the general case Bansal and Sviridenko√[4] show that the configuration-LP has a super-constant integrality gap of Ω( m). On the other hand, Asadpour and Saberi [3] show constructively that this is tight up √ to logarithmic factors and provide an algorithm with approximation ratio O( m log3 m). Relaxing the bound on the running time, Chakrabarty, Chuzhoy, and Khanna [6] present a poly-logarithmic approximation algorithm that runs in quasi-polynomial time. For the restricted assignment case of MaxMin-allocation, the integrality gap of the configuration-LP is much better. In a series of papers, it is shown to be at most O(log log m/ log log log m) by Bansal et al. [4], in O(1) by Feige [10], and bounded by 5 and subsequently by 4 by Asadpour, Feige, and Saberi [1,2]. Only the first bound is given by a constructive proof. However, Haeupler et al. [12] make the proof by Feige [10] constructive, yielding a polynomial time constant factor approximation algorithm. For the unrelated graph balancing case of MaxMin-allocation (each job can be assigned to at most two machines with possibly different execution times on them) Bateni et al. [5] give a 4-approximation algorithm. Chakrabarty et al. [6] improve this result by showing that the configuration-LP has an integrality gap

On the Configuration-LP for Scheduling on Unrelated Machines

533

of 2, yielding a (2 + ε)-approximation algorithm. Moreover, it is N P -hard to approximate even this special case with a better ratio than 2 [5,6]. This bound matches the best known complexity result for the general case. Interestingly, the case that every job can be assigned to at most three machines is essentially equivalent to the general case [5]. Bin-packing. In the bin-packing problem we are given a list of n items that need to be assigned to the least amount of unit size bins, without overloading them. For this problem Karamarkar, and Karp [13] show that an analogue to the Configuration-LP approximates the optimal value up to an additive error of O(log2 (n)). It has been further conjectured that this additive error is at most one [9,19]. 1.2

Our Contribution

Almost all known approximation algorithms for R||Cmax and its special cases are based on linear programs [8,16,23]. The strongest known LP is the configurationLP which implicitly contains a vast class of inequalities. In this paper, we identify a surprisingly basic class of instances which captures the core complexity of the problem for LPs: the unrelated graph balancing setting. We show that even the configuration-LP has an integrality gap of 2 in the unrelated graph balancing setting and hence cannot help to improve the best known approximation factor. Interestingly, if one additionally requires that each job has the same processing time on its two machines, the integrality gap of the configuration-LP is at most 1.75 (implicitly in [8]). We prove our result by presenting a sophisticated family of instances for which the configuration-LP has an integrality gap of 2. The instances have two novel technical properties which together lead to this large integrality gap. The first property is the usage of gadgets that we call high-lowgadgets. These gadgets form the seed of the inaccuracy of the configurationLP. Secondly, the machines of our instances are organized in a large number of layers. Through the layers, the introduced inaccuracy is amplified such that the integrality gap reaches 2. To the best of our knowledge, the unrelated graph balancing case has not been considered in its own right before. Therefore, our result points to an interesting direction of future research to eventually improve the approximation factor of 2 for the general case. We note that for the restricted assignment case the configuration-LP has an integrality gap of 33/17 < 2 [23]. We conclude that—at least for the configuration-LP—the restricted assignment case is easier than the unrelated graph balancing case. Table 1 shows an overview of the integrality gap of the configuration-LP for the respective cases. Only few approximation algorithms are known for scheduling unrelated machines which do not rely on solving a linear program. As seen above, LP-based algorithms have certain limitations and can be costly to solve. It is then preferable to have combinatorial algorithm with lower running times. For the unrelated graph balancing case of the MaxMin-allocation problem, we present an elegant combinatorial approximation algorithm with only quadratic running time and an approximation guarantee of 2. In the algorithm we use a new method of preprocessing that simplifies the complexity of a given instance and also yields a lower bound on the optimal makespan. This lower bound is as strong as the

534

J. Verschae and A. Wiese

Table 1. The integrality gap of the configuration-LP for R||Cmax in the respective scenarios. Job degree: maximum number of machines on which a job can have a finite processing time. Proc. times: whether each job has the same processing time on all its available machines.

hhhh hhhh Job Degree Processing Times hhhhh h Arbitrary pi,j Restricted Assignment

≤2

Unbounded 2 3 2

,

 33 17

[8,16,23]

3 2

,

2  7 4

[8]

worst case bound given by the configuration-LP. Then, we introduce a novel construction of a bipartite graph to link pairs of jobs which can be assigned to the same machine. A coloring for the graph then implies an assignment of the jobs to machines which ensures the claimed approximation factor. This improves on the LP-based (2 + ε)-approximation algorithm by Chakrabarty et al. [6]. Their algorithm resorts to the ellipsoid method to approximately solve the configurationLP with its exponentially many variables. Our new algorithm is fast, very simply to implement, and moreover best possible, unless P = N P .

2

LP-Based Approaches

In this section we revise the classical LP-formulation by Lenstra et al. [16] as well the configuration-LP. Moreover we study the relationship of these two LPs and give an explicit characterization of which kind of inequalities are implicitly implied by the configuration-LP. In the sequel, we denote by J the set of jobs and M the set of machines of a given instance. The Natural LP-Relaxation. The natural IP-formulation used by Lenstra et al. [16] uses assignment variables xi,j ∈ {0, 1} that denote whether job j is assigned to machine i. This formulation, which we denote by LST-IP, takes a target value for the makespan T (which will be determined later by a binary  x = 1 for all j ∈ J; search) and does not use any objective function: i,j i∈M  p · x ≤ T for all i ∈ M ; x = 0 for all i, j such that p i,j i,j i,j > T . Here the j∈J i,j variables xij are binary for all i ∈ M, j ∈ J. The corresponding LP-relaxation of this IP, which we denote by LST-LP, can be obtained by replacing the integrality condition by xi,j ≥ 0. Let CLP be the smallest integer value of T so that LST-LP is feasible, and let C ∗ be the optimal makespan of our instance. Thus, since the LP is feasible for T = C ∗ we have that CLP is a lower bound on C ∗ . Moreover, we can easily find CLP in polynomial time with a binary search procedure. Lenstra et al. [16] give a rounding procedure that takes a feasible solution of LST-LP with target makespan T and returns an integral solution with makespan at most 2T . By taking T = CLP ≤ C ∗ this yields a 2-approximation algorithm.

On the Configuration-LP for Scheduling on Unrelated Machines

535

Integrality gaps and the configuration-LP. Shmoys and Tardos [22] implicitly show that the rounding just mentioned is best possible by means of the integrality gap of LST-LP. For an instance I of R||Cmax , let CLP (I) be the smallest integer value of T so that LST-LP is feasible, and let C ∗ (I) the minimum makespan of this instance. Then the integrality gap of this LP is defined as supI C ∗ (I)/CLP (I). It is easy to see that the integrality gap is the best possible approximation guarantee that can be shown by using CLP as a lower bound. Shmoys and Tardos [22] give an example showing that the the integrality gap of LST-LP is arbitrarily close to 2, and thus the rounding procedure by Lenstra et al. [16] is best possible. Equivalently, the integrality gap of LST-LP equals 2. It is natural to ask whether adding a family of cuts can help to obtain a formulation with smaller integrality gap. Indeed, for special cases of our problem it has been shown that adding certain inequalities reduces  the integrality gap. In particular, Ebenlendr et al. [8] consider the star-cuts, j∈J:pi,j >T /2 xi,j ≤ 1, for all i ∈ M. They show that adding these inequalities to LST-LP yields an integrality gap of at most 1.75 in the graph balancing setting. The results of this paper have important implications on whether it is possible to add similar cuts to strengthen the LP for the unrelated graph balancing problem or for the general case of R||Cmax . We study the configurationLP which we define below. Let T be a target makespan, and define Ci (T ) as the collection of all subsets of jobs   with total processing time at most T , i.e.,  Ci (T ) := C ⊆ J : j∈C pi,j ≤ T . We introduce a binary variable yi,C for all i ∈ M and C ∈ Ci (T ), representing whether the set of jobs assigned to machine i equals C. The configuration-LP is defined as follows:  

yi,C = 1

for all i ∈ M,

yi,C = 1

for all j ∈ J,

yi,C ≥ 0

for all i ∈ M, C ∈ Ci (T ).

C∈Ci (T )



i∈M C∈Ci (T ):Cj

It is not hard to see that an integral version of this LP is a formulation for R||Cmax . Also notice that the configuration-LP suffers from an exponential number of variables, and thus it is not possible to solve it directly in polynomial time. However, it is easy to show that the separation problem of the dual corresponds to an instance of Knapsack and thus we can solve the LP approximately in polynomial time. More precisely, given a target makespan T there is a polynomial time algorithm that either asserts that the configuration-LP is infeasible or computes a solution which uses only configurations whose makespan is at most (1 + ε)T , for any constant ε > 0 [23]. The following section, we will show that the integrality gap of this formulation is as large as the integrality gap of LST-LP even for the unrelated graph balancing case. Notice that a solution of the configuration-LP yields a feasible solution to LST-LP with the same target makespan. On the other hand, there are

536

J. Verschae and A. Wiese

solutions to LST-LP that do not have corresponding feasible solutions to the configuration-LP. Now we elaborate on the relation of the two LPs, by giving a formulation in the space of the xi,j variables that is equivalent to the configuration-LP. Intuitively, the configuration-LP contains all possible (local) information for any single machine. Indeed, we show that any cut in the xi,j variables that involves only one machine is implied by the configuration-LP. Indeed, let α ∈ QJ be an  arbitrary row column. The configuration-LP will imply any cut of the form j∈J αj xi,j ≤ δα,i , where the right-hand-side is properly chosen so that no single machine schedule for machine i is removed by the cut. The proof of the following proposition can be found in [24]. Proposition 1. Fix a target makespan T . For each α ∈ QJ we define δα,i :=  max{ j∈S αj : S ∈ Ci (T )}. The feasibility of  the configuration-LP is equivalent to the feasibility of the linear program: i∈M xij = 1 for all j ∈ J and  J j∈J αj xi,j ≤ δα,i for all α ∈ Z , i ∈ M. As an example of the implications of this proposition, we note that adding the star-cuts does not help diminishing the integrality gap of LST-LP for unrelated graph balancing. This follows by taking α as the characteristic vector of the set {j ∈ J : pi,j > T /2} for each i ∈ M .

3

Integrality Gap of the Configuration-LP

In this section we prove that the configuration-LP has an integrality gap of 2, already for the special case of unrelated graph balancing. This shows that the core complexity of the configuration-LP is already captured by the unrelated graph balancing case. We construct a family of instances Ik of the unrelated graph balancing case such that pi,j ∈ { k1 , 1, ∞} for each machine i and each job j for some integer k. We will show that for Ik there is a solution of the configuration-LP which uses only configurations with makespan 1 + k1 . However, every integral solution for Ik has a makespan of at least 2 − k1 . Let k ∈ N and let N be the smallest integer satisfying k N /(k − 1)N +1 ≥ 12 . Consider two k-ary trees of height N − 1, i.e., two trees of height N − 1 in which apart from the leaves every vertex has k children. We say that all vertices with the same distance to the root are in the same layer. For every leaf v, we introduce another vertex v  and k edges between v and v  . (Hence, v is no longer a leaf.) We call such a pair of vertices v, v  a high-low-gadget. Observe that the resulting “tree” has height N . Based on this, we describe our instance of unrelated graph balancing. For each vertex v we introduce a machine mv . For each edge e = {u, v} we introduce a job je . Assume that u is closer to the root than v. We define that je has processing time k1 on machine mu , processing time 1 on machine mv , and infinite processing time on any other machine. This motivates the term “high-low-gadget”: each job inside such a gadget – given by two vertices v and v  as above – has a

On the Configuration-LP for Scheduling on Unrelated Machines

537 (1)

high processing time on mv and a low processing time on mv . Finally, let mr (2) and mr denote the two machines corresponding to the two root vertices. We (1) (2) introduce a job jbig which has processing time 1 on mr and mr . Denote by Ik the resulting instance. To gain some intuition for the construction, consider a high-low-gadget consisting of two machines mv , mv where v  is a leaf. In any solution with a makespan of at most 1 + k1 it is clear that mv can schedule only the jobs whose respective edges connect v and v  . However, we will see in the sequel that there are solutions for the configuration-LP with makespan 1 + k1 in which also a fraction of the job with processing time 1 is scheduled on mv . Since we chose a large number of layers, this fraction will be amplified through the layers to the root until we obtain a feasible solution to the configuration-LP using only configurations with makespan at most 1 + k1 . However, any integral solution has a makespan of at least 2 − k1 as we will prove in the following lemma. This implies that the configuration-LP has an integrality gap of 2. Lemma 1. Any integral solution for Ik has a makespan of at least 2 − k1 . (1)

Proof. We can assume w.l.o.g. that job jbig is assigned to machine mr . If the makespan of the whole schedule is less than 2 then there must be at least one (1) (1) job which has processing time k1 on mr which is not assigned to mr but to some other machine m. We can apply the same argumentation to machine m. Iterating the argument shows that there must be a leaf v such that machine mv has a job with processing time 1 assigned to it. Hence, either mv has a load of at least 2 − k1 or machine mv has a load of at least 2.

Now we want to show that there is a feasible solution of the configuration-LP for Ik which uses only configurations with makespan 1 + k1 . To this end, we introduce the concept of j-α-solutions for the configuration-LP. A j-α-solution is a solution for the configuration-LP whose right hand side is modified as follows: job j does not need to be fully assigned but only to an extent of α ≤ 1. This value α corresponds to the fraction of the big job assigned to a machine like mv as described above. (h) For any h ∈ N denote by Ik a subinstance of Ik defined as follows: Take a vertex v of height h and consider the subtree T (v) rooted at v. For the subin(h) stance Ik we take all machines and jobs which correspond to vertices and edges in T (v). (Note that since our construction is symmetric it does not matter which vertex of height h we take.) Additionally, we take the job which has processing time 1 on mv . We denote the latter by j (h) . We prove inductively that there (h) are j (h) -α(h) -solutions for the subinstances Ik for values α(h) which depend (h) increase for increasing h. The important point is only on h. These values α that α(N ) ≥ 12 . Hence, there are solutions for the configuration-LP which dis(1) (2) tribute jbig on the two machines mr and mr (which correspond to the two root vertices). The following lemma gives the base case of the induction. It explains the inaccuracy of the configuration-LP in the high-low-gadgets.

538

J. Verschae and A. Wiese (1)

1 Lemma 2. There is a j (1) - k−1 -solution for the configuration-LP for Ik uses only configurations with makespan at most 1 + k1 .

which

(1)

(1)

Proof. Let mv be the machine in Ik which corresponds to the root of Ik . Similarly, let mv denote the machine which corresponds to the leaf v  . For (0)  ∈ {1, ..., k} let j be the jobs which have processing time 1 on mv and 1 processing time k on mv .   (0)

For mv the configurations with makespan at most 1 + k1 are C := j

for (0)

each  ∈ {1, ..., k}. We define ymv ,C := k1 for each . Hence, for each job j a fraction of k−1 k remains unassigned.  mv there are the following   For machine (0)

(0)

each  ∈ {1, ..., k}. We define ymv ,Cbig  :=

1 k(k−1)

(maximal) configurations: Csmall :=

1 k

1 . − k−1 1 · k(k−1)

This assigns each job 1 = k−1 .

(0) j

j1 , ..., jk

 and Cbig :=

(0)

j (1) , j

for

for each  and ymv ,Csmall :=

completely and job j (1) to an extent of

After having proven the base case, the following lemma yields the inductive step. It shows how the value α of our j-α-solutions is increased by the layers of our construction, and thus the effect of the high-low-gadgets is amplified. Note that the intermediate nodes have similarities to the nodes which constitute the high-low-gadgets.  Lemma 3. Assume that we are given a j (n) - k n /(k − 1)n+1 -solution for the (n) configuration-LP for Ik which uses only configurations with makespan at most 1 1 + k . Then, there is a j (n+1) - k n+1 /(k − 1)n+2 -solution for the configuration(n+1) LP for Ik which uses only configurations with makespan at most 1 + k1 . (n+1)

(n)

Proof. Note that Ik consists of k copies of Ik , one additional machine and one additional job. Denote by mv the additional machine (which forms the “root” (n+1) of Ik ). Recall that j (n+1) is the (additional) job that can be assigned to mv (n+1) (n) but to no other machine in Ik . For  ∈ {1, ..., k} let j be the jobs which have processing time k1 on mv . (n) Inside of the copies of Ik we use the solution defined in the induction hypoth  (n) esis. Hence, each job j is already assigned to an extent of k n /(k − 1)n+1 . Like in Lemma configurations for mv are given by Csmall :=   2 the (maximal)   (n) (n)  (n+1) (n) j1 , ..., jk and Cbig := j for each  ∈ {1, ..., k}. We define the , j value ymv ,Cbig := k n /(k − 1)n+2 for each  and ymv ,Csmall := 1 − k n+1 /(k − 1)n+2 .  (n)

This assigns each job j completely and the job j (n+1) is assigned to an extent

of k · k n /(k − 1)n+2 = k n+1 /(k − 1)n+2 . Theorem 1. The integrality gap of the configuration-LP is 2 for the unrelated graph balancing problem.

On the Configuration-LP for Scheduling on Unrelated Machines

539

Proof. Due to the above reasoning and the choice of N for each of the two (N ) subinstances Ik there are jbig - 12 -solutions. Hence, there is a solution for the configuration-LP using only configurations with makespan at most 1 + k1 . Since by Lemma 1 any integral solution has a makespan of at least 2 − k1 the claim follows (as k can be chosen arbitrarily large).

4

Unrelated Graph Balancing Case of MaxMin-Allocation

In this section we study the MaxMin-allocation problem in the unrelated graph balancing setting. We present an elegant purely combinatorial 2-approximation algorithm with quadratic running time which is quite easy to implement. To this end we introduce a new preprocessing procedure that already gives an estimate to the optimal makespan that is within a factor of 2, and thus it is as strong as the worst-case bound of the configuration-LP. Let I be an instance of the problem and let T be a positive integer. Our algorithm either finds a solution with value T /2 or asserts that there is no solution with value T or larger. With an additional binary search this yields a 2-approximation algorithm. For each machine i denote by Ji = {ji,1 , ji,2 , ...} the list of all jobs which can be assigned to i. We partition this set into the sets ˙ i where Ai = {ai,1 , ai,2 , ...} denotes the jobs in Ji which can be assigned Ai ∪B to two machines (machine i and some other machine) and Bi denotes the jobs in Ji which can only be assigned to i. We define Ai to be the set Ai without the job with largest processing time (or one of those jobs in  case there is a tie). For any machine i and any set of jobs Ji we define p(Ji ) := j∈J  pi,j . Denote by pi, the processing time of job ai, on machine i. We assume that the elements of Ai are ordered non-increasingly by processing time, i.e., pi, ≥ pi,+1 for all respective values of . The preprocessing phase of our algorithm works as follows. If there is a machine i such that p(Ai ) + p(Bi ) < T we output that there is no solution with value T or larger. So now assume that p(Ai ) + p(Bi ) ≥ T for all machines i. If there is a machine i such that p(Ai ) + p(Bi ) < T (but p(Ai ) + p(Bi ) ≥ T ) then any solution with value at least T has to assign ai,1 to i. Hence, we assign ai,1 to i. This can be understood as moving ai,1 from Ai to Bi . We rename the remaining jobs in Ai accordingly and update the values p(Ai ), p(Ai ), and p(Bi ). We do this procedure until either one of the following two conditions holds: (1) There is one machine i such that p(Ai ) + p(Bi ) < T ; or (2) for all machines i we have that p(Ai ) + p(Bi ) ≥ T . In case (1) holds we output that there is no solution with value T or larger. If during the preprocessing phase the algorithm outputs that no solution with value T or larger exists, then clearly there can be no such solution. As we will see below, this preprocessing phase together with the binary search procedure already gives a lower bound that is within a factor 2 to the optimal solution. This matches the worst-case bound given by the configuration-LP. Now we construct a graph G as follows: For each machine i and each job ai, ∈ Ai we introduce a vertex ai, . We connect two vertices ai, , ai , if ai, and ai , represent the same job (but on different machines). Also, for each

540

J. Verschae and A. Wiese

machine i we introduce an edge between the vertices ai,2k+1 and ai,2k+2 for each respective value k ≥ 0. It can be shown that G is bipartite (see [24] for details). It follows that we can color G with two colors, black and white, such that two adjacent vertices have different colors. Let i be a machine. We assign each job ai, to i if and only if ai, is black. Also, we assign each job in Bi to i. We will prove in Theorem 2 that this procedure assigns each machine at least a load of T /2. In order to transform the previously described procedure into a 2-approximation algorithm, it is necessary to find the largest value T for which the preprocessing phase does not assert that T is too large. This can be easily achieved by embedding the preprocessing algorithm into a binary search framework. With appropriate data structures and a careful implementation the whole algorithm has a running time of O(|I|2 ) where |I| denotes the overall input length in binary encoding, see [24] for details. Theorem 2. There is a 2-approximation algorithm with running time O(|I|2 ) for the unrelated graph balancing case of the MaxMin-allocation problem. Proof. It remains to prove the approximation ratio. Let i be a machine. We show that the total weight of jobs assigned to i is at least p(Ai )/2 + p(Bi ). For each connected pair of vertices ai,2k+1 , ai,2k+2 ai,2k+1  we have that either  or ai,2k+2 is assigned to i. We calculate that p ≥ p(A )/2. Since i,2k+2 i k∈N pi,2k+1 ≥ pi,2k+2 (for each respective value of k) we conclude that the total weight of the jobs assigned to i is at least p(Ai )/2+p(Bi ). Since p(Ai )+p(Bi ) ≥ T the claim follows.

Further Results. The best known N P -hardness reductions for R||Cmax create instances with the property that pi,j ∈ {1, 2, 3, ∞} for all jobs j and all machines i (up to scaling), see [8,16]. In fact, we can show that if the (finite) execution times of the jobs in an instance of R||Cmax differ by at most a factor of 10/3 then there is a (1 + 56 )-approximation algorithm, see [24]. Also, if the greatest common divisor of the processing times arising in an instance is bounded, we also obtain a better approximation factor than 2, see [24]. These results imply key properties for reductions that rule out an approximation factor of (2 − ε) for R||Cmax . Acknowledgments. We would like to thank Annabell Berger, Matthias MüllerHannemann, Thomas Rothvoß, and Laura Sanità for helpful discussions.

References 1. Asadpour, A., Feige, U., Saberi, A.: Santa claus meets hypergraph matchings. In: Goel, A., Jansen, K., Rolim, J.D.P., Rubinfeld, R. (eds.) APPROX and RANDOM 2008. LNCS, vol. 5171, pp. 10–20. Springer, Heidelberg (2008) 2. Asadpour, A., Feige, U., Saberi, A.: Santa claus meets hypergraph matchings. Technical report, Standford University (2009), Available for download at http://www.stanford.edu/~asadpour/publication.htm

On the Configuration-LP for Scheduling on Unrelated Machines

541

3. Asadpour, A., Saberi, A.: An approximation algorithm for max-min fair allocation of indivisible goods. In: Proceedings of the 39th annual ACM symposium on Theory of computing (STOC 2007), pp. 114–121 (2007) 4. Bansal, N., Sviridenko, M.: The santa claus problem. In: Proceedings of the 38th Annual ACM Symposium on Theory of Computing (STOC 2006), pp. 31–40 (2006) 5. Bateni, M., Charikar, M., Guruswami, V.: Maxmin allocation via degree lowerbounded arborescences. In: Proceedings of the 41st Annual ACM Symposium on Theory of Computing (STOC 2009), pp. 543–552 (2009) 6. Chakrabarty, D., Chuzhoy, J., Khanna, S.: On allocating goods to maximize fairness. In: Proceedings of the 50th Annual Symposium on Foundations of Computer Science (FOCS 2009), pp. 107–116 (2009) 7. Correa, J.R., Skutella, M., Verschae, J.: The power of preemption on unrelated machines and applications to scheduling orders. In: Dinur, I., Jansen, K., Naor, J., Rolim, J. (eds.) APPROX 2009. LNCS, vol. 5687, pp. 84–97. Springer, Heidelberg (2009) 8. Ebenlendr, T., Krčál, M., Sgall, J.: Graph balancing: a special case of scheduling unrelated parallel machines. In: Proceedings of the 19th Annual ACM-SIAM Symposium on Discrete Algorithms (SODA 2008), pp. 483–490 (2008) 9. Eisenbrand, F., Palvoelgyi, D., Rothvoss, T.: Bin packing via discrepancy of permutations. In: Proceedings of the 22th Annual ACM-SIAM Symposium on Discrete Algorithms (SODA 2011), pp. 476–481 (2011) 10. Feige, U.: On allocations that maximize fairness. In: Proceedings of the 19th Annual ACM-SIAM Symposium on Discrete Algorithms (SODA 2008), pp. 287–293 (2008) 11. Gairing, M., Monien, B., Woclaw, A.: A faster combinatorial approximation algorithm for scheduling unrelated parallel machines. Theoretical Computer Science 380, 87–99 (2007) 12. Haeupler, B., Saha, B., Srinivasan, A.: New constructive aspects of the lovasz local lemma. In: Proceedings of the 51st Annual IEEE Symposium on Foundations of Computer Science (FOCS 2010), pp. 397–406 (2010) 13. Karmarkar, N., Karp, R.M.: An efficient approximation scheme for the onedimensional bin-packing problem. In: Proceedings of the 23rd Annual IEEE Symposium on Foundations of Computer Science (FOCS 1982), pp. 312–320 (1982) 14. Lawler, E.L., Labetoulle, J.: On preemptive scheduling of unrelated parallel processors by linear programming. Journal of the ACM 25, 612–619 (1978) 15. Lee, K., Leung, J.Y., Pinedo, M.L.: A note on graph balancing problems with restrictions. Information Processing Letters 110, 24–29 (2009) 16. Lenstra, J.K., Shmoys, D.B., Tardos, E.: Approximation algorithms for scheduling unrelated parallel machines. Mathematical Programming 46, 259–271 (1990) 17. Leung, J.Y., Li, C.: Scheduling with processing set restrictions: A survey. International Journal of Production Economics 116, 251–262 (2008) 18. Lin, Y., Li, W.: Parallel machine scheduling of machine-dependent jobs with unitlength. European Journal of Operational Research 156, 261–266 (2004) 19. Scheithauer, G., Terno, J.: Theoretical investigations on the modified integer roundup property for the one-dimensional cutting stock problem. Operations Research Letters 20, 93–100 (1997) 20. Schuurman, P., Woeginger, G.J.: Polynomial time approximation algorithms for machine scheduling: Ten open problems. Journal of Scheduling 2, 203–213 (1999) 21. Shchepin, E.V., Vakhania, N.: An optimal rounding gives a better approximation for scheduling unrelated machines. Operations Research Letters 33, 127–133 (2005)

542

J. Verschae and A. Wiese

22. Shmoys, D.B., Tardos, E.: An approximation algorithm for the generalized assignment problem. Mathematical Programming 62, 461–474 (1993) 23. Svensson, O.: Santa claus schedules jobs on unrelated machines. In: Proceedings of the 43th Annual ACM Symposium on Theory of Computing, STOC 2011 (2011) (to appear) 24. Verschae, J., Wiese, A.: On the configuration-LP for scheduling on unrelated machines. Technical Report 025-2010, Technische Universität Berlin (November 2010)

LNCS 6942 - On the Configuration-LP for Scheduling ... - Springer Link

insights on two key weaknesses of the configuration-LP. For the objective of maximizing the minimum machine load in the unrelated graph balancing setting ...... length. European Journal of Operational Research 156, 261–266 (2004). 19. Scheithauer, G., Terno, J.: Theoretical investigations on the modified integer round-.

260KB Sizes 2 Downloads 293 Views

Recommend Documents

LNCS 6683 - On the Neutrality of Flowshop Scheduling ... - Springer Link
Scheduling problems form one of the most important class of combinatorial op- .... Illustration of the insertion neighborhood operator for the FSP. The job located.

LNCS 4731 - On the Power of Impersonation Attacks - Springer Link
security or cryptography, in particular for peep-to-peer and sensor networks [4,5]. ... entity capable of injecting messages with arbitrary content into the network.

LNCS 4261 - Image Annotations Based on Semi ... - Springer Link
MOE-Microsoft Key Laboratory of Multimedia Computing and Communication ..... of possible research include the use of captions in the World Wide Web. ... the Seventeenth International Conference on Machine Learning, 2000, 1103~1110.

LNCS 3352 - On Session Identifiers in Provably Secure ... - Springer Link
a responding server process in network service protocol architecture [23]. A ... 3. proposal of an improved 3PKD protocol with a proof of security using a.

LNCS 4261 - Image Annotations Based on Semi ... - Springer Link
Keywords: image annotation, semi-supervised clustering, soft constraints, semantic distance. 1 Introduction ..... Toronto, Canada: ACM Press, 2003. 119~126P ...

LNCS 6361 - Automatic Segmentation and ... - Springer Link
School of Eng. and Computer Science, Hebrew University of Jerusalem, Israel. 2 ... OPG boundary surface distance error of 0.73mm and mean volume over- ... components classification methods are based on learning the grey-level range.

LNCS 3174 - Multi-stage Neural Networks for Channel ... - Springer Link
H.-S. Lee, D.-W. Lee, and J. Lee. In this paper, we propose a novel multi-stage algorithm to find a conflict-free frequency assignment with the minimum number of total frequencies. In the first stage, a good initial assignment is found by using a so-

LNCS 4233 - Fast Learning for Statistical Face Detection - Springer Link
Department of Computer Science and Engineering, Shanghai Jiao Tong University,. 1954 Hua Shan Road, Shanghai ... SNoW (sparse network of winnows) face detection system by Yang et al. [20] is a sparse network of linear ..... International Journal of C

LNCS 7601 - Optimal Medial Surface Generation for ... - Springer Link
parenchyma of organs, and their internal vascular system, powerful sources of ... but the ridges of the distance map have show superior power to identify medial.

LNCS 4258 - Privacy for Public Transportation - Springer Link
Public transportation ticketing systems must be able to handle large volumes ... achieved in which systems may be designed to permit gathering of useful business ... higher powered embedded computing devices (HPDs), such as cell phones or ... embedde

LNCS 7575 - Multi-component Models for Object ... - Springer Link
visual clusters from the data that are tight in appearance and configura- tion spaces .... Finally, a non-maximum suppression is applied to generate final detection ...

LNCS 6683 - On the Neutrality of Flowshop Scheduling ...
the minimization of makespan in flowshop problems, iterated local search (ILS) approaches ... scheduled jobs), and the size of the search space is then |S| = N!.

A Scheduling Framework for UWB & Cellular Networks - Springer Link
Dec 9, 2005 - C 2005 Springer Science + Business Media, Inc. Manufactured in The ... The max-min fair scheduling problem in wireless ad-hoc networks is a ...

LNCS 6621 - GP-Based Electricity Price Forecasting - Springer Link
learning set used in [14,9] at the beginning of the simulation and then we leave ..... 1 hour on a single core notebook (2 GHz), with 2GB RAM; the variable ...

LNCS 7335 - Usage Pattern-Based Prefetching: Quick ... - Springer Link
Oct 8, 2010 - The proposed scheme is implemented on both Android 2.2 and Linux kernel 2.6.29. ... solution for reducing page faults [1, 2, 3, 10]. The number ...

LNCS 4191 - Registration of Microscopic Iris Image ... - Springer Link
Casey Eye Institute, Oregon Health and Science University, USA. {xubosong .... sity variance in element m, and I is the identity matrix. This is equivalent to.

LNCS 6719 - Multiple People Activity Recognition ... - Springer Link
Keywords: Multiple Hypothesis Tracking, Dynamic Bayesian Network, .... shared space and doing some basic activities such as answering phone, using.

LNCS 4325 - An Integrated Self-deployment and ... - Springer Link
The VFSD is run only by NR-nodes at the beginning of the iteration. Through the VFSD ..... This mutual effect leads to Ni's unpredictable migration itinerary. Node Ni stops moving ... An illustration of how the ZONER works. The execution of the ...

LNCS 650 - Emergence of Complexity in Financial ... - Springer Link
We start with the network of boards and directors, a complex network in finance which is also a social ... from the board of Chase Manhattan Bank. Boards of ...

LNCS 3973 - Local Volatility Function Approximation ... - Springer Link
S&P 500 call option market data to illustrate a local volatility surface estimated ... One practical solution for the volatility smile is the constant implied volatility approach .... Eq. (4) into Eq. (2) makes us to rewrite ˆσRBF (w; K, T) as ˆσ

LNCS 6621 - GP-Based Electricity Price Forecasting - Springer Link
real-world dataset by means of a number of different methods, each cal- .... one, that we call GP-baseline, in which the terminal set consists of the same variables ...