IEICE TRANS. FUNDAMENTALS, VOL.E88–A, NO.12 DECEMBER 2005

3412

PAPER

Special Section on VLSI Design and CAD Algorithms

Efficient Large Scale Integration Power/Ground Network Optimization Based on Grid Genetic Algorithm Yun YANG†a) , Student Member, Atsushi KUROKAWA†† , Yasuaki INOUE† , Members, and Wenqing ZHAO††† , Nonmember

SUMMARY In this paper we propose a novel and efficient method for the optimization of the power/ground (P/G) network in VLSI circuit layouts with reliability constraints. Previous algorithms in the P/G network sizing used the sequence-of-linear-programming (SLP) algorithm to solve the nonlinear optimization problems. However the transformation from nonlinear network to linear subnetwork is not optimal enough. Our new method is inspired by the biological evolution and use the grid-geneticalgorithm (GGA) to solve the optimization problem. Experimental results show that new P/G network sizes are smaller than previous algorithms, as the fittest survival in the nature. Another significant advance is that GGA method can be applied for all P/G network problems because it can get the results directly no matter whether these problems are linear or not. Thus GGA can be adopted in the transient behavior of the P/G network sizing in the future, which recently faces on the obstacles in the solution of the complex nonlinear problems. key words: SLP algorithm, GGA method, P/G network optimization, global optimum

1.

Introduction

Power/ground (P/G) network is the power routes which supply voltage from the P/G pads to the on-chip circuits. With the development of VLSI, the engineers face challenges in the design of the reliable power/ground distribution because of the increasing susceptibility to current-induced reliability and functional failures [1], [2]. Electromigration, excessive IR drop and delta-I (Ldi/dt) noise are the typical reasons for those reliability and functional failures [3]. Thus the proper design for the wires of P/G network becomes a crucial task in high performance VLSI circuits. Common method used by the designers is enlarging the P/G network to avoid the reliability problems. However this method causes unnecessary chip area waste and impedes the VLSI’s further development. Especially in very deep submicron designs the circuit size decreases rapidly and the P/G network area occupies larger portion in the whole chip. Then the network constraints also become more stringent and more design requirements are introduced. Thus how to realize the miniManuscript received March 14, 2005. Manuscript revised June 16, 2005. Final manuscript received July 28, 2005. † The authors are with the Graduate School of Information, Production and Systems, Waseda University, Kitakyushu-shi, 8080135 Japan. †† The author is with the Semiconductor Technology Academic Research Center, Yokohama-shi, 222-0033 Japan. ††† The author is with the Micro-Electronics Department, Fudan University, Shanghai, 200433 China. a) E-mail: [email protected] DOI: 10.1093/ietfec/e88–a.12.3412

mum chip area for P/G network sizing under the restrictions of delta-I, excessive IR drop and electromigration becomes the recent hot-topic in deep submicron VLSI designs [1], [2]. Present P/G network design focuses on three parts: topology design, P/G network sizing based on the resistoronly model and P/G network design of transient behavior due to capacitive and inductive influences. Topology design achieves its aim by adjusting the sequence or order of the P/G network topology. While the P/G network sizing neglects the delta-I noise and mainly focuses on the voltage loss caused by the P/G network segment resistance. There are several algorithms developed to deal with the problems, such as augmented-Lagrangian method [4], conjugate-gradient method [5], feasible-direction method [6], SLP algorithm [2] and penalty algorithm [10]. All these algorithms convert the P/G network area sizing to a constraint nonlinear programming problem. In this paper we also consider the P/G network sizing problem with resistoronly model in fixed P/G network topology and propose our method based on the large scale P/G network sizing SLP algorithm [1]. The transient P/G network sizing considers the effects of resistance, capacitance and inductance in each segment. Thus delta-I noise, IR drop and electromigration can cause the whole P/G network instability. Several practices are applied to reduce the transient noise and optimize the global area under these restraints. Some methods redesign the decoupling capacitance allocation and placement [7], [8]. Others use wire sizing based on the time-domain sensitivity to reach the P/G network optimization [9]. Because of the complexity of P/G network many methods are used to reduce model order and increase process speed, including HiPRIME [11], PRIMA [12], Krylov [13], and Pad´e [14], etc. In the paper, we propose a novel grid-genetic-algorithm (GGA) to realize P/G network optimization under reliability constraints. Based on the SLP algorithm we observe its deficiency and replace it by improved genetic algorithm. Some early correlative research were presented to use the basic gene algorithm in the P/G network analysis [15]. In our new method we improve the basic gene algorithm to gridgenetic-algorithm and use it to get the global P/G network optimization. Experimental results show that the sizes of P/G network are smaller than previous algorithms, as the fittest survival in the nature.

c 2005 The Institute of Electronics, Information and Communication Engineers Copyright 

YANG et al.: EFFICIENT VLSI POWER/GROUND NETWORK OPTIMIZATION BASED ON GRID GENETIC ALGORITHM

3413

This paper is organized as follows. The P/G network optimization problem and the SLP algorithm are discussed in Sect. 2. Section 3 presents the new grid-genetic-algorithm (GGA). Some P/G network experimental results and comparisons are described in Sect. 4. Finally the conclusions are given in Sect. 5. 2.

Background

In this section we formulate the P/G network sizing problem at first, then we introduce the basic SLP method. At last we present the Tan-Shi SLP algorithm briefly [1], [2], [4], [16].

1. Voltage IR drop constraints: On the P/G network the voltage drop (IR drop) from pads to leaf nodes should not exceed the bound, otherwise the chip cannot work correctly. The detailed voltage constraints are [1]: Vi∈N ≥ Vmin , for power network, Vi∈N ≤ Vmax , for ground network,

where Vmin and Vmax are the lower and upper bounds decided by the given layer parameters. 2. Minimum width constraints: The P/G branches’ widths have minimal bounds because of the technological restrictions on the layers where the P/G branches lie. These bounds are as follows [1]:

2.1 Problem Formulation One example of P/G network G = {N, B} with n nodes N = {1,. . . ,n} and b branches B = {1,. . . ,b} is shown in Fig. 1. In B the segment between nodes i1 and i2 is the branch i with length li and width wi respectively. If we suppose ρ is the sheet resistivity, then the resistance ri of branch i is [1], [16]: ri = (Vi1 − Vi2 )/(Ii ) = ρ(li )/(wi ).

(1)

2.1.1 Objective Function Our aim of the P/G network sizing can be realized by minimizing the total P/G routing area in terms of voltages, currents, and lengths of branches as below [1]: f (V, I) =



li wi =

i∈B

 i∈B

ρIi l2i . Vi1 − Vi2

(2)

In this paper we suppose the length of each branch is constant because the P/G network’s connection is decided previously by the designers. Thus the routing area is related with the branch current variables I and node voltage variables V. 2.1.2 Constraints To satisfy the P/G network’s feasibility and reliability, its routing area is subject to the following constraints:

(3)

wi∈B =

ρli Ii ≥ wi(min) , Vi1 − Vi2

(4)

where wi(min) is width bound. 3. Electromigration constraints: In the P/G network electromigration is caused by high current density and has an upper limit [3]. In the routing layer with fixed thickness, the detailed limits are [1]: |Ii | ≤ σ, or wi |Vi1 − Vi2 | ≤ ρli σ,

(5)

where σ is the current density for fixed thickness. 4. Equal width constraints(coupling constraints): To satisfy the demands of the cost and performance, most adjacent P/G branch segments should not be changed largely. Thus all the segments in a chain can be considered to share the same width. The constraint can be expressed as wi = w j , or [1] Vi1 − Vi2 V j1 − V j2 = . li Ii ljIj

(6)

5. Kirchoff’s current law(KCL): The current influx and efflux in each node of the P/G network should keep balance. That is [1]:  Ii = 0, (7) i∈B( j)

where B( j) is the set of branches to node j. The P/G network optimization problem is to get the minimal P/G routing area under the constraints of voltage IR drop, minimum width, electromigration, equal width and Kirchoff’s current law. From Eqs. (2)–(7) the formulated problem is a nonlinear problem with its nonlinear constraints. 2.2 Two-Phase SLP Optimization Method

Fig. 1

Power/ground grid example.

In 1989 Chowdhury proposed basic SLP method to solve the P/G problem [5]. The method separates the original problem P into two phases (P − V, P − I), and only considers one variable in each phase, then iteratively solves P−V and P− I until the area result reaches the minimal value.

IEICE TRANS. FUNDAMENTALS, VOL.E88–A, NO.12 DECEMBER 2005

3414

1. Phase one (P-V): In this phase all branch currents are fixed and all node voltages are variables [1]:    1 (ρIi l2i ) × . (8) f (V) = Vi1 − Vi2 i∈B Because phase P − V is still nonlinear problem, it is further relaxed by Taylor’s expansion around the initial solution if the feasible result exists. Then the original problem P is converted to linear problem and can be back solved by linear programming (LP). The final objective function is [1]:  2αi fV (V) = 0 (Vi1 − Vi20 ) i∈B  αi − (Vi1 − Vi2 ), (9) 0 (Vi1 − Vi20 )2 i∈B where αi = ρIi l2i . The optimization constraints are Eqs. (3)–(6) with the follows [1]: Vi1 − Vi2 ≥ 0, Ii ξsign(Ii )(Vi10 − Vi20 ) ≤ sign(Ii )(Vi1 − Vi2 ),

of all the resistances in series. And the equivalent end current Ie1 and Ien replace the contributions from all the current sources. Then the P/G network is reduced and can be solved more quickly. Once the voltages at the end nodes are known, the intermediate nodes’ voltages can be back solved one by one [1]: Rs =

n−1  i=1

Ie1 =

n−2  i=1

Ien =

n−2 

Ri ,

(13)

n−1 j=i+1

j=1

where βi = (ρl2i )/(Vi1 − Vi2 ), the related constraints are Eqs. (4), (6), (7). 2.3 Tan-Shi SLP Algorithm Based on the two-phase SLP method, Tan and Shi propose new SLP algorithm with equivalent circuit modeling [1]. By exploring the architecture of P/G network and the relation of all node voltages with branch currents, Tan and Shi reduce the P/G network by using equivalent circuit modeling and apply the SLP method to realize the optimal routing area of the P/G network more efficiently. Shown in Fig. 2 the equivalent resistance R s is the sum

Ii ,

(15)

Ri V s − Ri Iei , Rs = Iei − Ii .

Vi+1 = Vi −

(16)

Ie(i+1)

(17)

|V s1 − V s2 | ≤

(11)

i∈B

Rj

Rs

(14)

After the reduced equivalent network is constructed, the SLP algorithm is used with new constrains [1]:

(10)

where Ii is the given constant and ξ ∈ (0, 1). 2. Phase two (P-I): In this phase the voltages in each node are fixed and branch currents become variables. Similarly, the objective function can be expressed as follows [1]:  βi Ii , (12) fI (I) =

Ii ,

Rs

i

i=1

Rj

|V s1 − V s2 | ≤

ρl s σ 1+

Ie1 Is

ρl s σ 1+

Ien Is

, for power network, , for ground network.

(18)

Then the new phase P − V is minimizing function (9) under the constraints (3), (4), (6), (11), and (18) using the SLP method. Similarly the new phase P − I can get the minimal objective function (12) under the constraints of (4), (6), (7), and (18). From an initial solution Tan-Shi SLP algorithm iteratively solves two new linear programming problems: P − V, then P − I, until the ultimate result is reached. 3.

New P/G Optimization Method

Developing new method is essential because the SLP method can not reach the minimal result under the normal situation. In addition the linear transformation of the SLP method also has problems. Based on the analogy between the P/G network optimization and the rapid colony evolution in biology science, we propose new grid-genetic-algorithm (GGA) to solve the optimization problems. In this section, we first point out the deficiency in the Tan-Shi SLP algorithm. Then we describe the new GGA method in detail and show its process flow. 3.1 The Deficiency of Tan-Shi SLP Method 1. Snare Effect: Tan-Shi SLP algorithm cannot reach the minimal P/G routing area in practical circuits. From Eq. (2) we can get:

Fig. 2

Equivalent circuits for a series-resistor chain.

f (V, I) =

 i∈B

li wi =

 i∈B

ρIi l2i Vi1 − Vi2

YANG et al.: EFFICIENT VLSI POWER/GROUND NETWORK OPTIMIZATION BASED ON GRID GENETIC ALGORITHM

3415

Fig. 4 P − I phase’s problem in practice. IB = IC − I : IC ⇒|IB |⇒P/G Area, (IB > 0) IC ⇒|IB|⇒P/G Areaor.(IB < 0) =⇒P − I phase is no longer linear programming problem.

Fig. 3 Snare effect in Tan-Shi SLP algorithm. (1) Xi /Yv means X units current with Y units voltage. (2) The dashed circles mean these values don’t exist. (3) Route1: (5i /3v ⇒5i /4v ⇒1i /1v ⇒4i /5v ⇒3i /5v ). (4) Route2: (5i /3v ⇒4i /3v ⇒1i /1v ⇒2i /3v ).

=

 i∈B

ρl2i

 Ii = kg(∆V, I), Vi1 − Vi2 i∈B

(19)

Ii Ii where k = ρl2i , g(∆V, I) = ∆V = Vi1 −V . i i2 Figure 3 shows the optimization flag g(∆V, I) in different routes. If all the values in the route exist, the TanShi SLP algorithm can get the minimal result. However, in practical circuits the values of ∆V and I are subject to constraints (3), (4), (6), (7), (11) and (18). Consequently many values in the routes do not exist and Tan-Shi SLP algorithm cannot reach the real optimization. For example, route2’s path is (5i /3v ⇒4i /3v ⇒1i /1v ⇒2i /3v ). Because the values around (2i /3v ) do not exist, the route2 cannot go ahead and the final result is (2i /3v ). In fact the global minimal value is (3i /5v ) if we choose route1 (5i /3v ⇒5i /4v ⇒1i /1v ⇒4i /5v ⇒3i /5v ). We name these situations “Snare Effect.” Thus the new method is required. 2. The solution space limit ξ: In phase P − V, the original problem is nonlinear programming problem in terms of V shown in Eq. (8). To replace it to linear programming problem the sequence-of-linearprogramming (SLP) method was suggested by Tan and Shi in 1999 [2]. Using the Taylor expansion the objective function can be transformed to Eq. (9). However the solution space limit ξ ∈ (0, 1) in Eq. (11) restricts the optimization step and increases the sizing complexity and time. Thus the limit should be improved to eliminate the Taylor expansion disadvantages. 3. P − I phase’s problem in practice: In the individual element situation the P − I phase (12) is linear programming problem, but elements in the P/G network is involved with adjacent circuits. Therefore the phase P − I is not always linear. For instance in Fig. 4, IB = IC − I2 . When IC decreases IB also reduces its

value, and fI (I) value in Eq. (12) changes in the same direction. However when Ic drops its value below I2 , IB < 0 and the absolute value of IB increases with the IC reduction. Then the P/G network area, which is related with the absolute value of branch currents in P − I phase, increases its value when IC is reduced. But the correct linear programming character must own the fixed change direction for the network area and the current IC . If IC rises, the network area increases (or decreases). Reversely when IC drops, the area also decreases (or increases). Thus Tan-Shi SLP algorithm loses the linear programming character in network optimization and cannot deal with whole network current interaction. As a result the phase P − I should be modified and improved by better practice method. 3.2 The Grid-Genetic-Algorithm (GGA) In the nature world the species evolution is rapid and optimal under the environment constraints. And the gene selection and mutation rules define the characters of each successful species. Similarly the P/G network optimization is finding the optimal P/G area under several constraints. Inspired by the analogy between the biology evolution and the P/G network optimization, we propose a new gene algorithm for the large scale P/G network called “Grid Genetic Algorithm (GGA).” Basic gene algorithm was presented by J. H. Holland in 1960’s [15]. We change its genetic code to meet the demand of the grid network and reduce the process complexity. In our method only the independent variables are coded. Other currents and nodal voltages are related with the independent variables and can be computed by the network configuration. Then we let the P/G gene “crossover” and “mutation” under the restrictions of “environment selection.” After many generations’ genetic evolution we can get the minimal P/G network area. Since the benefits of colony search, fast convergence, and global optimization, the gridgenetic-algorithm can approach the minimal P/G network area if the evolution generations are large enough. In addition another virtue is that GGA can process the objective function directly and avoid the error in the transformation from the nonlinear problem to linear problem, which is the crucial obstacle in the recent research in the P/G network optimization. As an illustration Fig. 5 shows a 4 × 4 P/G network

IEICE TRANS. FUNDAMENTALS, VOL.E88–A, NO.12 DECEMBER 2005

3416

riod (−DV max )∼(+DV max ) in each segment. Similarly the 1% variation for the branch current Ii is suitable for the evolution requirements of precision and time. Thus we use the seven bit coding for the branch current Ii . Then the scope for Ii is (−Imax )∼(+Imax ) and its scale is (Imax /64). By the combination of Ii and DV i we can get the final 12 bits genetic codes “VVVVVIIIIIII” for the independent variables in the whole P/G network. 3.2.3 Population Initialization

Fig. 5 4 × 4 P/G network and independent variables. (The code “VVVVV IIIIIII” in the network.) (V: the voltage bit; I: the current bit.)

Each feasible result of the P/G network is called “individual” which has an analogy with single creature in the nature. And just as many creatures form the colony, several individuals comprise “population.” From tests some initial feasible results are obtained and set as the population. The optimal P/G network is evolved from the population pool because most suited genes are included in these feasible results. 3.2.4 Crossover

with 3 segments in each chain [1]. There are 18 branches from “A” to “R,” as well as 16 nodes in the P/G network. In addition three voltage sources and ten inner current outputs compose the whole P/G network. The detailed explanation of the new P/G Optimization Algorithm (GGA) is shown as follows: 3.2.1 Find the Independent Variables Based on the Tan-Shi SLP Algorithm, the widths of all the branches in a chain are identical [1]. As a result each branch in a chain can be expressed by one independent variable. For an m×n P/G network with m rows, n columns and several strips, the numbers of independent variables are (m − 1) × (strip − 1). For example in Fig. 5 the top route of P/G network has three segments A, B and C. We select the segment C as the independent variable. Similarly other variables are segment D and L in the whole P/G network.

In the propagation the genes in same place of different individuals are exchanged to get next generation. This transform is named after “crossover.” Two steps compose this process: 1. Grouping: For population crossover these individuals are randomly divided into some groups which contain two individuals as the combination of male and female. 2. Crossing: After grouping each group’s genes are exchanged at random. This is called “Crossing.” In GGA method the exchange method is updated to suit the P/G network. At first one random value “n,” which is not more than the genetic codes’ bit number, is generated. Then two initializations in one group exchange their last “n” bits in terms of the crossover probability Pc (25%∼75%). Unlike the random crossing for basic gene algorithm, the directional end bits crossing of the GGA method can reduce the evolution complexity and increase the process speed.

3.2.2 Set the Genetic Codes of Independent Variables

3.2.5 Mutation

After the independent variables are found the genetic codes are given to prepare the genetic algorithm. Each branch includes two parts: the branch current Ii and the voltage difference DV i between its two nodes. And the values of Imax and DV max are determined by the given chip parameters. In GGA method the Imax means the summation of all the branch currents flowing out the circuit and the DV max is the maximal voltage difference in the P/G network. In the P/G network segments each part is signed and defined by the binary digits. And the appropriate bit distribution is crucial for the result precision and process time. For the ±3.0 V voltage source, the common switch threshold is 0.3 V, which is about 1/10 of the source voltage. Thus we can use five bits to code the voltage difference DV i . Then each step of the voltage difference is (DV max /16) with the maximal pe-

Mutation is the necessary step which changes the special positions of individual genetic codes. Because not all new genes can be created by the crossover, several code segments in special position alter to engender new offspring. GGA method firstly gets the special positions in the genetic codes at random, then inverses the selected positions’ value under the restrictions of mutation probability Pm whose typical value is between 1% to 20%. The mutation step of GGA method can guarantee the searching completeness and reach the proper result. The selection of Pm can also influence the evolution speed and complexity. 3.2.6 Compute the Whole P/G Network’s Parameter In spite of the width equality, the P/G network is subject to

YANG et al.: EFFICIENT VLSI POWER/GROUND NETWORK OPTIMIZATION BASED ON GRID GENETIC ALGORITHM

3417

Kircho f f ’s current law. Thus the branch currents and nodal voltages can be calculated from the known individual genetic codes, such as the 4 × 4 P/G network in Fig. 5 where branch B’s current is IB = IC − I2 , and its voltage difference DV B = DV C ×(LB /LC )×(IB /IC ). In the same way the P/G network area and other branches’ parameters can be acquired under the P/G network restrictions. 3.2.7 Selection Based on the survival of the fittest theory the next generation is selected under the environment condition. If the individual’s adaptive ability is higher than others, its genes can be preserved and inherited to offspring in all possibilities. The next generation can suit the surroundings more than its parents. In the P/G network problem the new generation is better result for the area optimization. The environment condition is the P/G network restrictions. In GGA method the selection is subject to the limits of (3)–(7) and avoid the difficult transformation from the nonlinear problem to linear problem. Thus the GGA selection is the attempt and approach to the optimal result shown as follows: 1. Individual selection: After the crossover and mutation in each group GGA method selects those new results under (3)–(7) limits. If the results are fit for restrictions they are preserved. Otherwise those unsuccessful results are rejected and their parents are forced to reproduce again, until the suited results are created. If the propagation times exceed the maximum number Mi , the individual selection stops and the parents in the group are kept. These suitable parent genes are used to attend the next generation’s propagation. 2. Population selection: For the given group results the GGA method calculates each group’s whole network area. If it is more optimal result than the last evolution, the area and related parameters in each segment are recorded as this evolution’s computing result. Otherwise this selection fails and GGA method comes back to continue crossing and repeats the above steps. The whole P/G network is evolved again to get the new generation. 3.2.8 Convergence to the Global Optimum This step gives an ending flag to reach the global optimum. There are two flags to determine when the evolution finishs. 1. Global optimum: If the global P/G network area keeps stable and just swings in a given little range A s , the P/G network can be regarded to reach the optimum. Then GGA method finishes and sends out the result. 2. Evolution maximum: If the network cannot get the steady result and the global optimization is not convergent, the evolution will come into the endless circle. Thus setting the maximum evolution number Me is necessary. When the evolution numbers exceed Me ,

Fig. 6

The flow of Grid-Genetic-Algorithm (GGA).

the GGA method’s optimization process stops and the final result is the optimal result acquired in the previous generations, which is considered as the global optimum result of the whole P/G network. If the above flags cannot be reached, GGA method returns to grouping and iterates the above related steps. 3.3 The Flow of Grid-Genetic-Algorithm The flow chart of entire grid-genetic-algorithm procedure is shown in Fig. 6. And the chart gives more detailed explanation of GGA’s process which is discussed in Sect. 3.2. 4.

Results and Discussion

Using GGA method, we can optimize the practical P/G network, and get the process time, network area and the evolution efficiency. Because the linear transformation in Tan-Shi SLP algorithm is not completely successful, we use the idea of searching the solution space step by step and get the accurate result to compare with GGA method. We test the result of the 4 × 4 power/ground network in Fig. 5. In this network the input voltage source is 3.0 V and the genetic code is 12 bits “VVVVVIIIIIII” for each segment. Then the largest segment current is 0.1 A and the highest voltage

IEICE TRANS. FUNDAMENTALS, VOL.E88–A, NO.12 DECEMBER 2005

3418 Table 1

The optimization result of 4 × 4 P/G network.

Methods

Power/Ground network area (um2 )

Process time(s)

Tan-Shi SLP (step search)

201.5382

0.015

Test1 GGA

Test2 Test3 Test4 Test5

109.0300 (optimal size & final result) 140.6160 134.9440 126.9540 137.0550

16.688 15.109 16.203 18.375 16.640

drop for each segment is 0.05 V. We also select the crossover rate 50% and the mutation rate 2% with the initial population size 30. After the computation we can get several P/G network areas because the GGA results float in each evolution as the diversity of species in the nature. The number of generations varies between 100 and 1000 mostly and no more than 10000 in each evolution group. From these results we can select the minimal area and get the final P/G network optimization size shown in Table 1. In Table 1, it is obvious that the GGA are better than the Tan-Shi SLP. If we use the Tan-Shi SLP, the P/G network area is constant and the linear transformation is not successful. By the GGA, we can get the minimal area 109.0300 µm2 and the improvement rate is 45%. Though the GGA behavior is affected by probability, we can get the minimal area by several experiments and the final result is more close to the global optimum than the Tan-Shi SLP, which can not realize the global optimization in fact. But the process time for GGA is longer than Tan-Shi SLP. The average process time is 100–1000 times than the SLP. Thus in the small scale P/G network, the GGA method can realize the better global P/G network sizing with the cost of process time. If analyze the time cost of GGA, we can find that it is mainly decided by the core evolution times. There are two evolution factors to affect the core evolution times. One is outside evolution factor eout and the other one is inside evolution factor ein . The former controls the group numbers and the latter decides the evolution times in each group. If these two factors vary the process time will be changed. The detailed relation is shown in Table 2. If the evolution factors are low the process time is short, but it is not easy to get the minimal area. On the contrary if the evolution factors are high, the global optimization is easy, but the process time is much larger. The different combinations of the two factors can also influence the final results. Thus proper selection of the evolution factors is the key point for the global P/G optimization. In this paper we use the 10000 outside evolution factor and 10000 inside evolution factor. The result is better than the Tan-Shi SLP and more close to the global optimum. In addition the process time is not large and can be accepted. For the Tan-Shi SLP algorithm the complexity and process time is based on the scale and network architecture. If the P/G network nodes increase, the complexity and time for the SLP algorithm are determined by the segment numbers and node quantities, which are linear with the square of P/G network size. Thus the process time of Tan-Shi SLP

Table 2 Outside evolution factor eout 100 100 100 1000 1000 1000 10000 10000 10000

The evolution result for 4 × 4 P/G network. Inside evolution factor ein 100 1000 10000 100 1000 10000 100 1000 10000

Optimal power/ ground network area (um2 ) 152.671 109.030 109.030 148.399 135.730 109.030 133.513 109.030 109.030

Optimal process time (s) 0.2030 1.6720 18.594 0.1400 1.8430 12.985 0.1880 1.5780 14.703

algorithm increases rapidly. In addition the snare effect and linear transform can also restraint its efficiency and accuracy. For the GGA method, though it is also necessary to calculate all branch currents and node voltages in the P/G network, there are many advantages and merits to use this new global optimization method. Firstly, the SLP computes each segment results step by step with the ordinal exchange process of two phases P − V and P − I. But in the GGA case each segments can realize their own evolution computation directly without the partition of computation phases. Secondly, the equivalent circuit modeling in Tan-Shi SLP algorithm is not completely successful. If the current directions are not same in a chain, the nodes can not be suppressed and the P/G segments can not be equivalent to simple circuit model. Then the SLP computation and back solved process become much more complex. For the GGA method, each chain has its own independent variable and the current directions can not influence the value establishment for each segment, once the genetic codes are decided. Thirdly, in the large scale network the GGA method has many evolution segments, then the whole system evolution speed will be accelerated rapidly because of the higher parallel global evolvement. There is an analogy between the P/G network optimization and the biology evolution history in the earth. The evolution time from the single cell animals to human being is almost ten times longer than the evolution period from the multicellular animals to human being, because the multicellular animals have much more evolution parts and can evolve together under the given natural selection. Thus all these advantages of GGA method can partly counteract the rapid increase of the computation complexity in large scale P/G network. Consequently the proposed GGA method can reach the final optimal result faster than SLP algorithm in the situation of same P/G network scale. The values of evolution factors are also important to determine the process time and P/G network area. If the evolution factors are limited improperly the network evolution can not easily reach the global optimum. And if the evolution factors are too large the process time will be extended intolerantly. Thus the suitable balance between the evolution factors and the P/G network optimization becomes significant problem. In our experiments the common evolution factors are 10000 outside evolution factor and 10000 inside

YANG et al.: EFFICIENT VLSI POWER/GROUND NETWORK OPTIMIZATION BASED ON GRID GENETIC ALGORITHM

3419

evolution factor, which is enough for the P/G network below the 100 × 100 scale. For the larger P/G network the detailed exploration of evolution factor effects should be carried on in the future work. Based on the above merits of direct computation, genetic variable independence and whole system evolution, the GGA method can optimize the large scale P/G network more efficiently than the SLP algorithm with high possibility. The skillful selection of evolution factors can also ensure the P/G network sizing. For example, we experiment the 20 × 20 P/G network. If using the SLP algorithm, the process time increase more than 10 times because the network nodes increase from 16 to 400. But the process time for GGA method is still between the 15 seconds and 20 seconds with the evolution factors (10000, 10000). In addition, the P/G network area is also more close to the global optimum than the SLP algorithm. 5.

Conclusion and Future Work

The sequence-of-linear-programming (SLP) algorithm is the common method to solve the global P/G network optimization. But it can not reach the really global optimum and faces many problems, such as the snare effect and inadequate linear transformation. To get rid of SLP’s deficiency, we introduce the evolution idea in the biology world and propose the grid-genetic-algorithm (GGA). This method can get the P/G network area more close to the global optimum and its optimization process is more efficient with the growing scale of P/G network. Experimental results show that the proposed method is better than the SLP method especially in the global optimization. Future work will be focused on decreasing the process time in the optimization and find the detailed relationship between the evolution factors and the process complexity. More Large and complex P/G network should be tested to demonstrate the benefits of GGA method. In addition, the GGA method can also be used in the P/G network design which considers the transient noise due to the parasitic capacitance and inductance driven by time-varying current sources. Further efforts will introduce the GGA method into the transient behavior research in the large scale P/G network optimization.

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

Acknowledgments [16]

We would like to thank the anonymous reviewers for invaluable and helpful comments on this paper. This work was supported by the National Science Foundation of China (NSFC) under grant No. 69928402 and 69806004, and by funds from the MEXT via Kitakyushu and Fukuoka innovative cluster projects. References [1] X.-D. Tan and C.-J. Shi, “Efficient very large scale integration power/ground network sizing based on equivalent circuit modeling,”

IEEE Trans. Comput.-Aided Des. Integr. Circuits Syst., vol.22, no.3, pp.277–284, March 2003. X.-D. Tan and C.-J. Shi, “Reliability-constrained area optimization of VLSI power/ground network via sequence of linear programming,” IEEE Trans. Comput.-Aided Des. Integr. Circuits Syst., vol.22, no.12, pp.1678–1684, Dec. 2003. J.R. Black, “Electromigration failure modes in aluminum metallization for semiconductor devices,” Proc. IEEE, vol.57, no.9, pp.1587– 1594, Sept. 1969. S. Chowdhury and M.A. Breuer, “Minimal area design of power/ground nets having graph topologies,” IEEE Trans. Circuits Syst., vol.CAS-34, no.12, pp.1441–1451, Dec. 1987. S. Chowdhury, “Optimum design of reliable IC power networks having general graph topologies,” Proc. 26th ACM/IEEE Design Automation Conference, pp.787–790, June 1989. R. Dutta and M. Marek-Sadowska, “Automatic sizing of power/ground (p/g) networks VLSI,” Proc. 26th ACM/IEEE Design Automation Conference, pp.783–786, June 1989. G. Bai, S. Bobba, and I.N. Hajj, “Simulation and optimization of the power distribution network in VLSI circuits,” Proc. IEEE International Conference on Computer-Aided Design, pp.481–486, Nov. 2000. S. Zhao, C. Koh, and K. Roy, “Decoupling capacitance allocation and its application to power supply noise aware floorplanning,” IEEE Trans. Comput.-Aided Des. Integr. Circuits Syst., vol.21, no.1, pp.81–92, Jan. 2002. H. Su, J. Hu, S.S. Sapatnekar, and S.R. Nassif, “Congestion-driven codesign of power and signal networks,” Proc. 39th ACM/IEEE Design Automation Conference, pp.64–69, June 2002. X. Wu, X. Hong, Y. Cai, Z. Luo, C.-K. Cheng, J. Gu, and W. Dai, “Area minimization of power distribution network using efficient nonlinear programming techniques,” IEEE Trans. Comput.-Aided Des. Integr. Circuits Syst., vol.23, no.7, pp.1086–1094, July 2004. Y. Cao, Y. Lee, T. Chen, and C.C. Chen, “HiPRIME: Hierarchical and passivity reserved interconnect macromodeling engine for RLKC power delivery,” Proc. 39th ACM/IEEE Design Automation Conference, pp.379–384, June 2002. A. Odabasioglu, M. Celik, and L.T. Pilagge, “PRIMA: Passive reduced order interconnect macromodeling algorithm,” IEEE Trans. Comput.-Aided Des. Integr. Circuits Syst., vol.17, no.8, pp.645– 654, Aug. 1998. J.M. Wang and T.V. Nguyen, “Extended Krylov subspace method for reduced order analysis of linear circuits with multiple sources,” Proc. 37th ACM/IEEE Design Automation Conference, pp.247–252, June 2000. J.C. Shah, A.A. Younis, and S.S. Sapatnekar, “An algorithm for simulating power/ground networks using Pad´e approximants and its symbolic implementation,” IEEE Trans. Circuits Syst., vol.45, no.10, pp.1372–1382, Oct. 1998. S. Zhao, K. Roy, and C. Koh, “Estimation of inductive and resistive switching noise on power supply network in deep sub-micron CMOS circuits,” Proc. IEEE International Conference On Computer Design, pp.65–72, Sept. 2000. T. Wang and C.C. Chen, “Optimization of the power/ground network wire sizing and spacing based on sequential network simplex algorithm,” Proc. 3rd IEEE International Symposium on Quality Electronic Design, pp.157–162, March 2002.

IEICE TRANS. FUNDAMENTALS, VOL.E88–A, NO.12 DECEMBER 2005

3420

Yun Yang received the B.S. degree in electronic engineering from Fudan University, Shanghai, China, in 1998 and the M.S. degree in Micro-Electronics Department from Fudan University, Shanghai, China, in 2004. Now he is pursuing the Ph.D. degree at Graduate School of Information, Production and Systems, Waseda University, Kitakyushu, Japan. His research interests include pipeline operation, MPEG chip design, VLSI CAD development and analog/digital circuit design.

Atsushi Kurokawa received the B.E. degree in electrical engineering from Seikei University, Tokyo, Japan, in 1986 and the D.E. degree in information, production and systems engineering from Waseda University, Fukuoka, Japan, in 2005. From 1986 to 2002, he was with Sanyo Electric Co., Ltd., Gunma, Japan. He is now with Semiconductor Technology Academic Research Center (STARC). His research interests include high-speed/low-power design techniques and timing analysis for VLSI circuits. Dr. Kurokawa is a member of IPSJ and IEEE.

Yasuaki Inoue was born in Niigata, Japan, on September 6, 1945. He received a diploma from the Department of Electronics, Nagaoka Technical High School, Niigata, Japan, in 1964 and the D.E. degree in electronics and communication engineering from Waseda University, Tokyo, Japan, in 1996. From 1964 to 2000, he was with Sanyo Electric Co., Ltd., Gunma, Japan, where he was engaged in research and development in analog integrated circuits and analog/digital CAD systems. In Sanyo Semiconductor Company, he was General Manager of the CAD Engineering Department from 1993 to 1998 and the Memory Development Department from 1998 to 2000. He holds over forty patents. From 2000 to 2003, he was a Professor with the Department of Integrated Cultures and Humanities, also a Professor with the Graduate School of Integrated Science and Art, University of East Asia, Shimonoseki, Japan. Since 2003, he has been a Professor with the Graduate School of Information, Production and Systems, Waseda University, Kitakyushu, Japan. His research interests include numerical analysis of nonlinear circuits and systems, analog circuits, and LSI CAD systems. He was an Associate Editor of the IEEE Transactions on Circuits and Systems Part II from 1997 to 1999. He received the Ishikawa Award from the Union of Japanese Scientists and Engineers in 1988, the Distinguished Service Award from the Science and Technology Agency, the Japanese Government in 1999, the TELECOM System Technology Award from the Telecommunications Advancement Foundation in 2002, the Achievement Award from the Information Processing Society of Japan (IPSJ) in 2003, and the Funai Information Technology Prize from the Funai Foundation for Information Technology in 2004. Dr. Inoue is a member of IEEE, IPSJ, IEEJ and JSST.

Wenqing Zhao was born in 1950. He received B.S. and M.S. degree from Physics Department of Fudan University, Shanghai, China, in 1973 and 1983 respectively. Now he is a professor in Micro-Electronics Department of Fudan University, Shanghai, China. His research interests are R&D of VLSI CAD, including layout synthesis, verification and logic synthesis.

Efficient Large Scale Integration Power/Ground ...

of delta-I, excessive IR drop and electromigration becomes the recent hot-topic in deep submicron VLSI designs [1],. [2]. Present P/G network design focuses on ...

474KB Sizes 1 Downloads 183 Views

Recommend Documents

practical valves and pumps for large-scale integration ...
Micro Total Analysis Systems 2002, Nov. ... nL) dead volumes to the fluidic system. ... metering specific nanoliter- or microliter-scale volumes of fluids may be ...

Large-scale integration of wind power into different energy systems
efficient distribution of wind power and other renewable energy sources in the grids [23–25] .... Regulation strategy II: meeting both heat and electricity demands.

Large-scale integration of wind power into different ... - Publications
Department of Development and Planning, Aalborg University, Fibiger Straede 13, DK-9220 Aalborg, ... designing integrated regulation strategies of overall energy systems. .... 50% more CHP: In the reference system 21.21 TWh equal to app.

Efficient computation of large scale transfer ... - Mathematical Institute
Imag axis (rad/s). Figure 4: Pole-zero map of the 46 × 46 transfer matrix for disturbance channels. Circles denote the transfer function zeros and stars denote the ...

Sparse Semantic Hashing for Efficient Large Scale ...
Nov 7, 2014 - explosive growth of the internet, a huge amount of data have been ... its fast query speed and low storage cost. ..... The test time for SpSH is sufficiently fast especially when compared to the nonlinear hashing method SH. The reason i

CloudRAMSort: fast and efficient large-scale distributed ...
May 24, 2012 - sorting of large-scale in-memory data of current and future systems. ... gation of tens to hundreds of commodity server processors. We carefully ...

Efficient Indexing for Large Scale Visual Search
local feature-based image retrieval systems, “bag of visual terms” (BOV) model is ... with the same data structure, and answer a query in a few milliseconds [2].

Efficient computation of large scale transfer ... - Mathematical Institute
zeros can be computed as the dominant poles of the inverse transfer function by .... As the controller K is determined by the designer and is, in general, of small ...

Efficient Large-Scale Distributed Training of Conditional Maximum ...
computer vision [12] over the last decade or more. ..... online product, service, and merchant reviews with a three-label output (positive, negative .... Our analysis and experiments give significant support for the mixture weight method for training

on production infrastructure Efficient large-scale replica ... - GitHub
1. Introduction. Replica-exchange (RE) [1,2] methods have been used to understand physical ... the resources, network and other components are homogeneous. .... Manager, (ii) the BigJob-Agent, and (iii) the advert service which is a central.

cost-efficient dragonfly topology for large-scale ... - Research at Google
radix or degree increases, hop count and hence header ... 1. 10. 100. 1,000. 10,000. 1985 1990 1995 2000 2005 2010. Year .... IEEE CS Press, 2006, pp. 16-28.

Efficient Topologies for Large-scale Cluster ... - Research at Google
... to take advantage of additional packing locality and fewer optical links with ... digital systems – e.g., server clusters, internet routers, and storage-area networks.

Efficient Large-Scale Distributed Training of ... - Research at Google
Training conditional maximum entropy models on massive data sets requires sig- ..... where we used the convexity of Lz'm and Lzm . It is not hard to see that BW .... a large cluster of commodity machines with a local shared disk space and a.

Deep Learning Methods for Efficient Large Scale Video Labeling
Jun 14, 2017 - We present a solution to “Google Cloud and YouTube-. 8M Video ..... 128 samples batch size) achieved private leaderboard GAP score of ...

Large-Scale Manifold Learning - Cs.UCLA.Edu
ever, when dealing with a large, dense matrix, as in the case of Isomap, these products become expensive to compute. Moreover, when working with 18M data ...

Efficient Mining of Large Maximal Bicliques - CiteSeerX
Graphs can be used to model a wide range of real world applications. In this ... increasingly large multigene data sets from sequence databases [18]. Deter-.

TensorFlow: Large-Scale Machine Learning on Heterogeneous ...
Nov 9, 2015 - containers in jobs managed by a cluster scheduling sys- tem [51]. These two different modes are illustrated in. Figure 3. Most of the rest of this section discusses is- sues that are common to both implementations, while. Section 3.3 di

LARGE SCALE NATURAL IMAGE ... - Semantic Scholar
1MOE-MS Key Lab of MCC, University of Science and Technology of China. 2Department of Electrical and Computer Engineering, National University of Singapore. 3Advanced ... million natural image database on different semantic levels defined based on Wo

Large-scale Incremental Processing Using Distributed ... - USENIX
collection of machines, meaning that this search for dirty cells must be distributed. ...... to create a wide variety of infrastructure but could be limiting for application ...