A Non-Expansive Convolution for Nonlinear-Phase Paraunitary Filter Banks and Its Application to Image Coding Yuichi Tanaka, Akihiro Ochi, and Masaaki Ikehara Department of Electronics and Electrical Engineering Keio University 3-14-1, Hiyoshi, Kohoku-ku, Yokohama, Kanagawa, 223-8522 JAPAN Email: [email protected]

z-1 M

z-1

z-1

E(z)

R(z)

M

z-1

z-1

z-1 M

M

^ x(n)

Fig. 1. The polyphase representation of an M -channel maximally decimated filter bank.

I. I NTRODUCTION The lapped orthogonal transform (LOT) and its generalized version (GenLOT) play an important role in the transform coding and have been extensively studied [1], [9]. From a filter bank perspective, the LOT and the GenLOT are particular class of M -channel linear-phase paraunitary filter banks (LPPUFBs), where the length L of each filter is KM where K is a positive integer [2], [3]. They can avoid the blocking effect which is a shortcoming of the DCT [4], because they have long basis functions. However, they cause the problem that the total number of transform coefficients is more than the input signals by the signal convolution if the signal lengths are finite. This expansive effect is undesirable in image compression applications. One simple non-expansive approach is to use the circular convolution for the finite length signals. Though this approach does not increase the amount of information to be coded, the periodic extension causes the signal discontinuities at the boundaries and requires more bits to code large transform coefficients in the high frequency bands. The symmetric extension is an efficient method for linearphase filter banks (LPFBs) to overcome the problem of the boundary distortion [5]. Since the symmetric extension improves the performance of image coding applications, most works in the field of filter bank systems are focused on LPFBs. However, the linear-phase property is the additional constraint on the filter design. In [6], it has been shown that the performance of stopband attenuation and coding gain of the LPPUFB is worse than the general (nonlinear-phase) PUFBs. Although the signal extension method for NLPPUFBs has been proposed [7], it increases the computational complexity

1­4244­0132­1/05/$20.00 ©2005 IEEE

M

M

x(n)

PROCESSING

Abstract— This paper proposes a new non-expansive convolution for nonlinear-phase paraunitary filter banks (NLPPUFBs). First, we present that any NLPPUFBs can be implemented by connecting several block transforms. Next, we show a signal extension method at the analysis bank by exploiting the characteristics of that structure. Furthermore, we prove that the signal can be reconstructed at the synthesis bank without any redundant signals. Finally, we apply the proposed extension to image coding to validate our method.

because it has to calculate the extension signals at the analysis bank. In order to find a simpler extension method, we focus on that any NLPPUFBs can be implemented by connecting several block transforms. In this paper, we propose a novel nonexpansive convolution method for M -channel NLPPUFBs. With the proposed extension, it is shown that NLPPUFBs have superior coding performance to LPPUFBs. II. E XISTI NG L ATTIC E S TRUCTUR E Fig. 1 shows a polyphase implementation of a typical M channel maximally decimated filter bank [2], [3]. If an M channel NLPPUFB has length KM , the analysis polyphase matrix E(z) is represented as [6] E(z) = GK−1 (z)GK−2 (z) · · · G1 (z)E0 .

(1)

If M is even, each matrix of (1) is represented as follows: Gi (z) = Pi Wi Λ(z)Wi where Pi = diag(Ui , Vi ) and  Wi =

Ci Si

Si −Ci

 , Λ(z) =



IM 2 0M 2

(2)

0M 2 z −1 I M

 .

2

E0 is an M × M unitary matrix, Ui and Vi are M/2 × M/2 unitary matrices and Ci and Si are M/2 × M/2 diagonal matrices as Ci = diag(cos αll ) and Si = diag(sin αll ) (l =

54

Authorized licensed use limited to: Keio University. Downloaded on October 16, 2008 at 21:48 from IEEE Xplore. Restrictions apply.

E0

W1

input signals

E0

W1

E0

W1

W1

P1

W2

W1

P1

W2

Fig. 2. A form of an M -channel NLPPUFB connecting several block transforms.

1, 2, · · · , M/2), respectively. Furthermore, in the real coefficients case, the synthesis polyphase matrix R(z) is shown as R(z) = z −(K−1) ET0 GT1 (z)GT2 (z) · · · GTK−1 (z)

(3)

signals at each image boundary, by using J (K−1)M for smooth 2 extension. Each block which has the size M × M in Fig. 3 is represented as follows:   A11 A12 = W1 E0 K=2 : A= A21 A22 ⎧   B11 B12 ⎪ ⎪ = W1 E0 ⎨ B=  B21 B22  K=3 : C11 C12 ⎪ ⎪ = W2 P1 W1 ⎩ C= C21 C22 ⎧   D11 D12 ⎪ ⎪ = W1 E0 D= ⎪ ⎪ ⎪ ⎪  D21 D22  ⎨ F11 F12 = W2 P1 W1 K=4 : F= F ⎪ ⎪  21 F22  ⎪ ⎪ H11 H12 ⎪ ⎪ = W3 P2 W2 ⎩ H= H21 H22 where each submatrix has the size M/2 × M/2. B. Signal reconstruction at the synthesis bank

from the orthogonality of E(z). III. A N ON - EXPANSI VE C ONVOLUTION FOR NLPPUFB S When FBs are applied to image coding, the signal increasing problem occurs at boundaries of images due to the signal convolution. The LPFB can avoid this problem by utilizing the symmetric extension. In contrast, the NLP one can not use the symmetric extension. Hence, we present a signal extension/reconstruction method using the lattice structure of the NLPPUFB in this section. Attention: For simplicity, we represent each line with/without arrow in Figs. 2–4 as a signal vector consisted ˆ in the synthesis of M/2 signals. We also denote signal p bank which corresponds to signal p in the analysis bank. A. Signal extension at the analysis bank We consider an M -channel NLPPUFB with its length KM (M : even). In (2), Λ(z) makes M/2 signals delay and their delayed signals are input to next building block. Therefore, an NLPPUFB is represented in Fig. 2 by connecting several M × M block transforms. M/2 samples cross between each Wi due to the delay elements. Fig. 3(a) shows the upper image boundary of Fig. 2 when K = 2. From that figure, M/2 signals have to be extended at the upper and the lower image boundaries, respectively. Consequently, the whole input signal has to be extended by M signals. Now we extend signals by using J M for smooth 2 extension. With the reverse operation, the signal has to be reconstructed at the synthesis bank. However, transmitted signals are only yn (n = 0, 1, 2, · · ·). Hence, we have to solve the “nonexpansive” problem. That is, we have to find a solution to reconstruct x0 from zn which are calculated from yn easily. This problem is solved in the next subsection. We also show the upper image boundary when K = 3 and 4 in Fig. 3(b) and (c), respectively. This is similar to K = 2. After all, the input signal has to be extended by (K − 1)M/2

In this subsection, our purpose is to reconstruct xn from zn , an and bn in Fig. 3. We show the signal reconstruction method for some K. Hereafter, we consider only the upper image boundary because the signal can be reconstructed by the same way at the lower image boundary. 1) K = 2: In Fig. 3(a), we have  

Jx0 = z0 . (4) A11 A12 x0 ˆ0 from ˆz0 which is transmitted The problem is to reconstruct x ˆ 0 is represented as the to the synthesis bank. From (4), x solution of the matrix equation (A11 J + A12 )ˆ x0 = ˆ z0 . Therefore, we get ˆ 0 = (A11 J + A12 )−1 ˆz0 . x

(5)

This procedure is shown in Fig. 4(a). In this case, the matrix (A11 J + A12 ) has to be nonsingular. In the next section, we make consideration of this condition to design the NLPPUFB. ˆ1 in Fig. 3(b). In 2) K = 3: The problem is to calculate a this case, we have  

x1 (6) a0 = B11 Jx1 + B12 Jx0 = B11 J B12 J x0 and



x0 x1



 =

BT11 BT12

BT21 BT22



a2 a1

 .

Substitute (7) into (6), then (6) is rewritten as   

BT11 BT21 a2 . a0 = B12 J B11 J BT12 BT22 a1 Furthermore, we also have z0 =



C11

C12

55 Authorized licensed use limited to: Keio University. Downloaded on October 16, 2008 at 21:48 from IEEE Xplore. Restrictions apply.





a0 a1

(7)

(8)

 (9)

x0

x0

J

A

x0

z0 W1

z1

x1

A

x2

P1

J

x1

boundary

y0

x0

y1

x1

B

a1 B

x2

z2

a0

z0

z1 W2

a2 a3

B

x3

C C

boundary

P2

y0 y1

z2

(a) (b) x0 x1 x2 x0 x1 x2 x3 x4

J

D D D

b0 b1

F

a0 a1

b2 b3

F

H

z1 W3

a2 a3

b4

boundary

z0

H

P3

y0 y1

z2

F D

(c) The proposed extension for NLPPUFBs (dotted lines represent the upper image boundaries): (a) K = 2, (b) K = 3 and (c) K = 4.

Fig. 3.

ˆ 1 can be calculated as the solution of from Fig. 3(b). Hence a ˆ 0 and the simultaneous matrix equation with two unknowns a ˆ1 represented as (8) and (9). Finally we get a ˆ1 = (C11 ∆ + C12 )−1 (ˆ z0 − C11 Ξˆ a2 ) a

(10)

if (C11 ∆ + C12 ) is nonsingular. We make consideration of this condition to design the NLPPUFB in the next section as well as K = 2. In (10), we define ∆ = Ξ =

(B12 JBT11 + B11 JBT12 ) (B12 JBT21 + B11 JBT22 ).

The reconstruction procedure is illustrated in Fig. 4(b). 3) K ≥ 4: In any K, the signal can be reconstructed as the solution of the simultaneous matrix equation with (K − 1) unknowns as well as the previous cases. As an example, the solution in K = 4 is shown as follows: ˆ 1 = (H11 F11 ΥFT22 + H11 F12 ΦFT21 + H12 )−1 a ˆ 4 ). (11) ×(ˆz0 −H11 (F11 ΥFT12 +F12 ΦFT11 )ˆ a2 −H11 F11 Ψb The condition is that (H11F11 ΥFT22 +H11F12 ΦFT21 +H12 ) is nonsingular, and we define Υ = Φ =

D12JDT21 + D11JDT22 (D21 J + D22 )(D11 J + D12 )−1

Ψ

D12JDT11 + D11JDT12 .

=

It is similar to K = 2 to reconstruct the signal from the next block transform.

IV. R ESULT S In this section, we design some NLPPUFBs and apply these to image coding with the proposed extension. We compare our proposed method with the traditional DCT, LOT with the symmetric extension and NLPPUFB with the traditional extension [7]. The cost function to design NLPPUFBs is defined as the linear combination of coding gain, stopband attenuation, DC leakage and symmetric property of filters. First three motivations are quite popular [2], and symmetric property of filters is defined as Csym =

M −1 i=0

MK 2

−1 (hi (n) − shi (M K − 1 − n)),

(12)

n=0

where hi (n)s are the impulse responses of the Hi (z). Additionally, we set s = 1 in the channel with even index and s = −1 in the channel with odd index. This restriction is required to satisfy the condition that the matrices in (5) and (10) are nonsingular. We designed two 8-channel NLPPUFBs with its length 16 (K = 2) and 24 (K = 3). Their magnitude and impulse responses are shown in Fig. 5. Table I shows the comparison of PSNRs of the reconstructed images. Each image is coded by 6-level SPIHT [8]. As a result, the proposed method indicates higher PSNR in all images. The 8×16 NLPPUFB with the proposed extension signifies 0.3–0.4 dB higher PSNR than the 8 × 16 LOT [9] with the symmetric extension in Lena image (especially 0.43 dB higher at 1.0 bpp). This result represents the proposed method can extend the

56 Authorized licensed use limited to: Keio University. Downloaded on October 16, 2008 at 21:48 from IEEE Xplore. Restrictions apply.

boundary

^y0 ^y1

T

P1

W1

T

^ y0

boundary

^z0 (A11J+A12)-1 ^z1

^ x0 ^ x1

AT

z^0 PT 2

^ y1

C12)-1

(C

C11

W2 ^ z1 T

P2T

^ y3

^ x0 BT

^ a2

CT

^ z2

^ y2

^ x2

-

^ a1

^ x1

^ a3

^ x2

W2T

BT

^ x3

CT

(a)

(b) The signal reconstruction method at the synthesis bank: (a) K = 2 and (b) K = 3.

-0.5 h6(n)

0.5

-45 0

0.2

0.4 0.6 Normalized Frequency

0.8

1

5

10

-0.5

5

10

15

5

10

15

0 -0.5 0.5

5

10

15

-15

0.5

-25 -30

5

10

-0.5 0.5

0

(a) Fig. 5.

0.2

0.4 0.6 Normalized Frequency

0.8

h1(n)

15

20

0.5

5

10

15

5

10

15

0.5

5

10

15

0.5

15

20

5

10

15

20

5

10

15

20

5

10

15

20

0 -0.5

20

10

0 -0.5

20

5

0 -0.5

20

0 -0.5

1

10

0

-35

-45

15

5

0 -0.5

-20

-40

0 -0.5

0.5

-10

0 -0.5

h3(n)

15

-0.5

h5(n)

10

0.5

0

h2(n)

5

0

0.5 h0(n)

-5

0.5

15

0

-0.5

-0.5

15

0

-35 -40

10

0

0.5

h7(n)

-30

5

5

0

h7(n)

0.5

-25

15

h5(n)

-0.5

-20

10

0

0.5

h4(n)

h2(n)

-15

5

h3(n)

0.5

-10

h4(n)

Magunitude Response [dB]

0 -0.5

-5

h1(n)

h0(n)

0.5

0

h6(n)

5

Magunitude Response [dB]

Fig. 4.

(b)

The magnitude and impulse responses of the NLPPUFBs (the analysis banks): (a) 8×16 and (b) 8×24.

TABLE I T HE COMPARISON OF PSNR [ D B] (S YM .: THE SYMMERTIC EXTENSION , P ROP.: THE PROPOSED EXTENSION ).

Image Transform/ extension 8×8 DCT 8×16 LOT/ Sym. 8×24 GenLOT/ Sym. 8×16 NLPPUFB/ [7] 8×24 NLPPUFB/ [7] 8×16 NLPPUFB/ Prop. 8×24 NLPPUFB/ Prop.

0.25 31.81 32.86 33.08 33.17 32.59 33.21 33.36

Lena bpp 0.5 35.61 36.24 36.45 36.06 36.13 36.55 36.66

1.0 39.29 39.38 39.77 39.35 39.33 39.81 39.86

signal without spoiling the property of the NLPPUFB, which has higher coding gain.

0.25 29.34 29.81 29.87 29.73 29.84 29.95 29.96

Goldhill bpp 0.5 32.00 32.44 32.49 32.46 32.49 32.57 32.56

1.0 35.47 35.78 35.86 35.86 35.85 35.91 35.90

0.25 31.39 32.14 32.31 31.55 31.61 32.38 32.54

Pepper bpp 0.5 34.54 34.79 34.91 34.27 34.20 35.02 35.05

1.0 37.13 37.11 37.30 36.82 36.72 37.38 37.36

ACKNOWLEDGEMENT This work is supported by in part by a Grant in Aid for the 21st century Center of Excellence for Optical and Electronic Device Technology for Access Network from the Ministry of Education, Culture, Sports, Science and Technology in Japan.

V. C ONCLUSION This paper proposed a new non-expansive convolution method for NLPPUFBs to apply to image coding. We presented that the signal can be reconstructed at the synthesis bank without any redundant signals. In the result of image coding, the proposed method has higher PSNR than the traditional ones. Our future work is to find more effective extension matrices to improve the coding performance.

R EFERENCE S [1] R. L. de Queiroz, T. Q. Nguyen, and K. R. Rao, “The GenLOT: generalized linear-phase lapped orthogonal transform,” IEEE Trans. Signal Processing, vol.40, pp.497–507, Mar. 1996. [2] G. Strang and T. Q. Nguyen, Wavelets and Filter Banks, Cambridge, MA: Wellesley-Cambridge, 1996.

57 Authorized licensed use limited to: Keio University. Downloaded on October 16, 2008 at 21:48 from IEEE Xplore. Restrictions apply.

[3] P. P. Vaidyanathan, Multirate Systems and Filter Banks, Englewood Cliffs, NJ: Prentice-Hall, 1993. [4] K. R. Rao and P. Yip, Discrete Cosine Transform: Algorithms, Advantages, Applications, New York: Academic, 1990. [5] M. J. T. Smith and S. L. Eddins, “Analysis/synthesis techniques for subband image coding,” IEEE Trans. Signal Process., vol. 38, no. 8, pp. 1446–1456, Aug. 1990. [6] X. Gao, T. Q. Nguyen and G. Strang, “On factorization of M -channel paraunitary filterbanks,” IEEE Trans. Signal Process., vol. 49, no. 5, pp. 1433–1446, Jul. 2001. [7] T. Oka, T. Uto and M. Ikehara, “Smooth signal extension for M -channel paraunitary filterbanks and its application to image coding,” Proc. ICIP 2003, pp. 229–232, Sept. 2003. [8] A. Said and W. A. Pearlman, “A new, fast and efficient image codec based on set partitioning in hierarchical trees,” IEEE Trans. Circuits Syst. Video Technol., vol. 6, no. 3, pp. 243–250, Jun. 1996. [9] H. S. Malvar and D. H. Staelin, “The LOT: Transform coding without blocking effects,” IEEE Trans. Signal Process., vol. 37, no. 4, pp. 553– 559, Apr. 1989.

58 Authorized licensed use limited to: Keio University. Downloaded on October 16, 2008 at 21:48 from IEEE Xplore. Restrictions apply.

A Non-Expansive Convolution for Nonlinear-Phase ...

In this case, the matrix. (A11J + A12) has to be nonsingular. In the next section, we make consideration of this condition to design the NLPPUFB. 2) K = 3: The problem is to calculate ˆa1 in Fig. 3(b). In this case, we have a0 = B11Jx1 + B12Jx0 = [. B11J B12J. ] [ x1 x0. ] (6) and. [ x0 x1. ] = [. BT. 11. BT. 21. BT. 12. BT. 22. ][ a2.

709KB Sizes 4 Downloads 208 Views

Recommend Documents

Viscosity approximation methods for nonexpansive ...
E-mail addresses: [email protected] (Y. Song), [email protected] (R. Chen), [email protected] (H. Zhou). 0362-546X/$ - see front matter c ...

Convolution and Differential Subordination for ...
Mar 2, 2009 - 4Department of Mathematics, University of Delhi, Delhi 110 007, India ..... [13] M. S. Kasi and V. Ravichandran, On starlike functions with ...

Iterative approximations for multivalued nonexpansive mappings in ...
Abstract. In this paper, we established the strong convergence of Browder type iteration {xt} for the multivalued nonexpansive nonself-mapping T satisfying the ...

MEX based Convolution For Image Gradient Filtering And Detection ...
MEX based Convolution For Image Gradient Filtering And Detection.pdf. MEX based Convolution For Image Gradient Filtering And Detection.pdf. Open. Extract.

Text-Line Extraction using a Convolution of Isotropic ...
... of text-lines. For a sample document image, the smoothing results of isotropic, ... of applying a set of filters, instead of one, for a given data processing task. ..... [12] W. T. Freeman and E. H. Adelson, “The design and use of steerable fil

Shape Modeling by Sketching using Convolution ...
challenge for the modeling software. One of the simplest and .... August 1999, pages 409-416. [7] O. Karpenko, J. F. Hughes, and R. Raskar, “Free-form.

Improvement of the Orthogonal Code Convolution ...
[7] B. Sklar, Digital Communications Fundamentals and Applications,. Prentice Hall, 1998. [8] Qualcomm, The CDMA Network Engineering Handbook, Vol. I,.

The Application of Convolution to Suffix to Prefix Rule ...
So, we know that “bac” is a suffix of T of length three which is also a prefix of pattern P. In this way, we can find out all the suffixes of T which is also a prefix of pattern P. Thus, we can also find the longest suffix of T which is also a pr

The Need for a CMS There is currently a market gap for a Content ...
large following and is a powerful piece of software, it can be difficult to extend, and is ... pushed by Zend, the company behind PHP, and therefor receives good press coverage ... market and trying to avoid being all things to all people. This is a 

sheniblog_revision_practices for A+ LEARNERS_PART4.pdf ...
She thought that it would be the end.But her efforts. were on.She went on with her strong will.Therefore she is there today.She advises him not to get. easily fed ...

Questions for a Socratic Dialogue
Recently, R.W. Paul's six types of Socratic Questions were expanded to nine types. These ques- tions are reproduced with permission from the Foundation for ...

A Case for XML - IJEECS
butions. For starters, we propose a novel methodology for the investigation of check- sums (TAW), which we use to verify that simulated annealing and operating systems are generally incompatible. We use decen- tralized archetypes to verify that super

Protective gear for a limb
Sep 14, 2004 - from contact With the interior of the segments. 39 Claims, 3 Drawing .... In another aspect of the invention, the tangent to the center point of the ...

A Case for XML - IJEECS
With these considerations in mind, we ran four novel experiments: (1) we measured instant messenger and RAID array throughput on our 2-node testbed; (2) we ...

PDF English for Business Studies Student's Book: A Course for ...
PDF English for Business Studies Student's Book: A Course for Business Studies and Economics. Students (Cambridge Professional English) Best. Book By #A#.