Differential Evolution for Frequency Modulation Sound Synthesis Parameter Identification Kai Xiong

Hui Wang

Dept. of Information Engineering Politecnico di Torino Turin, Italy [email protected]

School of Information Engineering Nanchang Institute of Technology Nanchang 330099, China [email protected]

Abstract—Frequency Modulation (FM) sound synthesis provides a neat synthesis method by which complex sound forms can be created simply and efficiently. However, it is difficult for traditional methods to find a best set of parameters in FM sounds synthesis because of the nonlinear parameter space. Differential Evolution (DE) is a recently proposed optimization technique which has shown excellent performance in many complex optimization problems. This paper presents an application of DE for FM sound synthesis parameter identification. Simulation results demonstrate that it is applicable to use DE for FM sound synthesis parameter identification.

which is a type of heterogeneous distributed real-coded GA. But GD-RCGA applied a different crossover operator to each subpopulation. Simulation studies were conducted on several benchmark functions and three real-world problems, including FM sounds parameter identification problem. The reported results show that GD-RCGA achieves better results than other compared algorithms. Mitchell presented an exploration of evolutionary computation applied to FM audio synthesis parameter optimization [2].

Keywords-differential evolution; particle swarm optimization; frequency modulation; parameter identification

I.

INTRODUCTION

Many real-world problems can be converted into optimization problems. An unconstrained minimization problem can be formulated as follows.

min f ( x ) where x = [ x1 , x2 , ..., xD ] , D is the dimension of the problem, and f() is some evaluation function. Frequency Modulation (FM) sound synthesis was proposed by John Chowning in 1973, which is a simple method to create complex sound [1]. FM aims to optimize a set of parameters to match the target sounds. However, it is difficult to achieve the desired target sounds through manual control of the parameters because of the nonlinear relationship between the FM synthesis parameters and the spectral shape of the synthesized sound [2]. Recently, some evolutionary algorithms haven been used to optimize synthesis parameters in sound matching with FM synthesis. Horner et al. presented the first evolutionary sound matching system for matching sounds produced by real acoustic instruments with FM and wavetable synthesis [3-4]. In [3], genetic algorithm (GA) is used to select a small number of spectral snapshots to reproduce the entire sound. The presented results show the spectrum error curves, and time/frequency plots to provide visual comparison between the target and the corresponding matched sounds. Herrera and Lozano [5] proposed gradual distributed real-coded GA (GD-RCGA),

Eberhart and Kennedy in 1995 developed the Particle Swarm Optimization (PSO), a population based stochastic optimization technique [10]. It shares many similarities with GA. However, PSO has no evolution operators such as crossover and mutation. The potential solutions, called particles, fly through the problem space by following the current optimum particles. Each particle keeps track of its coordinates in the problem space which are associated with the best solution (fitness) achieved so far. When all the population has become the particle’s topological neighbors, the best value will be a global. In this study, there effective evolutionary algorithms were proposed to match sounds with FM synthesis. Differential Evolution (DE), proposed by Storn and Price [6], is an effective, robust, and simple global optimization algorithm. According to frequently reported experimental studies, DE has shown better performance than many other evolutionary algorithms in terms of convergence speed and robustness over several benchmark functions and real-world problems [7]. In this paper, we present an application of DE for FM sound synthesis parameter identification. The rest of the paper is organized as follows. Section II describes the problem definition. Section III briefly introduces the DE algorithm. Section IV delivers the DE for FM sound synthesis parameter identification. The simulation results and discussions are presented in Section V. Finally, the conclusions and future work are given in Section VI. II.

PROBLEM DESCRIPTION

Frequency Modulation (FM) sound synthesis provides a computationally efficient means of creating complex sound timbres, which has seen wide application in commercial system. Let use consider a simple FM method as follows. Assume that

fm and fc are the modulator and carrier frequencies, respectively. I is index of modulation, and A controls the output amplitude. The amplitude function for simple FM is given as follows [2]:

y(t ) = A sin ( wct + I sin ( wmt ) )

Vi (t ) = X i (t ) + F ( X i 2 (t ) − X i 3 (t ))

(1)

wc = 2π fc d fm

where y(t) is the modulated carrier output. The modulating oscillator varies the carrier frequency in the range specified by the peak frequency deviation d, which is the product of the modulation index I and the modulating frequency fm [2]. If I=0, there is no modulation of the carrier oscillator frequency, and the generated single equals with the sine wave at frequency fc. When I > 0, frequency partials are around of two sides of the carrier with I =0. The FM sound synthesis aims to optimize the parameter of an FM synthesizer with a D-dimensional vector X. In this paper, we only consider the case of D=6 [5,8] as follows.

(

)

(2)

(

)

(3)

y (t ) = a1 sin w1tθ + a 2 sin ( w2 tθ + a3 sin ( w3tθ ) ) y 0 (t ) = 1.0sin 0.5tθ −1.5sin ( 4.8tθ + 2.0sin ( 4.9tθ ) )

i = 1, 2, ..., N p

Like other GAs, DE also employs a crossover operator to reproduce a trial individual Ui(t). Compared to GAs, DE utilizes a different crossover method. In GAs, offspring are usually generated by the crossover of two parents randomly selected from the population, while the crossover is conducted on the mutant individual and the current parent individual in DE. There are several different crossover strategies in DE, and the binary one is usually used. The offspring Ui(t) generated by binary crossover is defined by: ⎧⎪Vij (t ), U ij (t ) = ⎨ ⎪⎩ X ij (t ),

FM sound synthesis, θ = 2π /100 , and the parameters are defined in the range [-6.4, 6.35].

The FM sound synthesis problem is to generate a sound (using equation (2)) to approximate the target sound (equation (3)). According to the suggestions of [5], the fitness function is defined as the summation of square errors between the evolved data and the target model data as follows.

f ( X ) = ∑ ( y (t ) − y 0 (t ) )

otherwise

(6)

After the mutation and crossover, an elite selection is used to select a fitter individual between Xi(t) and Ui(t) as the new Xi(t). The selection model can be formulated as follows. ⎧⎪U (t ), X i (t + 1) = ⎨ i ⎪⎩ X i (t ),

if f (U i (t ))≤ f ( X i (t )) otherwise

(7)

Without loss of generality, this paper only considers minimization problem. If the trial vector Ui(t) is better than Xi(t), then Xi(t) is set to Ui(t); otherwise, the Xi(t) remains unchanged.

IV.

DE FOR FM SOUND SYNTHESIS PARAMETER IDENTIFICATION

2

(4)

t =0

III.

if rand (0,1)≤Cr ∧ j ==l

where j=1,2,…,D, Cr is the probability of crossover, and l is a random integer between 0 and Np.

where X = [ a1 , w1 , a2 , w2 , a3 , w3 , ] is the parameter of the

100

(5)

where Xi1(t), Xi2(t), Xi3(t) are three mutually different individuals randomly selected from the current population, i ≠ i1 ≠ i 2 ≠ i 3 , and F is the control parameter.

wm = 2π f m I=

For each individual Xi(t) in population P(t), a mutant individual Vi(t) is generated as follows.

In this section, we present an implementation of DE for FM sound synthesis parameter identification. The main steps of the algorithm are described in Algorithm 1. Algorithm 1: DE for FM Sound Synthesis

DIFFERENTIAL EVOLUTION

DE is a population-based stochastic search algorithm, which starts with an initial random population. By conducting a series of operations, such as mutation, crossover and selection, the fitness values are improved towards some optimal solution(s). Let us assume that Xi(t) is the ith individual in population P(t), where t is the index of generation, and P(t) is the population in the tth generation. The main idea of DE is to generate trial vectors. Mutation and crossover are used to produce new trial vectors, and selection determines which of the vectors will be successfully selected into the next generation.

Start While FEs < MAX_FEs do For i = 1 to Np do

Generate a mutant individual Vi(t) according to equation (5); Generate a trial individual Ui(t) according to equation (6); Calculate the estimated wave of Ui(t) according to

TABLE I.

equation (2); Calculate the fitness value of Ui(t) according to equation (4); According to equation (7), select a fitter one between Xi(t) and Ui(t) as new Xi(t);

RESULTS ACHIEVED BY PSO, GD-RCGA AND DE

Algorithms

Mean

Std Dev

PSO

9.81

5.67

GD-RCGA

7.0e-16

6

DE

0

0

End For End While End

V.

EXPERIMENTS

In this section, we used DE to optimize the parameters of FM sound synthesis. In the experiments, we compare the performance of DE with particle swarm optimization (PSO) [9], and GD-RCGA [5]. For DE and PSO, we use the same settings for the common parameters. The population size, Np, is set to 100, and the maximum number of function evaluations (MAX_FES) is set to 100,000. For DE, the control parameters, F and Cr, are fixed to 0.5 and 0.9, respectively. For PSO, w, c1, and c2 are set to 10, 0.72984, 1.49618 and 1.49618, respectively. For GD-RCGA, each subpopulation has 20 individuals. The mutation operator applied is non-uniform mutation. The probability of updating a chromosome by mutation is 0.125, and the crossover probability is 0.6. The maximum number of generations is set to 1,000. All the algorithms are run 30 times, and the mean best function values and standard deviation are recorded. The computational platform is described as follows. z

System: Win XP (SP3)

z

CPU: Intel (R) Core (TM) Duo T6400 2.00 GHz

z

RAM: 2.0 GHz

z

Complier: Microsoft Visual C++ 6.0

Table I presents the computational results of PSO, GDRCGA and DE, where “mean” represents the mean best function evaluations (the sum error between estimate wave and the target wave), and “Std Dev” indicates the standard deviation. From the results, DE can successfully find the optimal set of parameter for FM sound synthesis. GD-RCGA can also achieve promising results, while PSO falls into local minima.

Figure 1. The convergence curve of PSO and DE.

Table II shows the comparison between PSO and DE, where “AFE” means the obtained average number of function evaluations when searching the global optimum, and “SR” represents the successful running rate. As seen, DE obtains 100% successful running rate. It means that DE can search the global optimum every runs, while PSO only has a probability of 20% to find the global optimum. Fig. 1 presents evolutionary curves of PSO and DE. It is clear that DE converges faster than PSO.

TABLE II.

COMPARISON BETWEEN PSO AND DE

Algorithms

AFE

SR

PSO

94080

20%

DE

68760

100%

[3]

A. Horner, J. Beauchamp, L. Haken, “Machine tongues XVI: genetic algorithms and their application to FM matching synthesis,” Computer Music Journal, vol. 17, no. 4, pp. 17-29, 1993.

[4]

A. Horner, J. Beauchamp, L. Haken, “Methods for multiple wavetable synthesis of musical instrument tones,” Journal of the Audio Engineering Society, vol. 4,1 no. 5, pp. 336-356, 1993.

[5]

F. Herrera, M. Lozano, “Gradual distributed real-coded genetic algorithms,” IEEE Transaction on Evolutionary Computation, vol. 4, no. 1, pp. 43-62, 2000.

[6]

R. Storn, K. Price, “Differential evolution--A simple and efficient heuristic for global optimization over continuous spaces”, Journal of Global Optimization, vol. 11, pp. 341-359, 1997.

[7]

J. Vesterstrom, R. Thomsen, “A comparative study of differential evolution, particle swarm optimization, and evolutionary algorithms on numerical benchmark problems,” Proceedings of the IEEE Congress on Evolutionary Computation, 2004, pp. 1980-1987.

[8]

S. Das, P. N. Suganthan, “Problem definitions and evaluation criteria for CEC 2011 competition on testing evolutionary algorithms on real world optimization problems”, Technical Reprot, 2010.

[9]

J. Kennedy and R. C. Eberhart, “Particle swarm optimization”, IEEE International Conference on Neural Networks, Perth, Australia., pp. 1942-1948, 1995.

[10] Abdelbar, A.M.Abdelshahid, “Swarm optimization with instinct-driven particles”, the 2003 Congress on Evolutionary Computation, 2003

Figure 2. The comparison of target wave, estimated wave achieved by DE, and estimated wave achieved by PSO.

Fig. 2 shows the comparison of target wave, estimated wave achieved by DE, and estimated wave achieved by PSO. It can be seen that DE perfectly approximates the target wave, while PSO fails.

VI.

CONCLUSION

In this paper, we apply DE algorithm for FM sound synthesis parameter identification. Experimental studies are conducted on a benchmark problem. The results demonstrate that DE is applicable to optimize the parameters of FM sound synthesis. DE shows faster convergence speed and more robust performance than PSO. DE can successfully find the optimal set of parameters, while PSO fails. In our experiments, we only consider a six-dimensional problem. A more complex problem with higher dimensions will be investigated in the future work.

REFERENCES

[1]

J. M. Chowning, “The synthesis of complex audio spectra by means of frequency modulation,” Journal of the Audio Engineering Society, vol. 21, no. 7, pp. 526-534, 1973.

[2]

T. J. Mitchell, An exploration of evolutionary computation applied to frequency modulation audio synthesis parameter optimisation, Ph.D Thesis, University of the West of England, 2010.

Paper Title (use style: paper title)

Turin, Italy [email protected]. Hui Wang. School of Information Engineering. Nanchang Institute of Technology. Nanchang 330099, China [email protected]. Abstract—Frequency Modulation (FM) sound synthesis provides a neat synthesis method by which complex sound forms can be created simply ...

101KB Sizes 0 Downloads 372 Views

Recommend Documents

Paper Title (use style: paper title) - Sites
Android application which is having higher graphics or rendering requirements. Graphics intensive applications such as games, internet browser and video ...

Paper Title (use style: paper title) - GitHub
points in a clustered data set which are least similar to other data points. ... data mining, clustering analysis in data flow environments .... large than the value of k.

Paper Title (use style: paper title)
College of Computer Science. Kookmin ... of the distinct words for clustering online news comments. In ... This work was supported by the Basic Science Research Program through .... is performed on class-wise reviews as depicted in Fig. 1(b).

Paper Title (use style: paper title)
School of Electrical Engineering, KAIST .... [Online]. Available: http://yann.lecun.com/exdb/mnist/. [5] Design Compiler User Guide, Synopsys, Mountain View, CA, ...

Paper Title (use style: paper title)
on the substrate, substrate pre-deposition process, and Pd deposition .... concentration is below the ignition threshold, which is often important for such a sensor.

Paper Title (use style: paper title)
mobile wireless networking, it is becoming possible to monitor elderly people in so-called ... sensor network that might be used in order to recognize tasks described in Table 1. ..... its advantages, and their relative merits and demerits are still.

Paper Title (use style: paper title)
zero which means cosθ tends to 1. The distance between each of the test vectors and profile vectors were obtained using (2). If the cosine value between the test vector and profile hub vector was greater than the cosine value between the same test v

Paper Title (use style: paper title)
communication channel between the sensors and the fusion center: a Binary ..... location estimation in sensor networks using binary data," IEEE Trans. Comput., vol. ... [9] K. Sha, W. Shi, and O. Watkins, "Using wireless sensor networks for fire.

Paper Title (use style: paper title)
search and compact storage space. Although search ... neighbor search methods in the binary space. ... Given a query ∈ { } , we list the online search algorithm.

Paper Title (use style: paper title)
Research Program Fellowships, the University of Central Florida – Florida. Solar Energy Center (FSEC), and a NASA STTR Phase I contract. NNK04OA28C. ...... Effluents Given Off by Wiring Insulation," Review of Progress in. QNDE, vol. 23B ...

Paper Title (use style: paper title)
In Long term Evolution. (LTE), HARQ is implemented by MAC level module called .... the receiver is decoding already received transport blocks. This allows the ...

use style: paper title
helps learners acquire scientific inquiry skills. One of ... tutoring systems; LSA; natural language processing ..... We collected data from 21 college students who.

Paper Title (use style: paper title)
Reducing Power Spectral Density of Eye Blink Artifact through Improved Genetic ... which could be applied to applications like BCI design. MATERIALS AND ...

Paper Title (use style: paper title)
general, SAW technology has advantages over other potentially competitive ... SAW devices can also be small, rugged, passive, wireless, and radiation hard,.

Paper Title (use style: paper title)
provide onboard device sensor integration, or can provide integration with an .... Figure 2 Schematic diagram of a 7 chip OFC RFID tag, and. OFC measured and ..... [3] C. S. Hartmann, "A global SAW ID tag with large data capacity," in Proc.

Paper Title (use style: paper title) - Research at Google
decades[2][3], but OCR systems have not followed. There are several possible reasons for this dichotomy of methods: •. With roots in the 1980s, software OCR ...

Paper Title (use style: paper title) - Research
grams for two decades[1]. Yet the most common question addressed to the author over more than two decades in OCR is: “Why don't you use a dictionary?

Paper Title (use style: paper title)
determine the phase error at unity-gain frequency. In this paper, while comparing some topologies we ... degrees at the integrator unity gain frequency result in significant filter degradation. Deviations from the .... due to gm/Cgd occur at a much h

Paper Title (use style: paper title)
Abstract— The Open Network and Host Based Intrusion Detection. Testbed .... It is unique in that it is web-based. .... sensor is also the application web server.

Paper Title (use style: paper title)
Orlando, FL 32816-2450 (email: [email protected]). Brian H. Fisher, Student .... presentation provides a foundation for the current efforts. III. PALLADIUM ...

Paper Title (use style: paper title)
A VLSI architecture for the proposed method is implemented on the Altera DE2 FPGA board. Experimental results show that the proposed design can perform Chroma-key effect with pleasing quality in real-time. Index Terms—Chroma-key effect, K-means clu

Paper Title (use style: paper title)
the big amount of texture data comparing to a bunch of ... level and a set of tile data stored in the system memory from ... Figure 1: Architecture of our algorithm.

Paper Title (use style: paper title)
printed texts. Up to now, there are no ... free format file TIFF. ... applied on block texts without any use of pre- processing ... counting [12, 13] and the reticular cell counting [1]. The main ..... Computer Vision and Image Understanding, vol. 63

Paper Title (use style: paper title)
School of Electrical and Electronic Engineering, Nanyang Technological University, Singapore 639798. Abstract— ... For 60GHz wireless communication systems, the ... the benefit of isolated DC noise from the tuning element. The load on ...