JOURNAL OF COMPUTER SCIENCE AND ENGINEERING, VOLUME 1, ISSUE 1, MAY 2010 99

Fuzzy Modeling and Natural Language Processing for Panini’s Sanskrit Grammar P. Venkata Subba Reddy Abstract—Indian languages have long history in World Natural languages. Panini was the first to define Grammar for Sanskrit language with about 4000 rules in fifth century. These rules contain uncertainty information. It is not possible to Computer processing of Sanskrit language with uncertain information. In this paper, fuzzy logic and fuzzy reasoning are proposed to deal to eliminate uncertain information for reasoning with Sanskrit grammar. The Sanskrit language processing is also discussed in this paper. . Index Terms— Fuzzy logic, Fuzzy reasoning, Natural language processing, Panini’s Sanskrit grammar

——————————  ——————————

1 INTRODUCTION Computer languages are defined based on particular grammar, for example C, Pascal etc. Computer processing of natural languages has become an application area such as natural language processing, Machine translation etc.Natural languages are to be defined with the grammer. Panini defined Sanskrit grammar with 4000 rules long back in sixth century B.C. [4]. The rules of Panini’s Sanskrit grammar contain uncertainty. Computer processing of this grammar is very difficult unless uncertainty is eliminated. There are many logics available to deal with uncertainty like fuzzy logic, probabilistic logic etc. Fuzzy logic and reasoning [7, 8, 13, 14] are discussed to deal with uncertainty in the rules of Panini’s Sanskrit grammar. The Paninian approach to Natural Language Processing (NLP) is reviewed and compared with the current computer-based understanding systems [6, 12, 15]. The Paninian-style generative rules and meta-rules could assist in further advances in NLP.

2 FUZZY LOGIC Zadeh[9] proposed Fuzzy logic to deal with Uncertain and Incomplete information. Here we discuss Fuzzy logic briefly and later we apply it for Panini’s Sanskrit Grammar [2,3]. Zadeh[9] has introduced Fuzzy set as a model to deal with imprecise, inconsistent and inexact information. Fuzzy set is a class of objects with a continuum of grade of membership. The Fuzzy set A of X is characterized by its membership function A = µA(x ) and ranging values in the unit interval [0, 1] µA(x ): X [0, 1], x ЄX, where X is Universe of discourse. or A = µA(x 1)/x1 + µA(x 2)/x2 + … + µA(x n)/xn

Where “+” is union For instance, x is Tall is defined as Tall = µTall(x) [0, 1], where “Tall” is fuzzy set. Tall = 0.56/x1 + 0.6/x2 + 0.65/x3 + 0.67/x4 + 0.69/x5 The combination of Fuzzy sets are given as follows AΛB = min {µA(x), µB(x)}/x (disjunction) AVB = max {µA(x), µB(x)}/x (conjunction) A’ = 1 - µA(x)/x (negation) A  B= max {1 - µA(x), µB(x)}/x (implication) µA(x, y)= {µA(x) X µB(y)}/x,y = min{µA(x) X µB(y)}/x,y A o R = max {µA(x), µA(x, y)}/x where R(x, y) is Fuzzy relation and “o” is composition The propositions may contain quantifiers. Square operator is used for ‘very’, ‘most’ (concentration). The square root operator is used for ‘more or less’ (diffusion). For instance, µvery young(x) = µyoung(x)2 µnotvery young(x) = 1-µyoung(x)2 µmore or less young(x) = µyoung(x)1/2 Fuzzy reasoning[8] is a drawing conclusion from Fuzzy propositions using fuzzy inference rules. Some of the Fuzzy inference rules are given bellow R1:

x is A x and y are B _____________ y is AΛB

R3: x and y are A y and z are B ______________ y and z are B

R2: x is A x or y is B ____________ y is AVB R4: x or y are A y or z are B ____________ x or z are B

100

5. R5: x is A if x is A then y is B _______________ y is A o (AB)

µSyadasti(x) Λµdifferent times (x)1/2 where yet is diffusion 6.

3 FUZZY MODELLING SANSKRIT GRAMMAR

FOR

PANINI’S

Grammars are defined to either programming languages or natural languages. Computer processing of natural languages and language translations is an application area in the computer field. Indian languages are having long history. Panini proposed grammar with 4000 rules for Sanskrit. These are categorized into different sets. One of them is Syadvada set. The Syadvada set contains seven possibilities they are given below. 1. May be, it is. (Syadasti) 2. May be, it is not (Sada nasti) 3. May be it is, and it is not at different times (Syad astinasti) 4. May be it is and it is not at the same time and is indescribable (Syad avaktavya) 5. May be it is, and yet indescribable (Syad asti avaktavya) 6. May be it is not, and also indescribable (Syad asti nasti avaktavya) 7. May be it is, and it is not and also indescribable ( Syad asti nasti avaktavya) The above rules contain uncertainty. The uncertainty has to be eliminated for further computer processing of the Sanskrit language. Fuzzy logic can be used to eliminate the uncertainty. The fuzzy logic is applied for the above rules to eliminate uncertainty and the rules are given below. 1.

May be, it is. ( Syadasti) µSyadasti(x) →[0,1]

2.

May be, it is not (Syad nasti) Syad nasti = 1 - µSyadasti(x) May be it is, and it is not at different times (Syad asti-nasti)

3.

4.

where “t” is constant. May be it is and yet indescribable. (Syad asti avaktavya)

May be it is not, and also indescribable (yad asti nasti avaktavya) (1- µSyadasti(x)) Λµdifferent times (x)

7. May be it is, and it is not and also indescribable (Syad asti nasti avaktavya) µSyadasti(x) Λ(1- µSyadasti(x) Λµdifferent times (x)) This fuzzy represention of the sanskrit sentences shall be further used for fuzzy reasoning [7, 13, 14, 16]. For istance, consider two sentences May be, it is. (Syadasti) May be it is, and it is not at different times (Syad astinasti) The inference will be given as using R1 “it is not at different times “ with the fuzziness (Syadasti) Λ (Syad asti-nasti)

4 SANSKRIT LANGUAGE PROCESSING Beattie[1] presents an introductory review of some aspects of the computer processing of natural language in the form of a string of alphabetic characters, for example, spoken word. Applications of such processing in fields like information storage and retrieval and computer-assisted instruction are discussed for a computer to “understand” natural language[3]. The Sanskrit Language can be processed by defining English alphabetic characters using OM SETUP. This Sanskrit language representation in English shall be used for reasoning with the Sankrit language

µSyadasti(x) Λ(1 - µSyadasti(x) o µdifferent times (x,y)) where “o” is composition and “y” is time variable

Zenon [16] describes a new programming language FLISP which provides a number of facilities for efficiently representing and manipulating fuzzy knowledge. It is based on fuzzy sets and fuzzy logic theories. The language contains a collection of fuzzy-set operations and procedures for solution of fuzzy relational equations with triangular norms. FLISP forms implement and examine the fuzzy control algorithms. FLISP may be used to deal with Uncertainty in Panini’s Sanskrit Grammer.

May be it is and it is not at the same time and is indescribable

PROLOG can also be used for reasoning with the Sanskrit language.

(µSyadasti(x) Λ (1 - µSyadasti(x) o µdifferent µdifferent times (x)

times

(x,t)) Λ 5 COCLUSION

101

Panini’s proposed Sanskrit Grammar to Sanskrit language. The rules contain uncertainty information. The elimination of uncertainty information with Fuzzy logic is discussed. Fuzzy logic and fuzzy reasoning are discussed to deal with uncertainty information in Panini’s Sanskrit Grammar to make it convenient for further computer processing. The Computer processing of Sanskrit language is discussed.

AKNOWLEDGEMENT Our special thanks to Prof. V.V.S.Sarma, Computer Science and Automation, Indian Institute of Science, Bangalore for discussion, and reviewers for their valuable suggestions.

[9] [10]

[11] [12]

[13] [14]

REFERENCES [15] [1] [2]

[3]

[4] [5] [6] [7] [8]

J.D. Beattie, Natural language processing by computer,International Journal of Man-Machine Studies, Volume 1, Issue 3, Pages 311-329, July 1969. S. Herath, S. Ishizaki, Y. Anzai, H. Aiso, T. Ikeda , Machine processing of a natural language with interchangeable phrases, Information Sciences, Volume 66, Issues 1-2, Pages 139-165, December 1992. James Pustejovsky, Branimir Boguraev, Lexical knowledge representation and natural language processing, Artificial Intelligence, Volume 63, Issues 1-2, Pages 193-223, October 1993. Sarma, V.V.S., “A survey of Indian Logic from the point of view of Computer Science”, Sadhana –Academy Proceedings in Engineering Sciences, 19,6,971-983, 1994. Sarma, V.V.S., “Computers,Sanskrit and Indian Sanskrit Traditions”, Presentation at the Seminar at Kakatiya University on 25 September 2000. Subhash C. Kak The Paninian approach to natural languageprocessing, International Journal of Approximate Reasoning, Volume 1, Issue 1, Pages 117-130, January 1987. Venkataa Reddy, P. and Syam Babu, M., “Some methods of reasoning for fuzzy conditional propositions”, Fuzzy Sets and Systems, 52,229-250, 1992. L. A Zadeh, “Calculus of Fuzzy Restrictions”, In Fuzzy Sets and their Applications to Cognitive and Decision Processes, L. A. Zadeh, King-Sun FU, Kokichi Tanaka and Masamich Shimura (Eds.), Academic Press, New York, 1-40, 1975.

[16]

Zadeh, L.A., “Fuzzy Sets”, Information Control, 8, 338-353, 1965. L.A. Zadeh, Fuzzy sets and information granularity, Advances in Fuzzy Set Theory and Applications, M. Gupta, R. Ragade and R. Yager (eds.), 3-18. Amsterdam: NorthHolland Publishing Co., 1979. L.A. Zadeh, A theory of approximate reasoning, Machine Intelligence 9, J. Hayes, D. Michie, and L.I. Mikulich (eds.), 149-194. New York: Halstead Press, 1979. L.A. Zadeh, Test-score semantics for natural languages and meaning representation via PRUF, Empirical Semantics, B. Rieger (ed,), 281-349. Bochum, W. Germany: Brockmeyer, 1982. Also Technical Memorandum 246, AI Center, SRI International, Menlo Park, CA., 1981. L.A. Zadeh, Toward a theory of fuzzy information granulation and its centrality in human reasoning and fuzzy logic, Fuzzy Sets and Systems 90, 111-127, 1997. L.A. Zadeh, Some reflections on soft computing, granular computing and their roles in the conception, design and utilization of information/intelligent systems, Soft Computing 2, 23-25, 1998. L.A. Zadeh, Precisiated Natural Language (PNL), AI Magazine, Vol. 25, No. 3, 74-91, 2004. Zenon A. Sosnowski ,FLISP — A language for processing fuzzydata,Fuzzy Sets and Systems, Volume 37, Issue 1, Pages 23-32, August 1990.

P. Venkata Subba Reddy is working as Associate Professor in Department of Computer science and Engineering, College of Engineering, Sri Venkateswara University, Tirpathi, India since 2001. He joined as Asst. Professor in Department of Computer science and Engineering, College of Engineering, Sri Venkateswara University in 1992 and promoted as Associate Professor in 2001. He did Post Graduate degree in Applied Mathematics with Computer Programming as Specilization during 1984-86. He did his Post Graduation Diploma in Computer Methods & Programming from Computer Society of India, Hyderabad. He did M.Phii in Database Management systems during 1986-88 and Ph.D in Artificial Intelligence during 1988-1992 in Sri Venkateswara University, Tirpathi, India. . He did Post Doctoral/Visiting fellowship in Fuzzy Algorithms from IISC/JNCAR, Bangalore, India under Prof. V. Rajaraman in 1996. He is actively engaged in Teaching and Research work to B.Tech., M.Tech., and Ph.D students. He published papers in reputed journals. He is an Editor for JCSE.

PDF only - arXiv

JOURNAL OF COMPUTER SCIENCE AND ENGINEERING, VOLUME 1, ISSUE 1, MAY 2010. 99. Fuzzy Modeling and Natural Language. Processing for ...

295KB Sizes 4 Downloads 344 Views

Recommend Documents

PDF only - arXiv
He did Post Graduate degree in. Applied Mathematics with Computer Programming as Spe- cilization during 1984-86. He did his Post Graduation Diplo-.

PDF only - arXiv
The data initially stored in the tree nodes of a path representing motif ACT. (A). (B). Fig. 3. (A) An example ... pass currently stored data to next node(s) down. (2) if tick count ≤ m set x=d+1 else set x=0 input x to the ..... M. C. Herbordt, J.

PDF only - arXiv
Abstract—A technique using a systolic array structure is proposed for solving the common approximate substring (CAS) ... be implemented on a FPGA in a design spe- cifically targetted for this DNA string. In other works this may be referred to as th

PDF only - arXiv
system, an existing software with components that can run over the Grid (at the moment for .... management of this considerable quantity of data represents a problem: ... accounting methods and protected data access only by users owning a ...

ProjectionNet - arXiv
Aug 9, 2017 - ing the computation-intensive operations from device to the cloud is not a feasible strategy in many real-world scenarios due to connectivity issues (data ..... [4] D. Bahdanau, K. Cho, and Y. Bengio, “Neural machine translation by jo

AWG - arXiv
Apr 17, 2009 - network protection are crucial issues in network operators for ... Simplified Passive Optical network architecture model. ...... Egypt, in 1976.

ProjectionNet - arXiv
Aug 9, 2017 - The two networks are trained jointly using backpropagation, where the projection network learns from the full network similar to apprenticeship learning. Once trained, the smaller network can be used directly for inference at low memory

Is “Sampling” - arXiv
Aug 26, 2016 - trade-offs between competing goals. This is particularly ... 1) What are smallest set of test cases that cover all program branches? 2) What is ... Hence, in recent years, there has been an increasing interest in search-based ... like

arXiv neutrino published
Nov 11, 2011 - Can apparent superluminal neutrino speeds be explained as a quantum ... in a birefringent optical fibre, is based on the fact that the vacuum is.

arxiv-v2.pdf
There was a problem previewing this document. Retrying... Download. Connect more apps... Try one of the apps below to open or edit this item. arxiv-v2.pdf.

Transactions Template - arXiv
registered with respect to the centre of the fingerprint image. The dimensionality of .... tions are then normalized into the domain from 0 to , and the certain values ...

arXiv:1306.2931v1
Jun 12, 2013 - tion [1, 10] is that the maximum edge 2-coloring number is at least the size of the maximum matching of the graph. Therefore, if a graph G has a matching of size at least k, then G is a YES-instance. .... |c(Fv)| ≤ q for all v ∈ V

Bachelor Thesis - arXiv
Jun 26, 2012 - system such as Solr or Xapian and to design a generic bridge ..... application server. ..... document types including HTML, PHP and PDF.

Bachelor Thesis - arXiv
Jun 26, 2012 - Engine. Keywords. Document management, ranking, search, information ... Invenio is a comprehensive web-based free digital library software.

Catalogue of Spacetimes - arXiv
Nov 4, 2010 - 2.10 Gödel Universe . ...... With the Hamilton-Jacobi formalism it is possible to obtain an effective potential fulfilling 1. 2. ˙r2 + 1. 2. Veff(r)=. 1. 2.

101 Formulaic Alphas - arXiv
Dec 9, 2015 - Free University of Tbilisi, Business School & School of Physics. 240, David Agmashenebeli ... Business School and the School of Physics at Free University of Tbilisi. ...... Grinold, R.C. and Kahn, R.N. “Active Portfolio Management.â€

Molten Air arXiv
Molten Air - A new, highest energy class of rechargeable batteries ... (rechargeable), and have amongst the highest intrinsic battery electric energy storage capacities. ... We have recently presently a new pathway for the CO2-free synthesis of iron

Egg of Columbus - arXiv
where F is the force, x is the displacement,. AF. = σ is the stress,. 0 lx. = ε is the strain, 0 l is the end-to-end length of the fibre (see Fig.1),. ( ) 1. 0. 0. 1. ≤. −=≤lll.

τ λ - arXiv
Key words: Reliability, Markov process, Petri nets, Fault tree analysis. 1. INTRODUCTION ... assumption about the original data. However, fuzzy logic provides ...

Variational Program Inference - arXiv
If over the course of an execution path x of ... course limitations on what the generated program can do. .... command with a prior probability distribution PC , the.

Catalogue of Spacetimes - arXiv
Nov 4, 2010 - 2.10 Gödel Universe . ..... We will call a local tetrad natural if it is adapted to the symmetries or the ...... 2.17 Oppenheimer-Snyder collapse.

Novel Code Family - arXiv
system performance is degraded because of incoherency of the received light fields. ..... =1+( + 1)(2 − ). Fig.5. (a) Marked-set and (b) Detection-set of a user.

Variational Program Inference - arXiv
reports P(e|x) as the product of all calls to a function: .... Evaluating a Guide Program by Free Energy ... We call the quantity we are averaging the one-run free.