4. OpenFst: An Open-Source, Weighted Finite-State Transducer Library and its Applications to Speech and Language Michael Riley, Cyril Allauzen, and Martin Jansche, Google Inc. Finite-state methods are well established in language and speech processing. OpenFst (available from www.openfst.org) is a free and open-source software library for building and using finite automata, in particular, weighted finite-state transducers (FSTs). This tutorial is an introduction to weighted finitestate transducers and their uses in speech and language processing. While there are other weighted finite-state transducer libraries, OpenFst (a) offers, we believe, the most comprehensive, general and efficient set of operations; (b) makes available full source code; (c) exposes high- and low-level C++ APIs that make it easy to embed and extend; and (d) is a platform for active research and use among many colleagues.

4.1

Tutorial Outline

1. Introduction to OpenFst The first part of the tutorial introduces operations on weighted automata such as determinization and intersection/composition as well as the corresponding OpenFst binaries and library-level APIs that implement those operations. We describe how to read FSTs from simple textual descriptions and combine them into larger and more complex machines, and optimize them using simple command-line and library calls. • Introduction – Motivating examples – Finite-state methods in NLP and Speech • A quick tour of OpenFst – – – –

Finite-state machines and operations OpenFst binaries High-level C++ API Human-readable file formats

• Comparison of OpenFst and competing libraries – Comparison with the AT&T FSM LibraryTM – Brief comparison with SFST and related libraries • Advanced usage of OpenFst – – – – –

Low-level C++ API Constructing and modifying Fst objects programmatically Implementing new concrete Fst classes Adding new weight semirings Customizing matchers and filters for composition

305 Proceedings of the North American Chapter of the Association for Computational Linguistics – Human Language Technologies (NAACL HLT) 2009 conference, Companion Volume, pages 305–306, Boulder, Colorado, May/June 2009. c

2009 Association for Computational Linguistics

2. Applications The second part of the tutorial focuses on several application areas of interest to the NAACL HLT audience, including speech recognition, speech synthesis, and general text processing. In each application area we discuss a straightforward example in detail, then delve into an advanced example that highlights important features of OpenFst, common pitfalls, efficiency considerations, new research directions, etc. These examples are drawn from our extensive experience in applying OpenFst to problems in speech and language processing. • Automatic speech recognition – Context dependency transducers – Language models and grammars • Natural Language Processing – Unicode processing – Text analysis, text normalization – Pronunciation models • Other areas – Computational biology – Pattern matching

4.2

Target Audience

This tutorial is intended for students, researchers, and practitioners interested in applying finite-state methods. It is suitable for participants of a variety of backgrounds, including those without any background in finite-state techniques as well as advanced users of existing software packages. For those unfamiliar with finite-state transducers, the first portion of the tutorial provides an introduction to the theory and application areas. Users of other finite-state libraries will particularly benefit from the contrastive description of the unique features of OpenFst, its C++ API, and the examples drawn from real-world applications.

4.3

Presenters

Michael Riley began his career at Bell Labs and AT&T Labs where he, together with Mehryar Mohri and Fernando Pereira, introduced and developed the theory and use of weighted finite-state transducers (WFSTs) in speech and language. This work was recognized in best paper awards from the journals Speech Communication and Computer Speech and Language. He has been a research scientist at Google, Inc. since 2003. He is a principal author of the OpenFst library and the AT&T FSM LibraryTM . He has given several tutorials on WFSTs before: at ACL 1994, Coling 1997 and Interspeech 2002. Cyril Allauzen is another key author of the OpenFst library. His main research interests are in finite-state methods and their applications to text, speech and natural language processing and machine learning. Before joining Google, he worked as a researcher at AT&T Labs – Research and at NYU’s Courant Institute of Mathematical Sciences. Martin Jansche has applied the OpenFst library to several speech and language problems at Google. His FST-related interests are in text processing for speech tasks and in learning and applying pronunciation and transliteration models. 306

4. OpenFst: An Open-Source, Weighted Finite ... - Research at Google

and its Applications to Speech and Language. Michael ... APIs that make it easy to embed and extend; and (d) is a platform for active research and use among.

60KB Sizes 6 Downloads 378 Views

Recommend Documents

Music Identification with Weighted Finite-State ... - Research at Google
tle, album and recording artist(s) of a song with just a short au- .... In the following, we describe the construction of the factor au- tomaton .... We applied a support.

Encoding linear models as weighted finite-state ... - Research at Google
be used to apply the model to lattice input (or other more gen- eral automata) ..... of smoothing methods and n-gram orders on the development set, and settled ...

Learning with Weighted Transducers - Research at Google
b Courant Institute of Mathematical Sciences and Google Research, ... over a vector space are the polynomial kernels of degree d ∈ N, Kd(x, y)=(x·y + 1)d, ..... Computer Science, pages 262–273, San Francisco, California, July 2008. Springer-.

Improved Consistent Sampling, Weighted ... - Research at Google
simple data statistics to reduce the running time of hash ... statistics (such as histograms of color or texture). .... To obtain the distribution P(z), let us analyze its.

Pynini: A Python library for weighted finite-state ... - Research at Google
Aug 12, 2016 - is a patented predictive text entry system. In T9, each character in the “plaintext” alphabet is as- signed to one of the 9 digit keys (0 is usually re-.

Weighted Proximity Best-Joins for Information ... - Research at Google
1Department of Computer Science, Duke University; {rvt,junyang}@cs.duke.edu. 2Google ... input to the problem is a set of match lists, one for each query term, which ... Two matchsets with different degrees of clusteredness but equal-size enclosing .

Filters for Efficient Composition of Weighted ... - Research at Google
degree of look-ahead along paths. Composition itself is then parameterized to take one or more of these filters that are selected by the user to fit his problem.

Weighted Flowtime on Capacitated Machines - Research at Google
clouds which share resources and machines among ... farms and clouds often have excess capacity provi- ...... com/solutions/cloud-computing/index.html〉.

What is the Computational Value of Finite ... - Research at Google
Aug 1, 2016 - a substantial constant overhead against physical QA: D-Wave 2X again runs up ... ization dynamics of a system in contact with a slowly cooling.

Efficient and Robust Music Identification with Weighted Finite-State ...
of Mathematical Sciences, New York, NY USA, and Google Inc. e-mail: {mohri ... tion II presents an overview of our music identification ap- ...... he worked for about ten years at AT&T Labs - ... M.Eng. and B.S. degree in Computer Science from.

N-way composition of weighted finite-state transducers
as a finite automaton and painlessly integrated in our 3-way composition. ...... for a non-Ç« move in Tn. We call a maximal block of xs in a move m = (m1,...,mn).

3-Way Composition of Weighted Finite-State Transducers
recognition or information extraction system using a fundamental transducer al- ..... state, instead of the moves ab or ba, the matching or diagonal transition c can ..... New York State Office of Science Technology and Academic Research ...

Filters for Efficient Composition of Weighted Finite ... - Semantic Scholar
ter and presents various filters that process epsilon transitions, look- ahead along ... in OpenFst [3], an open-source weighted transducer library. 2 Composition ...

Efficient and Robust Music Identification with Weighted Finite-State ...
large-scale music identification system can be constructed for a database of over 15 .... In a second step we apply ordinary k-means clustering to refine the clusters ... 3) Measuring Convergence: In speech recognition, each utterance is usually ...

Music Identification with Weighted Finite-State ...
ply an unsupervised training process to learn an inventory of music phone units ... database of over 15,000 songs running faster than real time. Index Terms— ...

3-Way Composition of Weighted Finite-State Transducers
algorithm. We report the results of several experiments demonstrating ..... composition, it is possible to remain at the same state of T1 and the same state of.

N-Way Composition of Weighted Finite-State Transducers
weighted transducer representing the relational composition of the two transducers. Composition is ... with input labels. Weighted transducers are finite-state transducers in which each transition carries some weight in addition to the input and outp

Efficient and Robust Music Identification with Weighted Finite-State ...
that it helps speed up the construction of the weighted suffix automaton in our task by a ... In a practical setting, the test recording provided to a music identification ...... to be r (line 19) and that of r is set to be s[q′] (18), and l[r] set

Efficient and Robust Music Identification with Weighted Finite-State ...
be used to give a compact representation of all song snippets for a large .... song during the course of three iterations of acoustic model training. mpx stands for ...... transducers in speech recognition,” Computer Speech and Language, vol.

Efficient and Robust Music Identification with Weighted Finite-State ...
a database of over 15 000 songs while achieving an identification ... our system is robust to several different types of noise and ..... can be distributed along a path. ...... [9] D. Pye, “Content-based methods for the management of digital music,

An Information Avalanche - Research at Google
Web-page editors, blogging soft- ware, image- and video-sharing ser- vices, Internet-enabled mobile devices with multimedia recording capability, and a host of ...

Mathematics at - Research at Google
Index. 1. How Google started. 2. PageRank. 3. Gallery of Mathematics. 4. Questions ... http://www.google.es/intl/es/about/corporate/company/history.html. ○.

Measuring Nonprofit Results - OpenSource Leadership Strategies
services and systems, prevented higher-cost behaviors and activities, and/or delivered ... a “portfolio” of groups across a geographic region or field of work.