An Improved Text Entry Tool for PDAs a

Massimo ANCONAa , Gianluca QUERCINIa and Luca DOMINICIa Dipartimento di Informatica e Scienze dell’Informazione (DISI), University of Genoa Via Dodecaneso 35 16146, Genova, ITALY [email protected], [email protected], [email protected]

Abstract. Textual data entry on PDAs and, more in general, on mobile devices, is still far from being as comfortable as on desktop PCs. Different solutions have been envisaged, ranging from small hardware keyboards and new layouts of software keyboards to predictive input techniques and handwriting recognition systems. Although each of them has its bright sides, no method is universally accepted by the users. This paper aims at describing the new features of a tool, named Wtx, that cleverly combines different text entry techniques and allows users to choose among them. Particularly interesting are the integration of a handwriting recognition system and the FTL rule, that remarkably improves text prediction in Wtx.

1. Introduction PDAs (Personal Digital Assistants) were born as simple personal electronic books, with limited functionalities and computational power. Nowadays their performances are comparable to those of old desktop PCs; as a result, most of the applications running on PCs are supported by PDAs, with all the benefits springing from the portability of the devices. However, the interaction is affected by their limited size and the lack of tools such as keyboard and mouse, replaced by a touch-screen and a tiny pen. Whereas a pen allows a fast and comfortable selection of files and directories (MacKenzie et al., 1991), the same can not be claimed for text entry. This is a critical point, as text entry is at the basis of human-computer interaction. In the dreams of Alan Kay, who devised the first prototype of portable device (the Dynabook), the interaction with a PDA should not differ from that obtained with a simple notepad; it is the so-called pen and paper metaphor (Meyer, 1995). Handwriting recognition is the key of this dream. Although good results have been reached over years of researches, many users are still reluctant to use them, as they feel them slow and inaccurate. For this reason, any PDA operating system typically offers two text entry tools, a handwriting recognition system and a virtual Qwerty keyboard. This paper is the continuation of our previous work on a powerful and comfortable text entry tool named Wtx (Ancona et al., 2005). By integrating into a common interface different technologies, it provides users the possibility of choosing their favorite one. The final aim of Wtx is not speed (in terms of number of words per minute), but comfort. In particular, the last version includes a handwriting recognition system and a new rule, which we called FTL rule (First-Third-Last rule), that imposes to insert letters of a word in a different order than usual. The remainder of the paper is organized as follows: in section 2 a brief overview of the main text entry tools is given; in section 3 we describe the key concepts involved in Wtx; in section 4 we introduce the FTL rule; in section 5 we explain the main reasons behind our choice of integrating a handwriting recognition system in Wtx; in section 6 some experimental results are discussed and finally in section 7 we sketch future directions of our research.

2. State of Art Text entry tools for PDAs may be roughly classified into four categories: small hardware keyboards, software (or virtual) keyboards, handwriting recognition systems and voice recognition systems. Also, there are midway approaches between the second and the third one, named gesture-based systems. However tiny they are, hardware keyboards require the user to lay the device on a desk. This approach strongly limits the portability of the device and make it similar to a notebook. Although users generally prefer a hardware keyboard (Rodriguez et al., 2005), a software or virtual keyboard is more suitable on a PDA. A virtual keyboard is the image of a tiny keyboard with which users interact by “tapping” the keys by the pen. A virtual Qwerty keyboard is inefficient as Qwerty has been thought for typewriters; in order to prevent mechanical jamming, letters forming frequent digraphs in English are kept apart on the keyboard. Since such letters should be as close as possible on a virtual keyboard, in order to minimize hand movements (thus improving the speed), alternative keyboard layouts have been proposed (MacKenzie & Soukoreff, 2002). Because of the presence of the pen, handwriting recognition systems seem to be a more suitable approach. However, the metaphor dreamed by Alan Kay is not a reality yet. Most of the systems, in fact, force users to write one character at a time: this makes the writing process unnatural, slow and stressful. Moreover, since handwriting varies from person to person, each system proposes a customized alphabet with which users have to comply, in order to get the characters correctly interpreted. Speech recognition systems would be the most effective interaction method, as they require less attention than the other techniques. In the last decade many efforts have been invested in this research field, with the development of

valid systems; however, their accuracy can not be compared to that achieved by a virtual keyboard or even a handwriting recognition system. Moreover, some situations prevent speech recognition to be used, especially when users would like to preserve their privacy.

3. Key Concepts in Wtx In a previous work we defined Wtx as an “input method for selecting words from a long list (called a dictionary) of specialized terms like a long list of therapies or pathologies” (Ancona et al., 2005). The first version of Wtx, called WordTree, visualized the words of a dictionary into a tree structure (Ancona & Comes, 1999). Tests immediately showed that such interface was not so comfortable as expected, due to the limited size of the area devoted to text entry on PDAs. For that reason, the following versions opted for the interface showed in figure 1: on the left the selection area displays up to ten words to select, whereas on the right the composition area contains an alphabetical virtual keyboard (figure 1(a)) or a handwriting recognition system (figure 1(b)). An option exists to swap the two areas in order to make Wtx usable also for left-handed people.

(a)

(b)

Figure 1. Wtx interface (a) with a virtual keyboard and (b) with handwriting recognition system

For the sake of brevity, in this section and in the next one, unless otherwise specified, we will refer to the interface displayed in figure 1(a). At the start-up the selection area contains the ten most frequent English words; whenever a letter is inserted, the ten most frequent words beginning with that letter are showed. If the desired word comes up in the selection area, it can be inserted by simply tapping on it and this action also forces the insertion of a space. This is what we call the traditional running mode of Wtx. Wtx was first used to select words from a dictionary of names of therapies and pathologies; these words, often very long, can be comfortably selected by performing only 2 or 3 taps on the virtual keyboard. Comfort has never been taken into account when evaluating a text entry method; usually speed (in terms of number of words per minute) and accuracy (number of committed errors) are used in the evaluation. However, not only does speed depend on the system, but also on users. When users feel tired, in fact, they decrease their speed or at least tend to commit more errors, even if they are using the fastest and most accurate virtual keyboard. Our first results show that Wtx is slower than Qwerty, but the number of errors is remarkably lower, whereas it increases with the time by using Qwerty. Thanks to the dictionary and the selection area, in fact, in Wtx the words can be inserted without tapping on all their characters. Therefore, the less is the number of taps the more is comfort. Although Wtx reaches its best performances when it comes to select words from a specialized dictionary, it can be also used in continuous writing. The dictionary used by Wtx for a general purpose usage counts more than 13000 English words. Moreover, it is possible to easily switch among different dictionaries, depending on the context. In the current version dictionary changes have to be performed manually; however, in future releases dynamic contextualization will be implemented, so as to make Wtx a comfortable, effective and context-aware tool for inserting text on PDAs.

4. The FTL rule The number of taps needed to insert a word is an important parameter in Wtx. In (Ancona et al., 2005) we presented some results that were not completely correct, for two reasons. First of all we compared a non-weighted mean (the mean length of the words) to a weighted mean (the number of taps)1 ; secondly the dictionary of Wtx had only 5000 words. We repeated the same calculations and we obtained that the mean length (weighted on the frequency of the words) of the words in the dictionary is 4.4 and the mean number of taps to perform in order to insert all the words is 2.43. This means that Wtx in the traditional mode almost halves the number of required taps. Improving the available space to the selection area would provide an even better result; however, the keyboard in the composition area would have too small a size. Instead of modifying the interface, we devised a new easy-to-learn method of writing the words to guarantee a number of taps less than 2.43 (Quercini, 2007). After the first letter, the third and the last one of a word have to be inserted, instead of the second an third one. If the word does not come up in the selection area, the user must insert the second letter and then the fourth, the fifth and so on. This is the so-called FTL rule (First-Third-Last), that is included in the last release of Wtx. The mean number of taps with this rule turns out to be 2.32. Moreover, 87% of the words in the dictionary are inserted with ≤ 4 taps with the FTL rule, whereas only 64% of the words enjoy this property in the traditional mode (Quercini, 2007). This is due to the fact that the FTL 1

We thank Prof. G. Nagy for correcting us

rule performs a stronger selection of the words than the traditional mode. Obviously it requires users to learn it, but, as we will see in section 6, this seems not to be an issue.

5. Handwriting Recognition in Wtx The last version of Wtx includes an option that allows the use of a handwriting recognition system instead of a virtual keyboard. A virtual keyboard is usually faster and more accurate than a handwriting recognition system, but requires a higher level of attention, especially in Wtx, where the composition area length is half the length of the SIP2 . As a result, comfort could be negatively affected. Figure 1(b) displays the interface of Wtx with handwriting recognition system: in the composition area a box, where letters have to be written, replaces the virtual keyboard. The advantage brought by the recognition system is that users can write the letters in the box and look at the selection area at the same time; thus, only looking for the desired word in the selection area demands their attention. Whereas in the previous section we measured comfort in terms of taps, here we use the concept of stroke, that we define as the gesture performed to write a letter by the pen. “Tapping” is faster than “stroking”, but requires users to search for the key over the keyboard; therefore the difference between taps and strokes is not much significant to evaluate comfort. However, the box in the composition area could be further reduced in order to display up to 15 words in the selection area. This choice will decrease the mean number of strokes, but looking for the desired word in the selection area will be more time-consuming. In order to obtain a quick evaluation, we interfaced Wtx to the handwriting engine provided by the operating system. Although we experienced some troubles in interfacing Wtx to this engine and the engine itself is not very accurate, obtained results are good. For this reason we are planning to improve this part of Wtx, by embedding into it a better recognition system, such as that described in (Joshi & Nagy, 2005).

6. Experimental Results Evaluating a text entry method is not a straightforward process. Usually two are the considered parameters: speed and accuracy. While speed can be easily computed in terms of words per minute (WPM), different measurements exist for accuracy (Soukoreff & MacKenzie, 2001). Moreover, tests can be set up in different ways. Typically participants have to write down a text, but results may change depending on whether the text is dictated or directly copied by users. Finally, both beginners and expert users should attend to tests, to estimate the maximum achievable performances and highlight potential difficulties in learning. Due to the complexity of Wtx, our tests have been concentrated only on beginners so far. Thus, we chose 10 people not used to PDAs (but skilled in using a PC) and we asked them to copy and write down a text (454 words) in three different sessions. During the first session they were required to use the Qwerty virtual keyboard; in the second one they had to get acquainted with Wtx, running in traditional mode with virtual keyboard, and in the last one with Wtx, running in FTL mode and with handwriting recognition system. Each session was preceded by a fast warm-up aimed at making participants a bit confident with the tool to use. The PDAs used during the trials were all Compaq iPAQ H3630, running a Microsoft CE 3.0 operating system.

Session I II III

Table 1. Wtx Test Results WPM Standard Deviation 13.33 2.71 9.64 2.28 6.43 1.87

Accuracy 2.35 0.22 0.28

The values in Table 1 are the mean of the results obtained by each participant. The speed is computed as the mean of the number of words written over a minute; the standard deviation is the measure of the variation of the speed over the time and the accuracy is the average number of incorrectly written words over one minute. In computing the accuracy we only ignored errors generated by the recognition system, as we know that it is very inaccurate. Thus, the higher are the values in the third column of Table 1, the worse is the accuracy of the evaluated method. As expected, the maximum speed is obtained with Qwerty, of which users have a better knowledge. However, the speed in Wtx is not affected only by the low level of participant’s experience, but also by the movements imposed to the hand from the composition area to the selection area. Moreover, users said that they had to spend some time to search the desired word in the selection area. This overhead could be considerably reduced if the words were sorted in alphabetical order instead of by their frequencies. Finally, the low value for speed reported in session III is due to the inaccuracy of the recognition system and to the novelty introduced by the FTL rule. Despite a lower speed, users felt Wtx (both in traditional and in the FTL mode) more comfortable than Qwerty, as they had not to write completely long words. The level of comfort is related to accuracy and standard deviation. On one hand a high number of misspelled words is clue of lack of comfort, as it is mainly the result of the tiredness of the user. Moreover, as a text should 2 SIP (Soft Input Panel) is the area of the screen devoted to text entry in PDAs running a Microsoft PocketPC operating system. Its size is set up by the operating system to 320x80 pixels.

be error-free, an additional effort is required to make the needed corrections. On the other hand, a low value of the standard deviation means a good level of comfort. In fact usually speed considerably decreases as tiredness increases; we noticed that, whenever the speed reaches a very low level, users tend to take a rest. When they resume their work, speed considerably goes up. This behavior, that we most observed during the first session, gets the standard deviation to increase. Thus, the values of the standard deviation and accuracy reported in Table 1 confirm users’ perceptions about the comfort of Wtx.

7. Conclusions and Future Works In this paper we presented Wtx, a text entry tool for PDAs that exploits different well-known technologies in order to provide users with a powerful and comfortable tool. Through a simple interface, users can choose two different running modes (traditional and FTL) and whether to use a virtual keyboard instead of a handwriting recognition system. We also described the FTL rule, that requires users to change the way they write a word, in order to decrease the number of taps or strokes. First experimental results confirm that the new rule is quite challenging for beginners, but it is not so hard to learn as we expected and contributes to increase the level of comfort. Coupling handwriting recognition to Wtx seems to be a successful choice, as users report a high degree of comfort with respect to the keyboard. It has not to be forgotten that the recognition engine we use in the current release is pretty far from being efficient and comfortable. Thus, including a more reliable and accurate recognition system might result in even better performances. In the next releases we plan to enrich the dictionary of Wtx by adding also all the plurals of the regular nouns and the forms of the paradigms of all the regular verbs; moreover, we would like to extend the use of Wtx to other languages. Also, dynamic contextualization of the dictionaries will be implemented and deeply tested. Finally, more accurate tests will be performed on a number of different users.

8. References Ancona, M., & Comes, D. (1999). WordTree: A pen-based editor of short texts. In Proceedings of the 9th biennial conference of the international graphonomics society (p. 263-269). Ancona, M., Quercini, G., Locati, S., Mancini, M., Locati, S., & Romagnoli, A. (2005). Comfortable textual data entry for PocketPC: the Wtx system. In Proceedings of the 12th biennial conference of the international graphonomics society (p. 261-265). Joshi, A., & Nagy, G. (2005). Online handwriting recognition using time-order of lexical and signal co-occurrences. In Proceedings of the 12th biennial conference of the international graphonomics society (p. 201-205). MacKenzie, I. S., Sellen, A., & Buxton, W. A. S. (1991). A comparison of input devices in element pointing and dragging tasks. In Chi ’91: Proceedings of the sigchi conference on human factors in computing systems (pp. 161–166). New York, NY, USA: ACM Press. MacKenzie, I. S., & Soukoreff, R. W. (2002). Text entry for mobile computing: Models and methods, theory and practice. HumanComputer Interaction, 17(2), 147–198. Meyer, A. (1995). Pen computing: a technology overview and a vision. ACM SIGCHI Bulletin, 27(3), 46–90. Quercini, G. (2007). Fast text prediction in Wtx: the FTL rule (Tech. Rep. No. DISI-TR-07-04). Via Dodecaneso 35, 16146 Genoa: University of Genoa - Disi. (http://www.disi.unige.it/person/QuerciniG/publications/DISI-TR-07-04.pdf) Rodriguez, N. J., Borges, J. A., & Acosta, N. (2005). A study of text and numeric input modalities on PDAs. In Proceedings of the 11th international conference on human-computer interaction. Soukoreff, R. W., & MacKenzie, I. S. (2001). Measuring errors in text entry tasks: an application of the Levenshtein string distance statistic. In Chi ’01: Chi ’01 extended abstracts on human factors in computing systems (pp. 319–320). New York, NY, USA: ACM Press.

An Improved Text Entry Tool for PDAs

virtual) keyboards, handwriting recognition systems and voice recognition systems. Also ... This is what we call the traditional running mode of Wtx. .... In Proceedings of the 12th biennial conference of the international graphonomics society (p.

128KB Sizes 1 Downloads 334 Views

Recommend Documents

A Tool for Text Comparison
The data to be processed was a comparative corpus, the. METER ..... where xk denotes the mean value of the kth variables of all the entries within a cluster.

Improved Text-Detection Methods for a Camera-based ...
visually impaired persons have become an important ... The new method we present in the next section of the .... Table 1 Results of Small Character Data set p.

An Improved FloatBoost Algorithm for Naïve Bayes Text ...
20-Newsgroups consists of 20,000 Usenet articles collected by K. Lang from 20 different newsgroups. For this data set, about 70% documents in each newsgroup are used for training (700 documents per class), while left documents are used for testing. (

SeRT - a tool for knowledge extraction from text ...
text is used as language and meta-language. - paradigmatic relations can be ... parallel search of terms and relations. - term extraction. - search for surface ...

An Improved Control Law Using HVDC Systems for ...
Aug 28, 2013 - Systems for Frequency Control. Jing Dai1. Gilney Damm2 ... information. ... This lead to a smaller degree of primary reserve sharing, than the ...

An Improved Degree Based Condition for Hamiltonian ...
Lenin Mehedy1, Md. Kamrul Hasan1 and Mohammad Kaykobad2. 1Department of Computer Engineering, Kyung Hee University, South Korea. 2Department of Computer Science and Engineering, North South University, Dhaka, Bangladesh. Email: 1{lenin, kamrul}@oslab

An Improved Divide-and-Conquer Algorithm for Finding ...
Zhao et al. [24] proved that the approximation ratio is. 2 − 3/k for an odd k and 2 − (3k − 4)/(k2 − k) for an even k, if we compute a k-way cut of the graph by iteratively finding and deleting minimum 3-way cuts in the graph. Xiao et al. [23

An Improved Likelihood Model for Eye Tracking
Dec 6, 2005 - This property makes eye tracking systems a unique and effective tool for disabled people ...... In SPIE Defense and Security Symposium,. Automatic ... Real-time eye, gaze, and face pose tracking for monitoring driver vigilance.

Design Principles for an Extendable Verification Tool for ... - SpaceEx
Basic components of analysis algorithms are post- and .... The waiting list contains the symbolic states whose ..... v.4.37. http://www.gnu.org/software/glpk.

An Improved Particle Swarm Optimization for Prediction Model of ...
An Improved Particle Swarm Optimization for Prediction Model of. Macromolecular Structure. Fuli RONG, Yang YI,Yang HU. Information Science School ...

AN IMPROVED CONSENSUS-LIKE METHOD FOR ...
{haihua xu,zhujie}@sjtu.edu.cn, [email protected], [email protected]. ABSTRACT ... for lattice rescoring and lattice-based system combination, versus baselines such .... similar approximations as used in Minimum Phone/Word Error.

OMEGA: An Improved Gasoline Blending System for ... - EBSCOhost
refinery data bases and on-line data acquisition and exploits detailed nonlinear models of gasoline attributes. Texaco uses. OMEGA in all seven US refineries ...

An improved method for the determination of saturation ...
2) use of cost effective excitation source with improved voltage regulation and ..... power electronic applications, renewable energy systems, energy efficiency.

An Improved Control Law Using HVDC Systems for ...
distance transmission, as well as for connecting offshore wind farms. Despite its ... HVDC system, only one keeps the power balance of the. DC grid by ..... cumulative effects in the long term would make the converters' DC voltage drift to an ...

An Improved Hybrid Model for Molecular Image ...
1 Introduction. Image denoising is an active area of interest for image ... e-mail: [email protected]. K. Kannan e-mail: [email protected]. M.R. Kaimal.

An Improved Acquisition Algorithm for the Uplink ...
CDMA (Code Division Multiple Access) wireless communications system is a .... on the acquisition receiver invented in [4]. The accumulation of the signal ...

An Improved Acquisition Algorithm for the Uplink ...
and the error floor of the traffic transmission caused by imperfect synchronization can be .... The length of a frame is 20 ms, and the system load is 4 or 8 users.

An improved algorithm for anisotropic nonlinear ...
development of a software program for filtering cryot- omograms based on AND, ... AND can be considered as an adaptive gaussian filtering technique in which, ...

An Improved Caching Strategy for Training SVMs - CiteSeerX
is a good criteria to evaluate the performance of training .... admits an analytical solution, eliminating the need ... lar software(LIBSV M and SV Mlight) of training.

An Improved Algorithm for the Solution of the ...
tine that deals with duplicate data points, a routine that guards against ... numerical algorithm has been tested on several data sets with duplicate points and ...

The Development of an Improved Method for ...
between scraping to allow more time for the froth to build up and mature. This method of ... opinion thereof. It has been proposed that machine vision technologies could be used to extract useful data ... For the modified method of batch flotation th