US007647349B2
(12) Ulllted States Patent
(10) Patent N0.:
Hubert et al. (54)
(75)
(45) Date of Patent: 5,598,557 A 5,619,709 A 5,680,602 A 5,761,436 A
1/1997 4/1997 10/1997 6/1998
Inventors: Laurence Hubert, St Bernard du Touvet
5’822’539 A
10/1998 Van Hoff
5,822,720 A
10/1998 Bookman et al. ............ .. 704/3
5,832,480 A
11/1998 Byrd et al.
’
’
.
*
_
Asslgn?’Z Xerox Corporal”, Norwalk, CT (Us) _
_
( ) Not1ce:
_
_
_
F11 e d :
Doner et a1. Caid etal. Bloem et a1. Nielsen
5,905,988 A
5/1999
5,930,787 A
7/1999 Minakuchi et a1.
5,940,614
8/1999
A
Schwartz et al. ....... .. 707/104.1
Allen etal.
Subject' to any d1scla1mer, the term of th1s
5,948,061 A
patent 15 extended or adjusted under 35
5,963,205 A
10/1999 sotomayor
U.S.C. 154(b) by 734 days.
5,978,811 A
11/1999 Smiley
(21) Appl. N0.: 11/2s4,s7s (22)
Jan. 12, 2010
SYSTEM WITH USER DIRECTED ENRICHMENT AND IMPORT/EXPORT CONTROL
(FR) Nicolas Guerin Grenoble (FR)
(73)
US 7,647,349 B2
FOREIGN PATENT DOCUMENTS EP
0 986 010 A2
Prior Publication Data
US 2006/0080314 A1
9/1999 Merriman et a1‘
(Continued)
N ov. 22 , 2005
(65)
.... .. 707/4
............... .. 395/701
3/2000
_
(Contmued)
Apr. 13, 2006
OTHER PUBLICATIONS
(62)
Related US‘ Application Data Division of application No_ 09/683,237’ ?led on Dec_
Brush, A.J. Bernheim, et al., “Robust Annotation Positioning in Digi tal Documents”, Proceedings of the SIGCHI Conference on Human
5 2001 HOW Pat‘ NO‘ 7 133 862'
Factors in Computing, Mar. 2001, pp. 285- 292.
(60) Provisional application No. 60/31 1,857, ?led on Aug. 13’ 2001'
(Continued) Primary Examinerilsaac M Woo
(51) Int- ClG06F 17/00
(57)
(58)
ABSTRACT
(2006.01)
US. Cl- ............... .. R Field of Classi?cation Search ............. .. 707/ 1410,
707/ 10041041, 20(k206
A System for enriching document content using enrichment themes includes a directed search service and an import
export service. The directed search service alloWs users to
See application ?le for complete search history.
author documents While querying information providers
References Cited
using the directed searches that are inserted as part of the
US. PATENT DOCUMENTS
meta-document exchanges between systems that provide document enrichment by binding imported meta-documents
(56)
authored documents. The import-export service enables 5,297,249 A 5,359,514 A 5,367,621 A
5,553,279 A
3/1994 10/1994 11/1994
Bernstein et al. .......... .. 715/854 Manthuruthilet a1. .... .. 364/419 Cohen et a1. .............. .. 395/154
to identical or similar information providers.
6 Claims, 69 Drawing Sheets
9/1996 Goldring
RECEIVE SPECIFIED DOCUMENTIS) AND/OR FOLDER(S} WITH DOCUMENTIS)
1702
TO DEFINE A LEVEL N =0 DOCUMENT SET
EXTRACT ALL LINKS FROM TIIE LEVEL N DOCUMENTSET
T704
I 706
EETCN CUNTENTDF EXTIIALTED LINKS AND DEFINE EETCHED CONTENTAS LEVEL N+I DOCUMENTSET
DESCEND ONE MORE LEVEL? ND
I708
‘ DEFINE ALL THE N LEVEL DOCUMENTSE‘IS DEFINED AT NO! AND I706TD
I 709
BEAN EXPANDED DOCUMENT
| CONSTRUCTENTITYIJATABASEUSINGTNEEXPANDEDDOEUMENT l/ 7770
I IDENTIFY EACH FORM IN THE EXPANDED DOCUMENT
T712
I CREATEASERVICE FOR EACH FORM IDENTIFIED IN THE EXPANDED DOCUMENT LTERTIIECREATED SERVICES usms zunn amuse
77"
I DEFINE A PERSONALITY USING EILTERED SERVICES AND ENTITY DATABASE
US 7,647,349 B2 Page 2 US. PATENT DOCUMENTS 5,986,651 5,987,480 6,006,242 6,029,182 6,070,157 6,070,177
A A A A A A
11/1999 11/1999 12/1999 2/2000 5/2000 5/2000
Reber et al. Donohue et al. Poole et al. Nehab et al. Jacobson et al. Kao et al.
EP W0 W0 W0
1 284 461 A1 WO 00/67159 A2 W0 0161479 A1 W0 0l/44992 Al
2/2003 11/2000 572001 60001
OTHER PUBLICATIONS
6,078,916 A
6/2()()() Culliss
Dourish, R; Edwards, W. K.; Lamarca, A.; Salisbury, M., “Using
6,092,074 A
7/2()()() Rodkin et a1, _____________ __ 707/102
properties for uniform interaction in the Presto document system”,
9/2()()() Horowitz et 31,
Proceedings of the 12th Annual ACM Symposium on User Interface
6,122,647 A 6,138,129 A
10/2000 Combs ........................ .. 706/6
Software and Technology. UI$T’99;AShevil1e. NC, PP~ 55-64; Nov,
6,154,213 6,161,124 6,169,986 6,178,430 6,185,592
11/2000 12/2000 1/2001 1 /2()() 1 2/2001
7-10. 1999 Dourish, R; Edwards, W K; Lamarca, A; Lamping, J ~; Petersen, K; Salisbury, M., Terry, D. B.; Thornton, J ., “Extending document man agement systems with user-speci?c active properties”, ACM Trans actions on Information Systems; 18 (2): 140-170; Apr. 2000.
A A B1 B1 B1
6,189,019 B1 6,212,532 B1
Rennison et al. Takagawa et al. ......... .. 709/203 Bowman et 31, Cohen et a1, Boguraev et a1,
2/2001 Blumer et al. 4/2001 Johnson et al.
Feiner, S., Nagy, S., and Van Dam, A., “An integrated system for creating and presenting complex computer-based documents”, Pro
6,236,987 B1
5/2001 Horowitz et al.
ceedings of the 8th Annual Conference on Computer Graphics and
6,256,623 B1
7/2001 Jones .......................... .. 707/3
6,308,202 6,353,824 6,374,274 6,393,427 6,430,558 6,453,315
B1 B1 B1 B1 B1 B1
10/2001 3/ 2002 4/ 2002 5/2002 8/2002 9/2002
Cohn et al. Boguraev et al. Myers et al. Vu et al. Delano Weissman et a1.
interactive Techniques, SIGGRAPH ’81,ACM Press, NewYork, NY,
pp. 181-189, Aug. 3-7, 1981. Fredriksson, Martin, “Active Documents and Their Applicability in Distributed Environments”, Master of Science Thesis, Department of Software Engineering and Computer Science, Blekinge Institute of Technology, 1998. Fredriksson, Martin and Svahnberg, Mikael, “Fargo: Technical
6,453,342 B1
9/2002 Himmel et al.
Description”, University College of Kariskrona/Ronneby, Depart
6,457,026 B1 6,523,025 B1
9/2002 Graham et al. 2/2003 Hashimoto et al.
ment of Computer Science and Business Administration, Roneby, Sweden, 1997.
6,523,026 B1
2/2003 Gillis
Kleinberg, J ., “Authoritative sources in a hyperlinked environment”,
6,539,421 B1
3/2003 Appelman et a1~
J. ACM vol. 46, Issue 5, pp. 604-632, Sep. 1999 (also published as
6,546,386 B1 6,594,662 B1
4/2003 Black et a1~ 7/2003 Sieffert et al. ............... .. 707/ 10
Kleinberg, J ., “Authoritative sources in a hyperlinked environment”, Technical Report R1 10076, IBM, May 1997),
6,651,058 6,667,747 6,711,585 6,732,090 6,738,973 6,753,977 6,778,979 6,820,075 6’868’41l 6,928,425 6’990’628 7’l l7’432
B1 B1 B1 B2 B1 B2 B2 B2 B2 B2 B1 Bl
7,133,862 B2 7’ l 85’00l Bl 7,472,349 B1
11/2003 12/2003 3/2004 5/2004 5/2004 6/2004 8/2004 11/ 2004 3/2005 8/2005 1/ 2006 10/2006
Sundaresan et a1~ spellman et a1~ Copperman et al. Shanahan et a1~ Reikimoto Hoover Grefenstette et al. Shanahan et 31' Shanahan Grefenstette et al. Palmer et 31' Shanahan et a1‘
11/2006 Hubert et al. 2/2007 Bllrdlck et 31'
Marshall, Catherine C., “Toward and Ecology of Hypertext Annota tion”, Proceedings of the Ninth ACM Conference on Hypertext and Hypermedia, May 1998, pp, 40.49‘ Mockus, A., Hibino, S., and Graves, T., “A Web-based approach to interactive visualization in context”, Proceedings of the Working Conference on AdvancedVisual interfaces (Palermo, Italy), AVI ’00. ACM press, New York, NY, pp‘ 181488, 2000‘ Wang, W. and Rada, R., “Structured hypertext with domain seman tics”, ACM Transactions on Information Systems (TOIS), vol. 16, No‘ 4, pp‘ 3724112, Oct‘ 1998‘ Alexa Internet, “Alexa What’s Related for Netscape Navigator”, available on the Internet Apr. 2001 at http://www.alexa.com/
prodiserv/netscape‘html' Alexa Internet, “Alexa Related Links for Microsoft 1E”, available on
12/2008 Sr1vastava et al. ............ .. 707/1
the Internet Apr 2001 at httpWWWW alexa Com/prod SerV/mSie
A1
11/2001
Yonaitis
html
2001/0049698 A1
12/2001
Hsu et al.
2001/0047365
'
'
'
.
'
'
i
'
.
20020023215 Al
2/2002 Wang et a1‘
2002/0026297 A1
2/2002 Leymann et a1‘
2001
2002/0032672 2002/0054059 2002/0065673 2002/0069223
3/2002 5/2002 5/2002 6/2002
Technology-whlte-papenpdf _ Autonomy, Act1veKnowledge(TM), available on the Internet Apr. 2001 at http://www.autonomy.com/echo/user?le/ PBiAutonomyiActiveKnowledgem~01)~Pdf~
A1 A1 A1 A1
Keith Schneiderman Rooke Goodisman et al.
Autonomy, Technology White Paper, available on the Internet Apr.
at.
http://www.autonomy.com/echo/user?le/
2002/00g7621 A1
7/2002 Hendriks
2002/0107735 A1
8/2002 Henkin et 31,
in proceedings of the Ninth Text REtrieval Conference (TREC-9)
2002/0116291 A1
8/2002 Grasso et al. ............... .. 705/27
held in Gaithersburg. Maryland. NW 13-16. 2000
8/ 2002 Henkin et al.
“Searchand?nd, Inxight Thing Finder SDK”, Inxight Software, Inc.,
2002/ 0120505 A1 2002/ 0138456 2002/ 0143871 2003/ 0001016 2003/0212649
A1 A1 A1 A1
9/ 2002 10/ 2002 1/ 2003 11/2003
Levy et al. Meyer et al. Fraier et al. Denesuk et al.
2004/0205448 A1
10/2004 Grefenstette et al.
2004/0267805 A1
12/2004 SeZan et a1~
2005/0022114 A1
V2005 Sh?nah?n et a1.
FOREIGN PATENT DOCUMENTS EP EP
1 087 306 A2 1 143 356 A2
3/2001 10/2001
Cooper et al. in “A Simple Question Answering System”, published
ThingFinder SDK Data Sheet, available on the Internet Apr. 2001 at http://www.inxight.com/pdfs/products/tfisdkids.pdf. Jon Kleinberg, “Authoritative sources in a hyperlinked environ
ment”, Technical Report RJ 10076, IBM, May 1997. Kenichi Kamiya, Martin Réscheisen, and Terry winograd, “Grassroots: A System Providing A Uniform Framework for Com municating, Structuring, Sharing Information, and Organizing People”. Fifth International World Wide Web Conference, May 6-10,
1996, Paris, France. A. McCallum, K. Nigam, “Text Classi?cation by Bootstrapping with Keywords”, EM and Shrinkage, ACL Workshop for Unsupervised Learning in Natural Language Processing, 1999.
US 7,647,349 B2 Page 3 Projects, “Watson”, available on the Internet Apr. 2001 at http:// dent.
Yoelle S. Maarek, Israel Z. Ben Shaul. “Automatically Organizing
infolab.nwu.edu/infolab/projects/project.asp?ID:5.
Bookmarks per Contents”. Fifth International World Wide Web Con
Budzik, J ., and Hammond, K. J ., “Watson: Anticipating and
ference, May 6-10, 1996, Paris, France.
Contextualizing Information Needs”. In Proceedings of the Sixty second Annual Meeting of the American Society for Information Science. Information Today, Inc., Medford, NJ, 1999.
Zapper Tour, Zapper Technologies Inc., available on the Internet Apr.
Budzik, J ., Hammond K., and Birnbaum, L. “Information access in
context”, available on the Internet Apr. 2001 at http://dent.infolab.
nwu.edu/infolab/projects/proj ect.asp?ID:5. Paul Festa, “Amazon pops into consumers reviews with zBubbles”, Nov. 17, 1999, available on the Internet at http://news.cnet.com/ news/0-l007-200-l452l40html. John Lamping, Ramana Rao, and Peter Pirolli. “A focus+context
technique based on hyperbolic geometry for visualizing large hier archies”, in Proceedings of the ACM SIGCHI Conference on Human
Factors in Computing Systems, ACM, May 1995. Inxight Software, Inc. “Star Tree Walk-Through Tour”, available on the Internet Apr. 2001 at http://www.inxight.com/products/
staritree/walkithru/index.htrnl. “Network ToolsiZapper [Windows 95/ 98/2000/NT, IE 4.0+]”, The Scout Report, A Publication of the Internet Scout Project Computer Sciences Department, University of Wisconsin-Madison, vol. 7, No. 4, Jun. 9, 2000 available on the Internet at http://scout.cs.wisc.edu/
report/sr/2000/scout-000609.html#l7. Netscape’s “What’s related service FAQ”, available on the Internet
Apr. 2001 at http://home.netscape.com/escapes/related/faqhtml. OpenCola Folders, Product and Technology Overview, available on the Internet Apr. 2001 at http://www.opencola.com/products/ lifolders/. Pazzani, M., Billsus, D., “Learning and Revising User Pro?les: The Identi?cation of Interesting Web Sites”, Machine Learning 27, Kluwer Academic Publishers, pp. 313-331, 1997. Thorsten Joachims. “A probabilistic analysis of the Rocchio algo rithm with TFIDF for text categorization”. Technical Report CMU
CS-96-l 18, School of Computer Science, Carnegie Mellon Univer sity, Mar. 1996.
2001 at http://www.zapper.com/tour/tourifrhtml. Walter Mossberg, “New Windows XP Feature Can Re-Edit Others’ Sites”, The Wall Street Journal, Jun. 7, 2001. Luis Gravano and Hector Garca-Molina, “GIOSS: Text-Source Dis covery over the Internet”, ACM Transactions on Database Systems, 1999.
Bradley Rhodes and Pattie Maes, “Just-in-time information retrieval agents”, IBM Systems Journal special issue on the MIT Media Labo ratory, vol. 39, Nos. 3 and 4, 2000 pp. 685-704.
Scott Deerwester, Susan Dumais, Goerge Furnas, Thomas Landauer, and Richard Harshman. “Indexing by latent semantic analysis”. Jour nal of the American Society for Information Science, 4 l (6):39 l -407, 1990 .
“Zapper Technology Brief”, available on the Internet Apr. 2001 at www.zapper.com.
“Zapper Technologies Announces Release of Zapper 2.0”, available on the Internet Apr. 2001 at www.zapper.com.
“Inxight Thing?nder”, RubberTreePlant.co.uk KM Software Review, Apr. 3, 2001. “Autonomy UpdateTM”, Read Between The Lines available on the Internet Apr. 2001 at www.autonomy.com. “Autonomy ServerTM”, Read Between The Lines available on the Internet Apr. 2001 at www.autonomy.com. “Smart Tags Overview”, available on the Internet Jun. 2001 at www. microsoft.com.
Shi-Kuo Chang and Taieb Znati, “Adlet: Active Document for Adap tive Information Integration”, available on the Internet Jul. 2001 at
http://www.cs.pitt.edu/~chang/365/adlet.html. P. Ipeirotis, L. Gravano, and M. Sahami, “Probe, Count, and Classify: Categorizing Hidden-Web Databases”, Proceedings of the 2001 ACM SIGMOD International Conference On Management of Data, 2001.
* cited by examiner
US. Patent
Jan. 12, 2010
Sheet 6 0f 69
US 7,647,349 B2
“5Ee2$s85.
5m023 8N\E>525: E2853; @.OE
@528“:
US. Patent
Jan. 12, 2010
Sheet 7 0f 69
US 7,647,349 B2
710
20°MET/A-DOCUMENT SERVER
UNSOETRIFDY
'PERSONALITYUD -DOCUMENT|
‘-\DOCUMENTID DIDOCUMENT 704
US. Patent
Jan. 12, 2010
Sheet 8 0f 69
US 7,647,349 B2
@ PRINT
5E
PRINTER
NAME I=~2\\BOUILLAN1E\11MES STATUS: IDLE TYPE: WHERE
Iv] |PROPERT|Es| 3\T4
XEROX 45TTPO$TSER|PT LEVEL2 T|MES.GRENOBLE)(RCE.XEROX.(0M=PASSTHRU
U PRINT TO FILE
COMMENT: XEROX M2125 saw DUPLEX - LE CHATEAU -2ND FLOOR
PAGE RANGE
coPlEs
0 CURRENT PAGE
0 SELECTION
o PA§ES=|
1
El COLLAIE
ENTER PAGE NUMBERS AND/0R PAGE RANGES
SEPERATED BY COMPéAéjOR EXAMPLE 1,3542
896
ENRICHMENT 14 @PERSONALI JMEI
/ OSELEC APERs0NAuTYE0RNE
APPLY ENRICHMENTTO:
INSERT ENRICHMENTAS:
gETLTIRRENT PAGE 0 "mil
0 SELECTION I
@ LINKS o CONTENT ml
\
I
‘T
‘T
808
ZOOM 802
PRINTNHATwowNENT PEINT:
‘T 810
Iv] PAGESPERS?EET= [E13
|—_[_]ALLPAGES IN RANGE v SUTLETOPAPERSIZEIME
QPTIONS...
0K
cANcEL
800
FIG. 8
US. Patent
Jan. 12, 2010
Sheet 9 0f 69
US 7,647,349 B2
55?: PRINT DOCUMENT PROPERTIES LAYOUT
902
PAPER/QUALITY
EE
ENRICHMENT
@ PERSONALITY:
osELEcT PERSONALITY FOR ME
PROVIDE ENRICHMENT BY:
/ 900
INSERT ENRICHMENT AS:
@ PRINTINGWITH DOCUMENT
@ LINKS
o STORINGATSERVER
o coNTENT
WHENSTORINGENRICHMENTATSERVER;
CAPTURE POINT
@ EMAILMEWHENCOMPLETED
@ “WORKSTATION
AWN-155::
0 ATPRINTSERVE
90 1
/’
908
/ 906
/
0 no NOTNOTIFYME
\m ENE-l
FIG. 9
US. Patent
Jan. 12, 2010
Sheet 11 0f 69
1014
E1 ARCHITECTURE El BUILDINGS :1 DESIGN IE HOMES |'_‘] BRICK HOMES 1702‘ HAY BALE HOMES
I1 LOG HOMES 1104-“.
I1 TIMBER HOMES
FIG. 1 1
US 7,647,349 B2
US. Patent
Jan. 12, 2010
Sheet 12 0f 69
US 7,647,349 B2
121\O El PROPERTIESFORUSERJSMITH
LAYOUT
SHARING
\
PERSONALITIES
EE
'2'\4 \
T212©\0EFAULT=
\
o RECO MEND PERSONALITY FORME /
1230
PRQPAGATE [NRKHMENT
1220 /
INSERT ENRICHMENT AS:
BETWEEN DOCUMENTS:
@ LINKS/12722224
g)
o coNTENT/
1226
o AUTOMATICALLY DETéMINE / T204 WHEN DOCUMENT EN RICHED NOTIFY ME BY, 1205
AUTOMATICALLY (OMPLETE
O VOICEMESSAGE/ 1206 @ EMAI
T0 LINK 0R INSERT coNTENT
1207
o SMSTEXTMESS?
CITATIONS;
1242
@ N0
/
o YES,DEPTH=4
ORGANIZE MY PERSONALITIES ‘ T240
CREATE/MODIFY MY PERSONALITIES I =,
/
1216
FIG. 12
ENE]
US. Patent
Jan. 12, 2010
Sheet 15 0f 69
US 7,647,349 B2
1502
r: CREATE/MODIFY MY PERSONALIHES NAME:
@
EE
BROWSE
,/ 1504
CREATE/MODIFYAGROUPOFEXISHNGPERSONALITIES
,/ 7506
USE sumo DOCUMENTS AND/0R FOLDERS
O
TOCREATEANEWPERSONALIIY
/ 1509 A/15os
BROWSE
/
USE sumo FILE/WEBSITE TO CREATE A NEW PERSONALITY
O
BROWSE
,/
1510
CREATE uursnon FORM usme PERSONALITY LEXICONS O
QUESTION=
’
Him:
1513A
/152o
/'5'4
KNOWLEDGE
@ NOVICE
LEVEL,
0 EXPERT
'
WEE
FIG. 15
US. Patent
Jan. 12, 2010
Sheet 16 0f 69
US 7,647,349 B2
I600
/ |:| CREATE/MODIFY A GROUP OF EXISTING PERSONALITIES NAME:
EE
TECHNOLOGY WATCH IN ORGANIC CHEMISTRY
OPERATION: @ MERGE,’
1602
o SUBTRACI""I6O4 I PERSONALITYA
UI
WITH/ FROM PERSONALITY B
a
0K
FIG. I6
CANCEL
US. Patent
Jan. 12, 2010
Sheet 17 0f 69
US 7,647,349 B2
RECEIVE SPECIFIED DOCUMENTIS) AND/0R FOLDERIS) WITH DOCUMENT(S)
/ 1702
TO DEFINE A LEVEL N=O DOCUMENT SET
I EXTRACT ALL LINKS FROM THE LEVEL N DOCUMENTSET I FETCH CONTENT OF EXTRACTED LINKS AND DEFINE FETCHED
/
I 704
/ 1706
CONTENTAS LEVEL N+I DOCUMENTSET YES
I DESCEND ONE MORE LEVEL?
I110 DEFINE ALL THE N LEVEL DOCUMENTSETS DEFINED AT I702 AND I706 TO / 1709 BE AN EXPANDED DOCUMENT
I CONSTRUCT ENTITY DATABASE USING THE EXPANDED DOCUMENT I I712 IDENTIFY EACH FORM IN THE EXPANDED DOCUMENT I CREATE A SERVICE FOR EACH FORM IDENTIFIED IN THE EXPANDED DOCUMENT I
/ 1710
/
FILTER THE CREATED SERVICES USING ENTITY DATABASE / I716
I DEFINE A PERSONALITY USING FILTERED SERVICES AND ENTITY DATABASE
FIG. T7