research highlights doi:10.1145/ 2133806.2 1 3 3 8 2 9

Technical Perspective Who Knows? Searching for Expertise on the Social Web By Ed H. Chi

It is difficult to remember what people had to do to find the answer to a question before the Web. Imagine it is 1990, before the age of search engines, and of course, Wikipedia. You have no access to a library, and available reference books are not enough. The only option you might have is to call a friend who might know the answer. In fact, this option is so important, it is baked into the game “Who Wants to be a Millionaire?” as one of the three lifeline options to take when you are stumped for an answer. This natural instinct to call someone is also baked into the DNA of Aardvark, the social question and answering (QA) engine described in the following paper by D. Horowitz and S. Kamvar. When you turn to the phone, one of the first steps you have to figure out is who to call. This is the expertise location or question-routing problem in social QA research. At a high level, this seems like a great computer science problem. You have people as nodes, and their relationships and interactions as edges, and you want to model people’s interests and expertise, as well as the frequency and recency of their interactions with each other. You will use these models to route questions. Your mind races with possible algorithms and user modeling approaches to apply. Yes, conceptually you would probably be correct with many of these ideas, but in practice, building Aardvark is much more complex and difficult. First, you must figure out how to build accurate user models for each user on the whole Web, even with sparse data for many users, including brand new users of your system. Second, you have to scale this system to millions of users, and be able to do the question routing in milliseconds. You also want to try and get answers that are good and return them as quickly as possible.

110

co mm unicatio ns o f t h e ac m

Third, potential answerers (read: humans) are finicky: they do not want you to spam them; they do not like being interrupted if they are in the middle of another conversation; and they don’t like it when you call them up at weird hours of the day (no matter if you know their time zone or not). In order words, you must deal with the real human context and its associated social interaction. Finally, you need to socially engineer the growth of this system, so that early users get experiences good enough to rave about your service and recommend it to other users. You want to build trust, and you want a network effect, such that, as each user joins the system, the whole system becomes even more useful to those already there. Before Aardvark, social QA systems used a wide variety of techniques to route questions, most often using experience/reputation points or monetary rewards as incentives. Many services, such as Yahoo! Answers focused on building communities, and turning the act of answering into a pointbased game. Instead, Horowitz, Kamvar and their team at Aardvark pushed the envelope and built a different service that focused on getting answers as quickly as possible; from someone you are socially connected to; and who is likely to be an expert on the topic. How did they do it? On speed: By connecting Aardvark to a chat service (Google Chat), it exploited and delivered on that expectation of immediacy. In our age of instant knowledge via search engines and Wikipedia, this impedance match is a particularly nice touch that, perhaps ironically, humanizes the experience so it feels like a phone call. Interestingly enough, in a pioneering QA system called AnswerGarden, Mark Ackerman observed that users were often more satisfied when an answer came back

| a p r i l 201 2 | vo l . 5 5 | no. 4

quickly, even if the answer was somewhat less than perfect.1 On social interactions: Aardvark places emphasis on the social interaction just as much as getting the information—just as if there was a smart assistant who knew your Rolodex and made that phone call for you. Indeed, in Evans and Chi,2 we showed how social interactions were present and pervasive throughout the information seeking episode—before, during, and after the core search task. On expertise: Aardvark skillfully exploited the design knowledge gained from years of search engine research to scale the algorithms so that it can route those questions to others who are most likely able to answer it at that very moment. In fact, they followed the original meme of describing the “anatomy” of a search engine in describing their own system—a meme worth repeating in all areas of computer science involved in engineering real-world systems. Users want one thing—getting their questions answered immediately. Search engines have played that role for many years now. It can be argued that the greatest impact computers have had on the human endeavor is the Web search engine, whose development and refinement seems to be the epitome of computer science. That was before the Web truly became social. In the brave new social Web, search will be different, and reading this paper will give you a sense of the direction social search engines are headed. References 1. Ackerman, M.S. Augmenting organizational memory: A field study of answer garden. ACM Trans. Inf. Syst. 16, 3 (July 1998), 203-224; http://doi.acm. org/10.1145/290159.290160 2. Evans, B.M. and Chi, E.H. An elaborated model of social search. Information Processing & Management; http://dx.doi.org/10.1016/j.ipm.2009.10.012 Ed H. Chi ([email protected]) is staff research scientist at Google Research, Mountain View, CA. © 2012 ACM 0001-0782/12/04 $10.00

Who knows?: searching for expertise on the ... - ACM Digital Library

ple had to do to find the answer to a question before the Web. Imagine it is. 1990, before the age of search engines, and of course, Wikipedia. You have.

461KB Sizes 1 Downloads 265 Views

Recommend Documents

practice - ACM Digital Library
This article provides an overview of how XSS vulnerabilities arise and why it is so difficult to avoid them in real-world Web application software development.

Incorporating heterogeneous information for ... - ACM Digital Library
Aug 16, 2012 - A social tagging system contains heterogeneous in- formation like users' tagging behaviors, social networks, tag semantics and item profiles.

searching for expertise on the social web ... - Research
ready there. Before Aardvark, social QA systems used a wide variety of techniques to route questions, most often using ex- perience/reputation points or mon-.

Challenges on the Journey to Co-Watching ... - ACM Digital Library
Mar 1, 2017 - Examples they gave include watching video to avoid interacting with ... steps that people take to co-watch and the main challenges faced in this ...... 10. Erving Goffman and others. 1978. The presentation of self in everyday life. Harm

6LoWPAN Architecture - ACM Digital Library
ABSTRACT. 6LoWPAN is a protocol definition to enable IPv6 packets to be carried on top of low power wireless networks, specifically IEEE. 802.15.4.

Kinetic tiles - ACM Digital Library
May 7, 2011 - We propose and demonstrate Kinetic Tiles, modular construction units for kinetic animations. Three different design methods are explored and evaluated for kinetic animation with the Kinetic Tiles using preset movements, design via anima

On the Automatic Construction of Regular ... - ACM Digital Library
different application domains. Writing ... oped a tool based on Genetic Programming capable of con- ... We developed a web application containing a suite of ex-.

The Chronicles of Narnia - ACM Digital Library
For almost 2 decades Rhythm and Hues Studios has been using its proprietary software pipeline to create photo real characters for films and commercials. However, the demands of "The Chronicles of. Narnia" forced a fundamental reevaluation of the stud

BlueJ Visual Debugger for Learning the ... - ACM Digital Library
Science Education—computer science education, information systems education. General Terms: Experimentation, Human Factors. Additional Key Words and ...

On Effective Presentation of Graph Patterns: A ... - ACM Digital Library
Oct 30, 2008 - to mine frequent patterns over graph data, with the large spectrum covering many variants of the problem. However, the real bottleneck for ...

A Framework for Technology Design for ... - ACM Digital Library
Internet in such markets. Today, Internet software can ... desired contexts? Connectivity. While the Internet is on the rise in the Global South, it is still slow, unreliable, and often. (https://developers.google.com/ billions/). By having the devel

Borg, Omega, and Kubernetes - ACM Digital Library
acmqueue | january-february 2016 71 system evolution. As more and more applications were developed to run on top of Borg, our application and infrastructure ...

A Framework for Technology Design for ... - ACM Digital Library
learning, from the technological to the sociocultural, we ensured that ... lives, and bring a spark of joy. While the fields of ICTD and ..... 2015; http://www.gsma.com/ mobilefordevelopment/wp-content/ uploads/2016/02/Connected-Women-. Gender-Gap.pd

Proceedings Template - WORD - ACM Digital Library
knowledge-level approach (similarly to many AI approaches developed in the ..... 1 ArchE web: http://www.sei.cmu.edu/architecture/arche.html by ArchE when ...

Computing: An Emerging Profession? - ACM Digital Library
developments (e.g., the internet, mobile computing, and cloud computing) have led to further increases. The US Bureau of Labor Statistics estimates 2012 US.

Towards a Relation Extraction Framework for ... - ACM Digital Library
to the security domain are needed. As labeled text data is scarce and expensive, we follow developments in semi- supervised Natural Language Processing and ...

Designing Unbiased Surveys for HCI Research - ACM Digital Library
May 1, 2014 - enable the creation of unbiased surveys, this course ... Permission to make digital or hard copies of part or all of this work for personal or ...

Privacy-preserving query log mining for business ... - ACM Digital Library
transfer this problem into the field of privacy-preserving data mining. We characterize the possible adversaries interested in disclosing Web site confidential data and the attack strategies that they could use. These attacks are based on different v

An interactive multi-touch sketching interface for ... - ACM Digital Library
Diffusion curves are effective 2D vector-graphics primitives, for creating smoothly-shaded drawings with rich colors and unique styles. Conventional drawing ...

Evolutionary Learning of Syntax Patterns for ... - ACM Digital Library
Jul 15, 2015 - ABSTRACT. There is an increasing interest in the development of tech- niques for automatic relation extraction from unstructured text. The biomedical domain, in particular, is a sector that may greatly benefit from those techniques due

Online Microsurveys for User Experience Research - ACM Digital Library
May 1, 2014 - experience research. We focus specifically on Google. Consumer Surveys (GCS) and analyze a combination of log data and GCSs run by the ...

The multidimensional role of social media in ... - ACM Digital Library
informed consent to informed choice in medical decisions. Social media is playing a vital role in this transformation. I'm alive and healthy because of great doctors. Diagnosed with advanced kidney cancer, I received care from a great oncologist, a g

The word-gesture keyboard: reimagining ... - ACM Digital Library
Sep 1, 2012 - tion in the form of email, text chat, and Web posting. As com- ... desktop, the need for effective text entry on mobile devices has been ...