February 3–6, 2010, New York City, New York, USA

Table of Contents

Welcome from the Conference Chairs
Brian D. Davison (Lehigh University)
Torsten Suel (Polytechnic Institute of NYU)

Foreword from the Program Chairs
Nick Craswell (Microsoft)
Bing Liu (University of Illinois at Chicago)

WSDM 2010 Conference Organization 

WSDM 2010 Program Committees

WSDM 2010 Additional Reviewers

WSDM 2010 Sponsors & Supporters

Session 1: Web Search

Leveraging Temporal Dynamics of Document Content in Relevance Ranking (Page 1)
Jonathan L. Elsas (Carnegie Mellon University)
Susan T. Dumais (Microsoft Research)

Towards Recency Ranking in Web Search (Page 11)
Anlei Dong (Yahoo! Inc.)
Yi Chang (Yahoo! Inc.)
Zhaohui Zheng (Yahoo! Inc.)
Gilad Mishne (Yahoo! Inc.)
Jing Bai (Yahoo! Inc.)
Ruiqiang Zhang (Yahoo! Inc.)
Karolina Buchner (Yahoo! Inc.)
Ciya Liao (Yahoo! Inc.)
Fernando Diaz (Yahoo! Inc.)

Ranking Mechanisms in Twitter-like Forums (Page 21)
Anish Das Sarma (Yahoo Research)
Atish Das Sarma (Georgia Institute of Technology)
Sreenivas Gollapudi (Microsoft Research)
Rina Panigrahy (Microsoft Research)

Learning Concept Importance Using a Weighted Dependence Model (Page 31)
Michael Bendersky (University of Massachusetts, Amherst)
Donald Metzler (Yahoo! Laboratories)
W. Bruce Croft (University of Massachusetts, Amherst)

Query Reformulation Using Anchor Text (Page 41)
Van Dang (University of Massachusetts, Amherst)
W. Bruce Croft (University of Massachusetts, Amherst)

Session 2: Tagging and Recommendation

Tagging Human Knowledge (Page 51)
Paul Heymann (Stanford University)
Andreas Paepcke (Stanford University)
Hector Garcia-Molina (Stanford University)

Precomputing Search Features for Fast and Accurate Query Classification (Page 61)
Venkatesh Ganti (Microsoft Research)
Arnd Christian König (Microsoft Research)
Xiao Li (Microsoft Research)

I Tag, You Tag: Translating Tags for Advanced User Models (Page 71)
Robert Wetzker (Technische Universität Berlin)
Carsten Zimmermann (University of San Diego)
Christian Bauckhage (Fraunhofer IAIS)
Sahin Albayrak (Technische Universität Berlin)

Pairwise Interaction Tensor Factorization for Personalized Tag Recommendation (Page 81)
Steffen Rendle (Osaka University)
Lars Schmidt-Thieme (University of Hildesheim)

fLDA: Matrix Factorization through Latent Dirichlet Allocation (Page 91)
Deepak Agarwal (Yahoo! Research)
Bee-Chung Chen (Yahoo! Research)

(Return to Top)

Session 3: Information Extraction

Coupled Semi-Supervised Learning for Information Extraction (Page 101)
Andrew Carlson (Carnegie Mellon University)
Justin Betteridge (Carnegie Mellon University)
Richard C. Wang (Carnegie Mellon University)
Estevam R. Hruschka Jr. (Federal University of Sao Carlos)
Tom M. Mitchell (Carnegie Mellon University)

Adapting Information Bottleneck Method for Automatic Construction of Domain-oriented Sentiment Lexicon (Page 111)
Weifu Du (Haerbin Institute of Technology)
Songbo Tan (Chinese Academy of Sciences)
Xueqi Cheng (Chinese Academy of Sciences)
Xiaochun Yun (Chinese Academy of Sciences)

Data-oriented Content Query System: Searching for Data into Text on the Web (Page 121)
Mianwei Zhou (University of Illinois at Urbana-Champaign)
Tao Cheng (University of Illinois at Urbana-Champaign)
Kevin Chen-Chuan Chang (University of Illinois at Urbana-Champaign)

Corroborating Information from Disagreeing Views (Page 131)
Alban Galland (INRIA Saclay - Ile-de-France)
Serge Abiteboul (INRIA Saclay - Ile-de-France)
Amélie Marian (Rutgers University)
Pierre Senellart (Institut Télécom; Télécom ParisTech)

(Return to Top)

Session 4: Learning and Optimization

Ranking with Query-Dependent Loss for Web Search (Page 141)
Jiang Bian (Georgia Institute of Technology)
Tie-Yan Liu (Microsoft Research Asia)
Tao Qin (Microsoft Research Asia)
Hongyuan Zha (Georgia Institute of Technology)

IntervalRank - Isotonic Regression with Listwise and Pairwise Constraints (Page 151)
Taesup Moon (Yahoo! Laboratories)
Alex Smola (Yahoo! Laboratories)
Yi Chang (Yahoo! Laboratories)
Zhaohui Zheng (Yahoo! Laboratories)

An Optimization Framework for Query Recommendation (Page 161)
Aris Anagnostopoulos (Sapienza University of Rome)
Luca Becchetti (Sapienza University of Rome)
Carlos Castillo (Yahoo! Research)
Aristides Gionis (Yahoo! Research)

Improving Quality of Training Data for Learning to Rank Using Click-Through Data (Page 171)
Jingfang Xu (Microsoft Research Asia)
Chuanliang Chen (Beijing Normal University)
Gu Xu (Microsoft Research Asia)
Hang Li (Microsoft Research Asia)
Elbio Renato Torres Abib (Microsoft)

A Model to Estimate Intrinsic Document Relevance from the Clickthrough Logs of a Web Search Engine (Page 181)
Georges Dupret (Yahoo! Laboratories)
Ciya Liao (Yahoo! Laboratories)

(Return to Top)

Session 5: Users and Measurement

Large Scale Query Log Analysis of Re-Finding (Page 191)
Sarah K. Tyler (University of California, Santa Cruz)
Jaime Teevan (Microsoft Research)

Anatomy of the Long Tail: Ordinary People with Extraordinary Tastes (Page 201)
Sharad Goel (Yahoo Research)
Andrei Broder (Yahoo! Research)
Evgeniy Gabrilovich (Yahoo! Research)
Bo Pang (Yahoo! Research)

Inferring Search Behaviors Using Partially Observable Markov (POM) Model (Page 211)
Kuansan Wang (Microsoft Corporation)
Nikolas Gloy (Microsoft Corporation)
Xiaolong Li (Microsoft Corporation)

Beyond DCG: User Behavior as a Predictor of a Successful Search (Page 221)
Ahmed Hassan (University of Michigan, Ann Arbor)
Rosie Jones (Yahoo! Laboratories)
Kristina Lisa Klinkner (Carnegie Mellon University)

Measuring the Reusability of Test Collections (Page 231)
Ben Carterette (University of Delaware)
Evgeniy Gabrilovich (Yahoo! Research)
Vanja Josifovski (Yahoo! Research)
Donald Metzler (Yahoo! Research)

(Return to Top)

Session 6: Social 

Learning Influence Probabilities in Social Networks (Page 241)
Amit Goyal (University of British Columbia)
Francesco Bonchi (Yahoo! Research)
Laks V. S. Lakshmanan (University of British Columbia)

You Are Who You Know: Inferring User Profiles in Online Social Networks (Page 251)
Alan Mislove (MPI-SWS, Rice University, and Northeastern University)
Bimal Viswanath (MPI-SWS)
Krishna P. Gummadi (MPI-SWS)
Peter Druschel (MPI-SWS)

TwitterRank: Finding Topic-sensitive Influential Twitterers (Page 261)
Jianshu Weng (Singapore Management University)
Ee-Peng Lim (Singapore Management University)
Jing Jiang (Singapore Management University)
Qi He (Pennsylvania State University)

Folks in Folksonomies: Social Link Prediction from Shared Metadata (Page 271)
Rossano Schifanella (University of Turin)
Alain Barrat (CNRS UMR and ISI Foundation)
Ciro Cattuto (ISI Foundation)
Benjamin Markines (Indiana University)
Filippo Menczer (ISI Foundation and Indiana University)

GeoFolk: Latent Spatial Semantics in Web 2.0 Social Media (Page 281)
Sergej Sizov (University of Koblenz)

(Return to Top)

Session 7: Temporal Interaction

Learning Similarity Metrics for Event Identification in Social Media (Page 291)
Hila Becker (Columbia University)
Mor Naaman (Rutgers University)
Luis Gravano (Columbia University)

Early Online Identification of Attention Gathering Items in Social Media (Page 301)
Michael Mathioudakis (University of Toronto)
Nick Koudas (University of Toronto)
Peter Marbach (University of Toronto)

Evolution of Two-Sided Markets (Page 311)
Ravi Kumar (Yahoo! Research)
Yury Lifshits (Yahoo! Research)
Andrew Tomkins (Google, Inc.)

(Return to Top)

Session 8: Ads 

A Novel Click Model and Its Applications to Online Advertising (Page 321)
Zeyuan Allen Zhu (Tsinghua University and Microsoft Research Asia)
Weizhu Chen (Microsoft Research Asia)
Tom Minka (Microsoft Research Cambridge)
Chenguang Zhu (Microsoft Research Asia andTsinghua University)
Zheng Chen (Microsoft Research Asia)

Adaptive Weighing Designs for Keyword Value Computation (Page 331)
John W. Byers (Boston University)
Michael Mitzenmacher (Harvard University)
Georgios Zervas (Boston University and Adverplex Inc.)

Automatic Generation of Bid Phrases for Online Advertising (Page 341)
Sujith Ravi (ISI/USC)
Andrei Broder (Yahoo! Research)
Evgeniy Gabrilovich (Yahoo! Research)
Vanja Josifovski (Yahoo! Research)
Sandeep Pandey (Yahoo! Research)
Bo Pang (Yahoo! Research)

Personalized Click Prediction in Sponsored Search (Page 351)
Haibin Cheng (Yahoo! Laboratories)
Erick Cant\'u-Paz (Yahoo! Laboratories)

Improving Ad Relevance in Sponsored Search (Page 361)
Dustin Hillard (Yahoo! Laboratories)
Stefan Schroedl (Yahoo! Laboratories)
Eren Manavoglu (Yahoo! Laboratories)
Hema Raghavan (Yahoo! Laboratories)
Chris Leggetter (Yahoo! Laboratories)

(Return to Top)

Session 9: Systems and Efficiency

Revisiting Globally Sorted Indexes for Efficient Document Retrieval (Page 371)
Fan Zhang (Nankai University)
Shuming Shi (Microsoft Research Asia)
Hao Yan (Polytechnic Institute of New York University)
Ji-Rong Wen (Microsoft Research Asia)

Learning URL Patterns for Webpage De-duplication (Page 381)
Hema Swetha Koppula (Yahoo! Labs)
Krishna P. Leela (Yahoo! Labs)
Amit Agarwal (Picsquare.com)
Krishna Prasad Chitrapura (Yahoo! Labs)
Sachin Garg (Yahoo! Labs)
Amit Sasturkar (Yahoo! Inc.)

On Compressing the Textual Web (Page 391)
Paolo Ferragina (Università di Pisa)
Giovanni Manzini (Università del Piemonte Orientale)

A Sketch-Based Distance Oracle for Web-Scale Graphs (Page 401)
Atish Das Sarma (Georgia Institute of Technology)
Sreenivas Gollapudi (Microsoft Research)
Marc Najork (Microsoft Research)
Rina Panigrahy (Microsoft Research)

Early Exit Optimizations for Additive Machine Learned Ranking Systems (Page 411)
B. Barla Cambazoglu (Yahoo! Research)
Hugo Zaragoza (Yahoo! Research)
Olivier Chapelle (Yahoo! Research)
Jiang Chen (Yahoo! Research)
Ciya Liao (Yahoo! Laboratories)
Zhaohui Zheng (Yahoo! Laboratories)
Jon Degenhardt (Yahoo! Laboratories)

(Return to Top)

Session 10: Web Mining 

SBotMiner: Large Scale Search Bot Detection (Page 421)
Fang Yu (Microsoft Research Silicon Valley)
Yinglian Xie (Microsoft Research Silicon Valley)
Qifa Ke (Microsoft Research Silicon Valley)

Gathering and Ranking Photos of Named Entities with High Precision, High Recall, and Diversity (Page 431)
Bilyana Taneva (Max-Planck Institute for Informatics)
Mouna Kacimi (Free University of Bozen-Bolzano)
Gerhard Weikum (Max-Planck Institute for Informatics)

Boilerplate Detection Using Shallow Text Features (Page 441)
Christian Kohlschütter (L3S Research Center / Leibniz Universität Hannover)
Peter Fankhauser (L3S Research Center / Leibniz Universität Hannover)
Wolfgang Nejdl (L3S Research Center / Leibniz Universität Hannover)