Program - Full Papers (PDF)

The 18th ACM Conference on Information and Knowledge Management
Hong Kong, November 2-6, 2009

Session 1-Tuesday, Nov 3, 10:10-12:00

Session 2-Tuesday, Nov 3, 13:45-15:15

Session 3-Tuesday, Nov 3, 15:40-17:30

Session 4-Wednesday, Nov 4, 10:10-12:00

Session 5-Wednesday, Nov 4, 13:40-15:30

Session 6-Thursday, Nov 5, 10:10-12:00

Session 7-Thursday, Nov 5, 13:45-15:15

Session 8-Thursday, Nov 5, 15:40-17:30

Session 1-Tuesday, Nov 3, 10:10-12:00

1A - KM Track - Information Extraction I

(Session Chair: Wai Lam, CUHK)
Location: Rm 201A

StereoTrust: A Group Based Personalized Trust Model
Xin Liu, Anwitaman Datta, Krzysztof Razdca, Ee-Peng Lim

An Empirical Study on Using Hidden Markov Model for Search Interface Segmentation
Ritu Khare, Yuan An

Query by Analogical Example: Relational Search Using Web Search Engine Indices
Makoto Kato, Hiroaki Ohshima, Satoshi Oyama, Katsumi Tanaka

Semi-Supervised Learning of Semantic Classes for Query Understanding -- from the Web and for the Web
Wang Ye-Yi, Raphael Hoffmann, Xiao Li, Jakub Syzmanski

Efficient Record-Level Wrapper Induction
Shuyi Zheng, Ruihua Song, Ji-Rong Wen, C. Lee Giles


         
1B - IR Track - Web Search

(Session Chair: Xiaoming Li, Beijing University)
Location: Rm 201B

What Happens after an Ad Click? Quantifying the Impact of Landing Pages in Web Advertising
Hila Becker, Andrei Broder, Evgeniy Gabrilovich, Vanja Josifovski, Bo Pang

Characterizing Commercial Intent
Azin Ashkan, Charles Clarke

Analyzing and Evaluating Query Reformulation Strategies in Web Search Logs
Jeff Huang, Efthimis Efthimiadis

Characterizing and Predicting Search Engine Switching Behavior
Ryen White, Susan Dumais

Clustering and Exploring Search Results using Timeline Constructions
Omar Alonso, Michael Gertz, Ricardo Baeza-Yates


         
1C - DB Track - XML Data Processing, Filtering, Routing, and Algorithms

(Session Chair: Wook-Shin Han, Kyungpook National University)
Location: Rm 201C

Effective, Design-Independent XML Keyword Search
Arash Termehchy, Marianne Winslett

Efficient Processing of Twig Pattern Matching in Fuzzy XML
Jian Liu, Z. M. Ma, Li Yan

Dissemination of Heterogeneous XML Data in Publish/Subscribe Systems
Yuan Ni, Chee-Yong Chan

Linear Inclusion for XML Regular Expression Types
Dario Colazzo, Giorgio Ghelli, Luca Pardini, Carlo Sartiani

Effective XML Content and Structure Retrieval with Relevance Ranking
Liu Xiping, Wan Changxuan, Chen Lei


         
1D - IR Track - Domain-Specific Retrieval I

(Session Chair: David Carmel, IBM Research, Haifa Research Lab)
Location: Rm 204

Intention-Focused Active Reranking for Image Object Retrieval
Jen-Hao Hsiao, Ming-Syan Chen

A Translation Model for Matching Reviews to Objects
Nilesh Dalvi, Ravi Kumar, Bo Pang, Andrew Tomkins

Learning Better Transliterations
Jeffrey Pasternack, Dan Roth

Supervised Semantic Indexing
Bing Bai, Jason Weston, David Grangier, Ronan Collobert, Yanjun Qi, Kunihiko Sadamasa, Olivier Chapelle, Kilian Weinberger

Ranking Model Adaptation for Domain-Specific Search
Bo Geng, Linjun Yang, Chao Xu, Xian-Sheng Hua

 

Session 2-Tuesday, Nov 3, 13:45-15:15

2A - KM Track - Information Extraction II

(Session Chair: Lei Chen, HKUST)
Location: Rm 201A

Data-driven Compound Splitting Method for English Compounds in Domain Names
Sanjeet Khaitan, Arumay Das, Sandeep Gain, Adithi Sampath

Named Entity Disambiguation by Leveraging Wikipedia Semantic Knowledge
Xianpei Han, Jun Zhao 

Helping Editors Choose Better Seed Sets for Entity Set Expansion
Vishnu Vyas Sethumadhavan, Patrick Pantel, Eric Crestan

Using Multiple Ontologies in Information Extraction
Daya Wimalasuriya, Dejing Dou


         
2B - IR Track - Personalization and Social Search I

(Session Chair: Efthimis N. Efthimiadis, University of Washington)
Location: Rm 201B

Computational Community Interest for Ranking
Xiaozhong Liu, Vadim von Brzeski

Adaptive Relevance Feedback in Information Retrieval
Yuanhua Lv, ChengXiang Zhai

The Use of Categorization Information in Language Models for Question Retrieval
Xin Cao, Gao Cong, Cui Bin, Christian S. Jensen, Ce Zhang

Improving Search Engines Using Human Computation Games
Hao Ma, Raman Chandrasekar, Chris Quirk, Abhishek Gupta


2C - DB Track - String Databases, Blogs and Social Search

(Session Chair: Illhoi Yoo, University of Missouri , Columbia)
Location :Rm 201C

Space-Economical Partial Gram Indices for Exact Substring Matching
Tang Nan , Lefteris Sidirourgos, Peter Boncz

AS-Index: A Structure For String Search Using n-grams and Algebraic Signatures
Cedric du Mouza, Witold Litwin, Philippe Rigaux, Thomas Schwarz

Robust Record Linkage Blocking using Suffix Arrays
Timothy de Vries, Hui Ke, Sanjay Chawla, Peter Christen

Efficient Algorithms for Approximate Member Extraction Using Signature-based Inverted Lists
Jiaheng Lu, Jialong Han


         
2D - KM Track - Advance Mining Techniques

(Session Chair: Ada Fu, CUHK)
Location: Rm 204

An Integrated Discriminative Probabilistic Approach to Information Extraction
Xiaofeng Yu, Wai Lam, Bo Chen

Mining Linguistic Cues for Query Expansion: Applications to Drug Interaction Search
Sheng Guo, Naren Ramakrishnan

Message Family Propagation for Ising Mean Field Based on Iteration Tree
Yarui Chen, Shizhong Liao

Efficient Itemset Generator Discovery over a Stream Sliding Window
Chuancong Gao, Jianyong Wang

 

Session 3-Tuesday, Nov 3, 15:40-17:30

3A - KM Track - Text Mining

(Session Chair: Jiantao Sun, Microsoft Asia)
Location: Rm 201A

Learning document aboutness from implicit user feedback and document structure
Deepa Paranjpe

Joint Sentiment/Topic Model for Sentiment Analysis
Chenghua Lin, Yulan He

Generating Comparative Summaries of Contradictory Opinions in Text
Hyun Duk Kim, ChengXiang Zhai

sDoc: Exploring Social Wisdom for Document Enhancement in Web Mining
Zhang Xiaoxun, Yang Lichun, Wu Xian, Guo Honglei, Guo Zhili, Bao Shenghua, Yu Yong, Su Zhong

Terminology Mining in Social Media
Magnus Sahlgren, Jussi Karlgren


         
3B - IR Track - Crawling and Indexing

(Session Chair: Bruce Croft, University of Massachusetts, Amherst)
Location: Rm 201B   

Compact Full-Text Indexing of Versioned Document Collections
Jinru He, Hao Yan, Torsten Suel

On the Feasibility of Multi-Site Web Search Engines
Ricardo Baeza-Yates, Aristides Gionis, Flavio Junqueira, Vassilis Plachouras, Luca Telloli

On-line Index Maintenance Using Horizontal Partitioning
Sairam Gurajada, Sreenivasa Kumar P

Adaptive Geospatially Focused Crawling
Dirk Ahlers, Susanne Boll

Low-cost Management of Inverted Files for Online Full-Text Search
Giorgos Margaritis, Stergios Anastasiadis


         
3C - DB Track - Novel Data Management and Data Mining Tools

(Session Chair: Xue-Wen Chen, Unveristy of Kansas)
Location: Rm 201C

Bitmap Indexes for Relational XML Twig Query Processing
Kyong-Ha Lee, Bongki Moon

Answering XML Queries Using Materialized Views Revisited
Xiaoying Wu, Dimitri Theodoratos, Wendy Hui Wang

A Query Language for Analyzing Networks
Anton Dries, Siegfried Nijssen, Luc De Raedt

Probabilistic Models for Topic Learning from Images and Captions in Online Biomedical Literatures
Xin Chen, Caimei Lu, Yuan An, Palakorn Achananuparp

Learning to Rank with a Novel Kernel Perceptron Method
Xue-wen Chen, Haixun Wang, Xiaotong Lin


         
3D - KM Track - Semantic Techniques and Applications

(Session Chair: Evgeniy Gabrilovich, Yahoo! Research)
Location: RM 204

Towards a Universal Wordnet by Learning from Combined Evidence
Gerard de Melo, Gerhard Weikum

Event Detection from Flickr Data through Wavelet-based Spatial Analysis
Ling Chen, Abhishek Roy

Msuggest: A Semantic Recommender Framework for Traditional Chinese Medicine Book Search Engine
Shi Shaomin, Wei Baogang, Yang Yan

Interactive, Topic-based Visual Text Summarization and Analysis
Shixia Liu, Michelle Zhou, Shimei Pan, Weihong Qian, Weijia Cai, Xiaoxiao Lian

 

Session 4-Wednesday, Nov 4, 10:10-12:00

4A - KM Track - Graph Mining

(Session Chair: Michael Lyu, CUHK)
Location: Rm 201A

P-Rank: A Comprehensive Structural Similarity Measure over Information Networks
Peixiang Zhao, Han Jiawei, Yizhou Sun

Independent Informative Subgraph Mining for Graph Information Retrieval
Bingjun Sun, Prasenjit Mitra, C. Lee Giles

Graph Classification Based on Pattern Co-occurrence
Nin Jin, Calvin Young, Wei Wang

Frequent Subgraph Pattern Mining on Uncertain Graph Data
Zhaonian Zou, Jianzhong Li, Hong Gao, Shuo Zhang

L2 Norm Regularized Feature Kernel Regression For Graph Data
Hongliang Fei, Luke Huan


         
4B - IR Track - Evaluation

(Session Chair: Kalervo Jarvelin, University of Tampere)
Location: Rm 201B

Improvements That Don't Add Up: Ad-Hoc Retrieval Results Since 1998
Timothy Armstrong, Alistair Moffat, William Webber, Justin Zobel

Empirical Justification of the Gain and Discount Function for nDCG
Evangelos Kanoulas, Javed Aslam

Expected Reciprocal Rank for Graded Relevance
Olivier Chapelle, Donald Metzler, Ya Zhang, Pierre Grinspan

Usage Based Effectiveness Measures
Leif Azzopardi, Mark Baillie

Post-Rank Reordering: Resolving Preference Misalignments between Search Engines and End Users
Chao Liu


         
4C - DB Track - Information Integration, Data Provenance, Probabilistic Databases

(Session Chair: Nikos Mamoulis, HKU)
Location: Rm 201C

Probabilistic Skyline Queries
Christian Boehm, Frank Fiedler, Annahita Oswald, Claudia Plant, Bianca Wackersreuther

Density-based Clustering using Graphics Processors
Christian Boehm, Robert Noll, Claudia Plant, Bianca Wackersreuther

Scalable Continuous Range Monitoring of Moving Objects in Symbolic Indoor Space
Bin Yang, Hua Lu, Christian S. Jensen

Provenance Query Evaluation: What's so special about it?
Anastasios Kementsietsidis, Min Wang

Navigational Path Privacy Protection
Ken C.K. Lee, Wang-chien Lee, Hong Va Leong, Baihua Zheng

 


4D - Industry Track - Information Retrieval

(Session Chair: Anupam Joshi, University of Maryland, Baltimore County)
Location: Rm 204

Automatic Retrieval of Similar Content Using Search Engine Query Interface
Ali Dasdan, Paolo D'Alberto, Chris Drome, Santanu Kolay

Mashup-based Information Retrieval for Domain Experts
Anand Ranganathan, Anton Riabov, Octavian Udrea

A Study of Information Retrieval on Accumulative Social Descriptions using the Generation Features
Lichun, Yang, Shengliang Xu, Shenghua Bao, Dingyi Han, Zhong Su, Yong Yu

iMecho: An Associative Memory Based Desktop Search System
Jidong Chen, Hang Guo, Wentao Wu, Wei Wang 

Product Query Classification
Dou Shen

 

Session 5-Wednesday, Nov 4, 13:40-15:30

5A - KM Track - Information Filtering and Recommender Systems

(Session Chair: Ben Kao, HKU)
Location: RM 201A

Learning to Recommend Questions Based on User Ratings
Ke Sun, Cao Yunbo , Xinying Song, Chin-Yew Lin, Song Young-In, Xiaolong Wang

Probabilistic Latent Preference Analysis for Collaborative Filtering
Nathan Liu, Min Zhao, Qiang Yang

Semi-Nonnegative Matrix Factorization with Global Statistical Consistency for Collaborative Filtering
Hao Ma, Haixuan Yang, Irwin King, Michael Lyu

Voting in Social Networks
Paolo Boldi, Francesco Bonchi, Carlos Castillo, Sebastiano Vigna

User-induced Links in Collaborative Tagging Systems
Ching Man Au Yeung, Nicholas Gibbins, Nigel Shadbol


         
5B - IR Track - Ranking and Retrieval Models I

(Session Chair: Torsten Suel, Polytechnic Institute of NYU)
Location: Rm 201B

A Signal-to-Noise Approach to Score Normalization
Avi Arampatzis, Jaap Kamps

Nonlinear Static-Rank Computation
Shuming Shi, Bin Lu, Yunxiao Ma, Ji-Rong Wen

A General Magnitude-Preserving Boosting Algorithm for Search Ranking
Chenguang Zhu, Weizhu Chen, Zeyuan Zhu, Gang Wang

Learning to Rank from Bayesian Decision Inference
Jen-Wei Kuo, Pu-Jen Cheng, Hsin-Min Wang

Reducing the Risk of Query Expansion via Robust Constrained Optimization
Kevyn Collins-Thompson


         
5C - DB Track - Streams, Network Databases

(Session Chair: Raymond Wong, HKUST)
Location: Rm 201C

A Code Generation Approach to Optimizing High-Performance Distributed Data Stream Processing
Bugra Gedik, Henrique Andrade, Kun-Lung Wu

Efficient Join Processing on Uncertain Data Streams
Xiang Lian, Lei Chen

Fast Shortest Path Distance Estimation in Large Networks
Michalis Potamias, Francesco Bonchi, Carlos Castillo, Aristides Gionis

Evaluating Top-k Queries over Incomplete Data Streams
Haghani Parisa, Sebastian Michel, Karl Aberer

Mining Data Streams with Periodically Changing Distributions
Yingying Tao, Tamer Ozsu
         

 

Session 6-Thursday, Nov 5, 10:10-12:00

6A - KM Track - Classification and Clustering I

(Session Chair: Joost Kok, Leiden University)
Location: Rm 201A

Clustering Web Queries
John Whissell, Charles Clarke, Azin Ashkan

Evidence of Quality of Textual Features on the Web 2.0
Figueiredo Flavio, Belém Fabiano, Henrique Pinto, David Fernandes, Jussara Almeida, Marcos Gonçalves, Edleno Moura, Marco Cristo

Exploiting Internal and External Semantics for the Clustering of Short Texts Using World Knowledge
Xia Hu, Nan Sun, Chao Zhang, Tat-Seng Chua

SELC: A Self-Supervised Model for Sentiment Classification
Likun Qiu, Weishi Zhang, Changjian Hu, Kai Zhao

Graph-based Transfer Learning
Jingrui He, Yan Liu, Richard Lawrence

 

6B - IR Track - Domain-Specific Retrieval II

(Session Chair: ChengXiang Zhai, University of Illinois at Urbana-Champaign)
Location: Rm 201B       

A Unified Relevance Model for Opinion Retrieval
Xuanjing Huang, Bruce Croft

Detecting Topic Evolution in Scientific Literature: How Can Citations Help?
Qi He, Bi Chen, Jian Pei, Baojun Qiu, Prasenjit Mitra, C. Lee Giles

Efficient Information Retrieval in Mobile Peer-to-Peer Networks
Lijiang Chen, Cui Bin, Heng Tao Shen, Wei Lu, Xiaofang Zhou

Language-model-based Ranking for Queries on RDF-Graphs
Shady Elbassuoni, Maya Ramanath, Ralf Schenkel, Marcin Sydow, Gerhard Weikum

Heterogeneous Cross Domain Ranking in Latent Space
Bo Wang, Jie Tang, Wei Fan, Songcan Chen, Yanzhu Liu

 

6C - DB Track - Data Warehousing and OLAP

(Session Chair: Xiaofeng Meng, Renmin University)
Location: Rm 201C

Supporting Ranking Pattern-Based Aggregate Queries in Sequence Data Cubes
Chun-Kit Chui, Eric Lo, Ben Kao, Wai-Shing Ho

Fuzzy Semantic Web Ontology Learning from Fuzzy UML Model
Fu Zhang, Z. M. Ma, Jingwei Cheng, Xiangfu Meng

Efficient Joins with Compressed Bitmap Indexes
Kamesh Madduri, Kesheng Wu

A Framework for Semantic Link Discovery over Relational Data
Oktie Hassanzadeh, Anastasios Kementsietsidis, Lipyeow Lim, Renee J. Miller, Min Wang

POkA : Identifying Pareto-Optimal k-Anonymous Nodes in a Domain Hierarchy Lattice
Rinku Dewri, Indrajit Ray, Indrakshi Ray, Darrell Whitley

 

6D - Industry Track - Data Mining framework and applications

(Session Chair: Sanjay Madria, University of Missouri-Rolla)
Location: Rm 204

Practical Lessons of Data Mining at Yahoo!
Ye Chen, Dmitry Pavlov, Pavel Berkhin, Aparna Seetharaman, Albert Meltzer

Domain Driven Data Mining to Improve Promotional Campaign ROI and Select Marketing Channels
Thomas Piton, Julien Blanchard, Henri Briand, Fabrice Guillet

Framework for Timely and Accurate Ads on Mobile Devices
Alex Penev, Raymond K. Wong

Improving Web Page Classification by Label-propagation over Click Graphs
Soo-Min Kim, Patrick Pantel, Lei Duan, Scott Gaffney

Product Feature Categorization with Multilevel Latent Semantic Association
Honglei Guo, Huijia Zhu, Zhili Guo, Zhang Xiaoxun, Zhong Su

 

Session 7-Thursday, Nov 5, 13:45-15:15

7A - KM Track - Link Analysis and Social Computing

(Session Chair: Irwin King, CUHK)
Location: Rm 201A

Completing Wikipedia's Hyperlink Structure through Dimensionality Reduction
Robert West, Joelle Pineau, Doina Precup

Scalable Learning of Collective Behavior Based on Sparse Social Dimensions
Lei Tang, Huan Liu

Blog Cascade Affinity: Analysis and Prediction
Sourav S Bhowmick, Hui Li, Aixin Sun

Socializing or Knowledge Sharing? Characterizing Social Intent in Community Question Answering
Eduarda Mendes Rodrigues, Natasa Milic-Frayling

 

7B - KM Track - Data Summarization

(Session Chair: William Cheung, HKBU)
Location: Rm 201B

Time Sequence Summarization to Scale Up Chronology-dependent Applications
Quang-Khai Pham, Guillaume Raschia, Regis Saint-Paul, Boualem Benatallah, Noureddine Mouaddib

Compressing Tags to Find Interesting Media Groups
Matthijs van Leeuwen, Francesco Bonchi, Börkur Sigurbjörnsson, Arno Siebes

Efficient Feature Weighting Methods for Ranking
Hwanjo Yu, Jinoh Oh, Wook-Shin Han

Fast and Effective Histogram Construction
Felix Halim, Panagiotis Karras, Roland Yap

 

7C - Industry Track - Data and Query Similarity

(Session Chair: Youngja Park, IBM Research - Watson)
Location: Rm 201C

Characterizing, Constructing and Managing Resource Usage Profiles of System S Applications: Challenges and Experience
Sujay Parekh, Deepak Rajan, Kirsten Hildrum, Joel Wolf, Kun-Lung Wu

Generating SQL/XML Query and Update Statements
Matthias Nicola, Tim Kiefer

A System for Detecting XML Similarity in Content and Structure Using Relational Database
Sanjay Madria, Waraporn Viyanon

Characteristics of Document Similarity Measures for Compliance Analysis
Asad Sayeed, Soumitra Sarkar, Yu Deng, Rafah Hosn, Ruchi Mahindru, Nithya Rajamani

 

Session 8-Thursday, Nov 5, 15:40-17:30

8A - IR Track - Personalization and Social Search II

(Session Chair: Avi Arampatzis, University of Amsterdam)
Location: RM 201A

PQC: Personalized Query Classification
Bin Cao

Personalized Social Search Based on the User's Social Network
David Carmel, Naama Zwerdling, Ido Guy, Shila Ofek-Koifman, Nadav Har'el, Inbal Ronen, Erel Uziel, Sivan Yogev, Sergey Chernov

Beyond Hyperlinks: Organizing Information Footprints in Search Logs to Support Effective Browsing
Wang Xuanhui, ChengXiang Zhai

A Social Recommendation Framework Based on Multi-scale Continuous Conditional Random Fields
Xin Xin, Hongbo Deng, Irwin King, Michael Lyu

Enhancing Recommender Systems under Volatile User Interest Drifts
Jie Yang, Enhong Chen, Huanhuan Cao, Hui Xiong


        
8B - IR Track - Ranking and Retrieval Models II

(Session Chair: Kevyn Collins-Thompson, Microsoft Research)
Location: Rm 201B

A Term Dependency-Based Approach for Query Terms Ranking
Chia-Jung Lee, Ruey-Cheng Chen, Pu-Jen Cheng

Classification-Based Resource Selection
Jaime Arguello, Jamie Callan, Fernando Diaz

Probabilistic Models of Ranking Novel Documents for Faceted Topic Retrieval
Ben Carterette, Praveen Chandar

Retrieval Experiments using Pseudo-Desktop Collections
Jinyoung Kim, Bruce Croft

Incident Threading for News Passages
Ao Feng, James Allan


        
8C - KM Track - Classification and Clustering II

(Session Chair: Clement Yu, University of Illinois at Chicago)
Location: Rm 201C

Detection of Orthogonal Concepts in Subspaces of High Dimensional Data
Stephan Günnemann, Emmanuel Müller, Ines Färber, Thomas Seidl

Large Margin Transductive Transfer Learning
Quanz Brian, Luke Huan

Subspace Maximum Margin Clustering
Gu Quanquan, Jie Zhou

A Risk Minimization Framework for Domain Adaptation
Bo Long, Sudarshan Lamkhede, Srinivas Vadrevu, Ya Zhang, Belle Tseng


        
8D - Industry Track - Call and Web Center, E-Commerce related Technologies

(Session Chair: Mukesh Mohania, IBM Research - India)
Location: Rm 204

ExSearch: A Novel Vertical Search Engine for Online Barter Business
Lei JI, Jun Yan, Ning Liu

iLoc: A Framework for Incremental Location-State Acquisition and Prediction based on Mobile Sensors
Ma Yiming

Predicting the Conversion Probability for Items on C2CEcommerce Sites
Xiaoyuan Wu, Alvaro Bolivar

Towards Real-Time Measurement of Customer Satisfaction Using Automatically Generated Call Transcripts
Youngja Park, Stephen Gates

ROSE: Retail Outlet Site Evaluation by Learning with both Sample and Feature Preference
Bin Zhang, Shouchun Chen, Ming Xie, Li Xia, Wenjun Yin, Jin Dong

 

Platinum Supporters

Gold Supporters

Bronze Supporters

Organizations