Lecture Notes in Computer Science 6184 - UQ eSpace

Lecture Notes in Computer Science
Commenced Publication in 1973
Founding and Former Series Editors:
Gerhard Goos, Juris Hartmanis, and Jan van Leeuwen
Editorial Board
David Hutchison
Lancaster University, UK
Takeo Kanade
Carnegie Mellon University, Pittsburgh, PA, USA
Josef Kittler
University of Surrey, Guildford, UK
Jon M. Kleinberg
Cornell University, Ithaca, NY, USA
Alfred Kobsa
University of California, Irvine, CA, USA
Friedemann Mattern
ETH Zurich, Switzerland
John C. Mitchell
Stanford University, CA, USA
Moni Naor
Weizmann Institute of Science, Rehovot, Israel
Oscar Nierstrasz
University of Bern, Switzerland
C. Pandu Rangan
Indian Institute of Technology, Madras, India
Bernhard Steffen
TU Dortmund University, Germany
Madhu Sudan
Microsoft Research, Cambridge, MA, USA
Demetri Terzopoulos
University of California, Los Angeles, CA, USA
Doug Tygar
University of California, Berkeley, CA, USA
Gerhard Weikum
Max-Planck Institute of Computer Science, Saarbruecken, Germany
6184
Lei Chen Changjie Tang Jun Yang
Yunjun Gao (Eds.)
Web-Age
Information Management
11th International Conference, WAIM 2010
Jiuzhaigou, China, July 15-17, 2010
Proceedings
13
Volume Editors
Lei Chen
Hong Kong University of Science and Technology
Department of Computer Science
Clear Water Bay, Kowloon, Hong Kong, China
E-mail: leichen@cs.ust.hk
Changjie Tang
Sichuan University, Computer Department
Chengdu 610064, China
E-mail: cjtang@scu.edu.cn
Jun Yang
Duke University, Department of Computer Science
Box 90129, Durham, NC 27708-0129, USA
E-mail: junyang@cs.duke.edu
Yunjun Gao
Zhejiang University, College of Computer Science
388 Yuhangtang Road, Hangzhou 310058, China
E-mail: gaoyj@zju.edu.cn
Library of Congress Control Number: 2010929625
CR Subject Classification (1998): H.3, H.4, I.2, C.2, H.2, H.5
LNCS Sublibrary: SL 3 – Information Systems and Application, incl. Internet/Web
and HCI
ISSN
ISBN-10
ISBN-13
0302-9743
3-642-14245-1 Springer Berlin Heidelberg New York
978-3-642-14245-1 Springer Berlin Heidelberg New York
This work is subject to copyright. All rights are reserved, whether the whole or part of the material is
concerned, specifically the rights of translation, reprinting, re-use of illustrations, recitation, broadcasting,
reproduction on microfilms or in any other way, and storage in data banks. Duplication of this publication
or parts thereof is permitted only under the provisions of the German Copyright Law of September 9, 1965,
in its current version, and permission for use must always be obtained from Springer. Violations are liable
to prosecution under the German Copyright Law.
springer.com
© Springer-Verlag Berlin Heidelberg 2010
Printed in Germany
Typesetting: Camera-ready by author, data conversion by Scientific Publishing Services, Chennai, India
Printed on acid-free paper
06/3180
Preface
WAIM is a leading international conference on research, development, and applications of Web technologies, database systems, and information management. Traditionally, WAIM has drawn the strongest participation from the Asia-Pacific region.
The previous WAIM conferences were held in Shanghai (2000), Xi'an (2001), Beijing
(2002), Chengdu (2003), Dalian (2004), Hangzhou (2005), Hong Kong (2006),
Huangshan (2007), Zhangjiajie (2008), and Suzhou (2009). In 2010, WAIM was held
in Jiuzhaigou, Sichuan, China.
This high-quality program would not have been possible without the authors who
chose WAIM for disseminating their contributions. Out of 205 submissions from 16
countries and regions, including Australia, Canada, France, Germany, Hong Kong,
Japan, Korea, Macau, Malaysia, Mainland China, Saudi Arabia, Singapore, Taiwan,
Thailand, UK, and USA, we selected 58 full papers and 11 short papers for publication. The acceptance rate for regular full papers was 28%. The contributed papers
addressed a wide range of topics such as Web, XML, and multimedia data, data processing in the cloud or on new hardware, data mining and knowledge discovery, information integration and extraction, networked data and social networks, graph and
stream processing, similarity search, etc. We are also grateful to our distinguished
keynote speakers Prof. Jianzhong Li, Dr. Divesh Srivastava, Prof. Katsumi Tanaka,
and Prof. Xiaofang Zhou.
A conference like WAIM can only succeed as a team effort. We want to thank the
Program Committee members and the reviewers for their invaluable efforts. Special
thanks go to the local Organizing Committee headed by Changjie Tang, Aoying Zhou,
and Lei Duan. Many thanks also go to our Workshop Co-chairs (Jian Pei and Hengtao
Shen), Tutorial Co-chairs (Liu Wenyin and Jian Yang), Publicity Co-chairs (Hua
Wang and Shuigeng Zhou), Industrial Chairs (Qiming Chen and Haixun Wang), Registration Chair (Chuan Li), and Finance Co-chairs (Howard Leung and Yu Chen). Last
but not least, we wish to express our gratitude for the hard work of our webmaster Jie
Zuo, and for our sponsors who generously supported the smooth running of our
conference.
Lei Chen
Changjie Tang
Jun Yang
Masaru Kitsuregawa
Qing Li
WAIM 2010 Conference Organization
Honorary Chair
Yi Zhang
Sichuan University, China
Conference Co-chairs
Masaru Kitsuregawa
Qing Li
University of Tokyo, Japan
City University of Hong Kong, Hong Kong
Program Committee Co-chairs
Lei Chen
Changjie Tang
Jun Yang
Hong Kong University of Science and Technology,
Hong Kong
Sichuan University, China
Duke University, USA
Local Organization Co-chairs
Aoying Zhou
Lei Duan
East China Normal University, China
Sichuan University, China
Workshops Co-chairs
Jian Pei
Hengtao Shen
Simon Fraser University, Canada
University of Queensland, Australia
Tutorial/Panel Co-chairs
Wenyin Liu
Jian Yang
City University of Hong Kong, Hong Kong
Macquarie University, Australia
Industrial Co-chairs
Qiming Chen
Haixun Wang
HP Labs, Palo Alto, USA
Microsoft Research Asia, China
VIII
Organization
Publication Chair
Yunjun Gao
Zhejiang University, China
Publicity Co-chairs
Hua Wang
Shuigeng Zhou
University of Southern Queensland, Australia
Fudan University, China
Finance Co-chairs
Howard Leung
Yu Chen
Hong Kong Web Society, Hong Kong
Sichuan University, China
Registration Chair
Chuan Li
Sichuan University, China
CCF DB Society Liaison
Xiaofeng Meng
Renmin University of China, China
Steering Committee Liaison
Zhiyong Peng
Wuhan University, China
Web Master
Jie Zuo
Sichuan University, China
Program Committee
James Bailey
Gang Chen
Hong Chen
Yu Chen
Reynold Cheng
David Cheung
Dickson Chiu
Byron Choi
Bin Cui
Alfredo Cuzzocrea
University of Melbourne, Australia
Zhejiang University, China
Chinese Univeristy of Hong Kong, Hong Kong
Sichuan University, China
The University of Hong Kong, Hong Kong
The University of Hong Kong, Hong kong
Dickson Computer Systems, Hong Kong
Hong Kong Baptist University, Hong Kong
Peking University, China
University of Calabria, Italy
Organization
Guozhu Dong
Xiaoyong Du
Lei Duan
Ling Feng
Johann Gamper
Bryon Gao
Yong Gao
Jihong Guan
Giovanna Guerrini
Bingsheng He
Jimmy Huang
Seung-won Hwang
Wee Hyong
Yoshiharu Ishikawa
Yan Jia
Ruoming Jin
Ning Jing
Ben Kao
Yong Kim
Nick Koudas
Wu Kui
Carson Leung
Chengkai Li
Chuan Li
Feifei Li
Tao Li
Tianrui Li
Zhanhuai Li
Zhoujun Li
Xiang Lian
Lipeow Lim
Xuemin Lin
Huan Liu
Lianfang Liu
Qizhi Liu
Weiyi Liu
Wenyin Liu
Eric Lo
Zongmin Ma
Weiyi Meng
Mohamed Mokbel
Yang-Sae Moon
Akiyo Nadamoto
Miyuki Nakano
IX
Wright State University, USA
Renmin University of China, China
Sichuan University, China
Tsinghua University, China
Free University of Bozen-Bolzano, Italy
Texas State University at San Marcos, USA
Univeristy of British Columbia, Canada
Tongji University, China
Università di Genova, Italy
Chinese Univeristy of Hong Kong, Hong Kong
York Univeristy, Canada
Pohang University of Science and Technology,
Korea
Microsoft
Nagoya University, Japan
National University of Defence Technology, China
Kent State University, USA
National University of Defence Technology, China
The University of Hong Kong, Hong Kong
Korea Education & Research Information Service,
Korea
Univeristy of Toronto, Canada
Victoria University, Canada
University of Manitoba, Canada
University of Texas at Arlington, USA
Sichuan University, China
Florida State University, USA
Florida International University, USA
Southwest Jiaotong University, China
Northwestern Polytechnical University, China
Beihang University, China
Hong Kong University of Science and Technology,
Hong Kong
University of Hawaii at Manoa, USA
University of New South Wales, Australia
Arizona State University, USA
Computing Center of Guangxi, China
Nanjing University, China
Yunnan University, China
City Univeristy of Hong Kong
Hong Kong Polytechnic University, Hong Kong
Northeastern University, China
State University of New York at Binghamton, USA
University of Minnesota, USA
Kangwon National University, Korea
Konan University, Japan
University of Tokyo, Japan
X
Organization
Raymond Ng
Anne Ngu
Tadashi Ohmori
Olga Papaemmanouil
Zhiyong Peng
Evaggelia Pitoura
Tieyun Qian
Shaojie Qiao
Markus Schneider
Hengtao Shen
Yong Tang
David Taniar
Maguelonne Teisseire
Anthony Tung
Shunsuke Uemura
Jianyong Wang
Ke Wang
Tengjiao Wang
Wei Wang
Raymond Wong
Raymond Chi-Wing Wong
Xintao Wu
Yuqing Wu
Junyi Xie
Li Xiong
Jianliang Xu
Jian Yang
Xiaochun Yang
Ke Yi
Hwanjo Yu
Jeffrey Yu
Lei Yu
Philip Yu
Ting Yu
Xiaohui Yu
Demetris Zeinalipour
Donghui Zhang
Ji Zhang
Baihua Zheng
Aoying Zhou
Shuigeng Zhou
Xiangmin Zhou
Qiang Zhu
Lei Zou
University of British Columbia, Canada
Texas State University at San Marcos, USA
University of Electro Communications, Japan
Brandeis University, USA
Wuhan University, China
University of Ioannina, Greece
Wuhan University, China
Southwest Jiaotong University, China
University of Florida, USA
University of Queensland, Australia
Sun Yat-sen University, China
Monash University, Australia
University Montpellier 2, France
National University of Singapore, Singapore
Nara Sangyo University, Japan
Tsinghua University, China
Simon Fraser University, Canada
Peking University, China
University of New South Wales, Australia
University of New South Wales, Australia
Hong Kong University of Science and Technology,
Hong Kong
University of North Carolina at Charlotte, USA
Indiana University at Bloomington, USA
Oracle Corp., USA
Emory University, USA
Hong Kong Baptist University, Hong Kong
Macquaire University, Australia
Northeastern University, China
Hong Kong University of Science and Technology,
Hong Kong
Pohang University of Science and Technology,
Korea
Chinese Univeristy of Hong Kong, Hong Kong
State University of New York at Binghamton, USA
University of Illinois at Chicago, USA
North Carolina State University, USA
York University, Canada
University of Cyprus, Cyprus
Microsoft Jim Gray Systems Lab, USA
University of Southern Queensland, Australia
Singapore Management University, Singapore
East China Normal University, China
Fudan University, China
CSIRO, Australia
University of Michigan at Dearborn, USA
Peking University, China
Organization
Organized by
Sichuan University
Sponsored by
华东师范大学
EAST CHINA
NORMAL UNIVERSITY
XI
Table of Contents
Analyzing Data Quality Using Data Auditor (Keynote Abstract) . . . . . . .
Divesh Srivastava
1
Rebuilding the World from Views (Keynote Abstract) . . . . . . . . . . . . . . . .
Xiaofang Zhou and Henning Köhler
2
Approximate Query Processing in Sensor Networks
(Keynote Abstract) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Jianzhong Li
3
Web Data I
Duplicate Identification in Deep Web Data Integration . . . . . . . . . . . . . . . .
Wei Liu, Xiaofeng Meng, Jianwu Yang, and Jianguo Xiao
5
Learning to Detect Web Spam by Genetic Programming . . . . . . . . . . . . . .
Xiaofei Niu, Jun Ma, Qiang He, Shuaiqiang Wang, and
Dongmei Zhang
18
Semantic Annotation of Web Objects Using Constrained Conditional
Random Fields . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Yongquan Dong, Qingzhong Li, Yongqing Zheng, Xiaoyang Xu, and
Yongxin Zhang
Time Graph Pattern Mining for Web Analysis and Information
Retrieval . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Taihei Oshino, Yasuhito Asano, and Masatoshi Yoshikawa
28
40
Networked Data
FISH: A Novel Peer-to-Peer Overlay Network Based on
Hyper-deBruijn . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Ye Yuan, Guoren Wang, and Yongjiao Sun
47
Continuous Summarization of Co-evolving Data in Large Water
Distribution Network . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Hongmei Xiao, Xiuli Ma, Shiwei Tang, and Chunhua Tian
62
Proactive Replication and Search for Rare Objects in Unstructured
Peer-to-Peer Networks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Guoqiang Gao, Ruixuan Li, Kunmei Wen, Xiwu Gu, and
Zhengding Lu
74
XIV
Table of Contents
SWORDS: Improving Sensor Networks Immunity under Worm
Attacks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Nike Gui, Ennan Zhai, Jianbin Hu, and Zhong Chen
Efficient Multiple Objects-Oriented Event Detection over RFID Data
Streams . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Shanglian Peng, Zhanhuai Li, Qiang Li, Qun Chen, Hailong Liu,
Yanming Nie, and Wei Pan
86
97
Social Networks
CW2I: Community Data Indexing for Complex Query Processing . . . . . .
Mei Hui, Panagiotis Karras, and Beng Chin Ooi
103
Clustering Coefficient Queries on Massive Dynamic Social Networks . . . .
Zhiyu Liu, Chen Wang, Qiong Zou, and Huayong Wang
115
Predicting Best Answerers for New Questions in Community Question
Answering . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Mingrong Liu, Yicen Liu, and Qing Yang
Semantic Grounding of Hybridization for Tag Recommendation . . . . . . . .
Yan’an Jin, Ruixuan Li, Yi Cai, Qing Li, Ali Daud, and Yuhua Li
Rich Ontology Extraction and Wikipedia Expansion Using Language
Resources . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Christian Schönberg, Helmuth Pree, and Burkhard Freitag
127
139
151
Cloud Computing
Fine-Grained Cloud DB Damage Examination Based on Bloom
Filters . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Min Zhang, Ke Cai, and Dengguo Feng
XML Structural Similarity Search Using MapReduce . . . . . . . . . . . . . . . . .
Peisen Yuan, Chaofeng Sha, Xiaoling Wang, Bin Yang,
Aoying Zhou, and Su Yang
Comparing Hadoop and Fat-Btree Based Access Method for Small File
I/O Applications . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Min Luo and Haruo Yokota
157
169
182
Data Mining I
Mining Contrast Inequalities in Numeric Dataset . . . . . . . . . . . . . . . . . . . . .
Lei Duan, Jie Zuo, Tianqing Zhang, Jing Peng, and Jie Gong
194
Table of Contents
Users’ Book-Loan Behaviors Analysis and Knowledge Dependency
Mining . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Fei Yan, Ming Zhang, Jian Tang, Tao Sun, Zhihong Deng, and
Long Xiao
An Extended Predictive Model Markup Language for Data Mining . . . . .
Xiaodong Zhu and Jianzheng Yang
A Cross-Media Method of Stakeholder Extraction for News Contents
Analysis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Ling Xu, Qiang Ma, and Masatoshi Yoshikawa
XV
206
218
232
Stream Processing
An Efficient Approach for Mining Segment-Wise Intervention Rules in
Time-Series Streams . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Yue Wang, Jie Zuo, Ning Yang, Lei Duan, Hong-Jun Li, and
Jun Zhu
Automated Recognition of Sequential Patterns in Captured Motion
Streams . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Liqun Deng, Howard Leung, Naijie Gu, and Yang Yang
Online Pattern Aggregation over RFID Data Streams . . . . . . . . . . . . . . . .
Hailong Liu, Zhanhuai Li, Qun Chen, and Shanglian Peng
Cleaning Uncertain Streams by Parallelized Probabilistic Graphical
Models . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Qian Zhang, Shan Wang, and Biao Qin
238
250
262
274
Graph Processing
Taming Computational Complexity: Efficient and Parallel SimRank
Optimizations on Undirected Graphs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Weiren Yu, Xuemin Lin, and Jiajin Le
280
DSI: A Method for Indexing Large Graphs Using Distance Set . . . . . . . . .
Yubo Kou, Yukun Li, and Xiaofeng Meng
297
K-Radius Subgraph Comparison for RDF Data Cleansing . . . . . . . . . . . . .
Hai Jin, Li Huang, and Pingpeng Yuan
309
Query Processing
A Novel Framework for Processing Continuous Queries on Moving
Objects . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Liang Zhao, Ning Jing, Luo Chen, and Zhinong Zhong
321
XVI
Table of Contents
Group Visible Nearest Neighbor Queries in Spatial Databases . . . . . . . . .
Hu Xu, Zhicheng Li, Yansheng Lu, Ke Deng, and Xiaofang Zhou
iPoc: A Polar Coordinate Based Indexing Method for Nearest Neighbor
Search in High Dimensional Space . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Zhang Liu, Chaokun Wang, Peng Zou, Wei Zheng, and
Jianmin Wang
Join Directly on Heavy-Weight Compressed Data in Column-Oriented
Database . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Gan Liang, Li RunHeng, Jia Yan, and Jin Xin
333
345
357
Potpourri
Exploiting Service Context for Web Service Search Engine . . . . . . . . . . . .
Rong Zhang, Koji Zettsu, Yutaka Kidawara, and Yasushi Kiyoki
363
Building Business Intelligence Applications Having Prescriptive and
Predictive Capabilities . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Chen Jiang, David L. Jensen, Heng Cao, and Tarun Kumar
376
FileSearchCube: A File Grouping Tool Combining Multiple Types of
Interfile-Relationships . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Yousuke Watanabe, Kenichi Otagiri, and Haruo Yokota
386
Trustworthy Information: Concepts and Mechanisms . . . . . . . . . . . . . . . . .
Shouhuai Xu, Haifeng Qian, Fengying Wang, Zhenxin Zhan,
Elisa Bertino, and Ravi Sandhu
398
Web Data II
How to Design Kansei Retrieval Systems? . . . . . . . . . . . . . . . . . . . . . . . . . . .
Yaokai Feng and Seiichi Uchida
405
Detecting Hot Events from Web Search Logs . . . . . . . . . . . . . . . . . . . . . . . .
Yingqin Gu, Jianwei Cui, Hongyan Liu, Xuan Jiang, Jun He,
Xiaoyong Du, and Zhixu Li
417
Evaluating Truthfulness of Modifiers Attached to Web Entity Names . . .
Ryohei Takahashi, Satoshi Oyama, Hiroaki Ohshima, and
Katsumi Tanaka
429
Searching the Web for Alternative Answers to Questions on WebQA
Sites . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Natsuki Takata, Hiroaki Ohshima, Satoshi Oyama, and
Katsumi Tanaka
Domain-Independent Classification for Deep Web Interfaces . . . . . . . . . . .
Yingjun Li, Siwei Wang, Derong Shen, Tiezheng Nie, and Ge Yu
441
453
Table of Contents
XVII
Data Mining II
Data Selection for Exact Value Acquisition to Improve Uncertain
Clustering . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Yu-Chieh Lin, De-Nian Yang, and Ming-Syan Chen
459
Exploring the Sentiment Strength of User Reviews . . . . . . . . . . . . . . . . . . .
Yao Lu, Xiangfei Kong, Xiaojun Quan, Wenyin Liu, and Yinlong Xu
471
Semantic Entity Detection by Integrating CRF and SVM . . . . . . . . . . . . .
Peng Cai, Hangzai Luo, and Aoying Zhou
483
An Incremental Method for Causal Network Construction . . . . . . . . . . . . .
Hiroshi Ishii, Qiang Ma, and Masatoshi Yoshikawa
495
DCUBE: CUBE on Dirty Databases . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Guohua Jiang, Hongzhi Wang, Shouxu Jiang, Jianzhong Li, and
Hong Gao
507
XML and Images
An Algorithm for Incremental Maintenance of Materialized XPath
View . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Xueyun Jin and Husheng Liao
Query Processing in INM Database System . . . . . . . . . . . . . . . . . . . . . . . . .
Jie Hu, Qingchuan Fu, and Mengchi Liu
513
525
Fragile Watermarking for Color Image Recovery Based on Color Filter
Array Interpolation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Zhenxing Qian, Guorui Feng, and Yanli Ren
537
A Hybrid-Feature-Based Efficient Retrieval over Chinese Calligraphic
Manuscript Image Repository . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Yi Zhuang and Chengxiang Yuan
544
Efficient Filtering of XML Documents with XPath Expressions
Containing Ancestor Axis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Bo Ning, Chengfei Liu, and Guoren Wang
551
New Hardware
ACAR: An Adaptive Cost Aware Cache Replacement Approach for
Flash Memory . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Yanfei Lv, Xuexuan Chen, and Bin Cui
GPU-Accelerated Predicate Evaluation on Column Store . . . . . . . . . . . . . .
Ren Wu, Bin Zhang, Meichun Hsu, and Qiming Chen
558
570
XVIII
Table of Contents
MOSS-DB: A Hardware-Aware OLAP Database . . . . . . . . . . . . . . . . . . . . .
Yansong Zhang, Wei Hu, and Shan Wang
582
Similarity Search
Efficient Duplicate Record Detection Based on Similarity Estimation . . .
Mohan Li, Hongzhi Wang, Jianzhong Li, and Hong Gao
A Novel Composite Kernel for Finding Similar Questions in CQA
Services . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Jun Wang, Zhoujun Li, Xia Hu, and Biyun Hu
Efficient Similarity Query in RFID Trajectory Databases . . . . . . . . . . . . . .
Yanqiu Wang, Ge Yu, Yu Gu, Dejun Yue, and Tiancheng Zhang
595
608
620
Information Extraction
Context-Aware Basic Level Concepts Detection in Folksonomies . . . . . . .
Wen-hao Chen, Yi Cai, Ho-fung Leung, and Qing Li
632
Extracting 5W1H Event Semantic Elements from Chinese Online
News . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Wei Wang, Dongyan Zhao, Lei Zou, Dong Wang, and Weiguo Zheng
644
Automatic Domain Terminology Extraction Using Graph Mutual
Reinforcement . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Jingjing Kang, Xiaoyong Du, Tao Liu, and He Hu
656
Knowledge Discovery
Semi-supervised Learning from Only Positive and Unlabeled Data
Using Entropy . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Xiaoling Wang, Zhen Xu, Chaofeng Sha, Martin Ester, and
Aoying Zhou
668
Margin Based Sample Weighting for Stable Feature Selection . . . . . . . . . .
Yue Han and Lei Yu
680
Associative Classifier for Uncertain Data . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Xiangju Qin, Yang Zhang, Xue Li, and Yong Wang
692
Information Integration
Automatic Multi-schema Integration Based on User Preference . . . . . . . .
Guohui Ding, Guoren Wang, Junchang Xin, and Huichao Geng
704
EIF: A Framework of Effective Entity Identification . . . . . . . . . . . . . . . . . .
Lingli Li, Hongzhi Wang, Hong Gao, and Jianzhong Li
717
Table of Contents
A Multilevel and Domain-Independent Duplicate Detection Model for
Scientific Database . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Jie Song, Yubin Bao, and Ge Yu
XIX
729
Extending Databases
Generalized UDF for Analytics Inside Database Engine . . . . . . . . . . . . . . .
Meichun Hsu, Qiming Chen, Ren Wu, Bin Zhang, and Hans Zeller
742
Efficient Continuous Top-k Keyword Search in Relational Databases . . . .
Yanwei Xu, Yoshiharu Ishikawa, and Jihong Guan
755
V Locking Protocol for Materialized Aggregate Join Views on B-Tree
Indices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Gang Luo
768
Web Information Credibility (Keynote Abstract) . . . . . . . . . . . . . . . . . . . . .
Katsumi Tanaka
781
Author Index . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
783