APWeb 2016
The 18th Asia Pacific Web Conference
The Asia Pacific Web Conference, a leading international conference on research, development and applications of Web technologies, database systems, information management and software engineering, is aiming at attracting professionals of different communities such as industry and academic from not only Asia Pacific countries but also other continents. The objective is to share the experience in the area of the World Wide Web with the underlying technologies and applications.
Important Dates:
Camera-Ready Deadline: July 1, 2016
Author Registration Deadline: July 31, 2016
Conference Time: Sept 23 – Sept 25, 2016
About Suzhou
The long history of Suzhou City has left behind many attractive scenic spots and historical sites with beautiful and interesting legends. The elegant classical gardens, the old-fashioned houses and delicate bridges hanging over flowing waters in the drizzling rain, the beautiful lakes with undulating hills in lush green, and the exquisite arts and crafts, etc. have made Suzhou a renowned historical and cultural city full of eternal and poetic charm.
Suzhou is best known for its gardens: Humble Administrator's Garden, Lingering Garden, the Surging Wave Pavilion, and the Master of Nets Garden. These gardens weave together the best of traditional Chinese architecture, painting and arts.
Suzhou is also known as the "Venice of the East". The city is sandwiched between Taihu Lake and Grand Canal. Network of cannels, criss-crossed with hump-backed bridges, give Suzhou an image of City on the water.
Previous APWeb
Previous APWeb conferences were held in Beijing (1998), Hong Kong (1999), Xi'an (2000), Changsha (2001), Xi'an (2003), Hangzhou (2004), Shanghai (2005), Harbin (2006), Huangshan (2007), Shenyang (2008), Suzhou (2009), Busan (2010), Beijing (2011), Kunming (2012), Sydney (2013), Changsha(2014) and Guangzhou (2015).
APWeb 2016 will take place at Suzhou, China. It is a city with a long history on the lower reaches of the Yangtze River and on the shores of Lake Taihu in the province of Jiangsu, China.
Keynotes
Professor Zhi-Hua Zhou Nanjing University Title: Big Data Incremental Learning
Abstract: Traditional learning approaches usually try to collect all available data and then train a model. In big data applications, however, the data usually come in an accumulation or streaming way. Thus, it is more desirable to do incremental learning rather than training a new model from scratch when receiving new data. It is noteworthy that some important losses used in machine learning are quite challenging for incremental optimization. Moreover, in addition to new training samples, new classes may also occur. In this talk we will introduce some studies along this direction. Short Biography: Zhi-Hua Zhou is a Professor and Founding Director of the LAMDA Group at Nanjing University. He authored the book "Ensemble Methods: Foundations and Algorithms", and published more than 100 papers in top-tier journals and conference proceedings. His work have received more than 22,000 citations, with a h-index of 71. He also holds 14 patents and has good experiences in industrial applications. He has received various awards, including the National Natural Science Award of China, the IEEE CIS Outstanding Early Career Award, the Microsoft Professorship Award, 12 international journal/conference paper/competition awards, etc. He serves as the Executive Editor-in-Chief of Frontiers of Computer Science, Associate Editor-in-Chief of Science China, and Associate Editor of ACM TIST, IEEE TNNLS, etc. He founded ACML (Asian Conference on Machine Learning) and served as General Chair of ICDM’16, PAKDD’14, etc., Program Chair of IJCAI’15 Machine Learning track, SDM’13, etc. He also serves as Advisory Committee member for IJCAI 2015-2016, and Steering Committee Member of PAKDD and PRICAI. He is a Fellow of the AAAI, IEEE, IAPR, IET/IEE, CCF, and an ACM Distinguished Scientist. |
Professor Yufei Tao The University of Queensland Title: Small and Sweet MapReduce Algorithms Abstract: MapReduce has grown into a matured and powerful paradigm for large-scaled parallel computing. This keynote will introduce principles for designing algorithms on this paradigm that are both (i) small, i.e., they can be implemented in a real system with reasonable efforts, and (ii) sweet, i.e., they possess strong theoretical performance guarantees. Assuming little prior knowledge, we will start with the definition of the massively parallel computation (MPC) model, which has nowadays become a popular model in the database community for studying MapReduce algorithms. We will then move on to discuss MPC algorithms that can solve several fundamental database problems (particular, sorting and joins) optimally. The talk will end with several open problems exciting in the eyes of the speaker. Short Biography: Yufei Tao is a Professor at the School of Information Technology and Electrical Engineering, the University of Queensland (UQ). Prior to joining UQ, he held professorial positions at the City University of Hong Kong, the Chinese University of Hong Kong (CUHK), and the Korea Advanced Institute of Science and Technology (KAIST). He served as an associate editor of ACM Transactions on Database Systems (TODS) from 2008 to 2015, and of IEEE Transactions on Knowledge and Data Engineering (TKDE) from 2012 to 2014. He was a PC chair of International Conference on Data Engineering (ICDE) 2014, and of International Symposium on Spatial and Temporal Databases (SSTD) 2011. He was a keynote speaker at International Conference on Database Theory (ICDT) 2016, and a winner of the SIGMOD best paper award in 2013 and 2015. |
Professor Cyrus Shahabi University of Southern California (USC) Title: Inference of Social Relationships from Location Data Abstract: For decades, social scientists have been studying people's social behaviors by utilizing sparse datasets obtained by observations and surveys. These studies received a major boost in the past decade due to the availability of web data (e.g., social networks, blogs and review web sites). However, due to the nature of the utilized dataset, these studies were confined to behaviors that were observed mostly in the virtual world. Differing from all the earlier work, here, we aim to study social behaviors by observing people's behaviors in the real world. This is now possible due to the availability of large high-resolution spatio-temporal location data collected by GPS-enabled mobile devices through mobile apps (Google's Map/Navigation/Search/Chrome, Facebook, Foursquare, WhatsApp, Twitter) or through online services, such as geo-tagged contents (tweets from Twitter, pictures from Instagram, Flickr or Google+ Photo), etc. In particular, we focus on inferring two specific social measures: 1) pair-wise strength -- the strength of social connections between a pair of users, and 2) pair-wise influence - the amount of influence that an individual exerts on another, by utilizing the available high-fidelity location data representing people's movements. Finally, we argue that due to the sensitivity of location data and user privacy concerns, these inferences cannot be largely carried out on individually contributed data without privacy guarantees. Hence, we discuss open problems in protecting individuals'location information while enabling these inference analyses.
|
Distinguished Lecture Series
Professor Chen Li University of California, Irvine Title: Real-Time Analytics and Visualization on Large-Scale Spatial-Temporal-Textual Data Abstract: We are developing a system called Cloudberry to support analytics and visualization on large data sets with spatial, temporal, and textual attributes, such as social media data and query logs. It supports aggregation queries on various types of attributes, and allows efficient data exploration at different granularities (e.g., state, county, and city). It also supports real-time analytics, which can allow applications to monitor “what’s happening now.” To achieve a high speed, it includes an intelligent middleware for view materialization and cache management. As a general-purpose solution for large data sets, it uses the Apache AsterixDB big data management system that provides rich features and high performance, such as various indexes and data feeds. In this talk, we will give an overview of the system, our initial results, and open challenges in this direction. A live demonstration using tweets is available at http://cloudberry.ics.uci.edu/ . Short Biography: Chen Li is a professor in the Department of Computer Science at UC Irvine. He received his Ph.D. degree in Computer Science from Stanford University, and his M.S. and B.S. in Computer Science from Tsinghua University, China, respectively. His research interests are in the field of data management, including data cleaning, data integration, data-intensive computing, and text analytics. He was a recipient of an NSF CAREER Award, several test-of-time publication awards, and many other grants and industry gifts. He was once a part-time Visiting Research Scientist at Google. He founded a company SRCH2 to develop an open source search engine with high performance and advanced features. |
Professor Jianliang Xu Hong Kong Baptist University Title: Towards Interactive and Social-Aware Spatial Query Services Abstract: Location-based service (LBS) have been gaining in prominence, with about 40% of world's population using smartphones today. As such, there is a growing need to continuously advance the spatial database research for emerging LBS applications, which pose new challenges as well as new opportunities. For example, the convergence of location data and social media has enabled a new class of geo-social queries that combine location and social factors in query processing. In addition, to enhance system usability and user experience, it is important to support instantaneous and interactive responses to queries. In this talk, we will present several of our recent efforts on geo-social queries and "why-not"/"what-if" interactive queries that are aimed to improve the functionality, usability, and performance of spatial query services. We will also discuss some possible future research directions. |
Tutorials
Professor Kun-Ta Chuang National Cheng Kung University Title : Data Science for Epidemic Computing Abstract: The control of epidemic spread is the critical challenge for the authority in recent decades. When people are moving to live in the urban area, the crowded situation inevitably increases the outbreak probability of some contagious diseases such as flu and dengue fever. For the need to prevent the out-of-control infections, it is necessary to develop new technologies, predicting and evaluating the prevention result along with the dynamic deployment of intervention strategies over time. In this tutorial, we will introduce some mechanisms from data science and discuss their extension applied in the outbreak control during the spread of dengue fever in Taiwan 2015. We will also discuss the intervention procedure in Taiwan and show the way to incorporate data mining idea for epidemic computing into the process of decision making in the government side. The audience will know the basic concept of public health and learn the way to devise new computational algorithms for this critical challenge.
|
Professor Hyeonsang Eom Seoul National University Title: Workload-Aware Resource Management Technologies for Improving Server Performance Abstract: Datacenters where various sorts of servers may run have been becoming larger and more heterogeneous, possibly being highly distributed. It is crucial to manage many heterogeneous resources effectively to efficiently and cost-effectivity provide services; it is necessary to allocate “right” resources to Virtual Machines (VMs) in virtualized datacenters in order to decrease the cost of the operation while meeting the SLAs (Service Level Agreements) such as meeting the latency requirement. One of the most effective ways to allocate “right” resources to a VM would be to do it considering the characteristics of the VM such as the memory intensiveness of the workload executed in the VM. However, the existing schedulers do not consider these kinds of characteristics, including the NOVA scheduler of OpenStack and DRS (Distributed Resource Scheduler) of VMWare. In this tutorial, I explain some workload-aware schedulers, and our workload-aware one that schedules VMs on OpenStack clusters of nodes, considering the characteristics of workload executed in the VMs. Our experimental study with Redis and Memcached possibly caching the data and links of Web servers shows that our memory-intensiveness-aware scheduler may outperform the default scheduler of OpenStack and DRS as well in terms of throughput and latency. Short Biography: Hyeonsang Eom received the BS degree in computer science and statistics from Seoul National University (SNU), Seoul, Korea, in 1992, and the MS and PhD degrees in computer science from the University of Maryland at College Park, Maryland, USA, in 1996 and 2003, respectively. He is currently an associate professor in the Department of Computer Science and Engineering at SNU, where he has been a faculty member since 2005. He was an intern in the data engineering group at Sun Microsystems, California, USA, in 1997, and a senior engineer in the Telecommunication R&D Center at Samsung Electronics, Korea, from 2003 to 2004. His research interests include distributed systems, cloud computing, operating systems, high performance storage systems, energy efficient systems, fault-tolerant systems, security, and information dynamics. |
Workshops
2nd International Workshop on Web Data Mining and Applications (WDMA 2016)
http://ada.suda.edu.cn/apweb2016/WDMA_2016.html
1st International Workshop on Graph Analytics and Query Processing (GAP 2016)
http://www.cse.unsw.edu.au/~gap
1st International Workshop on Spatio-temporal Data Management and Analytics (SDMA’2016)
http://ada.suda.edu.cn/apweb2016/SDMA2016/
Accepted Papers
Research Full Paper
Probabilistic Nearest Neighbor Query in Traffic-Aware Spatial Networks
NERank: Bringing Order to Named Entities from Texts
FTS: A Practical Model for Feature-based Trajectory Synthesis
Distributed Text Representation with Weighting Scheme Guidance for Sentiment Analysis
A Real Time Wireless Interactive Multimedia System
Forecasting Career Choice for College Students Based on Campus Big Data
Efficient Group Top-k Spatial Keyword Query Processing
Online Prediction for Forex with an Optimized Experts Selection Model
Flexible and Adaptive Stream Join Algorithm
Practical Study of Subclasses of Regular Expressions in DTD and XML Schema
Finding Frequent Items in Time Decayed Data Streams
Fast Rare Category Detection Using Nearest Centroid Neighborhood
Scalable Private Blocking Technique for Privacy-Preserving Record Linkage
Learn to recommend local event using heterogeneous social networks
Latent Semantic Diagnosis in Traditional Chinese Medicine
Top-k Temporal Keyword Query over Social Media Data
Correlation-based Weighted k-Labelsets for Multi-Label Classification
Classifying Relation via Bidirectional Recurrent Neural Network based on Local Information
Psychological Stress Detection from Online Shopping
When a Friend Online is More Than a Friend in Life: Intimate Relationship Prediction in Microblogs
Community Inference with Bayesian Symmetric Non-Negative Matrix Factorization
Confidence-learning based collaborative filtering with heterogeneous implicit feedbacks
Spica: A Path Bundling Model for Rational Route Recommendation
Mining Co-locations from Continuously Distributed Uncertain Spatial Data
An Online Approach for Direction-Based Trajectory Compression with Error Bound Guarantee
A Target-dependent Sentiment Analysis Method for Micro-blog Streams
Maximizing the influence ranking under limited Cost in social network
A Data Grouping CNN Algorithm for Short-term Traffic Flow Forecasting
Repair Singleton IDs on the Fly
Making Cold Data Identification Efficient in Non-Volatile Memory Systems
Star-Scan : A Stable Clustering by Statistically Finding Centers and Noises
Aggregating Crowd Wisdom with Instance Grouping Methods
FHSM: Factored Hybrid Similarity Methods for Top-N Recommender Systems
Accelerating Path Nearest Neighbor Search in Spatial Networks
Man-O-Meter: Modeling and Assessing the Evolution of Language Usage of Individuals on Microblogs
Modeling for Noisy Labels of Crowd Workers
Online Streaming Feature Selection using Sampling Techniques and Correlations between Features
CoDS:Co-training with Domain Similarity for Cross-domain Image Sentiment Classification
Personalized Resource Recommendation Based on Regular Tag and User Operation
Academic Paper Recommendation Based on Community Detection in Citation-Collaboration Networks
Budget Minimization with Time and Influence Constraints in Social Network
Feature Selection via Vectorizing Feature's Discriminative Information
Incomplete Data Classfication Based on Multiple Views
Preference Join on Heterogeneous Data
A Label Inference Method based on Maximal Entropy Random Walk over Graphs
An Adaptive kNN Using Listwise Approach for Implicit Feedback
Accelerating Time Series Shapelets Discovery with Key Points
Quantifying the Effect of Sentiment on Topic Evolution in Chinese Microblog
Efficient Evaluation of Shortest Travel-time Path Queries in Road Networks by Optimizing Waypoints in Route Requests through Spatial Mashups
Improving Recommendation Accuracy for Travelers by Exploiting POI Correlations
Fuzzy Keywords Query
Towards Efficient Influence Maximization for Evolving Social Networks
Detecting Community Pacemakers of Burst Topic in Twitter
Dynamic User Attribute Discovery on Social Media
Improving Temporal Recommendation Accuracy and Diversity via Long and Short-Term Preference Transfer and Fusion Models
Measuring Directional Semantic Similarity with Multi-Features
Discovering Companion Vehicles from Live Streaming Traffic Data
EPLA:Efficient Personal Location Anonymity
The Competition of User Attentions Among Social Network Services: a Social Evolutionary Game Approach
An Adaptive Partition-Based Caching Approach for Efficient Range Queries on Key-value Data
Mechanism Analysis of Competitive Information Synchronous Dissemination in Social Networks
A Context-aware Method for Top-k Recommendation in Smart TV
A Topic-Specific Contextual Expert Finding Method in Social Network
Real-time Anomaly Detection over ECG Data Stream Based on Component Spectrum
Near-Duplicate Web Video Retrieval and Localization Using Improved Edit Distance
Finding Latest Influential Research Papers through Modeling Two Views of Citation Links
Handling Estimation Inaccuracy in Query Optimization
An Approach for Cross-Community Content Recommendation: A Case Study on Docker
Multi-label Chinese Microblog Emotion Classification via Convolutional Neural Network
Mining Recent High Expected Weighted Itemset from Uncertain Databases
Context-aware Chinese Microblog Sentiment Classification with Bidirectional LSTM
A Secure and Robust Covert Channel Based on Secret Sharing Scheme
A Hybrid Method for POI Recommendation: Combining Check-in Count, Geographical Information and Reviews
A Workload-Driven Vertical Partitioning Approach Based on Streaming Framework
Time-Constrained Sequenced Route Query in Indoor Spaces
Discovering Approximate Functional Dependencies from distributed big data
B-mine: Frequent Pattern Mining and Its Application to Knowledge Discovery from Social Networks
An Efficient Online Event Detection Method for Microblogs via User Modeling
Research Short Paper
Microblog Sentiment Analysis Based on Sentiment Features
FVBM: A Filter-Verification-Based Method for Finding Top-k Closeness Centrality on Dynamic Social Networks
Online Hot Topic Detection from Web News Based on Bursty Term Identification
Grouped Team Formation in Social Networks
Profit Maximizing Route Recommendation for Vehicle Sharing Requests
Ontology-Based Interactive Post-Mining of Interesting Co-location Patterns
AALRSMF: An Adaptive Learning Rate Schedule for Matrix Factorization
A Graph Clustering Algorithm for Citation Networks
A Distributed Frequent Itemsets Mining Algorithm Using Sparse Boolean Matrix on Spark
A Simple Stochastic Gradient Variational Bayes for the Correlated Topic Model
Reasoning with Large Scale OWL 2 EL Ontologies based on MapReduce
Purchase and Redemption Prediction based on Multi-task Gaussian Process and Dimensionality Reduction
RORS: Enhanced Rule-based OWL Reasoning on Spark
A Hadoop-based Database Querying Approach for Non-expert Users
A Collaborative Join Scheme on a MIC-based Heterogeneous Platform
Pairwise Expansion: A New Topdown Search for mCK Queries Problem over Spatial Web
Mentioning the optimal users in the appropriate time on Twitter
Historical Geo-Social Query Processing
WS-Rank: Bringing Sentences into Graph for Keyword Extraction
Efficient Community Maintenance for Dynamic Social Networks
Open Sesame! Web Authentication Cracking via Mobile App Analysis
K-th Order Skyline Queries in Bicriteria Networks
A K-Motifs Discovery Approach for Large Time-Series Data Analysis
User Occupation Prediction on Microblogs
Similarity Recoverable, Format-preserving String Encryption
Industry Full Paper
Combo-Recommendation based on Potential Relevance of Items
Demo Paper
OPGs-Rec: Organized-POI-Groups Based Recommendation
A Demonstration of Encrypted Logistics Information System
PCMiner:An Extensible Framework for Analysing and Detecting Protein Complexes
MASM:A Novel Movie Analysis System based on Microblog
A Text Retrieval System Based on Distributed Representations
A System for Searching Renting Houses Based on Relaxed Query Answering
TagTour: a Personalized Tourist Resource Recommendation System
A Chronic Disease Analysis System Based on Dirty Data Mining
ADDS: An Automated Disease Diagnosis-aided System
INDOOR MAP SERVICE SYSTEM BASED ON WECHAT TWO-DIMENSIONAL CODE
Factorization machine based business credit scoring by leveraging Internet data
OICRM:An Ontology-based Interesting Co-location Rule Miner
CB-CAS: a CAS-Based Cross-Browser SSO System
A Demonstration of QA System based on Knowledge Base
An Alarming and Prediction System for Infections Disease Baced on Combined Modles
Co-location Detector: A System to Find Interesting Spatial Co-locating Relationships
KEIPD:Knowledge Extraction and Inference System for Personal Documents
Workshop Papers
International Workshop on Web Data Mining and Applications (WDMA 2016)
Maximizing the Cooperative Influence Spread in a Social Network Oriented to Viral Marketing
A Multi-Model Based Approach for Big Data Analytics: the Case on Education Grant Distribution
Sentiment Target Extraction Based on CRFs with Multi-features for Chinese Microblog
EMD-DSJoin: Efficient Similarity Join over Probabilistic Data Streams Based on Earth Mover’s Distance
Sentiment Analysis on User Reviews through Lexicon and Rule-based Approach
Social Link Prediction Based on the Nodes' Information Transfer
An Improved ML-kNN Approach Based on Coupled Similarity
A Novel Recommendation Method Based on User’s Interest and Heterogeneous Information
Knee point-driven bottleneck detection algorithm for cloud service system
Confirmatory Analysis on Influencing Factors When Mention Users in Twitter
A stock recommendation strategy based on M-LDA model
Short-term Forecasting and Application about Indoor Cooling Load Based on EDA-PSO-BP Algorithm
International Workshop on Graph Analytics and Query Processing (GAP 2016)
Identifying Relevant Subgraphs in Large networks
User-dependent Multi-relational Community Detection in Social Networks
Compressing Streaming Graph Data Based on Triangulation
International Workshop on Spatio-temporal Data Management and Analytics (SDMA’2016)
Scene Classification in High Resolution Remotely Sensed Images based on PCANet
Finding Top-k Places for Group Social Activities
Temporal Spatial-Keyword Search On Databases Using SQL
Features of Rumor Spreading on WeChat Moments
Distance-Based Continuous Skylines On Geo-Textual Data
Improving Urban Traffic Evacuation Capability in Emergency Response by Using Smart Phones
Context Enhanced Keyword Extraction for Sparse Geo-entity Relation from Web Texts
A Stacked Generalization Framework for City Traffic Related Geospatial Data Analysis
Detection of Statistically Signi cant Bus Delay Aggregation by Spatial-Temporal Scanning
Acquisition and Representation of Knowledge for Academic Field
Using Learning Features to Find Similar Trajectories
An Algorithm for Mining Moving Flock Patterns from Pedestrian Trajectories
Conference Program
Program at a Glance
Detailed Program can be found in Program-v17.pdf. Should you find anything unclear or incorrect, please let us know.
Conference Venue
Suzhou Xi'an Jiaotong-Liverpool International Conference Center(苏州西交利物浦国际会议中心)
Location: No. 99 Renai Road, Suzhou City, Jiangsu Provice, P.R.China
Tel: +86-512-86665555 Introduction: http://05286665555.locoso.com
The conference venue is 20 minutes drive from Suzhou Railway Station (苏州火车站) and 30 minutes drive from Suzhou Highspeed Train Station (苏州高铁北站). Taxi is the most convenient way to get there.
|