Data Mining Projects

Detecting Statistically Significant Communities

Abstract: Data analysis involves community detection. Algorithms to solve this problem have been proposed for decades. Most community detection research ignores statistical significance. Although statistically significant communities have been mined, deriving an analytical solution of…

Data Mining Projects

Deep Learning for Spatio-Temporal Data Mining: A Survey

Abstract: Spatio-temporal data is increasingly available due to the rapid development of GPS, mobile devices, and remote sensing. Human mobility understanding, smart transportation, urban planning, public safety, health care, and environmental management depend on spatio-temporal…

Data Mining Projects

Deep Learning for Adverse Event Detection from Web Search

Abstract: Adverse event detection is essential for identifying product defects, disasters, and major socio-political events. Adverse drug events cause many hospitalizations and deaths annually. Search query logs are a key detection channel because users start…

Data Mining Projects

Deep Feature-Based Text Clustering and Its Explanation

Abstract: Text data analysis requires text clustering, which the text mining community has extensively studied. Most text clustering algorithms use the bag-of-words model, which ignores text structural and sequence information and is high-dimensional and sparse….

Data Mining Projects

Data Representation by Joint Hypergraph Embedding and Sparse Coding

Abstract: Data mining and machine learning use matrix factorization (MF), an unsupervised data representation technique. Different application scenarios can impose different constraints on the factorization to find the desired basis, which captures high-level semantics for…

Data Mining Projects

Context-aware Service Recommendation based on Knowledge Graph Embedding

Abstract: Over the past two decades, recommender systems have used context awareness to offer consumers both top-rated and context-appropriate products. Context-aware service recommendation (CASR) systems use invocation time, location, social profiles, connectivity, and other context…

Data Mining Projects

Consensus One-step Multi-view Subspace Clustering

Abstract: Multimedia, machine learning, and data mining communities are focusing on multi-view clustering. Multi-view subspace clustering (MVSC) is a popular multi-view clustering algorithm because it can reveal the intrinsic low-dimensional clustering structure hidden across views….