Data Mining Projects

Exploiting Reuse for GPU Subgraph Enumeration

Abstract: Network motif discovery, community detection, and frequent subgraph mining require subgraph enumeration. Recent works parallelize subgraph enumeration using GPUs to accelerate execution. Set intersection operations take up to 95% of the processing time in…

Data Mining Projects

ESA-Stream: Efficient Self-Adaptive Online Data Stream Clustering

Abstract: Big data applications generate massive amounts of high-dimensional, real-time, streaming data. These applications require effective and efficient data stream clustering. Well-known data stream clustering algorithms based on the popular online-offline framework still face major…

Data Mining Projects

Efficient Shapelet Discovery for Time Series Classification

Abstract: Time-series shapelets, discriminative subsequences, are effective for time series classification (tsc). Tsc accuracy depends on shapelet quality. However, major research has focused on accurate models from some shapelet candidates. Existing studies use simple methods…

Data Mining Projects

Detecting Statistically Significant Communities

Abstract: Data analysis involves community detection. Algorithms to solve this problem have been proposed for decades. Most community detection research ignores statistical significance. Although statistically significant communities have been mined, deriving an analytical solution of…

Data Mining Projects

Deep Learning for Spatio-Temporal Data Mining: A Survey

Abstract: Spatio-temporal data is increasingly available due to the rapid development of GPS, mobile devices, and remote sensing. Human mobility understanding, smart transportation, urban planning, public safety, health care, and environmental management depend on spatio-temporal…

Data Mining Projects

Deep Learning for Adverse Event Detection from Web Search

Abstract: Adverse event detection is essential for identifying product defects, disasters, and major socio-political events. Adverse drug events cause many hospitalizations and deaths annually. Search query logs are a key detection channel because users start…