Python Machine Learning Projects

A Survey on Data Pricing from Economics to Data Science

Abstract: Data matter. How can we quantitatively and objectively evaluate data? Pricing data, or information goods, has been studied and practiced in many disciplines, including economics, marketing, electronic commerce, data management, data mining, and machine…

Python Machine Learning Projects

Cold-Start Active Sampling Via γ-Tube

Abstract: Active learning (AL) queries labels from unlabeled data to improve classification hypothesis generalization. Informative, representative, or diverse evaluation policies evaluate sampling. In a cold-start hypothesis, the policy, which requires an initial labeled set, may…

Python Machine Learning Projects

A Survey on Large-scale Machine Learning

Abstract: Text mining, visual classification, and recommender systems use machine learning to make accurate predictions from data. When processing large datasets, most sophisticated machine learning methods take a long time. Large-scale machine learning (LML) is…

Python Machine Learning Projects

A Review for Weighted MinHash Algorithms

Abstract: Data similarity (or distance) computation is a fundamental research topic that underpins many high-level machine learning and data mining applications based on similarity measures. Big data’s “3V” nature—volume, velocity, and variety—makes exact similarity computation…