Parallel K-Means Using Map Reduce On Big Data Cluster Analysis
The document discusses using MapReduce to perform parallel k-means clustering on big data. The mapping step assigns data points to the closest cluster center. The reducing step revises cluster centers by taking the mean of assigned data points. This mapping and reducing is done iteratively until cluster centers converge.
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0 ratings0% found this document useful (0 votes)
77 views9 pages
Parallel K-Means Using Map Reduce On Big Data Cluster Analysis
The document discusses using MapReduce to perform parallel k-means clustering on big data. The mapping step assigns data points to the closest cluster center. The reducing step revises cluster centers by taking the mean of assigned data points. This mapping and reducing is done iteratively until cluster centers converge.
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB