Introduction to K-means Clustering

Introduction to K-means Clustering

K-means clustering is a type of unsupervised learning, which is used when you have unlabeled data (i.e., data without defined categories or groups).

k-means clustering aims to partition n observations into k clusters in which each observation belongs to the cluster with the nearest mean, serving as a prototype of the cluster.

The goal of this algorithm is to find groups in the data, with the number of groups represented by the variable K. The algorithm works iteratively to assign each data point to one of K groups based on the features that are provided. Data points are clustered based on feature similarity. The results of the K-means clustering algorithm are:

Continue reading “Introduction to K-means Clustering”

Install Python with Jupyter

The easiest way to install the Jupyter Notebook App is installing a scientific python distribution which also includes scientific python packages. The most common distribution is called Anaconda:

  • Download Anaconda Distribution (a few 100MB), Python 3, 64 bits.
  • Install it using the default settings for a single user.

Continue reading “Install Python with Jupyter”

%d bloggers like this: