Introduction to K-means Clustering

Introduction to K-means Clustering

K-means clustering is a type of unsupervised learning, which is used when you have unlabeled data (i.e., data without defined categories or groups).

k-means clustering aims to partition n observations into k clusters in which each observation belongs to the cluster with the nearest mean, serving as a prototype of the cluster.

The goal of this algorithm is to find groups in the data, with the number of groups represented by the variable K. The algorithm works iteratively to assign each data point to one of K groups based on the features that are provided. Data points are clustered based on feature similarity. The results of the K-means clustering algorithm are:

Continue reading “Introduction to K-means Clustering”

Install Python with Jupyter

The easiest way to install the Jupyter Notebook App is installing a scientific python distribution which also includes scientific python packages. The most common distribution is called Anaconda:

  • Download Anaconda Distribution (a few 100MB), Python 3, 64 bits.
  • Install it using the default settings for a single user.

Continue reading “Install Python with Jupyter”

Chapter 2 : Working with SAS Programs.

Introduction

In this lesson, you’ll learn how to work with SAS code. First, you’ll learn the main components of SAS programs. Then you’ll learn the syntax rules and formatting guidelines for writing SAS programs. As you work with SAS programs, you’ll add descriptive comments, and identify and correct common syntax errors. Continue reading “Chapter 2 : Working with SAS Programs.”

Chapter 1. Getting Started with SAS Programming

What You Learn in This Course

Welcome to the SAS Programming 1: Essentials e-course.

In this course, you’ll write SAS programs to access, manage, and analyze your data, and present the results in reports.

1

Continue reading “Chapter 1. Getting Started with SAS Programming”

%d bloggers like this: