Statistics - Cluster sampling


In cluster sampling, groups of elements that ideally speaking, are heterogeneous in nature within group, and are chosen randomly. Unlike stratified sampling where groups are homogeneous and few elements are randomly chosen from each group, in cluster sampling the group with intra group heterogeneity are developed and all the elements within the group become a pan of the sample. Whereas stratified sampling has intra group homogeneity and inter group heterogeneity, cluster sampling has intra group heterogeneity.


One stage cluster sampling

A committee comprising of number of members from different departments has a high degree of heterogeneity. When from number of such committees, few are chosen randomly, and then it is a case of one stage cluster sampling.

Two stage cluster sampling

If from each cluster which has been randomly chosen, few elements are chosen randomly using simple random sampling or any other probability method then it is a two stage cluster sampling.

Multi-stage cluster sampling

A cluster sample can be a multiple stage sampling, when the choice of element in a sample involves selection at multiple stages e.g. if in a national survey on insurance products a sample of insurance companies is to be drawn, then it requires developing clusters at multiple stages.

In the first stage the clusters are formed on the basis of public and private companies. At the next stage a group of companies is chosen randomly from each cluster developed earlier. In the third stage the office location of each chosen company from where data is to be collected is chosen randomly. Thus in multistage sampling, probability sampling of primary units is done, then from each primary unit a sample of secondary sampling units is drawn and then the third levels till we reach the final stage of breakdown for the sample units.

Useful Video Courses


Class 11th Statistics for Economics

40 Lectures 3.5 hours

Madhu Bhatia



40 Lectures 2 hours

Megha Aggarwal


Applied Statistics in Python for Machine Learning Engineers

66 Lectures 1.5 hours

Mike West


An Introduction to Wait Statistics in SQL Server

22 Lectures 1 hours

Mike West


Geospatial Data Science: Statistics and Machine Learning

60 Lectures 12 hours

Michael Miller


Basic Statistics & Regression for Machine Learning in Python

65 Lectures 5 hours

Abhilash Nelson