Skip to content

A compilation of SAS codes and results related to a debit card customer segmentation project

Notifications You must be signed in to change notification settings

jstephenj14/Debit-Card-Segmentation

Repository files navigation

Debit Card Segmentation

Disclaimer: Client sensitive numbers and names have been altered to retain confidentiality.

The repository consists of five main files that detail the entire process, stage by stage. A brief description of the stages as well as accompanying graphs are shown below (the links lead to code deployed for each process):

Stage 1 - Data Collation- This primary stage of the process collects data from a variety of sources and consolidates them all to derive a customer level data source containing five attributes. Their derivation are described in this chart:

Stage 2 - Data Cleaning- At this stage, the collated data is plotted to identify and treat outliers. Data is also transformed (using log functions) and standardized as well. Standardized values are used to profile clusters (discussed in Stage 5) while transformed values help arrive at stable clustering solutions. For example, the variables mean_txn_amt and mean_gap are treated and transformed like below:

Stage 3 - Clustering Process- After data is cleaned and transformed, the viable number of clusters was identified using a macro that runs PROC FASTCLUS for different number of clusters. Seven clusters were chosen to be optimal going by the scree plot below:

Stage 4 - Cluster Validation - Clusters created from Stage 4 are further validated using Andrews Plots and Candisc graphs as shown below:

Stage 5 - Cluster Profiling - Post validation, resultant clusters are profiled in a variety of ways. Profile and star plots created based on standardized values for each of the clusters are shown below:

A three dimensional plot based on any three of the five variables may also be plotted used PROC G3D like below:

Subsequent 2 dimensional projections can also be plotted for deeper analysis:

About

A compilation of SAS codes and results related to a debit card customer segmentation project

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages