{"id":1385,"date":"2014-01-12T10:03:54","date_gmt":"2014-01-12T04:33:54","guid":{"rendered":"http:\/\/ucanalytics.com\/blogs\/?p=1385"},"modified":"2015-09-07T21:56:34","modified_gmt":"2015-09-07T16:26:34","slug":"customer-segmentation-outliers-telecom-case-study-part-3","status":"publish","type":"post","link":"https:\/\/ucanalytics.com\/blogs\/customer-segmentation-outliers-telecom-case-study-part-3\/","title":{"rendered":"Cluster Analysis and Outliers \u2013 Telecom Case Study Example (Part 3)"},"content":{"rendered":"<div id=\"attachment_1386\" style=\"width: 235px\" class=\"wp-caption alignright\"><a href=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/01\/photo.jpg\"><img aria-describedby=\"caption-attachment-1386\" data-attachment-id=\"1386\" data-permalink=\"https:\/\/ucanalytics.com\/blogs\/customer-segmentation-outliers-telecom-case-study-part-3\/photo\/\" data-orig-file=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/01\/photo.jpg?fit=768%2C1024&amp;ssl=1\" data-orig-size=\"768,1024\" data-comments-opened=\"1\" data-image-meta=\"{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;}\" data-image-title=\"Groucho &#8211; by Roopam\" data-image-description=\"\" data-image-caption=\"&lt;p&gt;Groucho &#8211; by Roopam&lt;\/p&gt;\n\" data-medium-file=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/01\/photo.jpg?fit=225%2C300&amp;ssl=1\" data-large-file=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/01\/photo.jpg?fit=640%2C853&amp;ssl=1\" decoding=\"async\" loading=\"lazy\" class=\"size-medium wp-image-1386\" src=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/01\/photo.jpg?resize=225%2C300\" alt=\"Groucho - by Roopam\" width=\"225\" height=\"300\" srcset=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/01\/photo.jpg?resize=225%2C300&amp;ssl=1 225w, https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/01\/photo.jpg?w=768&amp;ssl=1 768w\" sizes=\"(max-width: 225px) 100vw, 225px\" data-recalc-dims=\"1\" \/><\/a><p id=\"caption-attachment-1386\" class=\"wp-caption-text\">Groucho &#8211; by Roopam<\/p><\/div>\n<hr \/>\n<h2><span style=\"color: #3366ff;\">Outliers<\/span><\/h2>\n<p><b><i>&#8220;I refuse to join any club that would have me as a member.&#8221; &#8211; <\/i><\/b><b>Groucho Marx<\/b><\/p>\n<p>This witty statement came from (according to me) one of the funniest men in the history of American cinema \u2013 Julius Henry Marx better known as Groucho Marx. Groucho was certainly a very unusual man and might be considered to be an outlier. Today we are going to discuss the impact of outliers on cluster analysis and life in general. An outlier is an observation that is distant \/ different from the others. I know statisticians get nightmares about outliers. A single outlier can create havoc in any analysis, hence the general tendency is to ignore them from the analysis or beat them back to normal (read data transformation to form normal distribution). At times the above techniques to deal with the outliers are necessary for the sake of analysis. However ignoring outliers altogether is something analysts \/ scientists \/ society can take at their own peril. The reason is outliers could be pointing towards a new emerging trend in the system. Today\u2019s outlier may very well be tomorrow\u2019s normal. Also, let&#8217;s face it outliers are so much more fun!<\/p>\n<p>I have discussed a few outliers in the articles on YOU CANalytics such as <a href=\"http:\/\/ucanalytics.com\/blogs\/credit-scorecards-predictive-analytics-part-7\/\" target=\"_blank\">Columbus<\/a>, <a href=\"http:\/\/ucanalytics.com\/blogs\/murder-cases-evidence-and-logical-rigor-addendum\/\" target=\"_blank\">Turning<\/a>,\u00a0<a href=\"http:\/\/ucanalytics.com\/blogs\/case-study-banking-part-3-logistic-regression\/\" target=\"_blank\">Euler<\/a>,\u00a0<a href=\"http:\/\/ucanalytics.com\/blogs\/data-visualization-case-study-banking-part-2\/\" target=\"_blank\">Sherlock Holmes<\/a>,\u00a0<a href=\"http:\/\/ucanalytics.com\/blogs\/data-visualization-case-study-banking\/\" target=\"_blank\">Leonardo da Vinci<\/a>, <a href=\"http:\/\/ucanalytics.com\/blogs\/seven-advanced-analytics1-0-solutions-loan-portfolios\/\" target=\"_blank\">Batman<\/a>, and of course Groucho Marx. These men* have changed the course of human history in their own way. I hope to introduce more such outliers in the future articles. Interestingly, one of the striking feature about the human outliers is the treatment they receive from the society, similar to statistical outliers of getting ignored or beaten-up to convert to normal.<\/p>\n<p><span style=\"font-size: 10px;\">* I just noticed so far I have not introduced a\u00a0<\/span><span style=\"font-size: 10px; line-height: 1.5em;\">woman\u00a0<\/span><span style=\"font-size: 10px; line-height: 1.5em;\">outlier in my articles, will do it soon.<\/span><\/p>\n<h2><span style=\"color: #3366ff;\">Telecom Case Study Example and Outliers<\/span><\/h2>\n<p><a href=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2013\/11\/T3.jpg\"><img data-attachment-id=\"1158\" data-permalink=\"https:\/\/ucanalytics.com\/blogs\/customer-segmentation-cluster-analysis-telecom-case-study-example\/t3\/\" data-orig-file=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2013\/11\/T3.jpg?fit=422%2C247&amp;ssl=1\" data-orig-size=\"422,247\" data-comments-opened=\"1\" data-image-meta=\"{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;}\" data-image-title=\"Cluster Analysis &#8211; 3nd Scatter Plot\" data-image-description=\"\" data-image-caption=\"\" data-medium-file=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2013\/11\/T3.jpg?fit=300%2C175&amp;ssl=1\" data-large-file=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2013\/11\/T3.jpg?fit=422%2C247&amp;ssl=1\" decoding=\"async\" loading=\"lazy\" class=\" wp-image-1158 alignright\" src=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2013\/11\/T3.jpg?resize=300%2C175\" alt=\"Cluster Analysis - 2nd Scatter Plot\" width=\"300\" height=\"175\" srcset=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2013\/11\/T3.jpg?resize=300%2C175&amp;ssl=1 300w, https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2013\/11\/T3.jpg?w=422&amp;ssl=1 422w\" sizes=\"(max-width: 300px) 100vw, 300px\" data-recalc-dims=\"1\" \/><\/a>In the last few articles, we have been working on a case study example from the telecom sector where you are playing the role of\u00a0the head of customer insights and marketing (Read <a href=\"http:\/\/ucanalytics.com\/blogs\/customer-segmentation-cluster-analysis-telecom-case-study\/\" target=\"_blank\">Part 1<\/a> and <a href=\"http:\/\/ucanalytics.com\/blogs\/customer-segmentation-cluster-analysis-telecom-case-study-part-2\/\" target=\"_blank\">Part 2<\/a>). In those articles, you started with some fundamentals of cluster analysis. The business case was to create customer segments to understand your customer base better and enhance your company\u2019s marketing campaigns. In the first part of this case study, we have created the clusters on our dataset with the following two variable &#8211; average international and local call duration. We chose a couple of random seeds and produced the adjustment clusters. These clusters were produced after recursive iteration where Euclidean distance plays a crucial role.<\/p>\n<p><a href=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/01\/outlier-cluster.jpg\"><img data-attachment-id=\"1429\" data-permalink=\"https:\/\/ucanalytics.com\/blogs\/customer-segmentation-outliers-telecom-case-study-part-3\/outlier-cluster\/\" data-orig-file=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/01\/outlier-cluster.jpg?fit=469%2C274&amp;ssl=1\" data-orig-size=\"469,274\" data-comments-opened=\"1\" data-image-meta=\"{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;}\" data-image-title=\"outlier cluster\" data-image-description=\"\" data-image-caption=\"\" data-medium-file=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/01\/outlier-cluster.jpg?fit=300%2C175&amp;ssl=1\" data-large-file=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/01\/outlier-cluster.jpg?fit=469%2C274&amp;ssl=1\" decoding=\"async\" loading=\"lazy\" class=\"alignright size-medium wp-image-1429\" src=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/01\/outlier-cluster.jpg?resize=300%2C175\" alt=\"outlier cluster\" width=\"300\" height=\"175\" srcset=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/01\/outlier-cluster.jpg?resize=300%2C175&amp;ssl=1 300w, https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/01\/outlier-cluster.jpg?w=469&amp;ssl=1 469w\" sizes=\"(max-width: 300px) 100vw, 300px\" data-recalc-dims=\"1\" \/><\/a><\/p>\n<p>Now, let us add one more customer \/data point to the above dataset with average local duration equal to 20 minutes and international duration equal to 10 minutes. This customer is clearly an outlier as can be seen in the adjustment plot. Now, let us try to perform cluster analysis on this modified dataset with the same original random seeds that we have used in the first article. The iterative results are shown in the animation below. There are total 4 iterations and notice how the cluster centroids are moving with each iteration. Additionally, also keep an eye for how cluster allegiance for each data point is changing with iterations. The color of data point represents cluster\u00a0allegiance.<\/p>\n<p><a href=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/01\/output_0OCngU.gif\"><img data-attachment-id=\"6000\" data-permalink=\"https:\/\/ucanalytics.com\/blogs\/customer-segmentation-outliers-telecom-case-study-part-3\/output_0ocngu\/\" data-orig-file=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/01\/output_0OCngU.gif?fit=487%2C291&amp;ssl=1\" data-orig-size=\"487,291\" data-comments-opened=\"1\" data-image-meta=\"{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}\" data-image-title=\"output_0OCngU\" data-image-description=\"\" data-image-caption=\"\" data-medium-file=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/01\/output_0OCngU.gif?fit=300%2C179&amp;ssl=1\" data-large-file=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/01\/output_0OCngU.gif?fit=487%2C291&amp;ssl=1\" decoding=\"async\" loading=\"lazy\" class=\"size-full wp-image-6000 aligncenter\" src=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/01\/output_0OCngU.gif?resize=487%2C291\" alt=\"output_0OCngU\" width=\"487\" height=\"291\" data-recalc-dims=\"1\" \/><\/a><\/p>\n<p>Clearly, the presence of an outlier has changed the entire result of our analysis. You must have noticed that the initial choice of two clusters or two random seeds was not that good with the addition of an outlier. In this case, the outlier became a single cluster and the remaining data points are formed into another cluster. If we had 3 cluster centroid-seeds in the beginning we would have seen a more reasonable cluster results. This has put forth an important question about the choice of number of cluster at the beginning of the analysis. Though one would like a simple answer to the question, trust me there isn&#8217;t one. There are a few analytical techniques that could serve well in this but at the end of the day the analyst needs to make a prudent choice based on her domain experience.\u00a0We will discuss these analytical techniques in some other article.<\/p>\n<h4><span style=\"color: #3366ff;\">Sign-off Note<\/span><\/h4>\n<p>I believe there is an outlier hidden somewhere in all of us. Outliers can make the world a better place to live. There is a need to not let these outliers be beaten into normal because of the life\u2019s pressure we all go through. Let me finish this article with another classic statement from Groucho Marx who didn\u2019t lose his witty self even when he was on his death bed.\u00a0 A nurse came and mentioned to frail Groucho that she wanted to measure if he had a temperature. Goucho retorted in his quick wit tone \u201cDon\u2019t be silly \u2013 everybody has a temperature\u201d. Yes, this is similar to saying everybody has an outlier.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Outliers &#8220;I refuse to join any club that would have me as a member.&#8221; &#8211; Groucho Marx This witty statement came from (according to me) one of the funniest men in the history of American cinema \u2013 Julius Henry Marx better known as Groucho Marx. Groucho was certainly a very unusual man and might be<\/p>\n<p><a class=\"excerpt-more blog-excerpt\" href=\"https:\/\/ucanalytics.com\/blogs\/customer-segmentation-outliers-telecom-case-study-part-3\/\">Read More&#8230;<\/a><\/p>\n","protected":false},"author":1,"featured_media":1386,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"jetpack_post_was_ever_published":false,"_jetpack_newsletter_access":"","_jetpack_newsletter_tier_id":0,"jetpack_publicize_message":"","jetpack_is_tweetstorm":false,"jetpack_publicize_feature_enabled":true,"jetpack_social_post_already_shared":true,"jetpack_social_options":{"image_generator_settings":{"template":"highway","enabled":false}}},"categories":[1,58],"tags":[7,71,6,10],"jetpack_publicize_connections":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v17.4 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Cluster Analysis and Outliers: Telecom Case Study Example<\/title>\n<meta name=\"description\" content=\"This is a case study example to illustrate significance of outliers in customer segments through cluster analysis.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/ucanalytics.com\/blogs\/customer-segmentation-outliers-telecom-case-study-part-3\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Cluster Analysis and Outliers: Telecom Case Study Example\" \/>\n<meta property=\"og:description\" content=\"This is a case study example to illustrate significance of outliers in customer segments through cluster analysis.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/ucanalytics.com\/blogs\/customer-segmentation-outliers-telecom-case-study-part-3\/\" \/>\n<meta property=\"og:site_name\" content=\"YOU CANalytics |\" \/>\n<meta property=\"article:author\" content=\"roopam\" \/>\n<meta property=\"article:published_time\" content=\"2014-01-12T04:33:54+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2015-09-07T16:26:34+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/01\/photo.jpg?fit=768%2C1024&#038;ssl=1\" \/>\n\t<meta property=\"og:image:width\" content=\"768\" \/>\n\t<meta property=\"og:image:height\" content=\"1024\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Roopam Upadhyay\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"4 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Organization\",\"@id\":\"https:\/\/ucanalytics.com\/blogs\/#organization\",\"name\":\"YOU CANalytics\",\"url\":\"https:\/\/ucanalytics.com\/blogs\/\",\"sameAs\":[],\"logo\":{\"@type\":\"ImageObject\",\"@id\":\"https:\/\/ucanalytics.com\/blogs\/#logo\",\"inLanguage\":\"en-US\",\"url\":\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2015\/11\/YOU-CANalytics-Logo.jpg?fit=607%2C120\",\"contentUrl\":\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2015\/11\/YOU-CANalytics-Logo.jpg?fit=607%2C120\",\"width\":607,\"height\":120,\"caption\":\"YOU CANalytics\"},\"image\":{\"@id\":\"https:\/\/ucanalytics.com\/blogs\/#logo\"}},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/ucanalytics.com\/blogs\/#website\",\"url\":\"https:\/\/ucanalytics.com\/blogs\/\",\"name\":\"YOU CANalytics |\",\"description\":\"Explore the Power of Data Science\",\"publisher\":{\"@id\":\"https:\/\/ucanalytics.com\/blogs\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/ucanalytics.com\/blogs\/?s={search_term_string}\"},\"query-input\":\"required name=search_term_string\"}],\"inLanguage\":\"en-US\"},{\"@type\":\"ImageObject\",\"@id\":\"https:\/\/ucanalytics.com\/blogs\/customer-segmentation-outliers-telecom-case-study-part-3\/#primaryimage\",\"inLanguage\":\"en-US\",\"url\":\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/01\/photo.jpg?fit=768%2C1024&ssl=1\",\"contentUrl\":\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/01\/photo.jpg?fit=768%2C1024&ssl=1\",\"width\":768,\"height\":1024,\"caption\":\"Groucho - by Roopam\"},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/ucanalytics.com\/blogs\/customer-segmentation-outliers-telecom-case-study-part-3\/#webpage\",\"url\":\"https:\/\/ucanalytics.com\/blogs\/customer-segmentation-outliers-telecom-case-study-part-3\/\",\"name\":\"Cluster Analysis and Outliers: Telecom Case Study Example\",\"isPartOf\":{\"@id\":\"https:\/\/ucanalytics.com\/blogs\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/ucanalytics.com\/blogs\/customer-segmentation-outliers-telecom-case-study-part-3\/#primaryimage\"},\"datePublished\":\"2014-01-12T04:33:54+00:00\",\"dateModified\":\"2015-09-07T16:26:34+00:00\",\"description\":\"This is a case study example to illustrate significance of outliers in customer segments through cluster analysis.\",\"breadcrumb\":{\"@id\":\"https:\/\/ucanalytics.com\/blogs\/customer-segmentation-outliers-telecom-case-study-part-3\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/ucanalytics.com\/blogs\/customer-segmentation-outliers-telecom-case-study-part-3\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/ucanalytics.com\/blogs\/customer-segmentation-outliers-telecom-case-study-part-3\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/ucanalytics.com\/blogs\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Cluster Analysis and Outliers \\u2013 Telecom Case Study Example (Part 3)\"}]},{\"@type\":\"Article\",\"@id\":\"https:\/\/ucanalytics.com\/blogs\/customer-segmentation-outliers-telecom-case-study-part-3\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/ucanalytics.com\/blogs\/customer-segmentation-outliers-telecom-case-study-part-3\/#webpage\"},\"author\":{\"@id\":\"https:\/\/ucanalytics.com\/blogs\/#\/schema\/person\/55961a1cea272ecdf290cb387be069b6\"},\"headline\":\"Cluster Analysis and Outliers \\u2013 Telecom Case Study Example (Part 3)\",\"datePublished\":\"2014-01-12T04:33:54+00:00\",\"dateModified\":\"2015-09-07T16:26:34+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/ucanalytics.com\/blogs\/customer-segmentation-outliers-telecom-case-study-part-3\/#webpage\"},\"wordCount\":829,\"commentCount\":1,\"publisher\":{\"@id\":\"https:\/\/ucanalytics.com\/blogs\/#organization\"},\"image\":{\"@id\":\"https:\/\/ucanalytics.com\/blogs\/customer-segmentation-outliers-telecom-case-study-part-3\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/01\/photo.jpg?fit=768%2C1024&ssl=1\",\"keywords\":[\"Business Analytics\",\"Customer Segmentation\",\"Predictive Analytics\",\"Roopam Upadhyay\"],\"articleSection\":[\"Marketing Analytics\",\"Telecom Case Study Example\"],\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\/\/ucanalytics.com\/blogs\/customer-segmentation-outliers-telecom-case-study-part-3\/#respond\"]}]},{\"@type\":\"Person\",\"@id\":\"https:\/\/ucanalytics.com\/blogs\/#\/schema\/person\/55961a1cea272ecdf290cb387be069b6\",\"name\":\"Roopam Upadhyay\",\"image\":{\"@type\":\"ImageObject\",\"@id\":\"https:\/\/ucanalytics.com\/blogs\/#personlogo\",\"inLanguage\":\"en-US\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/dd1aa0b0e813f7639800bcfad6a554f1?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/dd1aa0b0e813f7639800bcfad6a554f1?s=96&d=mm&r=g\",\"caption\":\"Roopam Upadhyay\"},\"description\":\"This blog contains my personal views and thoughts on predictive Analytics and big data. - Roopam Upadhyay\",\"sameAs\":[\"roopam\"],\"url\":\"https:\/\/ucanalytics.com\/blogs\/author\/roopam\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Cluster Analysis and Outliers: Telecom Case Study Example","description":"This is a case study example to illustrate significance of outliers in customer segments through cluster analysis.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/ucanalytics.com\/blogs\/customer-segmentation-outliers-telecom-case-study-part-3\/","og_locale":"en_US","og_type":"article","og_title":"Cluster Analysis and Outliers: Telecom Case Study Example","og_description":"This is a case study example to illustrate significance of outliers in customer segments through cluster analysis.","og_url":"https:\/\/ucanalytics.com\/blogs\/customer-segmentation-outliers-telecom-case-study-part-3\/","og_site_name":"YOU CANalytics |","article_author":"roopam","article_published_time":"2014-01-12T04:33:54+00:00","article_modified_time":"2015-09-07T16:26:34+00:00","og_image":[{"width":768,"height":1024,"url":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/01\/photo.jpg?fit=768%2C1024&ssl=1","type":"image\/jpeg"}],"twitter_misc":{"Written by":"Roopam Upadhyay","Est. reading time":"4 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Organization","@id":"https:\/\/ucanalytics.com\/blogs\/#organization","name":"YOU CANalytics","url":"https:\/\/ucanalytics.com\/blogs\/","sameAs":[],"logo":{"@type":"ImageObject","@id":"https:\/\/ucanalytics.com\/blogs\/#logo","inLanguage":"en-US","url":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2015\/11\/YOU-CANalytics-Logo.jpg?fit=607%2C120","contentUrl":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2015\/11\/YOU-CANalytics-Logo.jpg?fit=607%2C120","width":607,"height":120,"caption":"YOU CANalytics"},"image":{"@id":"https:\/\/ucanalytics.com\/blogs\/#logo"}},{"@type":"WebSite","@id":"https:\/\/ucanalytics.com\/blogs\/#website","url":"https:\/\/ucanalytics.com\/blogs\/","name":"YOU CANalytics |","description":"Explore the Power of Data Science","publisher":{"@id":"https:\/\/ucanalytics.com\/blogs\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/ucanalytics.com\/blogs\/?s={search_term_string}"},"query-input":"required name=search_term_string"}],"inLanguage":"en-US"},{"@type":"ImageObject","@id":"https:\/\/ucanalytics.com\/blogs\/customer-segmentation-outliers-telecom-case-study-part-3\/#primaryimage","inLanguage":"en-US","url":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/01\/photo.jpg?fit=768%2C1024&ssl=1","contentUrl":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/01\/photo.jpg?fit=768%2C1024&ssl=1","width":768,"height":1024,"caption":"Groucho - by Roopam"},{"@type":"WebPage","@id":"https:\/\/ucanalytics.com\/blogs\/customer-segmentation-outliers-telecom-case-study-part-3\/#webpage","url":"https:\/\/ucanalytics.com\/blogs\/customer-segmentation-outliers-telecom-case-study-part-3\/","name":"Cluster Analysis and Outliers: Telecom Case Study Example","isPartOf":{"@id":"https:\/\/ucanalytics.com\/blogs\/#website"},"primaryImageOfPage":{"@id":"https:\/\/ucanalytics.com\/blogs\/customer-segmentation-outliers-telecom-case-study-part-3\/#primaryimage"},"datePublished":"2014-01-12T04:33:54+00:00","dateModified":"2015-09-07T16:26:34+00:00","description":"This is a case study example to illustrate significance of outliers in customer segments through cluster analysis.","breadcrumb":{"@id":"https:\/\/ucanalytics.com\/blogs\/customer-segmentation-outliers-telecom-case-study-part-3\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/ucanalytics.com\/blogs\/customer-segmentation-outliers-telecom-case-study-part-3\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/ucanalytics.com\/blogs\/customer-segmentation-outliers-telecom-case-study-part-3\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/ucanalytics.com\/blogs\/"},{"@type":"ListItem","position":2,"name":"Cluster Analysis and Outliers \u2013 Telecom Case Study Example (Part 3)"}]},{"@type":"Article","@id":"https:\/\/ucanalytics.com\/blogs\/customer-segmentation-outliers-telecom-case-study-part-3\/#article","isPartOf":{"@id":"https:\/\/ucanalytics.com\/blogs\/customer-segmentation-outliers-telecom-case-study-part-3\/#webpage"},"author":{"@id":"https:\/\/ucanalytics.com\/blogs\/#\/schema\/person\/55961a1cea272ecdf290cb387be069b6"},"headline":"Cluster Analysis and Outliers \u2013 Telecom Case Study Example (Part 3)","datePublished":"2014-01-12T04:33:54+00:00","dateModified":"2015-09-07T16:26:34+00:00","mainEntityOfPage":{"@id":"https:\/\/ucanalytics.com\/blogs\/customer-segmentation-outliers-telecom-case-study-part-3\/#webpage"},"wordCount":829,"commentCount":1,"publisher":{"@id":"https:\/\/ucanalytics.com\/blogs\/#organization"},"image":{"@id":"https:\/\/ucanalytics.com\/blogs\/customer-segmentation-outliers-telecom-case-study-part-3\/#primaryimage"},"thumbnailUrl":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/01\/photo.jpg?fit=768%2C1024&ssl=1","keywords":["Business Analytics","Customer Segmentation","Predictive Analytics","Roopam Upadhyay"],"articleSection":["Marketing Analytics","Telecom Case Study Example"],"inLanguage":"en-US","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/ucanalytics.com\/blogs\/customer-segmentation-outliers-telecom-case-study-part-3\/#respond"]}]},{"@type":"Person","@id":"https:\/\/ucanalytics.com\/blogs\/#\/schema\/person\/55961a1cea272ecdf290cb387be069b6","name":"Roopam Upadhyay","image":{"@type":"ImageObject","@id":"https:\/\/ucanalytics.com\/blogs\/#personlogo","inLanguage":"en-US","url":"https:\/\/secure.gravatar.com\/avatar\/dd1aa0b0e813f7639800bcfad6a554f1?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/dd1aa0b0e813f7639800bcfad6a554f1?s=96&d=mm&r=g","caption":"Roopam Upadhyay"},"description":"This blog contains my personal views and thoughts on predictive Analytics and big data. - Roopam Upadhyay","sameAs":["roopam"],"url":"https:\/\/ucanalytics.com\/blogs\/author\/roopam\/"}]}},"jetpack_featured_media_url":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/01\/photo.jpg?fit=768%2C1024&ssl=1","jetpack_sharing_enabled":true,"jetpack_shortlink":"https:\/\/wp.me\/p3L0jT-ml","jetpack-related-posts":[{"id":8649,"url":"https:\/\/ucanalytics.com\/blogs\/bivariate-analysis-leverage-regression-case-study-example-part-3\/","url_meta":{"origin":1385,"position":0},"title":"Bivariate Analysis &#038; Leverage &#8211; Regression Case Study Example (Part 3)","author":"Roopam Upadhyay","date":false,"format":false,"excerpt":"Welcome back to the\u00a0case study example for regression analysis where you are helping an investment firm make money through property price arbitrage. In the last two parts (Part 1 & Part 2) you started with the univariate analysis to identify patterns in the data including missing data and outliers. In\u2026","rel":"","context":"In &quot;Pricing Case Study Example&quot;","block_context":{"text":"Pricing Case Study Example","link":"https:\/\/ucanalytics.com\/blogs\/category\/pricing-case-study-example\/"},"img":{"alt_text":"","src":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/08\/Regression-Case-Study-Example.jpg?fit=1156%2C720&ssl=1&resize=350%2C200","width":350,"height":200,"srcset":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/08\/Regression-Case-Study-Example.jpg?fit=1156%2C720&ssl=1&resize=350%2C200 1x, https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/08\/Regression-Case-Study-Example.jpg?fit=1156%2C720&ssl=1&resize=525%2C300 1.5x, https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/08\/Regression-Case-Study-Example.jpg?fit=1156%2C720&ssl=1&resize=700%2C400 2x, https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/08\/Regression-Case-Study-Example.jpg?fit=1156%2C720&ssl=1&resize=1050%2C600 3x"},"classes":[]},{"id":8488,"url":"https:\/\/ucanalytics.com\/blogs\/data-preparation-regression-pricing-case-study-example-part-2\/","url_meta":{"origin":1385,"position":1},"title":"Data Preparation for Regression &#8211; Pricing Case Study Example (Part 2)","author":"Roopam Upadhyay","date":false,"format":false,"excerpt":"In the last post we had started a case study example for regression analysis to help an investment firm make money through property price arbitrage\u00a0(read part 1 :\u00a0regression case study example).\u00a0This is an interactive case study example and required your help to move forward. These are some of your observations\u2026","rel":"","context":"In &quot;Analytics Labs&quot;","block_context":{"text":"Analytics Labs","link":"https:\/\/ucanalytics.com\/blogs\/category\/analytics-labs\/"},"img":{"alt_text":"","src":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/08\/Regression-analysis.jpg?fit=448%2C528&ssl=1&resize=350%2C200","width":350,"height":200},"classes":[]},{"id":695,"url":"https:\/\/ucanalytics.com\/blogs\/data-visualization-case-study-banking-part-2\/","url_meta":{"origin":1385,"position":2},"title":"Data Visualization &#8211; Banking Case Study Example (Part 2)","author":"Roopam Upadhyay","date":false,"format":false,"excerpt":"Sherlock Holmes & Data Visualization As a kid, a friend of mine used to own a Sherlock Holmes toy kit \u2013 the source of envy for all the other friends. The kit had a Sherlock Holmes cap, a pipe, a watch and a magnifying glass. The magnifying glass was the\u2026","rel":"","context":"In &quot;Banking Risk Case Study Example&quot;","block_context":{"text":"Banking Risk Case Study Example","link":"https:\/\/ucanalytics.com\/blogs\/category\/risk-analytics\/banking-risk-case-study-example\/"},"img":{"alt_text":"","src":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2013\/09\/Sherlock-Holmes-Copy.jpg?fit=459%2C458&ssl=1&resize=350%2C200","width":350,"height":200},"classes":[]},{"id":1116,"url":"https:\/\/ucanalytics.com\/blogs\/customer-segmentation-cluster-analysis-telecom-case-study-example\/","url_meta":{"origin":1385,"position":3},"title":"Customer Segmentation &#038; Cluster Analysis &#8211; Telecom Case Study Example (Part 1)","author":"Roopam Upadhyay","date":false,"format":false,"excerpt":"Galaxies and Cluster Analysis I live in Mumbai (Bombay), the financial capital of India and one of the largest cities in the world. One of the problems of living in a large city is that you rarely see stars in the night sky. The limited sky one can see through\u2026","rel":"","context":"In &quot;Marketing Analytics&quot;","block_context":{"text":"Marketing Analytics","link":"https:\/\/ucanalytics.com\/blogs\/category\/marketing-analytics\/"},"img":{"alt_text":"The Night Sky - by Roopam","src":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2013\/11\/sky-1.jpg?fit=768%2C1024&ssl=1&resize=350%2C200","width":350,"height":200,"srcset":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2013\/11\/sky-1.jpg?fit=768%2C1024&ssl=1&resize=350%2C200 1x, https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2013\/11\/sky-1.jpg?fit=768%2C1024&ssl=1&resize=525%2C300 1.5x, https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2013\/11\/sky-1.jpg?fit=768%2C1024&ssl=1&resize=700%2C400 2x"},"classes":[]},{"id":1259,"url":"https:\/\/ucanalytics.com\/blogs\/customer-segmentation-cluster-analysis-telecom-case-study-part-2\/","url_meta":{"origin":1385,"position":4},"title":"Customer Segmentation &#038; Cluster Analysis \u2013 Telecom Case Study Example(Part 2)","author":"Roopam Upadhyay","date":false,"format":false,"excerpt":"In one of\u00a0the previous articles, we have started with a case study example from the telecom sector. We learned about cluster analysis using black holes as an analogy. In that article, we used Euclidean distance to form customer segments. Let us continue with the same case study and learn about\u2026","rel":"","context":"In &quot;Marketing Analytics&quot;","block_context":{"text":"Marketing Analytics","link":"https:\/\/ucanalytics.com\/blogs\/category\/marketing-analytics\/"},"img":{"alt_text":"Euclid - by Roopam","src":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2013\/12\/unnamed.jpg?fit=524%2C615&ssl=1&resize=350%2C200","width":350,"height":200},"classes":[]},{"id":8159,"url":"https:\/\/ucanalytics.com\/blogs\/5-mistakes-for-analytics-projects\/","url_meta":{"origin":1385,"position":5},"title":"5 Mistakes at the Beginning of Analytics Projects, and Ways to Avoid Them","author":"Roopam Upadhyay","date":false,"format":false,"excerpt":"Why do data science and analytics projects fail? At what stage of the project life-cycle are they most vulnerable to failure? Like any living creature, the probability of analytics projects to fail is the highest either in their infancy or at the final stages of their life cycle. A successful\u2026","rel":"","context":"In &quot;Analytics Tips and Tricks&quot;","block_context":{"text":"Analytics Tips and Tricks","link":"https:\/\/ucanalytics.com\/blogs\/category\/analytics-tips\/"},"img":{"alt_text":"","src":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/04\/Data-Thinking-for-Survival-of-Analytics-Projects.jpg?fit=440%2C625&ssl=1&resize=350%2C200","width":350,"height":200},"classes":[]}],"_links":{"self":[{"href":"https:\/\/ucanalytics.com\/blogs\/wp-json\/wp\/v2\/posts\/1385"}],"collection":[{"href":"https:\/\/ucanalytics.com\/blogs\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/ucanalytics.com\/blogs\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/ucanalytics.com\/blogs\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/ucanalytics.com\/blogs\/wp-json\/wp\/v2\/comments?post=1385"}],"version-history":[{"count":0,"href":"https:\/\/ucanalytics.com\/blogs\/wp-json\/wp\/v2\/posts\/1385\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/ucanalytics.com\/blogs\/wp-json\/wp\/v2\/media\/1386"}],"wp:attachment":[{"href":"https:\/\/ucanalytics.com\/blogs\/wp-json\/wp\/v2\/media?parent=1385"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/ucanalytics.com\/blogs\/wp-json\/wp\/v2\/categories?post=1385"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/ucanalytics.com\/blogs\/wp-json\/wp\/v2\/tags?post=1385"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}