{"id":3973,"date":"2014-09-22T15:39:24","date_gmt":"2014-09-22T10:09:24","guid":{"rendered":"http:\/\/ucanalytics.com\/blogs\/?p=3973"},"modified":"2016-10-13T14:19:34","modified_gmt":"2016-10-13T08:49:34","slug":"model-selection-retail-case-study-example-part-7","status":"publish","type":"post","link":"https:\/\/ucanalytics.com\/blogs\/model-selection-retail-case-study-example-part-7\/","title":{"rendered":"Model Selection &#8211; Retail Case Study Example (Part 7)"},"content":{"rendered":"<hr \/>\n<div id=\"attachment_3974\" style=\"width: 375px\" class=\"wp-caption alignright\"><a href=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/09\/photo.jpg\"><img aria-describedby=\"caption-attachment-3974\" data-attachment-id=\"3974\" data-permalink=\"https:\/\/ucanalytics.com\/blogs\/model-selection-retail-case-study-example-part-7\/photo-7\/\" data-orig-file=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/09\/photo.jpg?fit=1654%2C1418&amp;ssl=1\" data-orig-size=\"1654,1418\" data-comments-opened=\"1\" data-image-meta=\"{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}\" data-image-title=\"Model Selection &#8211; by Roopam\" data-image-description=\"\" data-image-caption=\"&lt;p&gt;Model Selection &#8211; by Roopam&lt;\/p&gt;\n\" data-medium-file=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/09\/photo.jpg?fit=300%2C257&amp;ssl=1\" data-large-file=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/09\/photo.jpg?fit=640%2C548&amp;ssl=1\" decoding=\"async\" loading=\"lazy\" class=\"wp-image-3974\" src=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/09\/photo.jpg?resize=365%2C314\" alt=\"Model Selection - by Roopam\" width=\"365\" height=\"314\" srcset=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/09\/photo.jpg?w=1654&amp;ssl=1 1654w, https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/09\/photo.jpg?resize=250%2C214&amp;ssl=1 250w, https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/09\/photo.jpg?resize=300%2C257&amp;ssl=1 300w, https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/09\/photo.jpg?resize=1024%2C877&amp;ssl=1 1024w, https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/09\/photo.jpg?w=1280 1280w\" sizes=\"(max-width: 365px) 100vw, 365px\" data-recalc-dims=\"1\" \/><\/a><p id=\"caption-attachment-3974\" class=\"wp-caption-text\">Model Selection &#8211; by Roopam<\/p><\/div>\n<h2><span style=\"color: #3366ff;\">Model Selection<\/span><\/h2>\n<p>This is a continuation of our retail case study example for campaign and marketing analytics. In the previous two parts, we discussed a couple of decision tree algorithms (<a href=\"http:\/\/ucanalytics.com\/blogs\/decision-tree-cart-retail-case-part-5\/\">CART<\/a> and <a href=\"http:\/\/ucanalytics.com\/blogs\/decision-tree-entropy-retail-case-part-6\/\">C4.5<\/a>)\u00a0for classification. Recall a previous case study example on\u00a0banking and risk management where we discussed <a href=\"http:\/\/ucanalytics.com\/blogs\/case-study-banking-part-3-logistic-regression\/\">logistic regression<\/a>\u00a0which is another approach to solving classification problems. Additionally, there are several other\u00a0statistical and machine learning algorithms that are equally powerful for classification tasks such as:<\/p>\n<ul>\n<li>Support Vector Machines<\/li>\n<li>Random Forest<\/li>\n<li>Artificial Neural Networks<\/li>\n<li>Discriminant analysis<\/li>\n<li>Boosting<\/li>\n<li>Na\u00efve Bayes\u00a0Classifiers<\/li>\n<\/ul>\n<p>This list is definitely not complete but covers some of the commonly used approaches. We will discuss all these approaches in later articles\u00a0on YOU CANalytics. Now the question is: why there are so\u00a0many different approaches to solving the same problem?\u00a0A\u00a0more important question which everybody asks is: which one approach of\u00a0these\u00a0is the best? The answer to the second question is none! Yes, the best approach depends on the kind of data you are working with,\u00a0and since data come in all shapes and sizes hence you can&#8217;t have one\u00a0best approach for all problems. Hence, development of models with different approaches, and the best model selection for your data is an important exercise in data science and analytics. In this article, we will discuss the factors that influence the process of model selection. However, before that let us quickly examine some of the tasks data scientists perform, \u00a0as this will help us when we will make our transition to next parts of this case study example.<\/p>\n<h2><span style=\"color: #3366ff;\">Tasks for Data Science<\/span><\/h2>\n<p>Primarily, tasks that data scientists perform could be grouped into the following six broad categories as displayed\u00a0below. Please note that even the modern data science tasks such as web &amp; social media analytics, text mining, image analytics, and sound\u00a0pattern detection are some the used cases of these six broad categories.<\/p>\n<div id=\"attachment_4004\" style=\"width: 650px\" class=\"wp-caption aligncenter\"><img aria-describedby=\"caption-attachment-4004\" data-attachment-id=\"4004\" data-permalink=\"https:\/\/ucanalytics.com\/blogs\/model-selection-retail-case-study-example-part-7\/data-science-tasks-by-roopam\/\" data-orig-file=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/09\/Data-Science-Tasks-by-Roopam.jpg?fit=640%2C377&amp;ssl=1\" data-orig-size=\"640,377\" data-comments-opened=\"1\" data-image-meta=\"{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}\" data-image-title=\"Data Science Tasks &#8211; by Roopam\" data-image-description=\"\" data-image-caption=\"\" data-medium-file=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/09\/Data-Science-Tasks-by-Roopam.jpg?fit=300%2C176&amp;ssl=1\" data-large-file=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/09\/Data-Science-Tasks-by-Roopam.jpg?fit=640%2C377&amp;ssl=1\" decoding=\"async\" loading=\"lazy\" class=\"wp-image-4004 size-full\" src=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/09\/Data-Science-Tasks-by-Roopam.jpg?resize=640%2C377\" alt=\"Data Science Tasks - by Roopam\" width=\"640\" height=\"377\" srcset=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/09\/Data-Science-Tasks-by-Roopam.jpg?w=640&amp;ssl=1 640w, https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/09\/Data-Science-Tasks-by-Roopam.jpg?resize=250%2C147&amp;ssl=1 250w, https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/09\/Data-Science-Tasks-by-Roopam.jpg?resize=300%2C176&amp;ssl=1 300w\" sizes=\"(max-width: 640px) 100vw, 640px\" data-recalc-dims=\"1\" \/><p id=\"caption-attachment-4004\" class=\"wp-caption-text\">Data Science Tasks &#8211; by Roopam<\/p><\/div>\n<p>As you might have noticed, in this\u00a0case study, we have so far performed 3 tasks from the above list\u00a0i.e. &#8216;Description&#8217; (<a href=\"http:\/\/ucanalytics.com\/blogs\/exploratory-data-analysis-retail-case-study-part-3\/\">exploratory data analysis<\/a>), &#8216;Association&#8217; (<a href=\"http:\/\/ucanalytics.com\/blogs\/association-analysis-retail-case-study-part-4\/\">association analysis<\/a>), and &#8216;Classification&#8217; (decision trees:\u00a0<a href=\"http:\/\/ucanalytics.com\/blogs\/decision-tree-cart-retail-case-part-5\/\">CART<\/a> and <a href=\"http:\/\/ucanalytics.com\/blogs\/decision-tree-entropy-retail-case-part-6\/\">C4.5<\/a>). I must say, exploratory data analysis (EDA) is an integral part of every data science project. EDA is a crucial exercise that\u00a0drives predictive models in the right direction.<\/p>\n<p>In the last few\u00a0parts of this case study, we will do some &#8216;Estimation&#8217; (i.e. regression analysis to estimate revenue\u00a0generated by customers through campaigns). Let&#8217;s go back to model selection for our classification problem.<\/p>\n<h2><span style=\"color: #3366ff;\">Model Selection &#8211; Retail Case Study Example<\/span><\/h2>\n<p>Back to our retail case study example, where you are\u00a0the Chief Analytics Officer &amp; Business Strategy Head at an online shopping store called DresSMart Inc. that specializes in apparel and clothing.\u00a0Through\u00a0your rigorous exploratory data analysis you have found several factors\u00a0that play crucial roles in marketing campaigns&#8217;\u00a0response\u00a0for customers, some of these factors are:<\/p>\n<ul>\n<li>Recency: # recent visits to the company&#8217;s website and purchases<\/li>\n<li>Frequency: time lag between purchases in the last 6 months<\/li>\n<li>Payment mode used: cash on delivery, credit card, internet banking etc.<\/li>\n<li>Marketing data aggregator&#8217;s:\u00a0life-stage segmentations (i.e. luxury buffs, up-scale ageing, first-time earners etc.)<\/li>\n<li>Last year&#8217;s expenditure trend: amount spent last year<\/li>\n<li>Coupon usage pattern of customer<\/li>\n<\/ul>\n<p>You have tried several multivariate models mentioned above (i.e. logistic regression, SVM, decision trees etc.) to model customers&#8217; behaviour and generate purchase propensity scores. The choice of right model selection depends on the following 2\u00a0factors i.e.<\/p>\n<ol>\n<li>Predictive power of models<\/li>\n<li>Business &amp;\u00a0operations integration<\/li>\n<\/ol>\n<h4><span style=\"color: #3366ff;\">1 Predictive power of models<\/span><\/h4>\n<p>The first factor for model selection\u00a0is the overall predictive power that the model has in comparison to other models. For this classification problem, the area under receiver operating curve (AUROC) is possibly the best way to assess the predictive power of models (<a href=\"http:\/\/ucanalytics.com\/blogs\/credit-scorecards-model-validation-part-6\/\">read more about AUROC<\/a>). Sometimes people also use Gini coefficient for assessing predictive power of models, Gini is another variant\u00a0of AUROC and mathematically represented as:<\/p>\n<pre><img decoding=\"async\" src=\"https:\/\/s0.wp.com\/latex.php?latex=Gini%3D2%5Ctimes+AUROC-1+&#038;bg=ffffff&#038;fg=000&#038;s=0&#038;c=20201002\" alt=\"Gini=2&#92;times AUROC-1 \" class=\"latex\" \/><\/pre>\n<div id=\"attachment_4033\" style=\"width: 354px\" class=\"wp-caption alignright\"><a href=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/09\/image001.png\"><img aria-describedby=\"caption-attachment-4033\" data-attachment-id=\"4033\" data-permalink=\"https:\/\/ucanalytics.com\/blogs\/model-selection-retail-case-study-example-part-7\/image001\/\" data-orig-file=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/09\/image001.png?fit=660%2C622&amp;ssl=1\" data-orig-size=\"660,622\" data-comments-opened=\"1\" data-image-meta=\"{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}\" data-image-title=\"Area Under ROC for Different Models\" data-image-description=\"\" data-image-caption=\"&lt;p&gt;Area Under ROC for Different Models&lt;\/p&gt;\n\" data-medium-file=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/09\/image001.png?fit=300%2C282&amp;ssl=1\" data-large-file=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/09\/image001.png?fit=640%2C603&amp;ssl=1\" decoding=\"async\" loading=\"lazy\" class=\"wp-image-4033\" src=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/09\/image001.png?resize=344%2C324\" alt=\"Area Under ROC for Different Models\" width=\"344\" height=\"324\" srcset=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/09\/image001.png?w=660&amp;ssl=1 660w, https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/09\/image001.png?resize=250%2C235&amp;ssl=1 250w, https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/09\/image001.png?resize=300%2C282&amp;ssl=1 300w\" sizes=\"(max-width: 344px) 100vw, 344px\" data-recalc-dims=\"1\" \/><\/a><p id=\"caption-attachment-4033\" class=\"wp-caption-text\">Area Under ROC for Different Models<\/p><\/div>\n<p>In the adjacent plot, AUROC is displayed for artificial neural networks, logistic regression, and CART decision tree. Notice, the perfect model curve (in green) here is with 100% predictive power, and random model (in red) represents prediction through the flip of a coin. The AUROC values for the\u00a0test sample for the 3 models are:<\/p>\n<table>\n<tbody>\n<tr>\n<td>Model<\/td>\n<td>AUROC<\/td>\n<\/tr>\n<tr>\n<td>Decision Tree<\/td>\n<td>72%<\/td>\n<\/tr>\n<tr>\n<td>Logistic Regression<\/td>\n<td>76%<\/td>\n<\/tr>\n<tr>\n<td>Artificial Neural Networks<\/td>\n<td>77%<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>Decision tree here is performing much below the other two models. This is often the case with decision trees, but they are still very\u00a0useful and popular because of their highly intuitive and easy-to-explain solutions. Artificial neural networks\u00a0are performing a notch above logistic regression in this case with a slightly higher\u00a0area under ROC. Hence by the first criteria, artificial neural networks\u00a0offer the best model among the 3 models.<\/p>\n<h4><span style=\"color: #3366ff;\">2 \u00a0Business &amp;\u00a0operations integration:<\/span><\/h4>\n<p>This aspect of model selection is equally important as the above factor if not more. The model selection must be based on productization\u00a0of the model for business usage in the long run. The following factors\u00a0are useful to keep in mind at the beginning of modeling process:<\/p>\n<p><strong>1)<\/strong> <strong>Consistent availability of data for all predictor variables<\/strong>: many times models are developed by\u00a0predictor variables that are hard to procure regularly and consistently. Keeping such variables in the model is not advisable even if they contribute to high predictive power. This is especially true for third party data which is purchased once in a while.<\/p>\n<p><strong>2) The model should be\u00a0simple enough to calibrate<\/strong>: this factor is really important if the model will be used for a long duration i.e. more than 2 years. Certain models are relatively easy to calibrate or alter according to changes in market environment. This way analysts don&#8217;t need to rebuild a new model every so often.<\/p>\n<p><strong>3) Integration with information system and business process<\/strong>: the goal of any model is to integrate well with IT systems used by business users. Analysts must\u00a0think of productionization of the model for business process integration at the beginning of the project to avoid unnecessary rework at the completion of the project.<\/p>\n<p><strong>4) Business users&#8217; commitment for regular usage of models<\/strong>: data science is not just an intellectual exercise. The most important aspect of data science&#8217;s success is the generation of business value through actionable insights, and business users&#8217; commitment to act on these insights. This commitment by business users come from their involvement in, and understanding of model building process. Data scientists need to communicate well with business users throughout to gain their trust.<\/p>\n<h4><span style=\"color: #3366ff;\">Sign-off Note<\/span><\/h4>\n<p>In this article, we have noticed that\u00a0artificial neural networks performed slightly better than logistic regression and decision tree algorithms for our dataset. We will discuss neural networks in the next article before continuing with the next part of this case study i.e.\u00a0estimations through regression. See you soon!<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Model Selection This is a continuation of our retail case study example for campaign and marketing analytics. In the previous two parts, we discussed a couple of decision tree algorithms (CART and C4.5)\u00a0for classification. Recall a previous case study example on\u00a0banking and risk management where we discussed logistic regression\u00a0which is another approach to solving classification<\/p>\n<p><a class=\"excerpt-more blog-excerpt\" href=\"https:\/\/ucanalytics.com\/blogs\/model-selection-retail-case-study-example-part-7\/\">Read More&#8230;<\/a><\/p>\n","protected":false},"author":1,"featured_media":3974,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"jetpack_post_was_ever_published":false,"_jetpack_newsletter_access":"","_jetpack_newsletter_tier_id":0,"jetpack_publicize_message":"","jetpack_is_tweetstorm":false,"jetpack_publicize_feature_enabled":true,"jetpack_social_post_already_shared":true,"jetpack_social_options":{"image_generator_settings":{"template":"highway","enabled":false}}},"categories":[1,59],"tags":[7,42,6,72,10],"jetpack_publicize_connections":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v17.4 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Model Selection - Retail Case Study Example (Part 7) &ndash; YOU CANalytics |<\/title>\n<meta name=\"description\" content=\"Case study example for an online retail store: Model selection is an important data science process depends on both analytics and business needs.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/ucanalytics.com\/blogs\/model-selection-retail-case-study-example-part-7\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Model Selection - Retail Case Study Example (Part 7) &ndash; YOU CANalytics |\" \/>\n<meta property=\"og:description\" content=\"Case study example for an online retail store: Model selection is an important data science process depends on both analytics and business needs.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/ucanalytics.com\/blogs\/model-selection-retail-case-study-example-part-7\/\" \/>\n<meta property=\"og:site_name\" content=\"YOU CANalytics |\" \/>\n<meta property=\"article:author\" content=\"roopam\" \/>\n<meta property=\"article:published_time\" content=\"2014-09-22T10:09:24+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2016-10-13T08:49:34+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/09\/photo.jpg?fit=1654%2C1418&#038;ssl=1\" \/>\n\t<meta property=\"og:image:width\" content=\"1654\" \/>\n\t<meta property=\"og:image:height\" content=\"1418\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Roopam Upadhyay\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"6 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Organization\",\"@id\":\"https:\/\/ucanalytics.com\/blogs\/#organization\",\"name\":\"YOU CANalytics\",\"url\":\"https:\/\/ucanalytics.com\/blogs\/\",\"sameAs\":[],\"logo\":{\"@type\":\"ImageObject\",\"@id\":\"https:\/\/ucanalytics.com\/blogs\/#logo\",\"inLanguage\":\"en-US\",\"url\":\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2015\/11\/YOU-CANalytics-Logo.jpg?fit=607%2C120\",\"contentUrl\":\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2015\/11\/YOU-CANalytics-Logo.jpg?fit=607%2C120\",\"width\":607,\"height\":120,\"caption\":\"YOU CANalytics\"},\"image\":{\"@id\":\"https:\/\/ucanalytics.com\/blogs\/#logo\"}},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/ucanalytics.com\/blogs\/#website\",\"url\":\"https:\/\/ucanalytics.com\/blogs\/\",\"name\":\"YOU CANalytics |\",\"description\":\"Explore the Power of Data Science\",\"publisher\":{\"@id\":\"https:\/\/ucanalytics.com\/blogs\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/ucanalytics.com\/blogs\/?s={search_term_string}\"},\"query-input\":\"required name=search_term_string\"}],\"inLanguage\":\"en-US\"},{\"@type\":\"ImageObject\",\"@id\":\"https:\/\/ucanalytics.com\/blogs\/model-selection-retail-case-study-example-part-7\/#primaryimage\",\"inLanguage\":\"en-US\",\"url\":\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/09\/photo.jpg?fit=1654%2C1418&ssl=1\",\"contentUrl\":\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/09\/photo.jpg?fit=1654%2C1418&ssl=1\",\"width\":1654,\"height\":1418,\"caption\":\"Model Selection - by Roopam\"},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/ucanalytics.com\/blogs\/model-selection-retail-case-study-example-part-7\/#webpage\",\"url\":\"https:\/\/ucanalytics.com\/blogs\/model-selection-retail-case-study-example-part-7\/\",\"name\":\"Model Selection - Retail Case Study Example (Part 7) &ndash; YOU CANalytics |\",\"isPartOf\":{\"@id\":\"https:\/\/ucanalytics.com\/blogs\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/ucanalytics.com\/blogs\/model-selection-retail-case-study-example-part-7\/#primaryimage\"},\"datePublished\":\"2014-09-22T10:09:24+00:00\",\"dateModified\":\"2016-10-13T08:49:34+00:00\",\"description\":\"Case study example for an online retail store: Model selection is an important data science process depends on both analytics and business needs.\",\"breadcrumb\":{\"@id\":\"https:\/\/ucanalytics.com\/blogs\/model-selection-retail-case-study-example-part-7\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/ucanalytics.com\/blogs\/model-selection-retail-case-study-example-part-7\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/ucanalytics.com\/blogs\/model-selection-retail-case-study-example-part-7\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/ucanalytics.com\/blogs\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Model Selection &#8211; Retail Case Study Example (Part 7)\"}]},{\"@type\":\"Article\",\"@id\":\"https:\/\/ucanalytics.com\/blogs\/model-selection-retail-case-study-example-part-7\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/ucanalytics.com\/blogs\/model-selection-retail-case-study-example-part-7\/#webpage\"},\"author\":{\"@id\":\"https:\/\/ucanalytics.com\/blogs\/#\/schema\/person\/55961a1cea272ecdf290cb387be069b6\"},\"headline\":\"Model Selection &#8211; Retail Case Study Example (Part 7)\",\"datePublished\":\"2014-09-22T10:09:24+00:00\",\"dateModified\":\"2016-10-13T08:49:34+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/ucanalytics.com\/blogs\/model-selection-retail-case-study-example-part-7\/#webpage\"},\"wordCount\":1177,\"commentCount\":11,\"publisher\":{\"@id\":\"https:\/\/ucanalytics.com\/blogs\/#organization\"},\"image\":{\"@id\":\"https:\/\/ucanalytics.com\/blogs\/model-selection-retail-case-study-example-part-7\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/09\/photo.jpg?fit=1654%2C1418&ssl=1\",\"keywords\":[\"Business Analytics\",\"Marketing Analytics\",\"Predictive Analytics\",\"Retail Analytics\",\"Roopam Upadhyay\"],\"articleSection\":[\"Marketing Analytics\",\"Retail Case Study Example\"],\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\/\/ucanalytics.com\/blogs\/model-selection-retail-case-study-example-part-7\/#respond\"]}]},{\"@type\":\"Person\",\"@id\":\"https:\/\/ucanalytics.com\/blogs\/#\/schema\/person\/55961a1cea272ecdf290cb387be069b6\",\"name\":\"Roopam Upadhyay\",\"image\":{\"@type\":\"ImageObject\",\"@id\":\"https:\/\/ucanalytics.com\/blogs\/#personlogo\",\"inLanguage\":\"en-US\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/dd1aa0b0e813f7639800bcfad6a554f1?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/dd1aa0b0e813f7639800bcfad6a554f1?s=96&d=mm&r=g\",\"caption\":\"Roopam Upadhyay\"},\"description\":\"This blog contains my personal views and thoughts on predictive Analytics and big data. - Roopam Upadhyay\",\"sameAs\":[\"roopam\"],\"url\":\"https:\/\/ucanalytics.com\/blogs\/author\/roopam\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Model Selection - Retail Case Study Example (Part 7) &ndash; YOU CANalytics |","description":"Case study example for an online retail store: Model selection is an important data science process depends on both analytics and business needs.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/ucanalytics.com\/blogs\/model-selection-retail-case-study-example-part-7\/","og_locale":"en_US","og_type":"article","og_title":"Model Selection - Retail Case Study Example (Part 7) &ndash; YOU CANalytics |","og_description":"Case study example for an online retail store: Model selection is an important data science process depends on both analytics and business needs.","og_url":"https:\/\/ucanalytics.com\/blogs\/model-selection-retail-case-study-example-part-7\/","og_site_name":"YOU CANalytics |","article_author":"roopam","article_published_time":"2014-09-22T10:09:24+00:00","article_modified_time":"2016-10-13T08:49:34+00:00","og_image":[{"width":1654,"height":1418,"url":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/09\/photo.jpg?fit=1654%2C1418&ssl=1","type":"image\/jpeg"}],"twitter_misc":{"Written by":"Roopam Upadhyay","Est. reading time":"6 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Organization","@id":"https:\/\/ucanalytics.com\/blogs\/#organization","name":"YOU CANalytics","url":"https:\/\/ucanalytics.com\/blogs\/","sameAs":[],"logo":{"@type":"ImageObject","@id":"https:\/\/ucanalytics.com\/blogs\/#logo","inLanguage":"en-US","url":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2015\/11\/YOU-CANalytics-Logo.jpg?fit=607%2C120","contentUrl":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2015\/11\/YOU-CANalytics-Logo.jpg?fit=607%2C120","width":607,"height":120,"caption":"YOU CANalytics"},"image":{"@id":"https:\/\/ucanalytics.com\/blogs\/#logo"}},{"@type":"WebSite","@id":"https:\/\/ucanalytics.com\/blogs\/#website","url":"https:\/\/ucanalytics.com\/blogs\/","name":"YOU CANalytics |","description":"Explore the Power of Data Science","publisher":{"@id":"https:\/\/ucanalytics.com\/blogs\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/ucanalytics.com\/blogs\/?s={search_term_string}"},"query-input":"required name=search_term_string"}],"inLanguage":"en-US"},{"@type":"ImageObject","@id":"https:\/\/ucanalytics.com\/blogs\/model-selection-retail-case-study-example-part-7\/#primaryimage","inLanguage":"en-US","url":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/09\/photo.jpg?fit=1654%2C1418&ssl=1","contentUrl":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/09\/photo.jpg?fit=1654%2C1418&ssl=1","width":1654,"height":1418,"caption":"Model Selection - by Roopam"},{"@type":"WebPage","@id":"https:\/\/ucanalytics.com\/blogs\/model-selection-retail-case-study-example-part-7\/#webpage","url":"https:\/\/ucanalytics.com\/blogs\/model-selection-retail-case-study-example-part-7\/","name":"Model Selection - Retail Case Study Example (Part 7) &ndash; YOU CANalytics |","isPartOf":{"@id":"https:\/\/ucanalytics.com\/blogs\/#website"},"primaryImageOfPage":{"@id":"https:\/\/ucanalytics.com\/blogs\/model-selection-retail-case-study-example-part-7\/#primaryimage"},"datePublished":"2014-09-22T10:09:24+00:00","dateModified":"2016-10-13T08:49:34+00:00","description":"Case study example for an online retail store: Model selection is an important data science process depends on both analytics and business needs.","breadcrumb":{"@id":"https:\/\/ucanalytics.com\/blogs\/model-selection-retail-case-study-example-part-7\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/ucanalytics.com\/blogs\/model-selection-retail-case-study-example-part-7\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/ucanalytics.com\/blogs\/model-selection-retail-case-study-example-part-7\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/ucanalytics.com\/blogs\/"},{"@type":"ListItem","position":2,"name":"Model Selection &#8211; Retail Case Study Example (Part 7)"}]},{"@type":"Article","@id":"https:\/\/ucanalytics.com\/blogs\/model-selection-retail-case-study-example-part-7\/#article","isPartOf":{"@id":"https:\/\/ucanalytics.com\/blogs\/model-selection-retail-case-study-example-part-7\/#webpage"},"author":{"@id":"https:\/\/ucanalytics.com\/blogs\/#\/schema\/person\/55961a1cea272ecdf290cb387be069b6"},"headline":"Model Selection &#8211; Retail Case Study Example (Part 7)","datePublished":"2014-09-22T10:09:24+00:00","dateModified":"2016-10-13T08:49:34+00:00","mainEntityOfPage":{"@id":"https:\/\/ucanalytics.com\/blogs\/model-selection-retail-case-study-example-part-7\/#webpage"},"wordCount":1177,"commentCount":11,"publisher":{"@id":"https:\/\/ucanalytics.com\/blogs\/#organization"},"image":{"@id":"https:\/\/ucanalytics.com\/blogs\/model-selection-retail-case-study-example-part-7\/#primaryimage"},"thumbnailUrl":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/09\/photo.jpg?fit=1654%2C1418&ssl=1","keywords":["Business Analytics","Marketing Analytics","Predictive Analytics","Retail Analytics","Roopam Upadhyay"],"articleSection":["Marketing Analytics","Retail Case Study Example"],"inLanguage":"en-US","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/ucanalytics.com\/blogs\/model-selection-retail-case-study-example-part-7\/#respond"]}]},{"@type":"Person","@id":"https:\/\/ucanalytics.com\/blogs\/#\/schema\/person\/55961a1cea272ecdf290cb387be069b6","name":"Roopam Upadhyay","image":{"@type":"ImageObject","@id":"https:\/\/ucanalytics.com\/blogs\/#personlogo","inLanguage":"en-US","url":"https:\/\/secure.gravatar.com\/avatar\/dd1aa0b0e813f7639800bcfad6a554f1?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/dd1aa0b0e813f7639800bcfad6a554f1?s=96&d=mm&r=g","caption":"Roopam Upadhyay"},"description":"This blog contains my personal views and thoughts on predictive Analytics and big data. - Roopam Upadhyay","sameAs":["roopam"],"url":"https:\/\/ucanalytics.com\/blogs\/author\/roopam\/"}]}},"jetpack_featured_media_url":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/09\/photo.jpg?fit=1654%2C1418&ssl=1","jetpack_sharing_enabled":true,"jetpack_shortlink":"https:\/\/wp.me\/p3L0jT-125","jetpack-related-posts":[{"id":4357,"url":"https:\/\/ucanalytics.com\/blogs\/next-best-action-retail-case-study-example-part-11\/","url_meta":{"origin":3973,"position":0},"title":"Next Best Action &#8211; Retail Case Study Example (Epilogue)","author":"Roopam Upadhyay","date":false,"format":false,"excerpt":"This article is the epilogue to our case study example about campaign and marketing analytics for an online retail company. You could read the previous parts at Problem definition: \u00a0Part 1\u00a0&\u00a0Part 2 Description: Part 3 Association: Part 4 Classification: Part 5, Part 6,\u00a0Part 7,\u00a0Part 8 Estimation: Part 9, Part 10\u2026","rel":"","context":"In &quot;Marketing Analytics&quot;","block_context":{"text":"Marketing Analytics","link":"https:\/\/ucanalytics.com\/blogs\/category\/marketing-analytics\/"},"img":{"alt_text":"","src":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/11\/Free-Will.jpg?fit=315%2C448&ssl=1&resize=350%2C200","width":350,"height":200},"classes":[]},{"id":4274,"url":"https:\/\/ucanalytics.com\/blogs\/regression-model-retail-case-study-part-10\/","url_meta":{"origin":3973,"position":1},"title":"Regression Model &#8211; Retail Case Study Example (Part 10)","author":"Roopam Upadhyay","date":false,"format":false,"excerpt":"This article is a continuation of our marketing analytics case study example for campaign management solutions. In this case, we started with the following\u00a0two\u00a0goals to build models to identify Most responsive customers Most revenue generating customers Problem definition: \u00a0Part 1\u00a0&\u00a0Part 2 Description: Part 3 Association: Part 4 Classification: Part 5,\u2026","rel":"","context":"In &quot;Marketing Analytics&quot;","block_context":{"text":"Marketing Analytics","link":"https:\/\/ucanalytics.com\/blogs\/category\/marketing-analytics\/"},"img":{"alt_text":"","src":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/10\/Regression-Train-by-Roopam.jpg?fit=448%2C320&ssl=1&resize=350%2C200","width":350,"height":200},"classes":[]},{"id":3195,"url":"https:\/\/ucanalytics.com\/blogs\/marketing-analytics-retail-case-study-example-part-2\/","url_meta":{"origin":3973,"position":2},"title":"Marketing Analytics &#8211; Retail Case Study Example (Part 2)","author":"Roopam Upadhyay","date":false,"format":false,"excerpt":"Last week we started a case study from the online retail industry to learn more about marketing analytics (Read Part 1). Before we continue with the same case, let me share a few factors that enhance the quality of analysis for marketing or customer analytics. The obvious factors, of course,\u2026","rel":"","context":"In &quot;Marketing Analytics&quot;","block_context":{"text":"Marketing Analytics","link":"https:\/\/ucanalytics.com\/blogs\/category\/marketing-analytics\/"},"img":{"alt_text":"Noir - by Roopam","src":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/02\/photo.jpg?fit=713%2C1024&ssl=1&resize=350%2C200","width":350,"height":200,"srcset":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/02\/photo.jpg?fit=713%2C1024&ssl=1&resize=350%2C200 1x, https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/02\/photo.jpg?fit=713%2C1024&ssl=1&resize=525%2C300 1.5x, https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/02\/photo.jpg?fit=713%2C1024&ssl=1&resize=700%2C400 2x"},"classes":[]},{"id":3129,"url":"https:\/\/ucanalytics.com\/blogs\/marketing-analytics-retail-case-study-part-1\/","url_meta":{"origin":3973,"position":3},"title":"Marketing Analytics &#8211; Retail Case Study Example (Part 1)","author":"Roopam Upadhyay","date":false,"format":false,"excerpt":"Today we are going to start a new case study example on YOU CANalytics. In this case study example, we will examine different facets of marketing analytics and customer relationship management (CRM). We will use the example of online retail to explore more about marketing analytics - an area of\u2026","rel":"","context":"In &quot;Marketing Analytics&quot;","block_context":{"text":"Marketing Analytics","link":"https:\/\/ucanalytics.com\/blogs\/category\/marketing-analytics\/"},"img":{"alt_text":"Marketing Analytics Modeling","src":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/06\/Marketing-Analytics-Modeling-Blog.jpg?resize=350%2C200","width":350,"height":200,"srcset":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/06\/Marketing-Analytics-Modeling-Blog.jpg?resize=350%2C200 1x, https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/06\/Marketing-Analytics-Modeling-Blog.jpg?resize=525%2C300 1.5x, https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/06\/Marketing-Analytics-Modeling-Blog.jpg?resize=700%2C400 2x"},"classes":[]},{"id":4054,"url":"https:\/\/ucanalytics.com\/blogs\/artificial-neural-networks-retail-case-study-example-part-8\/","url_meta":{"origin":3973,"position":4},"title":"Artificial Neural Networks, Made Easy &#8211; Retail Case Study Example (Part 8)","author":"Roopam Upadhyay","date":false,"format":false,"excerpt":"Welcome back to\u00a0our retail case study example\u00a0for campaign and marketing analytics. So far in this case study, we were working on a\u00a0classification problem to identify customers with a higher likelihood to purchase products from\u00a0the campaign catalogues. In the last article on\u00a0model selection, we noticed that artificial neural networks performed better\u2026","rel":"","context":"In &quot;Marketing Analytics&quot;","block_context":{"text":"Marketing Analytics","link":"https:\/\/ucanalytics.com\/blogs\/category\/marketing-analytics\/"},"img":{"alt_text":"","src":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/09\/Artificial-Neural-Networks.jpg?fit=346%2C336&ssl=1&resize=350%2C200","width":350,"height":200},"classes":[]},{"id":4133,"url":"https:\/\/ucanalytics.com\/blogs\/regression-mother-models-retail-case-study-example-part-9\/","url_meta":{"origin":3973,"position":5},"title":"Regression: the Mother of all Models &#8211; Retail Case Study Example (Part 9)","author":"Roopam Upadhyay","date":false,"format":false,"excerpt":"Welcome back to our retail case study example for marketing analytics. In\u00a0the previous 8 parts, we have covered some of the key tasks of data science such as: Problem definition: \u00a0Part 1\u00a0&\u00a0Part 2 Description: Part 3 Association: Part 4 Classification: Part 5, Part 6,\u00a0\u00a0Part 7\u00a0&\u00a0Part 8 \u00a0 In this part,\u2026","rel":"","context":"In &quot;Marketing Analytics&quot;","block_context":{"text":"Marketing Analytics","link":"https:\/\/ucanalytics.com\/blogs\/category\/marketing-analytics\/"},"img":{"alt_text":"100 Meters Race","src":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/10\/100-Meters-Race.jpg?resize=350%2C200","width":350,"height":200,"srcset":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/10\/100-Meters-Race.jpg?resize=350%2C200 1x, https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/10\/100-Meters-Race.jpg?resize=525%2C300 1.5x, https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/10\/100-Meters-Race.jpg?resize=700%2C400 2x"},"classes":[]}],"_links":{"self":[{"href":"https:\/\/ucanalytics.com\/blogs\/wp-json\/wp\/v2\/posts\/3973"}],"collection":[{"href":"https:\/\/ucanalytics.com\/blogs\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/ucanalytics.com\/blogs\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/ucanalytics.com\/blogs\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/ucanalytics.com\/blogs\/wp-json\/wp\/v2\/comments?post=3973"}],"version-history":[{"count":0,"href":"https:\/\/ucanalytics.com\/blogs\/wp-json\/wp\/v2\/posts\/3973\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/ucanalytics.com\/blogs\/wp-json\/wp\/v2\/media\/3974"}],"wp:attachment":[{"href":"https:\/\/ucanalytics.com\/blogs\/wp-json\/wp\/v2\/media?parent=3973"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/ucanalytics.com\/blogs\/wp-json\/wp\/v2\/categories?post=3973"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/ucanalytics.com\/blogs\/wp-json\/wp\/v2\/tags?post=3973"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}