{"id":48,"date":"2013-07-15T17:39:11","date_gmt":"2013-07-15T12:09:11","guid":{"rendered":"http:\/\/ucanalytics.com\/blogs\/?p=48"},"modified":"2017-05-03T12:07:09","modified_gmt":"2017-05-03T06:37:09","slug":"credit-scorecards-variables-selection-part-3","status":"publish","type":"post","link":"https:\/\/ucanalytics.com\/blogs\/credit-scorecards-variables-selection-part-3\/","title":{"rendered":"Credit Scorecards &#8211; Variables Selection (part 3 of 7)"},"content":{"rendered":"<hr \/>\n<h2><span style=\"font-family: georgia, palatino; font-size: 18px; color: #3366ff;\">Variables Selection in Predictive Analytics<\/span><\/h2>\n<div id=\"attachment_49\" style=\"width: 226px\" class=\"wp-caption alignright\"><a href=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2013\/07\/3-masks1.jpg\"><img aria-describedby=\"caption-attachment-49\" data-attachment-id=\"49\" data-permalink=\"https:\/\/ucanalytics.com\/blogs\/credit-scorecards-variables-selection-part-3\/3-masks\/\" data-orig-file=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2013\/07\/3-masks1-e1375194528935.jpg?fit=150%2C139&amp;ssl=1\" data-orig-size=\"150,139\" data-comments-opened=\"1\" data-image-meta=\"{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;}\" data-image-title=\"Masks\" data-image-description=\"\" data-image-caption=\"&lt;p&gt;Theatre &#8211; by Roopam&lt;\/p&gt;\n\" data-medium-file=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2013\/07\/3-masks1-e1375194528935.jpg?fit=300%2C277&amp;ssl=1\" data-large-file=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2013\/07\/3-masks1-e1375194528935.jpg?fit=150%2C139&amp;ssl=1\" decoding=\"async\" loading=\"lazy\" class=\" wp-image-49 \" title=\"Predictive Analytics: Variables Selection\" src=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2013\/07\/3-masks1.jpg?resize=216%2C200\" alt=\"Predictive Analytics: Variables Selection - by Roopam Upadhyay\" width=\"216\" height=\"200\" data-recalc-dims=\"1\" \/><\/a><p id=\"caption-attachment-49\" class=\"wp-caption-text\">Predictive Analytics: Variables Selection &#8211; by Roopam<\/p><\/div>\n<p>The following story goes back to the time when I just started my transition from physics to business. I met this investment banker* in his mid-thirties during a Friday night party. After gulping down a few pints of beer, his mood became a bit somber and he told me how he hates his job. However, he had a plan of working his ass off until he retires at 45. Then he will do everything that makes him happy. I was thoroughly confused, how could someone debar himself from an emotion \u2013 happiness \u2013 for so many years and rediscover it later? I was wondering about the recipe for happiness \u2013 <em>raindrops on roses and whiskers on kittens<\/em>. An individual\u2019s happiness is a tricky thing; however, I shall attempt to tackle this issue in my later article on logistic regression. For now, let us try to explore how states measure the collective well-being of their people. I shall use this topic of population well-being to explore an interesting topic in analytical scorecard development: variables selection.<\/p>\n<h2><span style=\"color: #3366ff; font-size: 18px;\">Variables Selection &#8211; Lessons from GDP &amp; GNH<\/span><\/h2>\n<p>The most popular measure for national prosperity, unanimously projected by economists and TV channels, is Gross Domestic Product (GDP). The equation for measuring GDP as taught in macroeconomics 101 is:<\/p>\n<p><a href=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2013\/07\/GDP-Equation1.jpg\"><img data-attachment-id=\"50\" data-permalink=\"https:\/\/ucanalytics.com\/blogs\/credit-scorecards-variables-selection-part-3\/gdp-equation\/\" data-orig-file=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2013\/07\/GDP-Equation1.jpg?fit=370%2C161&amp;ssl=1\" data-orig-size=\"370,161\" data-comments-opened=\"1\" data-image-meta=\"{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;}\" data-image-title=\"GDP Equation\" data-image-description=\"\" data-image-caption=\"\" data-medium-file=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2013\/07\/GDP-Equation1.jpg?fit=300%2C130&amp;ssl=1\" data-large-file=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2013\/07\/GDP-Equation1.jpg?fit=370%2C161&amp;ssl=1\" decoding=\"async\" loading=\"lazy\" class=\"aligncenter size-full wp-image-50\" src=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2013\/07\/GDP-Equation1.jpg?resize=370%2C161\" alt=\"GDP Equation\" width=\"370\" height=\"161\" srcset=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2013\/07\/GDP-Equation1.jpg?w=370&amp;ssl=1 370w, https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2013\/07\/GDP-Equation1.jpg?resize=300%2C130&amp;ssl=1 300w\" sizes=\"(max-width: 370px) 100vw, 370px\" data-recalc-dims=\"1\" \/><\/a><\/p>\n<p>Clearly, there are 5 factors\/variables that govern GDP according to this equation. The first look at GDP as a measure for national well-being seemed incomplete to me. All the variables for GDP were from commerce. They are important but cannot be the only factors for country\u2019s well-being, more so in a highly diverse &amp; complicated country like India.<\/p>\n<h2><span style=\"color: #3396e6; font-size: 16px;\">Gross National Happiness Index &#8211; The Story of Bhutan Naresh<\/span><\/h2>\n<div id=\"attachment_454\" style=\"width: 311px\" class=\"wp-caption alignright\"><a href=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2013\/07\/3-Tree2.jpg\"><img aria-describedby=\"caption-attachment-454\" data-attachment-id=\"454\" data-permalink=\"https:\/\/ucanalytics.com\/blogs\/credit-scorecards-part-1\/3-tree-2\/\" data-orig-file=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2013\/07\/3-Tree2.jpg?fit=768%2C1024&amp;ssl=1\" data-orig-size=\"768,1024\" data-comments-opened=\"1\" data-image-meta=\"{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;}\" data-image-title=\"Variables Selection &#8211; by Roopam\" data-image-description=\"&lt;p&gt;Variables Selection &#8211; by Roopam&lt;\/p&gt;\n\" data-image-caption=\"&lt;p&gt;Variables Selection &#8211; by Roopam&lt;\/p&gt;\n\" data-medium-file=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2013\/07\/3-Tree2.jpg?fit=225%2C300&amp;ssl=1\" data-large-file=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2013\/07\/3-Tree2.jpg?fit=640%2C853&amp;ssl=1\" decoding=\"async\" loading=\"lazy\" class=\" wp-image-454\" src=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2013\/07\/3-Tree2.jpg?resize=301%2C401\" alt=\"\" width=\"301\" height=\"401\" srcset=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2013\/07\/3-Tree2.jpg?w=768&amp;ssl=1 768w, https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2013\/07\/3-Tree2.jpg?resize=225%2C300&amp;ssl=1 225w\" sizes=\"(max-width: 301px) 100vw, 301px\" data-recalc-dims=\"1\" \/><\/a><p id=\"caption-attachment-454\" class=\"wp-caption-text\">Variables Selection &#8211; by Roopam<\/p><\/div>\n<p>Ok, so what else do we have? A lesser-known index is Gross National Happiness (GNH). The origins of GNH are in Bhutan. They measure their country\u2019s progress through GNH. The term was coined and implemented by Jigme Singye Wangchuck. This name immediately takes me back to the early nineties live telecast of the SAARC summit by India\u2019s national broadcaster Doordarshan (DD). The old-timer Hindi commentators were referring to a modest man in a bathrobe-like-attire as \u2018Bhutan Naresh\u2019 \u2013 King of Bhutan. At first glance, he did not fit well with the power horses of the south Asian region. Nevertheless, he seems to have devised a more holistic metric to\u00a0measure his country\u2019s well-being. GNH is a combination of the following broad categories:<\/p>\n<p>1. Living standard &amp; income<br \/>\n2. Health coverage<br \/>\n3. Physiological well-being<br \/>\n4. Time spent at work and relaxing<br \/>\n5. Good governance<br \/>\n6. Schooling &amp; education<br \/>\n7. Cultural diversity<br \/>\n8. Community vitality<br \/>\n9. Environmentalism and conservatism<\/p>\n<p>There are 72 total variables in GNH measured on a scale of 0 to 1, such as daily hours of sleep and trust in media; hmmm, not a bad start! You could do your own research on GNH and let me know what you feel about it. Actually, we can work out our own formula for a GNH like metric. The idea is to select the right variables to build your model!<\/p>\n<h2><span style=\"color: #3366ff; font-size: 16px;\">Variables Selection in Credit Scoring<\/span><\/h2>\n<p>In data mining and statistical model building exercises, similar to credit scoring, variables selection process is performed through statistical significance \u2013 a reasonably automated process through advanced software. However, the variables are still created and measured by humans. High impact analyses in businesses are still driven by hunches. Human intelligence is not obsolete yet.<\/p>\n<p>In one of the projects I did with a financial organization, the result of credit risk analysis and scoring led to redesigning of the application form. Application forms are a major source of data collection regarding the borrower. However, nobody wants to fill a lengthy form hence an optimal size of the form ensures accurate information provided by the borrower. The idea is to select the right variable and ensure accurate measurement.<\/p>\n<p>There are several aspects regarding variables but I will mention just one of them here (coarse classing).<\/p>\n<h2><span style=\"font-size: 18px; color: #3396e6;\">Coarse Classing in Credit Scoring<\/span><\/h2>\n<p><a href=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2013\/07\/3-Shoe-measure1.jpg\"><img data-attachment-id=\"56\" data-permalink=\"https:\/\/ucanalytics.com\/blogs\/credit-scorecards-variables-selection-part-3\/3-shoe-measure\/\" data-orig-file=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2013\/07\/3-Shoe-measure1.jpg?fit=358%2C282&amp;ssl=1\" data-orig-size=\"358,282\" data-comments-opened=\"1\" data-image-meta=\"{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;}\" data-image-title=\"3 Shoe measure\" data-image-description=\"\" data-image-caption=\"\" data-medium-file=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2013\/07\/3-Shoe-measure1.jpg?fit=300%2C236&amp;ssl=1\" data-large-file=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2013\/07\/3-Shoe-measure1.jpg?fit=358%2C282&amp;ssl=1\" decoding=\"async\" loading=\"lazy\" class=\"alignright wp-image-56\" title=\"Coarse Classing in Credit Scoring\" src=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2013\/07\/3-Shoe-measure1.jpg?resize=192%2C151\" alt=\"3 Shoe measure\" width=\"192\" height=\"151\" srcset=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2013\/07\/3-Shoe-measure1.jpg?resize=300%2C236&amp;ssl=1 300w, https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2013\/07\/3-Shoe-measure1.jpg?w=358&amp;ssl=1 358w\" sizes=\"(max-width: 192px) 100vw, 192px\" data-recalc-dims=\"1\" \/><\/a>One of my favorite activities as a kid was going to a shoe store and getting my feet measured every summer before the school started. The shoe shops had a strange, miniature, slide-like device to measure foot size. It was fun to see my feet grow from one size to another every year or two. The growth was quantized i.e you are size-2 or 3 never 2.5 or 2.7. This aspect of converting measure such as 2.5 &amp; 2.7 to 3 is called grouping, bucketing or classing. This is an integral part of creating scorecards that you will find in all the books I have listed in the first part of this blog series.<\/p>\n<p>I have been a part of several heated discussions on the relevance of coarse class in scorecard development throughout my career. In most, if not all academic articles you will rarely see coarse classing as a technique during model development. Quite a few academicians &amp; practitioners for a good reason believe that coarse classing results in loss of information. However, in my opinion, coarse classing has the following advantage over using raw measurement for a variable.<\/p>\n<p>1. It reduces random noise that exists in raw variables \u2013 similar to averaging and yes, you lose some information here.<br \/>\n2. It handles extreme events \u2013 on two extremes of a variable \u2013 much better where you have thin data.<br \/>\n3. It handles the non-linear relationship between dependent and independent variable without a lot of effort of variable transformation from the analyst.<\/p>\n<h4><span style=\"color: #3396e6; font-size: 16px;\">Sign-off Note\u00a0<\/span><\/h4>\n<p>We are half way through this series on \u2018Analytical Scorecard Development\u2019 and I am enjoying writing this thoroughly. I hope as a reader you are on the same page. Scorecard building is highly technical and I have tried to discuss some aspects with easy to understand examples. However, to manage the length of the article, I am not able to get into the details. I must say that I love the details! So, if you have any queries, doubts, points-of-view or recommendations please write back on the discussion board or on my email: roopam.up@gmail.com<\/p>\n<pre style=\"text-align: justify;\">*I do not remember correctly if he was an investment banker but it fits the description better \u2013 I should also not refrain from treating the community like a punching bag (going by the popular emotions).<\/pre>\n","protected":false},"excerpt":{"rendered":"<p>Variables Selection in Predictive Analytics The following story goes back to the time when I just started my transition from physics to business. I met this investment banker* in his mid-thirties during a Friday night party. After gulping down a few pints of beer, his mood became a bit somber and he told me how<\/p>\n<p><a class=\"excerpt-more blog-excerpt\" href=\"https:\/\/ucanalytics.com\/blogs\/credit-scorecards-variables-selection-part-3\/\">Read More&#8230;<\/a><\/p>\n","protected":false},"author":1,"featured_media":49,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"jetpack_post_was_ever_published":false,"_jetpack_newsletter_access":"","_jetpack_newsletter_tier_id":0,"jetpack_publicize_message":"","jetpack_is_tweetstorm":false,"jetpack_publicize_feature_enabled":true,"jetpack_social_post_already_shared":false,"jetpack_social_options":{"image_generator_settings":{"template":"highway","enabled":false}}},"categories":[55,54],"tags":[8,7,69,6,10],"jetpack_publicize_connections":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v17.4 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Credit Scorecards : Variables Selection - YOU CANalytics<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/ucanalytics.com\/blogs\/credit-scorecards-variables-selection-part-3\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Credit Scorecards : Variables Selection - YOU CANalytics\" \/>\n<meta property=\"og:description\" content=\"Variables Selection in Predictive Analytics The following story goes back to the time when I just started my transition from physics to business. I met this investment banker* in his mid-thirties during a Friday night party. After gulping down a few pints of beer, his mood became a bit somber and he told me howRead More...\" \/>\n<meta property=\"og:url\" content=\"https:\/\/ucanalytics.com\/blogs\/credit-scorecards-variables-selection-part-3\/\" \/>\n<meta property=\"og:site_name\" content=\"YOU CANalytics |\" \/>\n<meta property=\"article:author\" content=\"roopam\" \/>\n<meta property=\"article:published_time\" content=\"2013-07-15T12:09:11+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2017-05-03T06:37:09+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2013\/07\/3-masks1-e1375194528935.jpg?fit=150%2C139&#038;ssl=1\" \/>\n\t<meta property=\"og:image:width\" content=\"150\" \/>\n\t<meta property=\"og:image:height\" content=\"139\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Roopam Upadhyay\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"5 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Organization\",\"@id\":\"https:\/\/ucanalytics.com\/blogs\/#organization\",\"name\":\"YOU CANalytics\",\"url\":\"https:\/\/ucanalytics.com\/blogs\/\",\"sameAs\":[],\"logo\":{\"@type\":\"ImageObject\",\"@id\":\"https:\/\/ucanalytics.com\/blogs\/#logo\",\"inLanguage\":\"en-US\",\"url\":\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2015\/11\/YOU-CANalytics-Logo.jpg?fit=607%2C120\",\"contentUrl\":\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2015\/11\/YOU-CANalytics-Logo.jpg?fit=607%2C120\",\"width\":607,\"height\":120,\"caption\":\"YOU CANalytics\"},\"image\":{\"@id\":\"https:\/\/ucanalytics.com\/blogs\/#logo\"}},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/ucanalytics.com\/blogs\/#website\",\"url\":\"https:\/\/ucanalytics.com\/blogs\/\",\"name\":\"YOU CANalytics |\",\"description\":\"Explore the Power of Data Science\",\"publisher\":{\"@id\":\"https:\/\/ucanalytics.com\/blogs\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/ucanalytics.com\/blogs\/?s={search_term_string}\"},\"query-input\":\"required name=search_term_string\"}],\"inLanguage\":\"en-US\"},{\"@type\":\"ImageObject\",\"@id\":\"https:\/\/ucanalytics.com\/blogs\/credit-scorecards-variables-selection-part-3\/#primaryimage\",\"inLanguage\":\"en-US\",\"url\":\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2013\/07\/3-masks1-e1375194528935.jpg?fit=150%2C139&ssl=1\",\"contentUrl\":\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2013\/07\/3-masks1-e1375194528935.jpg?fit=150%2C139&ssl=1\",\"width\":150,\"height\":139,\"caption\":\"Theatre - by Roopam\"},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/ucanalytics.com\/blogs\/credit-scorecards-variables-selection-part-3\/#webpage\",\"url\":\"https:\/\/ucanalytics.com\/blogs\/credit-scorecards-variables-selection-part-3\/\",\"name\":\"Credit Scorecards : Variables Selection - YOU CANalytics\",\"isPartOf\":{\"@id\":\"https:\/\/ucanalytics.com\/blogs\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/ucanalytics.com\/blogs\/credit-scorecards-variables-selection-part-3\/#primaryimage\"},\"datePublished\":\"2013-07-15T12:09:11+00:00\",\"dateModified\":\"2017-05-03T06:37:09+00:00\",\"breadcrumb\":{\"@id\":\"https:\/\/ucanalytics.com\/blogs\/credit-scorecards-variables-selection-part-3\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/ucanalytics.com\/blogs\/credit-scorecards-variables-selection-part-3\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/ucanalytics.com\/blogs\/credit-scorecards-variables-selection-part-3\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/ucanalytics.com\/blogs\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Credit Scorecards &#8211; Variables Selection (part 3 of 7)\"}]},{\"@type\":\"Article\",\"@id\":\"https:\/\/ucanalytics.com\/blogs\/credit-scorecards-variables-selection-part-3\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/ucanalytics.com\/blogs\/credit-scorecards-variables-selection-part-3\/#webpage\"},\"author\":{\"@id\":\"https:\/\/ucanalytics.com\/blogs\/#\/schema\/person\/55961a1cea272ecdf290cb387be069b6\"},\"headline\":\"Credit Scorecards &#8211; Variables Selection (part 3 of 7)\",\"datePublished\":\"2013-07-15T12:09:11+00:00\",\"dateModified\":\"2017-05-03T06:37:09+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/ucanalytics.com\/blogs\/credit-scorecards-variables-selection-part-3\/#webpage\"},\"wordCount\":1027,\"commentCount\":8,\"publisher\":{\"@id\":\"https:\/\/ucanalytics.com\/blogs\/#organization\"},\"image\":{\"@id\":\"https:\/\/ucanalytics.com\/blogs\/credit-scorecards-variables-selection-part-3\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2013\/07\/3-masks1-e1375194528935.jpg?fit=150%2C139&ssl=1\",\"keywords\":[\"Banking and Insurance Analytics\",\"Business Analytics\",\"Credit Risk\",\"Predictive Analytics\",\"Roopam Upadhyay\"],\"articleSection\":[\"Credit Risk Analytics Series\",\"Risk Analytics\"],\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\/\/ucanalytics.com\/blogs\/credit-scorecards-variables-selection-part-3\/#respond\"]}]},{\"@type\":\"Person\",\"@id\":\"https:\/\/ucanalytics.com\/blogs\/#\/schema\/person\/55961a1cea272ecdf290cb387be069b6\",\"name\":\"Roopam Upadhyay\",\"image\":{\"@type\":\"ImageObject\",\"@id\":\"https:\/\/ucanalytics.com\/blogs\/#personlogo\",\"inLanguage\":\"en-US\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/dd1aa0b0e813f7639800bcfad6a554f1?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/dd1aa0b0e813f7639800bcfad6a554f1?s=96&d=mm&r=g\",\"caption\":\"Roopam Upadhyay\"},\"description\":\"This blog contains my personal views and thoughts on predictive Analytics and big data. - Roopam Upadhyay\",\"sameAs\":[\"roopam\"],\"url\":\"https:\/\/ucanalytics.com\/blogs\/author\/roopam\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Credit Scorecards : Variables Selection - YOU CANalytics","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/ucanalytics.com\/blogs\/credit-scorecards-variables-selection-part-3\/","og_locale":"en_US","og_type":"article","og_title":"Credit Scorecards : Variables Selection - YOU CANalytics","og_description":"Variables Selection in Predictive Analytics The following story goes back to the time when I just started my transition from physics to business. I met this investment banker* in his mid-thirties during a Friday night party. After gulping down a few pints of beer, his mood became a bit somber and he told me howRead More...","og_url":"https:\/\/ucanalytics.com\/blogs\/credit-scorecards-variables-selection-part-3\/","og_site_name":"YOU CANalytics |","article_author":"roopam","article_published_time":"2013-07-15T12:09:11+00:00","article_modified_time":"2017-05-03T06:37:09+00:00","og_image":[{"width":150,"height":139,"url":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2013\/07\/3-masks1-e1375194528935.jpg?fit=150%2C139&ssl=1","type":"image\/jpeg"}],"twitter_misc":{"Written by":"Roopam Upadhyay","Est. reading time":"5 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Organization","@id":"https:\/\/ucanalytics.com\/blogs\/#organization","name":"YOU CANalytics","url":"https:\/\/ucanalytics.com\/blogs\/","sameAs":[],"logo":{"@type":"ImageObject","@id":"https:\/\/ucanalytics.com\/blogs\/#logo","inLanguage":"en-US","url":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2015\/11\/YOU-CANalytics-Logo.jpg?fit=607%2C120","contentUrl":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2015\/11\/YOU-CANalytics-Logo.jpg?fit=607%2C120","width":607,"height":120,"caption":"YOU CANalytics"},"image":{"@id":"https:\/\/ucanalytics.com\/blogs\/#logo"}},{"@type":"WebSite","@id":"https:\/\/ucanalytics.com\/blogs\/#website","url":"https:\/\/ucanalytics.com\/blogs\/","name":"YOU CANalytics |","description":"Explore the Power of Data Science","publisher":{"@id":"https:\/\/ucanalytics.com\/blogs\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/ucanalytics.com\/blogs\/?s={search_term_string}"},"query-input":"required name=search_term_string"}],"inLanguage":"en-US"},{"@type":"ImageObject","@id":"https:\/\/ucanalytics.com\/blogs\/credit-scorecards-variables-selection-part-3\/#primaryimage","inLanguage":"en-US","url":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2013\/07\/3-masks1-e1375194528935.jpg?fit=150%2C139&ssl=1","contentUrl":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2013\/07\/3-masks1-e1375194528935.jpg?fit=150%2C139&ssl=1","width":150,"height":139,"caption":"Theatre - by Roopam"},{"@type":"WebPage","@id":"https:\/\/ucanalytics.com\/blogs\/credit-scorecards-variables-selection-part-3\/#webpage","url":"https:\/\/ucanalytics.com\/blogs\/credit-scorecards-variables-selection-part-3\/","name":"Credit Scorecards : Variables Selection - YOU CANalytics","isPartOf":{"@id":"https:\/\/ucanalytics.com\/blogs\/#website"},"primaryImageOfPage":{"@id":"https:\/\/ucanalytics.com\/blogs\/credit-scorecards-variables-selection-part-3\/#primaryimage"},"datePublished":"2013-07-15T12:09:11+00:00","dateModified":"2017-05-03T06:37:09+00:00","breadcrumb":{"@id":"https:\/\/ucanalytics.com\/blogs\/credit-scorecards-variables-selection-part-3\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/ucanalytics.com\/blogs\/credit-scorecards-variables-selection-part-3\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/ucanalytics.com\/blogs\/credit-scorecards-variables-selection-part-3\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/ucanalytics.com\/blogs\/"},{"@type":"ListItem","position":2,"name":"Credit Scorecards &#8211; Variables Selection (part 3 of 7)"}]},{"@type":"Article","@id":"https:\/\/ucanalytics.com\/blogs\/credit-scorecards-variables-selection-part-3\/#article","isPartOf":{"@id":"https:\/\/ucanalytics.com\/blogs\/credit-scorecards-variables-selection-part-3\/#webpage"},"author":{"@id":"https:\/\/ucanalytics.com\/blogs\/#\/schema\/person\/55961a1cea272ecdf290cb387be069b6"},"headline":"Credit Scorecards &#8211; Variables Selection (part 3 of 7)","datePublished":"2013-07-15T12:09:11+00:00","dateModified":"2017-05-03T06:37:09+00:00","mainEntityOfPage":{"@id":"https:\/\/ucanalytics.com\/blogs\/credit-scorecards-variables-selection-part-3\/#webpage"},"wordCount":1027,"commentCount":8,"publisher":{"@id":"https:\/\/ucanalytics.com\/blogs\/#organization"},"image":{"@id":"https:\/\/ucanalytics.com\/blogs\/credit-scorecards-variables-selection-part-3\/#primaryimage"},"thumbnailUrl":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2013\/07\/3-masks1-e1375194528935.jpg?fit=150%2C139&ssl=1","keywords":["Banking and Insurance Analytics","Business Analytics","Credit Risk","Predictive Analytics","Roopam Upadhyay"],"articleSection":["Credit Risk Analytics Series","Risk Analytics"],"inLanguage":"en-US","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/ucanalytics.com\/blogs\/credit-scorecards-variables-selection-part-3\/#respond"]}]},{"@type":"Person","@id":"https:\/\/ucanalytics.com\/blogs\/#\/schema\/person\/55961a1cea272ecdf290cb387be069b6","name":"Roopam Upadhyay","image":{"@type":"ImageObject","@id":"https:\/\/ucanalytics.com\/blogs\/#personlogo","inLanguage":"en-US","url":"https:\/\/secure.gravatar.com\/avatar\/dd1aa0b0e813f7639800bcfad6a554f1?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/dd1aa0b0e813f7639800bcfad6a554f1?s=96&d=mm&r=g","caption":"Roopam Upadhyay"},"description":"This blog contains my personal views and thoughts on predictive Analytics and big data. - Roopam Upadhyay","sameAs":["roopam"],"url":"https:\/\/ucanalytics.com\/blogs\/author\/roopam\/"}]}},"jetpack_featured_media_url":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2013\/07\/3-masks1-e1375194528935.jpg?fit=150%2C139&ssl=1","jetpack_sharing_enabled":true,"jetpack_shortlink":"https:\/\/wp.me\/p3L0jT-M","jetpack-related-posts":[{"id":55,"url":"https:\/\/ucanalytics.com\/blogs\/credit-scorecards-advanced-analytics-part-4\/","url_meta":{"origin":48,"position":0},"title":"Credit Scorecards &#8211; Advanced Analytics (part 4 of 7)","author":"Roopam Upadhyay","date":false,"format":false,"excerpt":"Modeling in Advanced Analytics The room, full of Analysts, erupts with a loud round of laughter when a young business analyst narrates to us an incident from his recent trip back home. A distant aunt inquired about his new profession. His response \u2013 I am into modeling. She got all\u2026","rel":"","context":"In &quot;Credit Risk Analytics Series&quot;","block_context":{"text":"Credit Risk Analytics Series","link":"https:\/\/ucanalytics.com\/blogs\/category\/risk-analytics\/credit-risk-analytics-series\/"},"img":{"alt_text":"4. Scorecard Simple","src":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2013\/07\/4.-Scorecard-Simple1.jpg?resize=350%2C200","width":350,"height":200,"srcset":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2013\/07\/4.-Scorecard-Simple1.jpg?resize=350%2C200 1x, https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2013\/07\/4.-Scorecard-Simple1.jpg?resize=525%2C300 1.5x, https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2013\/07\/4.-Scorecard-Simple1.jpg?resize=700%2C400 2x"},"classes":[]},{"id":8,"url":"https:\/\/ucanalytics.com\/blogs\/credit-scorecards-part-1\/","url_meta":{"origin":48,"position":1},"title":"Credit Scorecards &#8211; Introduction (part 1 of 7)","author":"Roopam Upadhyay","date":false,"format":false,"excerpt":"Credit Scorecards in the Age of Credit Crisis This incident took place at a friend\u2019s party circa 2009, in the backdrop of the worst financial crisis the planet has seen for a long time. The average Joe on the street was aware of terms such as mortgaged-backed securities (MBS), sub-prime\u2026","rel":"","context":"In &quot;Credit Risk Analytics Series&quot;","block_context":{"text":"Credit Risk Analytics Series","link":"https:\/\/ucanalytics.com\/blogs\/category\/risk-analytics\/credit-risk-analytics-series\/"},"img":{"alt_text":"","src":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2015\/12\/Slide7.jpg?fit=290%2C210&ssl=1&resize=350%2C200","width":350,"height":200},"classes":[]},{"id":281,"url":"https:\/\/ucanalytics.com\/blogs\/credit-scorecards-predictive-analytics-part-7\/","url_meta":{"origin":48,"position":2},"title":"Credit Scorecards &#8211;  Business Integration of Predictive Analytics (part 7 of 7)","author":"Roopam Upadhyay","date":false,"format":false,"excerpt":"Columbus - A lesson in Leadership Christopher Columbus \u2013 I have adored this man for various reasons at various stages of my life. At seven, I adored him because his mistakes were applauded and became part of history \u2013 Columbus mistook Native Americans for Indians because he thought he had\u2026","rel":"","context":"In &quot;Credit Risk Analytics Series&quot;","block_context":{"text":"Credit Risk Analytics Series","link":"https:\/\/ucanalytics.com\/blogs\/category\/risk-analytics\/credit-risk-analytics-series\/"},"img":{"alt_text":"","src":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2013\/07\/7-Leader-225x300.jpg?resize=350%2C200","width":350,"height":200},"classes":[]},{"id":35,"url":"https:\/\/ucanalytics.com\/blogs\/credit-scorecards-classification-problem-part-2\/","url_meta":{"origin":48,"position":3},"title":"Credit Scorecards &#8211; Classification Problem (part 2 of 7)","author":"Roopam Upadhyay","date":false,"format":false,"excerpt":"Classification Problem in Statistics & Data Mining I must say I was shocked when Amishi, a girl little over three years old, announced that going forward she is only friends with my wife and not me. Her reason for the breakup was that I am a boy and girls can\u2026","rel":"","context":"In &quot;Credit Risk Analytics Series&quot;","block_context":{"text":"Credit Risk Analytics Series","link":"https:\/\/ucanalytics.com\/blogs\/category\/risk-analytics\/credit-risk-analytics-series\/"},"img":{"alt_text":"2 sample window","src":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2013\/07\/2-sample-window1.jpg?resize=350%2C200","width":350,"height":200,"srcset":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2013\/07\/2-sample-window1.jpg?resize=350%2C200 1x, https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2013\/07\/2-sample-window1.jpg?resize=525%2C300 1.5x"},"classes":[]},{"id":3973,"url":"https:\/\/ucanalytics.com\/blogs\/model-selection-retail-case-study-example-part-7\/","url_meta":{"origin":48,"position":4},"title":"Model Selection &#8211; Retail Case Study Example (Part 7)","author":"Roopam Upadhyay","date":false,"format":false,"excerpt":"Model Selection This is a continuation of our retail case study example for campaign and marketing analytics. In the previous two parts, we discussed a couple of decision tree algorithms (CART and C4.5)\u00a0for classification. Recall a previous case study example on\u00a0banking and risk management where we discussed logistic regression\u00a0which is\u2026","rel":"","context":"In &quot;Marketing Analytics&quot;","block_context":{"text":"Marketing Analytics","link":"https:\/\/ucanalytics.com\/blogs\/category\/marketing-analytics\/"},"img":{"alt_text":"","src":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/09\/photo.jpg?fit=1200%2C1029&ssl=1&resize=350%2C200","width":350,"height":200,"srcset":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/09\/photo.jpg?fit=1200%2C1029&ssl=1&resize=350%2C200 1x, https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/09\/photo.jpg?fit=1200%2C1029&ssl=1&resize=525%2C300 1.5x, https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/09\/photo.jpg?fit=1200%2C1029&ssl=1&resize=700%2C400 2x, https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/09\/photo.jpg?fit=1200%2C1029&ssl=1&resize=1050%2C600 3x"},"classes":[]},{"id":2783,"url":"https:\/\/ucanalytics.com\/blogs\/in-conversation-with-eric-siegel-author-predictive-analytics\/","url_meta":{"origin":48,"position":5},"title":"In Conversation with Eric Siegel: Author &#8216;Predictive Analytics&#8217;","author":"Roopam Upadhyay","date":false,"format":false,"excerpt":"In Conversation with.. Today we are starting a new series on YOU CANalytics called 'in conversation with'. In this series we will talk to the leaders and experts of predictive analytics and big data to gain deeper insight into the field. Dr. Eric Siegel Our first guest for the series\u2026","rel":"","context":"In &quot;Events &amp; Interviews&quot;","block_context":{"text":"Events &amp; Interviews","link":"https:\/\/ucanalytics.com\/blogs\/category\/events-and-interviews\/"},"img":{"alt_text":"","src":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2015\/12\/Slide15.jpg?fit=290%2C210&ssl=1&resize=350%2C200","width":350,"height":200},"classes":[]}],"_links":{"self":[{"href":"https:\/\/ucanalytics.com\/blogs\/wp-json\/wp\/v2\/posts\/48"}],"collection":[{"href":"https:\/\/ucanalytics.com\/blogs\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/ucanalytics.com\/blogs\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/ucanalytics.com\/blogs\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/ucanalytics.com\/blogs\/wp-json\/wp\/v2\/comments?post=48"}],"version-history":[{"count":0,"href":"https:\/\/ucanalytics.com\/blogs\/wp-json\/wp\/v2\/posts\/48\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/ucanalytics.com\/blogs\/wp-json\/wp\/v2\/media\/49"}],"wp:attachment":[{"href":"https:\/\/ucanalytics.com\/blogs\/wp-json\/wp\/v2\/media?parent=48"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/ucanalytics.com\/blogs\/wp-json\/wp\/v2\/categories?post=48"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/ucanalytics.com\/blogs\/wp-json\/wp\/v2\/tags?post=48"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}