{"id":8159,"date":"2016-05-02T22:04:04","date_gmt":"2016-05-02T16:34:04","guid":{"rendered":"http:\/\/ucanalytics.com\/blogs\/?p=8159"},"modified":"2018-05-19T11:18:49","modified_gmt":"2018-05-19T05:48:49","slug":"5-mistakes-for-analytics-projects","status":"publish","type":"post","link":"https:\/\/ucanalytics.com\/blogs\/5-mistakes-for-analytics-projects\/","title":{"rendered":"5 Mistakes at the Beginning of Analytics Projects, and Ways to Avoid Them"},"content":{"rendered":"<div id=\"attachment_8162\" style=\"width: 325px\" class=\"wp-caption alignright\"><a href=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/04\/Data-Thinking-for-Survival-of-Analytics-Projects.jpg\"><img aria-describedby=\"caption-attachment-8162\" data-attachment-id=\"8162\" data-permalink=\"https:\/\/ucanalytics.com\/blogs\/5-mistakes-for-analytics-projects\/data-thinking-for-survival-of-analytics-projects\/\" data-orig-file=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/04\/Data-Thinking-for-Survival-of-Analytics-Projects.jpg?fit=440%2C625&amp;ssl=1\" data-orig-size=\"440,625\" data-comments-opened=\"1\" data-image-meta=\"{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}\" data-image-title=\"Data Thinking for Survival of Analytics Projects\" data-image-description=\"\" data-image-caption=\"\" data-medium-file=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/04\/Data-Thinking-for-Survival-of-Analytics-Projects.jpg?fit=211%2C300&amp;ssl=1\" data-large-file=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/04\/Data-Thinking-for-Survival-of-Analytics-Projects.jpg?fit=440%2C625&amp;ssl=1\" decoding=\"async\" loading=\"lazy\" class=\"wp-image-8162\" src=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/04\/Data-Thinking-for-Survival-of-Analytics-Projects.jpg?resize=315%2C448\" alt=\"Data Thinking for Survival of Analytics Projects\" width=\"315\" height=\"448\" srcset=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/04\/Data-Thinking-for-Survival-of-Analytics-Projects.jpg?w=440&amp;ssl=1 440w, https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/04\/Data-Thinking-for-Survival-of-Analytics-Projects.jpg?resize=176%2C250&amp;ssl=1 176w, https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/04\/Data-Thinking-for-Survival-of-Analytics-Projects.jpg?resize=211%2C300&amp;ssl=1 211w\" sizes=\"(max-width: 315px) 100vw, 315px\" data-recalc-dims=\"1\" \/><\/a><p id=\"caption-attachment-8162\" class=\"wp-caption-text\">Struggle of Analytics Projects\u00a0&#8211; by Roopam<\/p><\/div>\n<hr \/>\n<p>Why do data science and analytics projects fail? At what stage of the project life-cycle are they most vulnerable to failure? Like any living creature, the probability of analytics projects to fail is the highest either in their infancy or at the final stages of their life cycle. A successful analytics project, like a successful life-form, leaves a legacy for the next generations to follow.<\/p>\n<p>Thinking about\u00a0data in a meticulous and scientific way is at the core of a successful analytics project that produces a competitive edge for the organization. In this article, we will discuss some of the mistakes while working with data that invariably lead analytics projects to failure. In the absence of scientific and rational thought process while working with data most analytics projects experience infant mortality.\u00a0A good way for us to understand this struggle is through correlating it with the..<\/p>\n<h2><span style=\"color: #3366ff;\">Struggle of a New Born Wildebeest<\/span><\/h2>\n<p>National Geographic Channel is certainly\u00a0the most important source for many of us\u00a0to experience the wild. While I was growing up, on Sunday mornings the national television in India used to broadcast an hour-long show by the National Geographic Society. I vividly remember one of the episodes where a wildebeest mother gave birth to a baby. Wildebeest, by the way, got the name because of\u00a0their resemblance to wild cattle i.e wild-ox or wildebeest.<\/p>\n<p>I found\u00a0the birth of a baby wildebeest both a wonderful\u00a0and grotesque event at the same time. The baby wildebeest covered in slimy discharge slowly dropped out of the mother. Yuck! that was gross to me when I was 13 years old. However, what followed after that\u00a0were the visuals\u00a0of a great struggle and triumph. The baby had to immediately stand up after the birth and suckle milk from its mother. This was absolutely important for the baby&#8217;s survival otherwise, it had become a meal for the lurking predators. The baby\u00a0struggled for a while and fell hard on the ground several times during this effort.\u00a0The baby had no one to help, including the mother, while it rose from the ground and reached for the mother. Finally, we witnessed a great triumph of nature when the baby crossed the first hurdle for its survival.<\/p>\n<p>Analytics projects also need to cross this initial hurdle for their survival. Sound thinking about data and the problem statement is at the core of crossing this hurdle.<\/p>\n<h2><span style=\"color: #3366ff;\">5 Mistakes\u00a0at the Initial Stages of Analytics Projects<\/span><\/h2>\n<p><a href=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/05\/mistake.jpg\"><img data-attachment-id=\"8296\" data-permalink=\"https:\/\/ucanalytics.com\/blogs\/5-mistakes-for-analytics-projects\/mistake\/\" data-orig-file=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/05\/mistake.jpg?fit=455%2C575&amp;ssl=1\" data-orig-size=\"455,575\" data-comments-opened=\"1\" data-image-meta=\"{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}\" data-image-title=\"mistake\" data-image-description=\"\" data-image-caption=\"\" data-medium-file=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/05\/mistake.jpg?fit=237%2C300&amp;ssl=1\" data-large-file=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/05\/mistake.jpg?fit=455%2C575&amp;ssl=1\" decoding=\"async\" loading=\"lazy\" class=\" wp-image-8296 alignright\" src=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/05\/mistake.jpg?resize=301%2C381\" alt=\"mistake\" width=\"301\" height=\"381\" srcset=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/05\/mistake.jpg?w=455&amp;ssl=1 455w, https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/05\/mistake.jpg?resize=198%2C250&amp;ssl=1 198w, https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/05\/mistake.jpg?resize=237%2C300&amp;ssl=1 237w\" sizes=\"(max-width: 301px) 100vw, 301px\" data-recalc-dims=\"1\" \/><\/a>There are several reasons why analytics projects fail to generate beneficial outcomes for an organization. In this article, I\u00a0will focus purely on the initial hurdles for analytics projects. Moreover, our\u00a0complete attention will be on good practices while\u00a0thinking about data and the business problem statement.<\/p>\n<p>This is my list of 5 mistakes that one wants to avoid at the beginning of analytics projects.<\/p>\n<ol>\n<li>Eagerness to solve problems<\/li>\n<li>Failure to identify\u00a0the right variables<\/li>\n<li>Treatment of missing data<\/li>\n<li>Beating down the outliers<\/li>\n<li>Not being careful about reproducibility of results<\/li>\n<\/ol>\n<p>Let me discuss these mistakes and ways to avoid them in some detail in the next sections.<\/p>\n<h4><span style=\"color: #3366ff;\">Mistake 1. Eagerness to Solve Problems<\/span><\/h4>\n<p>On both Linkedin and Facebook you must have seen people posting problems like the ones shown below:<\/p>\n<table>\n<tbody>\n<tr>\n<td style=\"background-color: #e8e8e8;\">\n<p style=\"padding-left: 30px;\">i) Tell a word that starts and ends with the letter &#8216;R&#8217;<\/p>\n<\/td>\n<\/tr>\n<tr>\n<td style=\"background-color: #e8e8e8;\">\n<p style=\"padding-left: 30px;\">ii)\u00a095% people will fail this simple mathematics problem : solve 36\u00f73\u22124\u00d79\u00f73<\/p>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>Almost always users on these social media sites immediately start answering these questions. For instance, for the first problem, you will notice answers such as rear, roar, render, rejoinder etc. The answers tend to get much more sophisticated and complicated with more users pooling in. You will invariably find hundreds and thousands of responses to every such problem. Interestingly you will rarely find anybody retorting with : why is this an important question? If someone does ask this, he\/she is\u00a0considered a spoilsport.<\/p>\n<p><a href=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/05\/questions.jpg\"><img data-attachment-id=\"8233\" data-permalink=\"https:\/\/ucanalytics.com\/blogs\/5-mistakes-for-analytics-projects\/questions-2\/\" data-orig-file=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/05\/questions.jpg?fit=220%2C148&amp;ssl=1\" data-orig-size=\"220,148\" data-comments-opened=\"1\" data-image-meta=\"{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}\" data-image-title=\"questions\" data-image-description=\"\" data-image-caption=\"\" data-medium-file=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/05\/questions.jpg?fit=220%2C148&amp;ssl=1\" data-large-file=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/05\/questions.jpg?fit=220%2C148&amp;ssl=1\" decoding=\"async\" loading=\"lazy\" class=\"alignleft wp-image-8233\" src=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/05\/questions.jpg?resize=294%2C198\" alt=\"questions\" width=\"294\" height=\"198\" data-recalc-dims=\"1\" \/><\/a>There is something extremely interesting happening up there. Humans are wired, both by nature and nurture (read schooling), to answer questions without questioning the question. We see a problem and we need to solve it. This is a dangerous strategy for analytics projects and often results in quick\u00a0mortality for the projects.<\/p>\n<p>Identification of the right business problem is at the core of successful analytics projects. Moreover, estimation of\u00a0business benefit, both financial and intangible, is the foremost task for the project team. Not every business problem is equality important, and trust me several problems are not even worth putting any effort into. Always ask why the problem you are solving is important and don&#8217;t start your project till you have a satisfactory answer.<\/p>\n<p>As for the second problem posted at the top of this section, the solution is in the BODMAS rule we learned in the primary school. The answer is &#8216;zer0&#8217;\u00a0but again why is this problem important? You could type this equation in an Excel cell and get the answer in no time.<\/p>\n<h4><span style=\"color: #3366ff;\">Mistake 2.\u00a0Failure to Identify\u00a0the Right Variables<\/span><\/h4>\n<p><a href=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/05\/job-satisfaction.jpg\"><img data-attachment-id=\"8274\" data-permalink=\"https:\/\/ucanalytics.com\/blogs\/5-mistakes-for-analytics-projects\/job-satisfaction\/\" data-orig-file=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/05\/job-satisfaction.jpg?fit=802%2C562&amp;ssl=1\" data-orig-size=\"802,562\" data-comments-opened=\"1\" data-image-meta=\"{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}\" data-image-title=\"job satisfaction\" data-image-description=\"\" data-image-caption=\"\" data-medium-file=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/05\/job-satisfaction.jpg?fit=300%2C210&amp;ssl=1\" data-large-file=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/05\/job-satisfaction.jpg?fit=640%2C448&amp;ssl=1\" decoding=\"async\" loading=\"lazy\" class=\"wp-image-8274 alignright\" src=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/05\/job-satisfaction.jpg?resize=309%2C217\" alt=\"job satisfaction\" width=\"309\" height=\"217\" srcset=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/05\/job-satisfaction.jpg?w=802&amp;ssl=1 802w, https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/05\/job-satisfaction.jpg?resize=250%2C175&amp;ssl=1 250w, https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/05\/job-satisfaction.jpg?resize=300%2C210&amp;ssl=1 300w, https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/05\/job-satisfaction.jpg?resize=768%2C538&amp;ssl=1 768w\" sizes=\"(max-width: 309px) 100vw, 309px\" data-recalc-dims=\"1\" \/><\/a>After\u00a0identification of the right question(s), the second step is to identify the right data and variables to work with. Assume you want to build a model to predict job satisfaction for employees. In any human resources system, the easily available and highly quantifiable metrics are income, bonus, designation, increments etc. But we all know from our experience that job satisfaction is a highly complicated phenomenon\u00a0and can barely be predicted with just these variables. However, when one builds this model there is a greater temptation to just use the easily available variables. The ability to identify the right set of variables at the beginning of the project differentiates a good analyst from the rest. Identification of variables requires a good understanding of the domain and lots of creativity. Creativity helps in generating derived variables from the available data\u00a0in the business systems.<\/p>\n<p>Once the right set of variables are identified and prepared, the next step is diagnostic or exploratory data analysis (EDA) of these variables. The next two mistakes are linked to EDA and they happen while handling missing data and outliers.<\/p>\n<h4><span style=\"color: #3366ff;\">Mistake 3. Treatment of Missing data<\/span><\/h4>\n<blockquote><p>&#8220;Is there any point to which you would wish to draw my attention [Mr. Sherlock Holmes]?&#8221;<br \/>\n&#8220;To the curious incident of the dog in the night-time.&#8221;<br \/>\n&#8220;The dog did nothing in the night-time.&#8221;<br \/>\n&#8220;That was the curious incident, &#8221; remarked Sherlock Holmes.<\/p>\n<p style=\"text-align: right;\">\u2015 form\u00a0<span id=\"quote_book_link_6224895\">Silver Blaze in\u00a0The Memoirs of Sherlock Holmes<\/span><\/p>\n<\/blockquote>\n<p><a href=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/05\/missing-data.jpg\"><img data-attachment-id=\"8236\" data-permalink=\"https:\/\/ucanalytics.com\/blogs\/5-mistakes-for-analytics-projects\/missing-data\/\" data-orig-file=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/05\/missing-data.jpg?fit=698%2C400&amp;ssl=1\" data-orig-size=\"698,400\" data-comments-opened=\"1\" data-image-meta=\"{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}\" data-image-title=\"missing data\" data-image-description=\"\" data-image-caption=\"\" data-medium-file=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/05\/missing-data.jpg?fit=300%2C172&amp;ssl=1\" data-large-file=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/05\/missing-data.jpg?fit=640%2C367&amp;ssl=1\" decoding=\"async\" loading=\"lazy\" class=\"wp-image-8236 alignright\" src=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/05\/missing-data.jpg?resize=297%2C170\" alt=\"missing data\" width=\"297\" height=\"170\" srcset=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/05\/missing-data.jpg?w=698&amp;ssl=1 698w, https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/05\/missing-data.jpg?resize=250%2C143&amp;ssl=1 250w, https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/05\/missing-data.jpg?resize=300%2C172&amp;ssl=1 300w\" sizes=\"(max-width: 297px) 100vw, 297px\" data-recalc-dims=\"1\" \/><\/a>Missing data is a reality of virtually every business data-set. In\u00a0statistics classes, it is taught that missing data is the biggest enemy of analysis. You are told to replace missing data with either the average or some other sophisticated values generate through regression or other fancy techniques. At times, this process of replacing missing values becomes so mechanical that the analysts tend to forget that there could be a reason why data is missing.<\/p>\n<p>Sherlock Holmes, in the above dialogue, enunciated that absence of something is also evidence as in the case of the dog, not barking. This signified that someone familiar to the dog had entered the barn at the night time. This helped Sherlock Holmes solve the mystery of a lost horse named Silver Blaze.<\/p>\n<p>Similarly missing data or absence of something\u00a0in certain cases can be a strong evidence in itself. This is particularly true in risk and fraud analytics. At the beginning of the analytics projects, it is a good idea to scrutinize missing data and identify if there are compelling clues hiding within them.<\/p>\n<h4><span style=\"color: #3366ff;\">Mistake 4. Beating down the Outliers<\/span><\/h4>\n<p><a href=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/05\/outliers.jpg\"><img data-attachment-id=\"8255\" data-permalink=\"https:\/\/ucanalytics.com\/blogs\/5-mistakes-for-analytics-projects\/outliers\/\" data-orig-file=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/05\/outliers.jpg?fit=276%2C183&amp;ssl=1\" data-orig-size=\"276,183\" data-comments-opened=\"1\" data-image-meta=\"{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}\" data-image-title=\"outliers\" data-image-description=\"\" data-image-caption=\"\" data-medium-file=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/05\/outliers.jpg?fit=276%2C183&amp;ssl=1\" data-large-file=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/05\/outliers.jpg?fit=276%2C183&amp;ssl=1\" decoding=\"async\" loading=\"lazy\" class=\"size-full wp-image-8255 alignright\" src=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/05\/outliers.jpg?resize=276%2C183\" alt=\"outliers\" width=\"276\" height=\"183\" srcset=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/05\/outliers.jpg?w=276&amp;ssl=1 276w, https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/05\/outliers.jpg?resize=250%2C166&amp;ssl=1 250w\" sizes=\"(max-width: 276px) 100vw, 276px\" data-recalc-dims=\"1\" \/><\/a>Another problem for analysis, highlighted by every statistics textbook, is outliers. Outliers are the observations that are extremely dissimilar to the studied population. For instance, if you are studying the net wealth of individuals on the planet then Bill Gates and the Sultan of Brunei are complete outliers. One of the strategies to deal with outliers is data transformation i.e. taking the log or the square root of all the observations. This beats the data down to normal range. Again, this is a good strategy in many cases but is equally ineffective in several others. For example in several marketing analytics applications, it is a good idea to create different segments of the population and create a separate model for each segment.\u00a0Including Bill Gates and Sultan of Brunei in the same model as for the majority of world&#8217;s population does not make sense.<\/p>\n<p>I have used missing data and outliers\u00a0as a way to highlight\u00a0that analysts need to be careful about blindly using any statistical technique. In the next segment, we will discuss a serious problem that plagues many scientific investigations.<\/p>\n<h4><span style=\"color: #3366ff;\">Mistake 5. Not being Careful about Reproducibility of Results<\/span><\/h4>\n<p><a href=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/05\/reproduce-1.jpg\"><img data-attachment-id=\"8240\" data-permalink=\"https:\/\/ucanalytics.com\/blogs\/5-mistakes-for-analytics-projects\/reproduce-1\/\" data-orig-file=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/05\/reproduce-1.jpg?fit=334%2C321&amp;ssl=1\" data-orig-size=\"334,321\" data-comments-opened=\"1\" data-image-meta=\"{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}\" data-image-title=\"reproduce 1\" data-image-description=\"\" data-image-caption=\"\" data-medium-file=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/05\/reproduce-1.jpg?fit=300%2C288&amp;ssl=1\" data-large-file=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/05\/reproduce-1.jpg?fit=334%2C321&amp;ssl=1\" decoding=\"async\" loading=\"lazy\" class=\"alignleft wp-image-8240\" src=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/05\/reproduce-1.jpg?resize=261%2C251\" alt=\"reproduce 1\" width=\"261\" height=\"251\" srcset=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/05\/reproduce-1.jpg?w=334&amp;ssl=1 334w, https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/05\/reproduce-1.jpg?resize=250%2C240&amp;ssl=1 250w, https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/05\/reproduce-1.jpg?resize=300%2C288&amp;ssl=1 300w\" sizes=\"(max-width: 261px) 100vw, 261px\" data-recalc-dims=\"1\" \/><\/a>A few years ago, Amgen, a biotech company, decided to repeat over 50 landmark cancer biology studies published in the topmost scientific journals. They could only reproduce results for\u00a06 out of 53 studies. That is a success rate of a little\u00a0over 10 percent. In another effort of a similar sort, a group of qualified researchers tried to repeat\u00a0studies from\u00a0three prestigious psychology journals. They could reproduce just 39 out of 100 studies.\u00a0Reproducibility is a fundamental tenet of any scientific investigation. Any result that you get today must be reproducible tomorrow in somewhat similar conditions. Analytics or business analysis is no different.<\/p>\n<p>On the brighter side, recently scientists have confirmed that Albert Einstein&#8217;s General Theory of Relativity holds true for a galaxy 13 billion light-years from Earth. Now if you talk about reproducibility, Einstein&#8217;s theory takes it to the new height or distance. Analytics projects need not be as generalizable as the<em> General Theory of Relativity\u00a0<\/em>but they still need to be reproducible\u00a0in a localized boundary region\u00a0and time.<\/p>\n<p>Predictive models are built with the idea that the model built today will be good in the future. If the results are not reproducible than the predictive models are completely worthless. It is essential for the project team to identify reasons why their models won&#8217;t work in the future.<\/p>\n<h4><span style=\"color: #3366ff;\">Define Segments and Boundaries to Make Your Models Robust and Reproducible<\/span><\/h4>\n<p>Moreover, it is also a good idea to define boundaries and segments within which the model will operate properly. For instance, consider this fictitious model to estimate work experience for professionals<\/p>\n<pre><span style=\"font-family: 'andale mono', monospace; font-size: 14pt;\"><em>Work Expreience = Age - 21\r\n<\/em><\/span><\/pre>\n<p>This mathematical equation says that if someone who is just born will have -21 years of work experience. We know this is incorrect. However, most models in business systems are implemented without defining the boundaries of predictor variables and the surrounding environment. This will make the model behave erratically for a new segment. The above\u00a0model for salary is possibly correct in the boundary of age between 21 to 60 years. Outside these boundaries, this model will make no sense.<\/p>\n<h4><span style=\"color: #3366ff;\">Sign-off Note<\/span><\/h4>\n<p>The struggle and triumph of a baby wildebeest have an important lesson for teams involved in analytics projects. The baby had to stand up without any help from anyone including the mother.\u00a0Similarly, analytics teams need to rely on their own scientific logic and knowledge of numbers to travel the journey because for these aspects of the project they won&#8217;t get any help from the champions or the sponsors of the project.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Why do data science and analytics projects fail? At what stage of the project life-cycle are they most vulnerable to failure? Like any living creature, the probability of analytics projects to fail is the highest either in their infancy or at the final stages of their life cycle. A successful analytics project, like a successful<\/p>\n<p><a class=\"excerpt-more blog-excerpt\" href=\"https:\/\/ucanalytics.com\/blogs\/5-mistakes-for-analytics-projects\/\">Read More&#8230;<\/a><\/p>\n","protected":false},"author":1,"featured_media":8162,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"jetpack_post_was_ever_published":false,"_jetpack_newsletter_access":"","_jetpack_newsletter_tier_id":0,"jetpack_publicize_message":"","jetpack_is_tweetstorm":false,"jetpack_publicize_feature_enabled":true,"jetpack_social_post_already_shared":true,"jetpack_social_options":{"image_generator_settings":{"template":"highway","enabled":false}}},"categories":[62,78],"tags":[],"jetpack_publicize_connections":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v17.4 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>5 Mistakes at the Beginning of Analytics Projects, and Ways to Avoid Them &ndash; YOU CANalytics |<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/ucanalytics.com\/blogs\/5-mistakes-for-analytics-projects\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"5 Mistakes at the Beginning of Analytics Projects, and Ways to Avoid Them &ndash; YOU CANalytics |\" \/>\n<meta property=\"og:description\" content=\"Why do data science and analytics projects fail? At what stage of the project life-cycle are they most vulnerable to failure? Like any living creature, the probability of analytics projects to fail is the highest either in their infancy or at the final stages of their life cycle. A successful analytics project, like a successfulRead More...\" \/>\n<meta property=\"og:url\" content=\"https:\/\/ucanalytics.com\/blogs\/5-mistakes-for-analytics-projects\/\" \/>\n<meta property=\"og:site_name\" content=\"YOU CANalytics |\" \/>\n<meta property=\"article:author\" content=\"roopam\" \/>\n<meta property=\"article:published_time\" content=\"2016-05-02T16:34:04+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2018-05-19T05:48:49+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/04\/Data-Thinking-for-Survival-of-Analytics-Projects.jpg?fit=440%2C625&#038;ssl=1\" \/>\n\t<meta property=\"og:image:width\" content=\"440\" \/>\n\t<meta property=\"og:image:height\" content=\"625\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Roopam Upadhyay\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"9 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Organization\",\"@id\":\"https:\/\/ucanalytics.com\/blogs\/#organization\",\"name\":\"YOU CANalytics\",\"url\":\"https:\/\/ucanalytics.com\/blogs\/\",\"sameAs\":[],\"logo\":{\"@type\":\"ImageObject\",\"@id\":\"https:\/\/ucanalytics.com\/blogs\/#logo\",\"inLanguage\":\"en-US\",\"url\":\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2015\/11\/YOU-CANalytics-Logo.jpg?fit=607%2C120\",\"contentUrl\":\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2015\/11\/YOU-CANalytics-Logo.jpg?fit=607%2C120\",\"width\":607,\"height\":120,\"caption\":\"YOU CANalytics\"},\"image\":{\"@id\":\"https:\/\/ucanalytics.com\/blogs\/#logo\"}},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/ucanalytics.com\/blogs\/#website\",\"url\":\"https:\/\/ucanalytics.com\/blogs\/\",\"name\":\"YOU CANalytics |\",\"description\":\"Explore the Power of Data Science\",\"publisher\":{\"@id\":\"https:\/\/ucanalytics.com\/blogs\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/ucanalytics.com\/blogs\/?s={search_term_string}\"},\"query-input\":\"required name=search_term_string\"}],\"inLanguage\":\"en-US\"},{\"@type\":\"ImageObject\",\"@id\":\"https:\/\/ucanalytics.com\/blogs\/5-mistakes-for-analytics-projects\/#primaryimage\",\"inLanguage\":\"en-US\",\"url\":\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/04\/Data-Thinking-for-Survival-of-Analytics-Projects.jpg?fit=440%2C625&ssl=1\",\"contentUrl\":\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/04\/Data-Thinking-for-Survival-of-Analytics-Projects.jpg?fit=440%2C625&ssl=1\",\"width\":440,\"height\":625},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/ucanalytics.com\/blogs\/5-mistakes-for-analytics-projects\/#webpage\",\"url\":\"https:\/\/ucanalytics.com\/blogs\/5-mistakes-for-analytics-projects\/\",\"name\":\"5 Mistakes at the Beginning of Analytics Projects, and Ways to Avoid Them &ndash; YOU CANalytics |\",\"isPartOf\":{\"@id\":\"https:\/\/ucanalytics.com\/blogs\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/ucanalytics.com\/blogs\/5-mistakes-for-analytics-projects\/#primaryimage\"},\"datePublished\":\"2016-05-02T16:34:04+00:00\",\"dateModified\":\"2018-05-19T05:48:49+00:00\",\"breadcrumb\":{\"@id\":\"https:\/\/ucanalytics.com\/blogs\/5-mistakes-for-analytics-projects\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/ucanalytics.com\/blogs\/5-mistakes-for-analytics-projects\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/ucanalytics.com\/blogs\/5-mistakes-for-analytics-projects\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/ucanalytics.com\/blogs\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"5 Mistakes at the Beginning of Analytics Projects, and Ways to Avoid Them\"}]},{\"@type\":\"Article\",\"@id\":\"https:\/\/ucanalytics.com\/blogs\/5-mistakes-for-analytics-projects\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/ucanalytics.com\/blogs\/5-mistakes-for-analytics-projects\/#webpage\"},\"author\":{\"@id\":\"https:\/\/ucanalytics.com\/blogs\/#\/schema\/person\/55961a1cea272ecdf290cb387be069b6\"},\"headline\":\"5 Mistakes at the Beginning of Analytics Projects, and Ways to Avoid Them\",\"datePublished\":\"2016-05-02T16:34:04+00:00\",\"dateModified\":\"2018-05-19T05:48:49+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/ucanalytics.com\/blogs\/5-mistakes-for-analytics-projects\/#webpage\"},\"wordCount\":1877,\"commentCount\":0,\"publisher\":{\"@id\":\"https:\/\/ucanalytics.com\/blogs\/#organization\"},\"image\":{\"@id\":\"https:\/\/ucanalytics.com\/blogs\/5-mistakes-for-analytics-projects\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/04\/Data-Thinking-for-Survival-of-Analytics-Projects.jpg?fit=440%2C625&ssl=1\",\"articleSection\":[\"Analytics Tips and Tricks\",\"Data Science Career\"],\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\/\/ucanalytics.com\/blogs\/5-mistakes-for-analytics-projects\/#respond\"]}]},{\"@type\":\"Person\",\"@id\":\"https:\/\/ucanalytics.com\/blogs\/#\/schema\/person\/55961a1cea272ecdf290cb387be069b6\",\"name\":\"Roopam Upadhyay\",\"image\":{\"@type\":\"ImageObject\",\"@id\":\"https:\/\/ucanalytics.com\/blogs\/#personlogo\",\"inLanguage\":\"en-US\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/dd1aa0b0e813f7639800bcfad6a554f1?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/dd1aa0b0e813f7639800bcfad6a554f1?s=96&d=mm&r=g\",\"caption\":\"Roopam Upadhyay\"},\"description\":\"This blog contains my personal views and thoughts on predictive Analytics and big data. - Roopam Upadhyay\",\"sameAs\":[\"roopam\"],\"url\":\"https:\/\/ucanalytics.com\/blogs\/author\/roopam\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"5 Mistakes at the Beginning of Analytics Projects, and Ways to Avoid Them &ndash; YOU CANalytics |","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/ucanalytics.com\/blogs\/5-mistakes-for-analytics-projects\/","og_locale":"en_US","og_type":"article","og_title":"5 Mistakes at the Beginning of Analytics Projects, and Ways to Avoid Them &ndash; YOU CANalytics |","og_description":"Why do data science and analytics projects fail? At what stage of the project life-cycle are they most vulnerable to failure? Like any living creature, the probability of analytics projects to fail is the highest either in their infancy or at the final stages of their life cycle. A successful analytics project, like a successfulRead More...","og_url":"https:\/\/ucanalytics.com\/blogs\/5-mistakes-for-analytics-projects\/","og_site_name":"YOU CANalytics |","article_author":"roopam","article_published_time":"2016-05-02T16:34:04+00:00","article_modified_time":"2018-05-19T05:48:49+00:00","og_image":[{"width":440,"height":625,"url":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/04\/Data-Thinking-for-Survival-of-Analytics-Projects.jpg?fit=440%2C625&ssl=1","type":"image\/jpeg"}],"twitter_misc":{"Written by":"Roopam Upadhyay","Est. reading time":"9 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Organization","@id":"https:\/\/ucanalytics.com\/blogs\/#organization","name":"YOU CANalytics","url":"https:\/\/ucanalytics.com\/blogs\/","sameAs":[],"logo":{"@type":"ImageObject","@id":"https:\/\/ucanalytics.com\/blogs\/#logo","inLanguage":"en-US","url":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2015\/11\/YOU-CANalytics-Logo.jpg?fit=607%2C120","contentUrl":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2015\/11\/YOU-CANalytics-Logo.jpg?fit=607%2C120","width":607,"height":120,"caption":"YOU CANalytics"},"image":{"@id":"https:\/\/ucanalytics.com\/blogs\/#logo"}},{"@type":"WebSite","@id":"https:\/\/ucanalytics.com\/blogs\/#website","url":"https:\/\/ucanalytics.com\/blogs\/","name":"YOU CANalytics |","description":"Explore the Power of Data Science","publisher":{"@id":"https:\/\/ucanalytics.com\/blogs\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/ucanalytics.com\/blogs\/?s={search_term_string}"},"query-input":"required name=search_term_string"}],"inLanguage":"en-US"},{"@type":"ImageObject","@id":"https:\/\/ucanalytics.com\/blogs\/5-mistakes-for-analytics-projects\/#primaryimage","inLanguage":"en-US","url":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/04\/Data-Thinking-for-Survival-of-Analytics-Projects.jpg?fit=440%2C625&ssl=1","contentUrl":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/04\/Data-Thinking-for-Survival-of-Analytics-Projects.jpg?fit=440%2C625&ssl=1","width":440,"height":625},{"@type":"WebPage","@id":"https:\/\/ucanalytics.com\/blogs\/5-mistakes-for-analytics-projects\/#webpage","url":"https:\/\/ucanalytics.com\/blogs\/5-mistakes-for-analytics-projects\/","name":"5 Mistakes at the Beginning of Analytics Projects, and Ways to Avoid Them &ndash; YOU CANalytics |","isPartOf":{"@id":"https:\/\/ucanalytics.com\/blogs\/#website"},"primaryImageOfPage":{"@id":"https:\/\/ucanalytics.com\/blogs\/5-mistakes-for-analytics-projects\/#primaryimage"},"datePublished":"2016-05-02T16:34:04+00:00","dateModified":"2018-05-19T05:48:49+00:00","breadcrumb":{"@id":"https:\/\/ucanalytics.com\/blogs\/5-mistakes-for-analytics-projects\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/ucanalytics.com\/blogs\/5-mistakes-for-analytics-projects\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/ucanalytics.com\/blogs\/5-mistakes-for-analytics-projects\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/ucanalytics.com\/blogs\/"},{"@type":"ListItem","position":2,"name":"5 Mistakes at the Beginning of Analytics Projects, and Ways to Avoid Them"}]},{"@type":"Article","@id":"https:\/\/ucanalytics.com\/blogs\/5-mistakes-for-analytics-projects\/#article","isPartOf":{"@id":"https:\/\/ucanalytics.com\/blogs\/5-mistakes-for-analytics-projects\/#webpage"},"author":{"@id":"https:\/\/ucanalytics.com\/blogs\/#\/schema\/person\/55961a1cea272ecdf290cb387be069b6"},"headline":"5 Mistakes at the Beginning of Analytics Projects, and Ways to Avoid Them","datePublished":"2016-05-02T16:34:04+00:00","dateModified":"2018-05-19T05:48:49+00:00","mainEntityOfPage":{"@id":"https:\/\/ucanalytics.com\/blogs\/5-mistakes-for-analytics-projects\/#webpage"},"wordCount":1877,"commentCount":0,"publisher":{"@id":"https:\/\/ucanalytics.com\/blogs\/#organization"},"image":{"@id":"https:\/\/ucanalytics.com\/blogs\/5-mistakes-for-analytics-projects\/#primaryimage"},"thumbnailUrl":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/04\/Data-Thinking-for-Survival-of-Analytics-Projects.jpg?fit=440%2C625&ssl=1","articleSection":["Analytics Tips and Tricks","Data Science Career"],"inLanguage":"en-US","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/ucanalytics.com\/blogs\/5-mistakes-for-analytics-projects\/#respond"]}]},{"@type":"Person","@id":"https:\/\/ucanalytics.com\/blogs\/#\/schema\/person\/55961a1cea272ecdf290cb387be069b6","name":"Roopam Upadhyay","image":{"@type":"ImageObject","@id":"https:\/\/ucanalytics.com\/blogs\/#personlogo","inLanguage":"en-US","url":"https:\/\/secure.gravatar.com\/avatar\/dd1aa0b0e813f7639800bcfad6a554f1?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/dd1aa0b0e813f7639800bcfad6a554f1?s=96&d=mm&r=g","caption":"Roopam Upadhyay"},"description":"This blog contains my personal views and thoughts on predictive Analytics and big data. - Roopam Upadhyay","sameAs":["roopam"],"url":"https:\/\/ucanalytics.com\/blogs\/author\/roopam\/"}]}},"jetpack_featured_media_url":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/04\/Data-Thinking-for-Survival-of-Analytics-Projects.jpg?fit=440%2C625&ssl=1","jetpack_sharing_enabled":true,"jetpack_shortlink":"https:\/\/wp.me\/p3L0jT-27B","jetpack-related-posts":[{"id":2783,"url":"https:\/\/ucanalytics.com\/blogs\/in-conversation-with-eric-siegel-author-predictive-analytics\/","url_meta":{"origin":8159,"position":0},"title":"In Conversation with Eric Siegel: Author &#8216;Predictive Analytics&#8217;","author":"Roopam Upadhyay","date":false,"format":false,"excerpt":"In Conversation with.. Today we are starting a new series on YOU CANalytics called 'in conversation with'. In this series we will talk to the leaders and experts of predictive analytics and big data to gain deeper insight into the field. Dr. Eric Siegel Our first guest for the series\u2026","rel":"","context":"In &quot;Events &amp; Interviews&quot;","block_context":{"text":"Events &amp; Interviews","link":"https:\/\/ucanalytics.com\/blogs\/category\/events-and-interviews\/"},"img":{"alt_text":"","src":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2015\/12\/Slide15.jpg?fit=290%2C210&ssl=1&resize=350%2C200","width":350,"height":200},"classes":[]},{"id":3768,"url":"https:\/\/ucanalytics.com\/blogs\/indian-institute-management-iim-lucknow\/","url_meta":{"origin":8159,"position":1},"title":"@ Indian Institute of Management (IIM), Lucknow","author":"Roopam Upadhyay","date":false,"format":false,"excerpt":"At IIM Lucknow For the last few days, I was at IIM Lucknow as a guest faculty to give\u00a0lecture sessions on 'Data Science, and\u00a0Marketing Analytics' to their\u00a0MBA students. It was an interesting\u00a0experience to interact with young\u00a0minds that will influence the future course of the business world. It was equally interesting\u00a0for\u2026","rel":"","context":"In &quot;Events &amp; Interviews&quot;","block_context":{"text":"Events &amp; Interviews","link":"https:\/\/ucanalytics.com\/blogs\/category\/events-and-interviews\/"},"img":{"alt_text":"","src":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/08\/IIM-Lucknow.jpg?fit=639%2C462&ssl=1&resize=350%2C200","width":350,"height":200,"srcset":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/08\/IIM-Lucknow.jpg?fit=639%2C462&ssl=1&resize=350%2C200 1x, https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/08\/IIM-Lucknow.jpg?fit=639%2C462&ssl=1&resize=525%2C300 1.5x"},"classes":[]},{"id":6241,"url":"https:\/\/ucanalytics.com\/blogs\/4-ps-to-bring-data-science-to-boardroom-the-economic-times-business-analytics-summit\/","url_meta":{"origin":8159,"position":2},"title":"4 Ps to Bring Data Science to Boardroom @ The Economic Times Business Analytics Summit","author":"Roopam Upadhyay","date":false,"format":false,"excerpt":"A couple of\u00a0weeks ago I got an\u00a0opportunity to be a\u00a0part of a\u00a0panel discussion at 'The Economic Times Business Analytics Summit'. The topic of the\u00a0discussion was\u00a0'overcoming the challenges of bringing data science to the boardroom'.\u00a0The panel had a well-balanced representation from both industry and academia. It was an interesting and thought-provoking\u2026","rel":"","context":"In &quot;Events &amp; Interviews&quot;","block_context":{"text":"Events &amp; Interviews","link":"https:\/\/ucanalytics.com\/blogs\/category\/events-and-interviews\/"},"img":{"alt_text":"","src":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2015\/10\/The-Economic-Times-Business-Analytics-Summit.jpg?fit=678%2C395&ssl=1&resize=350%2C200","width":350,"height":200,"srcset":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2015\/10\/The-Economic-Times-Business-Analytics-Summit.jpg?fit=678%2C395&ssl=1&resize=350%2C200 1x, https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2015\/10\/The-Economic-Times-Business-Analytics-Summit.jpg?fit=678%2C395&ssl=1&resize=525%2C300 1.5x"},"classes":[]},{"id":7083,"url":"https:\/\/ucanalytics.com\/blogs\/career-transition-to-data-science-business-analytics-isb-hydrabad-and-bocconi\/","url_meta":{"origin":8159,"position":3},"title":"3 Suggestions for Career Transition to Data Science and Business Analytics for Experienced Professionals","author":"Roopam Upadhyay","date":false,"format":false,"excerpt":"A couple of\u00a0weeks ago I was at two different business schools as a guest speaker. Both ISB, Hyderabad, and MISB Bocconi have specialized programs in business analytics and data science for working professionals. I gave talks about 'career in data science & industry's expectations from data scientists'. I got the\u2026","rel":"","context":"In &quot;Analytics Tips and Tricks&quot;","block_context":{"text":"Analytics Tips and Tricks","link":"https:\/\/ucanalytics.com\/blogs\/category\/analytics-tips\/"},"img":{"alt_text":"","src":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2015\/12\/3-Sky-and-water.jpeg?fit=480%2C480&ssl=1&resize=350%2C200","width":350,"height":200},"classes":[]},{"id":4519,"url":"https:\/\/ucanalytics.com\/blogs\/healthcare-analytics-next-frontier\/","url_meta":{"origin":8159,"position":4},"title":"Healthcare Analytics &#8211; The Next Frontier","author":"Roopam Upadhyay","date":false,"format":false,"excerpt":"Healthcare Analytics In 2012, Larry Smarr concludes his TEDMED talk (a TED conference for medicine, healthcare, and biology) with the phrase: \u2018Because of big data and because of our ability to analyze it \u2013 we got\u00a0hope\u2019 Larry is no medical professional or biologist rather he is a physicist (astrophysicist), and\u2026","rel":"","context":"In &quot;Healthcare Analytics&quot;","block_context":{"text":"Healthcare Analytics","link":"https:\/\/ucanalytics.com\/blogs\/category\/healthcare-analytics\/"},"img":{"alt_text":"","src":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/12\/Florence.jpg?fit=573%2C1024&ssl=1&resize=350%2C200","width":350,"height":200,"srcset":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/12\/Florence.jpg?fit=573%2C1024&ssl=1&resize=350%2C200 1x, https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/12\/Florence.jpg?fit=573%2C1024&ssl=1&resize=525%2C300 1.5x"},"classes":[]},{"id":281,"url":"https:\/\/ucanalytics.com\/blogs\/credit-scorecards-predictive-analytics-part-7\/","url_meta":{"origin":8159,"position":5},"title":"Credit Scorecards &#8211;  Business Integration of Predictive Analytics (part 7 of 7)","author":"Roopam Upadhyay","date":false,"format":false,"excerpt":"Columbus - A lesson in Leadership Christopher Columbus \u2013 I have adored this man for various reasons at various stages of my life. At seven, I adored him because his mistakes were applauded and became part of history \u2013 Columbus mistook Native Americans for Indians because he thought he had\u2026","rel":"","context":"In &quot;Credit Risk Analytics Series&quot;","block_context":{"text":"Credit Risk Analytics Series","link":"https:\/\/ucanalytics.com\/blogs\/category\/risk-analytics\/credit-risk-analytics-series\/"},"img":{"alt_text":"","src":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2013\/07\/7-Leader-225x300.jpg?resize=350%2C200","width":350,"height":200},"classes":[]}],"_links":{"self":[{"href":"https:\/\/ucanalytics.com\/blogs\/wp-json\/wp\/v2\/posts\/8159"}],"collection":[{"href":"https:\/\/ucanalytics.com\/blogs\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/ucanalytics.com\/blogs\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/ucanalytics.com\/blogs\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/ucanalytics.com\/blogs\/wp-json\/wp\/v2\/comments?post=8159"}],"version-history":[{"count":0,"href":"https:\/\/ucanalytics.com\/blogs\/wp-json\/wp\/v2\/posts\/8159\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/ucanalytics.com\/blogs\/wp-json\/wp\/v2\/media\/8162"}],"wp:attachment":[{"href":"https:\/\/ucanalytics.com\/blogs\/wp-json\/wp\/v2\/media?parent=8159"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/ucanalytics.com\/blogs\/wp-json\/wp\/v2\/categories?post=8159"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/ucanalytics.com\/blogs\/wp-json\/wp\/v2\/tags?post=8159"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}