{"id":8649,"date":"2016-09-04T11:21:47","date_gmt":"2016-09-04T05:51:47","guid":{"rendered":"http:\/\/ucanalytics.com\/blogs\/?p=8649"},"modified":"2016-09-04T11:21:47","modified_gmt":"2016-09-04T05:51:47","slug":"bivariate-analysis-leverage-regression-case-study-example-part-3","status":"publish","type":"post","link":"https:\/\/ucanalytics.com\/blogs\/bivariate-analysis-leverage-regression-case-study-example-part-3\/","title":{"rendered":"Bivariate Analysis &#038; Leverage &#8211; Regression Case Study Example (Part 3)"},"content":{"rendered":"<hr \/>\n<div id=\"attachment_8654\" style=\"width: 646px\" class=\"wp-caption aligncenter\"><a href=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/08\/Regression-Case-Study-Example.jpg\"><img aria-describedby=\"caption-attachment-8654\" data-attachment-id=\"8654\" data-permalink=\"https:\/\/ucanalytics.com\/blogs\/bivariate-analysis-leverage-regression-case-study-example-part-3\/regression-case-study-example\/\" data-orig-file=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/08\/Regression-Case-Study-Example.jpg?fit=1156%2C720&amp;ssl=1\" data-orig-size=\"1156,720\" data-comments-opened=\"1\" data-image-meta=\"{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}\" data-image-title=\"Regression Case Study Example\" data-image-description=\"\" data-image-caption=\"\" data-medium-file=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/08\/Regression-Case-Study-Example.jpg?fit=300%2C187&amp;ssl=1\" data-large-file=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/08\/Regression-Case-Study-Example.jpg?fit=640%2C399&amp;ssl=1\" decoding=\"async\" loading=\"lazy\" class=\"wp-image-8654\" src=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/08\/Regression-Case-Study-Example.jpg?resize=636%2C396\" alt=\"Regression Case Study Example\" width=\"636\" height=\"396\" srcset=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/08\/Regression-Case-Study-Example.jpg?w=1156&amp;ssl=1 1156w, https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/08\/Regression-Case-Study-Example.jpg?resize=250%2C156&amp;ssl=1 250w, https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/08\/Regression-Case-Study-Example.jpg?resize=300%2C187&amp;ssl=1 300w, https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/08\/Regression-Case-Study-Example.jpg?resize=768%2C478&amp;ssl=1 768w, https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/08\/Regression-Case-Study-Example.jpg?resize=1024%2C638&amp;ssl=1 1024w\" sizes=\"(max-width: 636px) 100vw, 636px\" data-recalc-dims=\"1\" \/><\/a><p id=\"caption-attachment-8654\" class=\"wp-caption-text\">Leverage &amp; Regression Case Study Example &#8211; by Roopam<\/p><\/div>\n<p>Welcome back to the\u00a0case study example for regression analysis where you are helping an investment firm make money through property price arbitrage. In the last two parts (<a href=\"http:\/\/ucanalytics.com\/blogs\/regression-analysis-pricing-case-study-example-part-1\/\" target=\"_blank\"><strong>Part 1<\/strong> <\/a>&amp; <strong><a href=\"http:\/\/ucanalytics.com\/blogs\/data-preparation-regression-pricing-case-study-example-part-2\/\" target=\"_blank\">Part 2<\/a><\/strong>) you started with the univariate analysis to identify patterns in the data including missing data and outliers. In the discussion section of the last part, Katya, Chetan, Abhishek and VC started an interesting discussion about the pros and cons of removal of missing data. I have a few opinions on missing data but let me reveal them later, for now I am really enjoying reading your ideas. Thanks, and please keep sharing your ideas.<\/p>\n<p>In this part you will further your investigation through\u00a0bivariate analysis. Bivariate analysis will eventually help you develop multivariate regression models in the latter\u00a0parts of this case study example. Through the bivariate analysis you will also identify how outliers can play havoc for your analysis. However, before that let&#8217;s discuss:<\/p>\n<h2><span style=\"color: #3366ff;\">Archimedes and Leverage in Regression<\/span><\/h2>\n<p><a href=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/08\/Regression-and-leverage.jpg\"><img data-attachment-id=\"8670\" data-permalink=\"https:\/\/ucanalytics.com\/blogs\/bivariate-analysis-leverage-regression-case-study-example-part-3\/regression-and-leverage\/\" data-orig-file=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/08\/Regression-and-leverage.jpg?fit=387%2C781&amp;ssl=1\" data-orig-size=\"387,781\" data-comments-opened=\"1\" data-image-meta=\"{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}\" data-image-title=\"Regression and leverage\" data-image-description=\"\" data-image-caption=\"\" data-medium-file=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/08\/Regression-and-leverage.jpg?fit=149%2C300&amp;ssl=1\" data-large-file=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/08\/Regression-and-leverage.jpg?fit=387%2C781&amp;ssl=1\" decoding=\"async\" loading=\"lazy\" class=\"wp-image-8670 alignright\" src=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/08\/Regression-and-leverage.jpg?resize=188%2C379\" alt=\"Regression and leverage\" width=\"188\" height=\"379\" srcset=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/08\/Regression-and-leverage.jpg?w=387&amp;ssl=1 387w, https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/08\/Regression-and-leverage.jpg?resize=124%2C250&amp;ssl=1 124w, https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/08\/Regression-and-leverage.jpg?resize=149%2C300&amp;ssl=1 149w\" sizes=\"(max-width: 188px) 100vw, 188px\" data-recalc-dims=\"1\" \/><\/a>We have discussed Archimedes in an <a href=\"http:\/\/ucanalytics.com\/blogs\/the-beauty-of-%CF%80-iterative-calculation\/\" target=\"_blank\">earlier article<\/a>\u00a0on YOU CANalytics. Yes, he is the famous <em>Eureka!<\/em> guy. Archimedes was a mathematician, physicist, engineer, inventor, and astronomer. He used to say :\u00a0\u201c<em>\u03a0\u0391 \u0392\u03a9 \u039a\u0391\u0399 \u03a7\u0391\u03a1\u0399\u03a3\u03a4\u0399\u03a9\u039d\u0399 \u03a4\u0391\u039d \u0393\u0391\u039d \u039a\u0399\u039d\u0397\u03a3\u03a9 \u03a0\u0391\u03a3\u0391\u039d<\/em>.\u201d OK if that&#8217;s Greek to you then you are right. The literal translation of this Greek sentence is :\u00a0\u201c<em>Give me a place to stand and with a lever I will move the whole world.<\/em>\u201d.<\/p>\n<p>Archimedes in his famous quote was referring to the phenomenon called leverage. It is easy to experience leverage\u00a0if you try to open a door from the three different points (A, B, and C) displayed in the adjacent picture. You will notice that it requires much lesser effort or force to open the door the further you move away from the door hinges. This is the reason a door knob is placed as far away from the hinges to reduce your effort every time you open or close the door.<\/p>\n<p>Leverage plays an important role in regression models as we will notice in the next sections.<\/p>\n<h2><span style=\"color: #3366ff;\">Bivariate Analysis &#8211; Regression Case Study Example<\/span><\/h2>\n<p>Let&#8217;s come back to our case study example where you are a data science consultant for an investment firm. You are working with <strong><a href=\"http:\/\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/07\/Regression-Analysis-Data.csv\">this\u00a0data set<\/a>\u00a0<\/strong>to estimate property price through regression modeling. In this part of the case study, you will do bivariate analysis between the numeric response\u00a0variable (house_price) and the remaining prospective predictor variables in this data set. The\u00a0bivariate analysis has different approaches based the nature of\u00a0predictor variables i.e. numeric or categorical. Before we continue with our analysis let&#8217;s revisit some core concepts of correlation analysis and scatter plots to analyse numeric predictor variables.<\/p>\n<h2><span style=\"color: #3366ff;\"><a href=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/08\/Correlation-Coefficient-and-Scatter-Plot.jpg\"><img data-attachment-id=\"8714\" data-permalink=\"https:\/\/ucanalytics.com\/blogs\/bivariate-analysis-leverage-regression-case-study-example-part-3\/correlation-coefficient-and-scatter-plot\/\" data-orig-file=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/08\/Correlation-Coefficient-and-Scatter-Plot.jpg?fit=464%2C1109&amp;ssl=1\" data-orig-size=\"464,1109\" data-comments-opened=\"1\" data-image-meta=\"{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}\" data-image-title=\"Correlation Coefficient and Scatter Plot\" data-image-description=\"\" data-image-caption=\"\" data-medium-file=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/08\/Correlation-Coefficient-and-Scatter-Plot.jpg?fit=126%2C300&amp;ssl=1\" data-large-file=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/08\/Correlation-Coefficient-and-Scatter-Plot.jpg?fit=428%2C1024&amp;ssl=1\" decoding=\"async\" loading=\"lazy\" class=\"wp-image-8714 alignright\" src=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/08\/Correlation-Coefficient-and-Scatter-Plot.jpg?resize=252%2C602\" alt=\"Correlation Coefficient and Scatter Plot\" width=\"252\" height=\"602\" srcset=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/08\/Correlation-Coefficient-and-Scatter-Plot.jpg?w=464&amp;ssl=1 464w, https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/08\/Correlation-Coefficient-and-Scatter-Plot.jpg?resize=105%2C250&amp;ssl=1 105w, https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/08\/Correlation-Coefficient-and-Scatter-Plot.jpg?resize=126%2C300&amp;ssl=1 126w, https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/08\/Correlation-Coefficient-and-Scatter-Plot.jpg?resize=428%2C1024&amp;ssl=1 428w\" sizes=\"(max-width: 252px) 100vw, 252px\" data-recalc-dims=\"1\" \/><\/a>Correlation Analysis &amp; Scatter Plot<\/span><\/h2>\n<p>One of the important measures while we perform bivariate analysis between 2 numeric variables is the correlation coefficient. Relation is the operative\u00a0word here. This coefficient represents relationship between the 2 variables. The range for correlation coefficient is between -1 and 1. The closer the correlation coefficient to 1 (i.e. 0.9 or 0.85) the higher positive relationship\u00a0the two variables have. Positive relationship means if the first variable grows than the second variable also grows. An example for positive correlation is age and height of teenagers.<\/p>\n<p>Similarly, the closer the correlation coefficient to \u00a0-1 (i.e -0.95 or &#8211; 0.8) the higher negative relationship the two variables have. As you must have guessed for the variables with negative correlation coefficient if the first variable grows than the second variable shrinks. An example for negative\u00a0correlation is your expenses and savings with a fixed income.<\/p>\n<p>Moreover, when the value of correlation coefficient is close to 0 (i.e. 0.01 or 0.05) then it represent there no or very little relationship between the 2 variables.<\/p>\n<p>In the next segment we will revisit our case study example and see how correlation coefficient\u00a0can be misleading in the presence of outliers. We will also connect this to leverage, the concept we discussed in the previous segment.<\/p>\n<h2><span style=\"color: #3366ff;\">Correlation and Leverage &#8211; Regression Case Study Example<\/span><\/h2>\n<p>Now let&#8217;s get back to the\u00a0<a href=\"http:\/\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/07\/Regression-Analysis-Data.csv\">dataset<\/a>\u00a0you are analyzing for your client. \u00a0You had\u00a0\u00a0started with\u00a0scatter plots of carpet area and house price with and without the outliers. You are particularly interested in\u00a0the correlation coefficients which you have placed at the top of these plots.<\/p>\n<p><a href=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/08\/Regression-Case-Study-Example-Leverage.gif\"><img data-attachment-id=\"8650\" data-permalink=\"https:\/\/ucanalytics.com\/blogs\/bivariate-analysis-leverage-regression-case-study-example-part-3\/regression-case-study-example-leverage\/\" data-orig-file=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/08\/Regression-Case-Study-Example-Leverage.gif?fit=619%2C533&amp;ssl=1\" data-orig-size=\"619,533\" data-comments-opened=\"1\" data-image-meta=\"{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}\" data-image-title=\"Regression Case Study Example Leverage\" data-image-description=\"\" data-image-caption=\"\" data-medium-file=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/08\/Regression-Case-Study-Example-Leverage.gif?fit=300%2C258&amp;ssl=1\" data-large-file=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/08\/Regression-Case-Study-Example-Leverage.gif?fit=619%2C533&amp;ssl=1\" decoding=\"async\" loading=\"lazy\" class=\"size-full wp-image-8650 aligncenter\" src=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/08\/Regression-Case-Study-Example-Leverage.gif?resize=619%2C533\" alt=\"Regression Case Study Example Leverage\" width=\"619\" height=\"533\" data-recalc-dims=\"1\" \/><\/a><\/p>\n<p>You noticed that the\u00a0extreme outlier (a mansion among the middle class houses) has a massive\u00a0leverage over the other observations. This has a huge impact on the correlation coefficient as well. Essentially, in the presence of this outlier you are drawing\u00a0the regression line between two dots on the scattered plot\u00a0: the extreme outlier and the whole bunch of data clubbed together (Exhibit 1). This is the reason the correction coefficient in this case is very close to perfect correlation i.e. 1. In Exhibit 2, when this outlier is removed the bunched up data looks like a more realistic scattered plot.<\/p>\n<p>Now, you want to\u00a0analyse all the numeric predictor variables against the response variable (house price) all at once. You will create scatter\u00a0plots and correlation coefficeints in the matrix format. Keep an eye for correlation coefficient and scatter plot for carpet area and house price you had already analysed in the above animation.<\/p>\n<p><em>R-code to create :\u00a0<a href=\"http:\/\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/09\/Matrix-plot-scatter-plot-and-correlation.txt\">matrix plot with both scatter plot and correlation coefficient<\/a>.<\/em><\/p>\n<p><a href=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/08\/Regression-case-study-example-correlation-with-outliers.jpg\"><img data-attachment-id=\"8652\" data-permalink=\"https:\/\/ucanalytics.com\/blogs\/bivariate-analysis-leverage-regression-case-study-example-part-3\/regression-case-study-example-correlation-with-outliers\/\" data-orig-file=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/08\/Regression-case-study-example-correlation-with-outliers.jpg?fit=1918%2C975&amp;ssl=1\" data-orig-size=\"1918,975\" data-comments-opened=\"1\" data-image-meta=\"{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}\" data-image-title=\"Regression case study example &#8211; correlation with outliers\" data-image-description=\"\" data-image-caption=\"\" data-medium-file=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/08\/Regression-case-study-example-correlation-with-outliers.jpg?fit=300%2C153&amp;ssl=1\" data-large-file=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/08\/Regression-case-study-example-correlation-with-outliers.jpg?fit=640%2C326&amp;ssl=1\" decoding=\"async\" loading=\"lazy\" class=\"size-full wp-image-8652 aligncenter\" src=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/08\/Regression-case-study-example-correlation-with-outliers.jpg?resize=640%2C325\" alt=\"Regression case study example - correlation with outliers\" width=\"640\" height=\"325\" srcset=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/08\/Regression-case-study-example-correlation-with-outliers.jpg?w=1918&amp;ssl=1 1918w, https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/08\/Regression-case-study-example-correlation-with-outliers.jpg?resize=250%2C127&amp;ssl=1 250w, https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/08\/Regression-case-study-example-correlation-with-outliers.jpg?resize=300%2C153&amp;ssl=1 300w, https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/08\/Regression-case-study-example-correlation-with-outliers.jpg?resize=768%2C390&amp;ssl=1 768w, https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/08\/Regression-case-study-example-correlation-with-outliers.jpg?resize=1024%2C521&amp;ssl=1 1024w, https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/08\/Regression-case-study-example-correlation-with-outliers.jpg?w=1280 1280w\" sizes=\"(max-width: 640px) 100vw, 640px\" data-recalc-dims=\"1\" \/><\/a><\/p>\n<p>You\u00a0have opted to remove the observation for the large mansion\u00a0from our dataset before the development of regression models. The matrix scattered plot without the outlier\u00a0looks like this.<\/p>\n<p><a href=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/08\/Regression-case-study-example-correlation-without-outliers.jpg\"><img data-attachment-id=\"8651\" data-permalink=\"https:\/\/ucanalytics.com\/blogs\/bivariate-analysis-leverage-regression-case-study-example-part-3\/regression-case-study-example-correlation-without-outliers\/\" data-orig-file=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/08\/Regression-case-study-example-correlation-without-outliers.jpg?fit=1919%2C972&amp;ssl=1\" data-orig-size=\"1919,972\" data-comments-opened=\"1\" data-image-meta=\"{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}\" data-image-title=\"Regression case study example &#8211; correlation without outliers\" data-image-description=\"\" data-image-caption=\"\" data-medium-file=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/08\/Regression-case-study-example-correlation-without-outliers.jpg?fit=300%2C152&amp;ssl=1\" data-large-file=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/08\/Regression-case-study-example-correlation-without-outliers.jpg?fit=640%2C324&amp;ssl=1\" decoding=\"async\" loading=\"lazy\" class=\"size-full wp-image-8651 aligncenter\" src=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/08\/Regression-case-study-example-correlation-without-outliers.jpg?resize=640%2C324\" alt=\"Regression case study example - correlation without outliers\" width=\"640\" height=\"324\" srcset=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/08\/Regression-case-study-example-correlation-without-outliers.jpg?w=1919&amp;ssl=1 1919w, https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/08\/Regression-case-study-example-correlation-without-outliers.jpg?resize=250%2C127&amp;ssl=1 250w, https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/08\/Regression-case-study-example-correlation-without-outliers.jpg?resize=300%2C152&amp;ssl=1 300w, https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/08\/Regression-case-study-example-correlation-without-outliers.jpg?resize=768%2C389&amp;ssl=1 768w, https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/08\/Regression-case-study-example-correlation-without-outliers.jpg?resize=1024%2C519&amp;ssl=1 1024w, https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/08\/Regression-case-study-example-correlation-without-outliers.jpg?w=1280 1280w\" sizes=\"(max-width: 640px) 100vw, 640px\" data-recalc-dims=\"1\" \/><\/a><\/p>\n<p>Check out the histograms for all the numeric variables at the diagonal panels of this matrix plot. They all look well centered and nicely distributed. This\u00a0gives you a good confidence to move further with our analysis.<\/p>\n<p>If you look at the above matrix plot carefully you will also notice that distance between taxi stand, market, and hospital have significantly high correlations. Moreover, carpet and built-up area has almost perfect correlation. This indicates that in our data-set the build-up area is a derived field from the carper area because such a high correlation is almost impossible for a natural phenomenon. This is all pointing towards high correlation between predictor variables &#8211; the phenomenon is also known as multicollinearity. \u00a0We will explore more about multicollinearity in the next part when we will discuss principal component analysis. For now, let&#8217;s move further with bivariate analysis between categorical predictor variable and house price.<\/p>\n<h2><span style=\"color: #3366ff;\">Bivariate analysis &#8211; Categorical Predictor Variables<\/span><\/h2>\n<p>You have 2 categorical variables in our data-set: city category and parking availability. A good way to analyse categorical predictor variables\u00a0and numeric response variable is through a box plot. We can clearly see that there is a significant difference between the average price of houses based on the category of cities. The average prices are shown as in the middle of the boxes. R code :\u00a0<a href=\"http:\/\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/09\/bivariate-analysis-categorical-variable.txt\">bivariate analysis &#8211; categorical variable<\/a>.<\/p>\n<p><a href=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/09\/Bivariate-analysis.jpeg\"><img data-attachment-id=\"8728\" data-permalink=\"https:\/\/ucanalytics.com\/blogs\/bivariate-analysis-leverage-regression-case-study-example-part-3\/bivariate-analysis\/\" data-orig-file=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/09\/Bivariate-analysis.jpeg?fit=935%2C533&amp;ssl=1\" data-orig-size=\"935,533\" data-comments-opened=\"1\" data-image-meta=\"{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}\" data-image-title=\"Bivariate analysis\" data-image-description=\"\" data-image-caption=\"\" data-medium-file=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/09\/Bivariate-analysis.jpeg?fit=300%2C171&amp;ssl=1\" data-large-file=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/09\/Bivariate-analysis.jpeg?fit=640%2C365&amp;ssl=1\" decoding=\"async\" loading=\"lazy\" class=\"size-full wp-image-8728 aligncenter\" src=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/09\/Bivariate-analysis.jpeg?resize=640%2C365\" alt=\"Bivariate analysis\" width=\"640\" height=\"365\" srcset=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/09\/Bivariate-analysis.jpeg?w=935&amp;ssl=1 935w, https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/09\/Bivariate-analysis.jpeg?resize=250%2C143&amp;ssl=1 250w, https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/09\/Bivariate-analysis.jpeg?resize=300%2C171&amp;ssl=1 300w, https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/09\/Bivariate-analysis.jpeg?resize=768%2C438&amp;ssl=1 768w\" sizes=\"(max-width: 640px) 100vw, 640px\" data-recalc-dims=\"1\" \/><\/a><\/p>\n<p>Moreover to validate what you see in the box plot, you have performed pair-wise t test for each category. The results for pair-wise t-test shown at the top of the box plot in red. You noticed that\u00a0P(A=B)~0 means that there are almost 0% chances that average price of houses in cat A city is equal to cat B city.\u00a0Go ahead and plot a similar chart for parking and house prices. Also, let us know what you see.<\/p>\n<h4><span style=\"color: #3366ff;\">Sign-off Note<\/span><\/h4>\n<p>Archimedes did not find an enormously long lever and a place far-far away to move the earth. However, he did shake the earth with his ideas and had a leverage over the scientific thinking.<\/p>\n<p>In the next article, we will progress with multivariate regression model. The bivariate analysis in this part has already offered clues about the structure of our final model.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Welcome back to the\u00a0case study example for regression analysis where you are helping an investment firm make money through property price arbitrage. In the last two parts (Part 1 &amp; Part 2) you started with the univariate analysis to identify patterns in the data including missing data and outliers. In the discussion section of the<\/p>\n<p><a class=\"excerpt-more blog-excerpt\" href=\"https:\/\/ucanalytics.com\/blogs\/bivariate-analysis-leverage-regression-case-study-example-part-3\/\">Read More&#8230;<\/a><\/p>\n","protected":false},"author":1,"featured_media":8654,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"jetpack_post_was_ever_published":false,"_jetpack_newsletter_access":"","_jetpack_newsletter_tier_id":0,"jetpack_publicize_message":"","jetpack_is_tweetstorm":false,"jetpack_publicize_feature_enabled":true,"jetpack_social_post_already_shared":true,"jetpack_social_options":{"image_generator_settings":{"template":"highway","enabled":false}}},"categories":[80],"tags":[],"jetpack_publicize_connections":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v17.4 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Bivariate Analysis &amp; Leverage - Regression Case Study Example (Part 3) &ndash; YOU CANalytics |<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/ucanalytics.com\/blogs\/bivariate-analysis-leverage-regression-case-study-example-part-3\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Bivariate Analysis &amp; Leverage - Regression Case Study Example (Part 3) &ndash; YOU CANalytics |\" \/>\n<meta property=\"og:description\" content=\"Welcome back to the\u00a0case study example for regression analysis where you are helping an investment firm make money through property price arbitrage. In the last two parts (Part 1 &amp; Part 2) you started with the univariate analysis to identify patterns in the data including missing data and outliers. In the discussion section of theRead More...\" \/>\n<meta property=\"og:url\" content=\"https:\/\/ucanalytics.com\/blogs\/bivariate-analysis-leverage-regression-case-study-example-part-3\/\" \/>\n<meta property=\"og:site_name\" content=\"YOU CANalytics |\" \/>\n<meta property=\"article:author\" content=\"roopam\" \/>\n<meta property=\"article:published_time\" content=\"2016-09-04T05:51:47+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/08\/Regression-Case-Study-Example.jpg?fit=1156%2C720&#038;ssl=1\" \/>\n\t<meta property=\"og:image:width\" content=\"1156\" \/>\n\t<meta property=\"og:image:height\" content=\"720\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Roopam Upadhyay\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"6 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Organization\",\"@id\":\"https:\/\/ucanalytics.com\/blogs\/#organization\",\"name\":\"YOU CANalytics\",\"url\":\"https:\/\/ucanalytics.com\/blogs\/\",\"sameAs\":[],\"logo\":{\"@type\":\"ImageObject\",\"@id\":\"https:\/\/ucanalytics.com\/blogs\/#logo\",\"inLanguage\":\"en-US\",\"url\":\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2015\/11\/YOU-CANalytics-Logo.jpg?fit=607%2C120\",\"contentUrl\":\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2015\/11\/YOU-CANalytics-Logo.jpg?fit=607%2C120\",\"width\":607,\"height\":120,\"caption\":\"YOU CANalytics\"},\"image\":{\"@id\":\"https:\/\/ucanalytics.com\/blogs\/#logo\"}},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/ucanalytics.com\/blogs\/#website\",\"url\":\"https:\/\/ucanalytics.com\/blogs\/\",\"name\":\"YOU CANalytics |\",\"description\":\"Explore the Power of Data Science\",\"publisher\":{\"@id\":\"https:\/\/ucanalytics.com\/blogs\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/ucanalytics.com\/blogs\/?s={search_term_string}\"},\"query-input\":\"required name=search_term_string\"}],\"inLanguage\":\"en-US\"},{\"@type\":\"ImageObject\",\"@id\":\"https:\/\/ucanalytics.com\/blogs\/bivariate-analysis-leverage-regression-case-study-example-part-3\/#primaryimage\",\"inLanguage\":\"en-US\",\"url\":\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/08\/Regression-Case-Study-Example.jpg?fit=1156%2C720&ssl=1\",\"contentUrl\":\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/08\/Regression-Case-Study-Example.jpg?fit=1156%2C720&ssl=1\",\"width\":1156,\"height\":720},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/ucanalytics.com\/blogs\/bivariate-analysis-leverage-regression-case-study-example-part-3\/#webpage\",\"url\":\"https:\/\/ucanalytics.com\/blogs\/bivariate-analysis-leverage-regression-case-study-example-part-3\/\",\"name\":\"Bivariate Analysis & Leverage - Regression Case Study Example (Part 3) &ndash; YOU CANalytics |\",\"isPartOf\":{\"@id\":\"https:\/\/ucanalytics.com\/blogs\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/ucanalytics.com\/blogs\/bivariate-analysis-leverage-regression-case-study-example-part-3\/#primaryimage\"},\"datePublished\":\"2016-09-04T05:51:47+00:00\",\"dateModified\":\"2016-09-04T05:51:47+00:00\",\"breadcrumb\":{\"@id\":\"https:\/\/ucanalytics.com\/blogs\/bivariate-analysis-leverage-regression-case-study-example-part-3\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/ucanalytics.com\/blogs\/bivariate-analysis-leverage-regression-case-study-example-part-3\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/ucanalytics.com\/blogs\/bivariate-analysis-leverage-regression-case-study-example-part-3\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/ucanalytics.com\/blogs\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Bivariate Analysis &#038; Leverage &#8211; Regression Case Study Example (Part 3)\"}]},{\"@type\":\"Article\",\"@id\":\"https:\/\/ucanalytics.com\/blogs\/bivariate-analysis-leverage-regression-case-study-example-part-3\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/ucanalytics.com\/blogs\/bivariate-analysis-leverage-regression-case-study-example-part-3\/#webpage\"},\"author\":{\"@id\":\"https:\/\/ucanalytics.com\/blogs\/#\/schema\/person\/55961a1cea272ecdf290cb387be069b6\"},\"headline\":\"Bivariate Analysis &#038; Leverage &#8211; Regression Case Study Example (Part 3)\",\"datePublished\":\"2016-09-04T05:51:47+00:00\",\"dateModified\":\"2016-09-04T05:51:47+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/ucanalytics.com\/blogs\/bivariate-analysis-leverage-regression-case-study-example-part-3\/#webpage\"},\"wordCount\":1294,\"commentCount\":6,\"publisher\":{\"@id\":\"https:\/\/ucanalytics.com\/blogs\/#organization\"},\"image\":{\"@id\":\"https:\/\/ucanalytics.com\/blogs\/bivariate-analysis-leverage-regression-case-study-example-part-3\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/08\/Regression-Case-Study-Example.jpg?fit=1156%2C720&ssl=1\",\"articleSection\":[\"Pricing Case Study Example\"],\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\/\/ucanalytics.com\/blogs\/bivariate-analysis-leverage-regression-case-study-example-part-3\/#respond\"]}]},{\"@type\":\"Person\",\"@id\":\"https:\/\/ucanalytics.com\/blogs\/#\/schema\/person\/55961a1cea272ecdf290cb387be069b6\",\"name\":\"Roopam Upadhyay\",\"image\":{\"@type\":\"ImageObject\",\"@id\":\"https:\/\/ucanalytics.com\/blogs\/#personlogo\",\"inLanguage\":\"en-US\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/dd1aa0b0e813f7639800bcfad6a554f1?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/dd1aa0b0e813f7639800bcfad6a554f1?s=96&d=mm&r=g\",\"caption\":\"Roopam Upadhyay\"},\"description\":\"This blog contains my personal views and thoughts on predictive Analytics and big data. - Roopam Upadhyay\",\"sameAs\":[\"roopam\"],\"url\":\"https:\/\/ucanalytics.com\/blogs\/author\/roopam\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Bivariate Analysis & Leverage - Regression Case Study Example (Part 3) &ndash; YOU CANalytics |","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/ucanalytics.com\/blogs\/bivariate-analysis-leverage-regression-case-study-example-part-3\/","og_locale":"en_US","og_type":"article","og_title":"Bivariate Analysis & Leverage - Regression Case Study Example (Part 3) &ndash; YOU CANalytics |","og_description":"Welcome back to the\u00a0case study example for regression analysis where you are helping an investment firm make money through property price arbitrage. In the last two parts (Part 1 &amp; Part 2) you started with the univariate analysis to identify patterns in the data including missing data and outliers. In the discussion section of theRead More...","og_url":"https:\/\/ucanalytics.com\/blogs\/bivariate-analysis-leverage-regression-case-study-example-part-3\/","og_site_name":"YOU CANalytics |","article_author":"roopam","article_published_time":"2016-09-04T05:51:47+00:00","og_image":[{"width":1156,"height":720,"url":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/08\/Regression-Case-Study-Example.jpg?fit=1156%2C720&ssl=1","type":"image\/jpeg"}],"twitter_misc":{"Written by":"Roopam Upadhyay","Est. reading time":"6 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Organization","@id":"https:\/\/ucanalytics.com\/blogs\/#organization","name":"YOU CANalytics","url":"https:\/\/ucanalytics.com\/blogs\/","sameAs":[],"logo":{"@type":"ImageObject","@id":"https:\/\/ucanalytics.com\/blogs\/#logo","inLanguage":"en-US","url":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2015\/11\/YOU-CANalytics-Logo.jpg?fit=607%2C120","contentUrl":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2015\/11\/YOU-CANalytics-Logo.jpg?fit=607%2C120","width":607,"height":120,"caption":"YOU CANalytics"},"image":{"@id":"https:\/\/ucanalytics.com\/blogs\/#logo"}},{"@type":"WebSite","@id":"https:\/\/ucanalytics.com\/blogs\/#website","url":"https:\/\/ucanalytics.com\/blogs\/","name":"YOU CANalytics |","description":"Explore the Power of Data Science","publisher":{"@id":"https:\/\/ucanalytics.com\/blogs\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/ucanalytics.com\/blogs\/?s={search_term_string}"},"query-input":"required name=search_term_string"}],"inLanguage":"en-US"},{"@type":"ImageObject","@id":"https:\/\/ucanalytics.com\/blogs\/bivariate-analysis-leverage-regression-case-study-example-part-3\/#primaryimage","inLanguage":"en-US","url":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/08\/Regression-Case-Study-Example.jpg?fit=1156%2C720&ssl=1","contentUrl":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/08\/Regression-Case-Study-Example.jpg?fit=1156%2C720&ssl=1","width":1156,"height":720},{"@type":"WebPage","@id":"https:\/\/ucanalytics.com\/blogs\/bivariate-analysis-leverage-regression-case-study-example-part-3\/#webpage","url":"https:\/\/ucanalytics.com\/blogs\/bivariate-analysis-leverage-regression-case-study-example-part-3\/","name":"Bivariate Analysis & Leverage - Regression Case Study Example (Part 3) &ndash; YOU CANalytics |","isPartOf":{"@id":"https:\/\/ucanalytics.com\/blogs\/#website"},"primaryImageOfPage":{"@id":"https:\/\/ucanalytics.com\/blogs\/bivariate-analysis-leverage-regression-case-study-example-part-3\/#primaryimage"},"datePublished":"2016-09-04T05:51:47+00:00","dateModified":"2016-09-04T05:51:47+00:00","breadcrumb":{"@id":"https:\/\/ucanalytics.com\/blogs\/bivariate-analysis-leverage-regression-case-study-example-part-3\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/ucanalytics.com\/blogs\/bivariate-analysis-leverage-regression-case-study-example-part-3\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/ucanalytics.com\/blogs\/bivariate-analysis-leverage-regression-case-study-example-part-3\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/ucanalytics.com\/blogs\/"},{"@type":"ListItem","position":2,"name":"Bivariate Analysis &#038; Leverage &#8211; Regression Case Study Example (Part 3)"}]},{"@type":"Article","@id":"https:\/\/ucanalytics.com\/blogs\/bivariate-analysis-leverage-regression-case-study-example-part-3\/#article","isPartOf":{"@id":"https:\/\/ucanalytics.com\/blogs\/bivariate-analysis-leverage-regression-case-study-example-part-3\/#webpage"},"author":{"@id":"https:\/\/ucanalytics.com\/blogs\/#\/schema\/person\/55961a1cea272ecdf290cb387be069b6"},"headline":"Bivariate Analysis &#038; Leverage &#8211; Regression Case Study Example (Part 3)","datePublished":"2016-09-04T05:51:47+00:00","dateModified":"2016-09-04T05:51:47+00:00","mainEntityOfPage":{"@id":"https:\/\/ucanalytics.com\/blogs\/bivariate-analysis-leverage-regression-case-study-example-part-3\/#webpage"},"wordCount":1294,"commentCount":6,"publisher":{"@id":"https:\/\/ucanalytics.com\/blogs\/#organization"},"image":{"@id":"https:\/\/ucanalytics.com\/blogs\/bivariate-analysis-leverage-regression-case-study-example-part-3\/#primaryimage"},"thumbnailUrl":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/08\/Regression-Case-Study-Example.jpg?fit=1156%2C720&ssl=1","articleSection":["Pricing Case Study Example"],"inLanguage":"en-US","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/ucanalytics.com\/blogs\/bivariate-analysis-leverage-regression-case-study-example-part-3\/#respond"]}]},{"@type":"Person","@id":"https:\/\/ucanalytics.com\/blogs\/#\/schema\/person\/55961a1cea272ecdf290cb387be069b6","name":"Roopam Upadhyay","image":{"@type":"ImageObject","@id":"https:\/\/ucanalytics.com\/blogs\/#personlogo","inLanguage":"en-US","url":"https:\/\/secure.gravatar.com\/avatar\/dd1aa0b0e813f7639800bcfad6a554f1?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/dd1aa0b0e813f7639800bcfad6a554f1?s=96&d=mm&r=g","caption":"Roopam Upadhyay"},"description":"This blog contains my personal views and thoughts on predictive Analytics and big data. - Roopam Upadhyay","sameAs":["roopam"],"url":"https:\/\/ucanalytics.com\/blogs\/author\/roopam\/"}]}},"jetpack_featured_media_url":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/08\/Regression-Case-Study-Example.jpg?fit=1156%2C720&ssl=1","jetpack_sharing_enabled":true,"jetpack_shortlink":"https:\/\/wp.me\/p3L0jT-2fv","jetpack-related-posts":[{"id":8388,"url":"https:\/\/ucanalytics.com\/blogs\/regression-analysis-pricing-case-study-example-part-1\/","url_meta":{"origin":8649,"position":0},"title":"Regression Analysis &#8211; Pricing Case Study Example (Part 1)","author":"Roopam Upadhyay","date":false,"format":false,"excerpt":"How to figure out if you are paying the right price for the property you are about to purchase? Welcome to a new data science case study example on YOU CANalytics to identify the right housing price. Pricing is a highly important and\u00a0specialized function for any business. A right price\u2026","rel":"","context":"In &quot;Pricing Case Study Example&quot;","block_context":{"text":"Pricing Case Study Example","link":"https:\/\/ucanalytics.com\/blogs\/category\/pricing-case-study-example\/"},"img":{"alt_text":"","src":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/07\/Connect-the-Dots.jpg?fit=397%2C603&ssl=1&resize=350%2C200","width":350,"height":200},"classes":[]},{"id":8488,"url":"https:\/\/ucanalytics.com\/blogs\/data-preparation-regression-pricing-case-study-example-part-2\/","url_meta":{"origin":8649,"position":1},"title":"Data Preparation for Regression &#8211; Pricing Case Study Example (Part 2)","author":"Roopam Upadhyay","date":false,"format":false,"excerpt":"In the last post we had started a case study example for regression analysis to help an investment firm make money through property price arbitrage\u00a0(read part 1 :\u00a0regression case study example).\u00a0This is an interactive case study example and required your help to move forward. These are some of your observations\u2026","rel":"","context":"In &quot;Analytics Labs&quot;","block_context":{"text":"Analytics Labs","link":"https:\/\/ucanalytics.com\/blogs\/category\/analytics-labs\/"},"img":{"alt_text":"","src":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/08\/Regression-analysis.jpg?fit=448%2C528&ssl=1&resize=350%2C200","width":350,"height":200},"classes":[]},{"id":9018,"url":"https:\/\/ucanalytics.com\/blogs\/step-step-regression-models-pricing-case-study-example-part-5\/","url_meta":{"origin":8649,"position":2},"title":"Step by Step Regression Modeling Using Principal Component Analysis &#8211; Case Study Example (Part 5)","author":"Roopam Upadhyay","date":false,"format":false,"excerpt":"This is a continuation of our case study example to estimate property pricing. In this part, you will learn nuances of regression modeling by building three different regression models and compare their results.\u00a0We will also use results of the principal component analysis, discussed in the last part, to develop a\u2026","rel":"","context":"In &quot;Pricing Case Study Example&quot;","block_context":{"text":"Pricing Case Study Example","link":"https:\/\/ucanalytics.com\/blogs\/category\/pricing-case-study-example\/"},"img":{"alt_text":"","src":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/09\/Sumo-and-Regression-Model.jpg?fit=918%2C384&ssl=1&resize=350%2C200","width":350,"height":200,"srcset":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/09\/Sumo-and-Regression-Model.jpg?fit=918%2C384&ssl=1&resize=350%2C200 1x, https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/09\/Sumo-and-Regression-Model.jpg?fit=918%2C384&ssl=1&resize=525%2C300 1.5x, https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/09\/Sumo-and-Regression-Model.jpg?fit=918%2C384&ssl=1&resize=700%2C400 2x"},"classes":[]},{"id":5782,"url":"https:\/\/ucanalytics.com\/blogs\/how-effective-is-my-marketing-budget-regression-with-arima-errors-arimax-case-study-example-part-5\/","url_meta":{"origin":8649,"position":3},"title":"How Effective is My Marketing Budget? &#8211; Regression with ARIMA Errors, Case Study Example (Part 5)","author":"Roopam Upadhyay","date":false,"format":false,"excerpt":"So far we have covered the following topics in this case study example\u00a0on time series forecasting and ARIMA models: Part 1\u00a0: Introduction to time series modeling & forecasting Part 2: Time series decomposition to decipher patterns and trends before forecasting Part 3: Introduction to ARIMA models for forecasting Part 4:\u2026","rel":"","context":"In &quot;Manufacturing Case Study Example&quot;","block_context":{"text":"Manufacturing Case Study Example","link":"https:\/\/ucanalytics.com\/blogs\/category\/manufacturing-case-study-example\/"},"img":{"alt_text":"","src":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2015\/07\/rope-walk.jpg?fit=480%2C640&ssl=1&resize=350%2C200","width":350,"height":200},"classes":[]},{"id":3973,"url":"https:\/\/ucanalytics.com\/blogs\/model-selection-retail-case-study-example-part-7\/","url_meta":{"origin":8649,"position":4},"title":"Model Selection &#8211; Retail Case Study Example (Part 7)","author":"Roopam Upadhyay","date":false,"format":false,"excerpt":"Model Selection This is a continuation of our retail case study example for campaign and marketing analytics. In the previous two parts, we discussed a couple of decision tree algorithms (CART and C4.5)\u00a0for classification. Recall a previous case study example on\u00a0banking and risk management where we discussed logistic regression\u00a0which is\u2026","rel":"","context":"In &quot;Marketing Analytics&quot;","block_context":{"text":"Marketing Analytics","link":"https:\/\/ucanalytics.com\/blogs\/category\/marketing-analytics\/"},"img":{"alt_text":"","src":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/09\/photo.jpg?fit=1200%2C1029&ssl=1&resize=350%2C200","width":350,"height":200,"srcset":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/09\/photo.jpg?fit=1200%2C1029&ssl=1&resize=350%2C200 1x, https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/09\/photo.jpg?fit=1200%2C1029&ssl=1&resize=525%2C300 1.5x, https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/09\/photo.jpg?fit=1200%2C1029&ssl=1&resize=700%2C400 2x, https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/09\/photo.jpg?fit=1200%2C1029&ssl=1&resize=1050%2C600 3x"},"classes":[]},{"id":8700,"url":"https:\/\/ucanalytics.com\/blogs\/principal-component-analysis-step-step-guide-r-regression-case-study-example-part-4\/","url_meta":{"origin":8649,"position":5},"title":"Principal Component Analysis: Step-by-Step Guide using R- Regression Case Study Example (Part 4)","author":"Roopam Upadhyay","date":false,"format":false,"excerpt":"Principal component analysis is a wonderful technique for data reduction without losing critical information. Yes, you could reduce the size of 2GB data to a few MBs without losing a lot of information. This is like a mp3 version of music. Many, including some experienced data scientists, find principal component\u2026","rel":"","context":"In &quot;Pricing Case Study Example&quot;","block_context":{"text":"Pricing Case Study Example","link":"https:\/\/ucanalytics.com\/blogs\/category\/pricing-case-study-example\/"},"img":{"alt_text":"","src":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2016\/08\/principal-component-analysis-Death-Profile.jpg?fit=495%2C329&ssl=1&resize=350%2C200","width":350,"height":200},"classes":[]}],"_links":{"self":[{"href":"https:\/\/ucanalytics.com\/blogs\/wp-json\/wp\/v2\/posts\/8649"}],"collection":[{"href":"https:\/\/ucanalytics.com\/blogs\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/ucanalytics.com\/blogs\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/ucanalytics.com\/blogs\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/ucanalytics.com\/blogs\/wp-json\/wp\/v2\/comments?post=8649"}],"version-history":[{"count":0,"href":"https:\/\/ucanalytics.com\/blogs\/wp-json\/wp\/v2\/posts\/8649\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/ucanalytics.com\/blogs\/wp-json\/wp\/v2\/media\/8654"}],"wp:attachment":[{"href":"https:\/\/ucanalytics.com\/blogs\/wp-json\/wp\/v2\/media?parent=8649"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/ucanalytics.com\/blogs\/wp-json\/wp\/v2\/categories?post=8649"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/ucanalytics.com\/blogs\/wp-json\/wp\/v2\/tags?post=8649"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}