{"id":3499,"date":"2014-07-19T21:50:22","date_gmt":"2014-07-19T16:20:22","guid":{"rendered":"http:\/\/ucanalytics.com\/blogs\/?p=3499"},"modified":"2016-10-13T14:36:11","modified_gmt":"2016-10-13T09:06:11","slug":"in-conversation-with-michael-berthold-knime","status":"publish","type":"post","link":"https:\/\/ucanalytics.com\/blogs\/in-conversation-with-michael-berthold-knime\/","title":{"rendered":"In Conversation with Michael Berthold &#8211; Founder KNIME"},"content":{"rendered":"<hr \/>\n<div id=\"attachment_3526\" style=\"width: 283px\" class=\"wp-caption alignright\"><a href=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/07\/KNIME.jpg\"><img aria-describedby=\"caption-attachment-3526\" data-attachment-id=\"3526\" data-permalink=\"https:\/\/ucanalytics.com\/blogs\/in-conversation-with-michael-berthold-knime\/knime-2\/\" data-orig-file=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/07\/KNIME.jpg?fit=422%2C198&amp;ssl=1\" data-orig-size=\"422,198\" data-comments-opened=\"1\" data-image-meta=\"{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;}\" data-image-title=\"Micheal Berthold &#8211; Founder KNIME\" data-image-description=\"\" data-image-caption=\"&lt;p&gt;Micheal Berthold &#8211; Founder KNIME&lt;\/p&gt;\n\" data-medium-file=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/07\/KNIME.jpg?fit=300%2C140&amp;ssl=1\" data-large-file=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/07\/KNIME.jpg?fit=422%2C198&amp;ssl=1\" decoding=\"async\" loading=\"lazy\" class=\" wp-image-3526\" src=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/07\/KNIME.jpg?resize=273%2C128\" alt=\"Micheal Berthold - FounderKNIME\" width=\"273\" height=\"128\" srcset=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/07\/KNIME.jpg?w=422&amp;ssl=1 422w, https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/07\/KNIME.jpg?resize=250%2C117&amp;ssl=1 250w, https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/07\/KNIME.jpg?resize=300%2C140&amp;ssl=1 300w\" sizes=\"(max-width: 273px) 100vw, 273px\" data-recalc-dims=\"1\" \/><\/a><p id=\"caption-attachment-3526\" class=\"wp-caption-text\">Micheal Berthold &#8211; Founder KNIME<\/p><\/div>\n<h2><span style=\"color: #3366ff;\">KNIME<\/span><\/h2>\n<p>I am a huge fan of open source software because I believe they democratize the playing field and promote creativity. I am always on a lookout for open source software for predictive analytics and data mining. KNIME,\u00a0the Konstanz Information Miner, is one of the best open source software for data mining, analytics, reporting and data integration. KNIME has a wonderful, and easy-to-use graphical-user-interface\u00a0that competes with commercial software like SAS, and IBM SPSS. KNIME has most of the state of the art data mining and statistical algorithms built in. Additionally, it provides features to integrate other open source software such as Weka and R.<\/p>\n<p>Today we have the president and one of the founders of KNIME,\u00a0Prof. Dr. Michael Berthold, with us at YOU CANalytics. He will talk to us about different facets of KNIME including inception, development process, use cases, and future plans. \u00a0Michael has promised us that KNIME will <span style=\"text-decoration: underline;\">always<\/span> be an open source software! So\u00a0I recommend that after reading this interview you download and try KNIME for yourself without worrying about the license fee. The following is my conversation with Michael:<\/p>\n<hr \/>\n<p><span style=\"color: #3366ff;\"><strong>Roopam Upadhyay:<\/strong><\/span> Hi Michael, Thanks for talking to YOU CANalytics! I have read that you are the author of the first line of code for KNIME.<\/p>\n<blockquote><p><strong><span style=\"color: #3366ff;\">Michael Berthold:<\/span>\u00a0<\/strong>Well, let me first say that I am _arguably_ the author of the first line of KNIME code \u2013 there are three other founders who claim the same. Bernd claims that he may not have written the first line but that his first line is at least still part of the current code base.<\/p><\/blockquote>\n<p><strong><span style=\"color: #3366ff;\">Roopam:<\/span> <\/strong>It is slightly more than 10 years since you started working on KNIME in January 2004. This was just a few months after you joined the University of Konstanz. What motivated you to work on KNIME?<\/p>\n<blockquote><p><strong><span style=\"color: #3366ff;\"><strong>Michael Berthold<\/strong>:<\/span>\u00a0<\/strong>We decided to start KNIME because three of us had worked together at a software company where we actually worked on a similar type of architecture \u2013 I still remember heated discussions in 2002\/2003 when we argued over moving data in buckets or tables or\u2026 So when we started the group in Konstanz, we knew what we wanted to build and we actually had done what many software projects sorely miss: build a prototype, trash it, and start from scratch. KNIME, in a way, is really the second generation of its kind already.<\/p>\n<p>It is interesting to go back and check what we aimed for when we started with KNIME: we wanted to build an open environment that was intuitive, easily extensible, and application agnostic. We wanted to build a professional and scalable architecture from the start, too, because some of the application that drove the development, such as pharmaceutical data analysis, were already doing \u201cbig data\u201d before that term became hype. So, in contrast to many other open source projects, KNIME is _not_ a commercialized version of a PhD student\u2019s project. It was aimed to be professional software from day one. It was also clear that the platform needed to be open source so that others could deploy their cool algorithms using it as well.<\/p><\/blockquote>\n<p><span style=\"color: #3366ff;\"><strong>Roopam: <\/strong><\/span>These 10 years must have been a great journey; what motivates you now to work on KNIME?<\/p>\n<blockquote><p><strong><span style=\"color: #3366ff;\"><strong><strong>Michael Berthold<\/strong>:<\/strong><\/span>\u00a0<\/strong>It\u2019s fun to see how \u2013 even without excessive\/aggressive marketing &#8211; lots of people use and love KNIME. Actually we are quite proud of the latter: in surveys about analytics tools, KNIME users are often the most satisfied ones. That\u2019s what still drives us: talking to happy users, sometimes meeting people who\u2019ve been using KNIME for years and are doing truly powerful things.<\/p><\/blockquote>\n<p><span style=\"color: #3366ff;\"><strong>Roopam:<\/strong><\/span>\u00a0Have things changed since you started?<\/p>\n<blockquote><p><strong><strong><span style=\"color: #3366ff;\"><strong><strong>Michael Berthold<\/strong>:<\/strong><\/span><\/strong>\u00a0<\/strong>Have things changed? Absolutely and in many ways. We now have a professional organization in place, next to the research group in Konstanz still feeding cool new technology into the open source platform. But the key vision is still the same: we are building an integrative, transparent, flexible, collaborative and open platform. And through its openness it is actually more powerful than many of the other, closed solutions out there.<\/p><\/blockquote>\n<p><span style=\"color: #0000ff;\"><strong><span style=\"color: #3366ff;\">Roopam:<\/span>\u00a0<\/strong><\/span>KNIME has a great graphical user interface which is really easy to use though it still requires a little time to get used to. What learning resources can you suggest for new users of KNIME to get used to the interface and the incredible list of data-mining algorithms it has?<\/p>\n<blockquote><p><strong><span style=\"color: #3366ff;\"><strong><strong>Michael Berthold<\/strong>:<\/strong><\/span>\u00a0<\/strong>We have a lot of resources out there \u2013 our<a href=\"http:\/\/www.knime.com\/learning-hub\"> learning hub<\/a> is the best place to get started with tutorials, white papers, YouTube videos, and more. And KNIME connects directly to our example workflow server which hosts hundreds of nice examples as well. Then there are the KNIME Press books, written by Rosaria Silipo and one co-authored by Mike Mazanetz. Both long time KNIME experts. And last, but definitely not least: use our forum! The KNIME community is super active and very helpful 24\/7!<\/p><\/blockquote>\n<p><span style=\"color: #3366ff;\"><strong>Roopam: <\/strong><\/span>Could you describe one of your favorite data-mining projects on KNIME, including the problem statement, analysis, and insights?<\/p>\n<blockquote><p><span style=\"color: #3366ff;\"><strong><strong>Michael Berthold<\/strong>:<\/strong><\/span>\u00a0Oh, there are many \u2013 but the coolest one was probably a presentation at our User Group Meeting in 2013: someone had analyzed hundreds of KNIME workflows across the company and pumped them through KNIME\u2019s frequent subgraph mining algorithm to identify pieces of workflows that appear often. It was using lots of KNIME technology to mine KNIME \u2013 and it actually had quite a few surprises for us, too. We expanded a number of nodes to include functionality that people seemed to use often together.<\/p>\n<p>There are many other real world examples out there, too \u2013 I am not quite sure which ones to pick. The ones I like is where people use the integrative powers of KNIME, pulling together various different data resources, integrating them in KNIME, and then running joined analyses. Maybe three examples can serve to give a bit of an impression how widely KNIME is being used:<\/p>\n<p>Triggered by a large Telko that uses KNIME for online discussion analysis we built our own workflow to analyse the KNIME forum: combining text, network, and \u201cclassical\u201d data mining to find out who influences the forum: it\u2019s interesting (although maybe not surprising) to see that positive sentiment tends to result in longer, more in-depth discussions and for us it was really nice to see that we now have many more external experts being active on our forum that just a few years ago. And that\u2019s not because we are not posting\/replying anymore but the KNIME community is really taking over the forum.<\/p>\n<p>A second, more machine learning oriented example was presented by a group\u00a0at our last User Group Meeting \u2013 they are using KNIME to train literally thousands of models for price prediction. The level of automation is helped by infrastructure built by one of our partners, Dymatrix. Their Dynamine environment connects to the KNIME Server and handles automatic training and re-training and, of course, continuous evaluation of those models.<\/p>\n<p>On the complete opposite of the spectrum are applications such as the ones by a local bank \u2013 all they use KNIME for is to integrate their diverse data sources in a well-documented, reproducible way. They used to do that in Excel once a quarter and spent days every month manually putting all of this together. Now they literally click on button on the KNIME WebPortal and\u00a0 the final report is ready. Even we find it interesting to see how often KNIME is being used to replace pretty sophisticated Excel spreadsheets that are hard to understand or used in a reproducible way.<\/p>\n<p>There are many more interesting applications in predictive maintenance, finance, health care, customer intelligence, pharma, games, online shopping \u2013 even casinos are using KNIME to better understand their customers. We have a number of white papers on our web site that describe some of these applications (or similar ones when where the data is not publicly available) in much more detail (<a href=\"http:\/\/www.knime.com\/applications\">link<\/a>). For all white papers you can download the corresponding workflow and modify it for your own data\/analyses.<\/p><\/blockquote>\n<p><strong><span style=\"color: #3366ff;\">Roopam:<\/span>\u00a0<\/strong>KNIME has an exhaustive list of data management, statistics, and data mining algorithms. What is your process for research, selection, and integration of new algorithms to KNIME? Also how often do you modify the existing program?<\/p>\n<blockquote><p><strong><span style=\"color: #3366ff;\"><strong><strong>Michael Berthold<\/strong>:<\/strong><\/span>\u00a0<\/strong>We include what we believe are standard\/often used algorithms and constantly grow that list \u2013 and we also pay attention to what people are using via some of our integrations (most notably R or Weka). If we see some functionality pop up more frequently, we consider adding it to the library of native KNIME nodes \u2013 those scale better (KNIME nodes can -but are not forced to- run in-memory) and we can process more complex data types.<\/p>\n<p>But we do not aim to replace R (or Weka) \u2013 quite a few of the more bleeding-edge algorithms are available there and it\u2019s a great asset for people to be able to reach out to those libraries and use whatever they want to use.<\/p>\n<p>Note that we never modify algorithms \u2013 if we do make changes to an algorithm in such a way that it changes its behavior we deprecate the previous node so that existing workflows still use that version and produce the same results as before. That way, KNIME workflows stay 100% backwards compatible and only newly created workflows make use of those modifications. Users are, of course, free to upgrade their workflows to the newer implementations.<\/p><\/blockquote>\n<p><span style=\"color: #3366ff;\"><strong>Roopam:<\/strong><\/span>\u00a0I think it\u2019s great that the entire \u2018Open Source\u2019 community is leveraging each other\u2019s strengths. Coming to Text mining and Natural Language Processing (NLP), they are quickly becoming essential tools in the toolkit for data scientists. Could you describe a few applications of text mining and NLP and explain how KNIME assists in solving these business problems?<\/p>\n<blockquote><p><span style=\"color: #3366ff;\"><strong><strong><strong>Michael Berthold<\/strong>:<\/strong>\u00a0<\/strong><\/span>We have integrations with a few packages for this type of data, the Stanford package comes to mind and the Palladin extensions. So you can combine textual data with other sources right in KNIME. The forum analysis that I mentioned above is such an example: it provides a sentiment score on posts and extracts a network of authors from their interactions. In the bioinformatics domain, people have used the text processing extensions to look at research articles and other type of scientific information. And one of our partners is using these tools to do language detection for automatic email sorting.<\/p><\/blockquote>\n<p><span style=\"color: #3366ff;\"><strong>Roopam:<\/strong><\/span>\u00a0Personally, I am a fan of KNIME and occasionally try to replicate my analysis done with other software on KNIME for fun. It works great on most occasions but still there are times when KNIME struggles. What is a good forum for analysts to reach out to you? Is there a suggested configuration for the computer (personal machines and servers) for working with KNIME seamlessly?<\/p>\n<blockquote><p><span style=\"color: #3366ff;\"><strong><strong>Michael Berthold<\/strong>:<\/strong><\/span>\u00a0Let us know about it! \u00a0We are always curious to hear about problems people encounter. We do have a very extensive test setup but there are always types of data and weird setups that cause problems and we are eager to fix that asap \u2013 but we need to know about it, of course.<\/p><\/blockquote>\n<p><strong><span style=\"color: #3366ff;\">Roopam:<\/span>\u00a0<\/strong>KNIME provides extension nodes for other open source packages such as R and WEKA. What is the purpose of these extensions? And what is the best way for a new user to get comfortable with these extensions?<\/p>\n<blockquote><p><span style=\"color: #3366ff;\"><strong><strong>Michael Berthold<\/strong>:<\/strong>\u00a0<\/span>See also my comments above \u2013 we don\u2019t believe in closed analytics packages. No single vendor can really claim to provide all of the functionality out there. If at all then the R community can make that claim \u2013 but it is a community effort and let\u2019s face it, if you aren\u2019t a programmer it\u2019s hard to use! We believe in the power of R for analytics and in the power of an intuitive and easy to use environment to get analytics onto people\u2019s desktops. So being able to do lots of ETL and analytics in KNIME and being able to reach out to more advanced (R) or specialized (Weka) packages for other routines when needed gives you both. Simplicity and transparency and a connection to bleeding edge methods when you want to. Note that in KNIME you can even wrap R scripts so nicely that your non-programming neighbor never needs to touch R code\u2026<\/p><\/blockquote>\n<p><strong><span style=\"color: #3366ff;\">Roopam:<\/span>\u00a0<\/strong>About a year ago Rapid Miner, another open source product, decided to cease its open source development and launched the latest version with a commercial license. Personally, I would be really sad if KNIME goes the same way though I understand there are a huge cost and effort involved in software development which needs to be compensated. Does KNIME also have any such plans?<\/p>\n<blockquote><p><span style=\"color: #3366ff;\"><strong><strong>Michael Berthold<\/strong>:<\/strong><\/span>\u00a0KNIME will always be open source, we believe strongly in the value of our community \u2013 see also above. KNIME is powerful as-is but it\u2019s really powerful when you reach out to the cool extensions provided by the community or our partners. This is a real asset and we do not plan to abandon them.<\/p>\n<p>Plus I personally believe that the platform for data (analysis) has become commodity \u2013 it\u2019s more like an operating system that you need in order to combine the data and tools that you want to use. You cannot afford to be limited by a vendor\u2019s choice of tools to really trigger data-driven innovation \u2013 that\u2019s also why we coined the \u201copen for innovation\u201d tag-line. In order to be open for innovation you need to have an open platform that allows you do what _you_ want to do not what your vendor thinks you should be able to do.<\/p><\/blockquote>\n<p><span style=\"color: #3366ff;\"><strong>Roopam:<\/strong><\/span> Are there creative ways in\u00a0which the business analytics community can support KNIME (through finance, ideas and work)?<\/p>\n<blockquote><p><span style=\"color: #3366ff;\"><strong><strong>Michael Berthold<\/strong>:<\/strong><\/span>\u00a0Well, you can always donate extensions adding to the wealth of analytics functionality in the KNIME ecosystem. We won\u2019t return checks either \u00a0but you can help us most by spreading the word. If you use and like KNIME \u2013 talk about it!<\/p><\/blockquote>\n<p><span style=\"color: #3366ff;\"><strong>Roopam:<\/strong><\/span>\u00a0What are some of the exciting new things the team at KNIME is working on? What can the users expect in the recent future?<\/p>\n<blockquote><p><span style=\"color: #3366ff;\"><strong><strong>Michael Berthold<\/strong>:<\/strong>\u00a0<\/span>We are working on a lot of new things \u2013 with KNIME v2.10 (July 2014) we have started to move over the entire view framework to JavaScript so that we can deploy those on the web, too. We have also added more support for Big Data \/ Hadoop (that\u2019s actually work sponsored by Siemens!) and we will continue to grow the analytics capabilities of KNIME further. And, if you have checked out our web site recently, we have also evolved our identity a bit\u2026<\/p><\/blockquote>\n<p><span style=\"color: #3366ff;\"><strong>Roopam:<\/strong>\u00a0<\/span>Thanks so much Michael for talking to us and sharing your ideas and views. I am really excited about the latest release of\u00a0KNIME!<\/p>\n<pre><span style=\"font-family: verdana, geneva;\"><span style=\"font-size: 12px; line-height: 18px;\">- Download KNIME from this <\/span><a style=\"font-family: Consolas, Monaco, monospace; font-size: 12px; line-height: 18px;\" href=\"http:\/\/www.knime.com\/\">link<\/a>\r\n<span style=\"font-size: 12px; line-height: 18px;\">- Read an earlier post\u00a0about comparing <\/span><a style=\"font-family: Consolas, Monaco, monospace; font-size: 12px; line-height: 18px;\" href=\"http:\/\/ucanalytics.com\/blogs\/choose-your-data-mining-statistics-software\/\">statistical &amp; data mining software<\/a><\/span><\/pre>\n","protected":false},"excerpt":{"rendered":"<p>KNIME I am a huge fan of open source software because I believe they democratize the playing field and promote creativity. I am always on a lookout for open source software for predictive analytics and data mining. KNIME,\u00a0the Konstanz Information Miner, is one of the best open source software for data mining, analytics, reporting and<\/p>\n<p><a class=\"excerpt-more blog-excerpt\" href=\"https:\/\/ucanalytics.com\/blogs\/in-conversation-with-michael-berthold-knime\/\">Read More&#8230;<\/a><\/p>\n","protected":false},"author":1,"featured_media":3526,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"jetpack_post_was_ever_published":false,"_jetpack_newsletter_access":"","_jetpack_newsletter_tier_id":0,"jetpack_publicize_message":"","jetpack_is_tweetstorm":false,"jetpack_publicize_feature_enabled":true,"jetpack_social_post_already_shared":true,"jetpack_social_options":{"image_generator_settings":{"template":"highway","enabled":false}}},"categories":[60],"tags":[6,72,10],"jetpack_publicize_connections":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v17.4 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Interview with Michael Berthold - Founder KNIME<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/ucanalytics.com\/blogs\/in-conversation-with-michael-berthold-knime\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Interview with Michael Berthold - Founder KNIME\" \/>\n<meta property=\"og:description\" content=\"KNIME I am a huge fan of open source software because I believe they democratize the playing field and promote creativity. I am always on a lookout for open source software for predictive analytics and data mining. KNIME,\u00a0the Konstanz Information Miner, is one of the best open source software for data mining, analytics, reporting andRead More...\" \/>\n<meta property=\"og:url\" content=\"https:\/\/ucanalytics.com\/blogs\/in-conversation-with-michael-berthold-knime\/\" \/>\n<meta property=\"og:site_name\" content=\"YOU CANalytics |\" \/>\n<meta property=\"article:author\" content=\"roopam\" \/>\n<meta property=\"article:published_time\" content=\"2014-07-19T16:20:22+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2016-10-13T09:06:11+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/07\/KNIME.jpg?fit=422%2C198&#038;ssl=1\" \/>\n\t<meta property=\"og:image:width\" content=\"422\" \/>\n\t<meta property=\"og:image:height\" content=\"198\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Roopam Upadhyay\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"13 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Organization\",\"@id\":\"https:\/\/ucanalytics.com\/blogs\/#organization\",\"name\":\"YOU CANalytics\",\"url\":\"https:\/\/ucanalytics.com\/blogs\/\",\"sameAs\":[],\"logo\":{\"@type\":\"ImageObject\",\"@id\":\"https:\/\/ucanalytics.com\/blogs\/#logo\",\"inLanguage\":\"en-US\",\"url\":\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2015\/11\/YOU-CANalytics-Logo.jpg?fit=607%2C120\",\"contentUrl\":\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2015\/11\/YOU-CANalytics-Logo.jpg?fit=607%2C120\",\"width\":607,\"height\":120,\"caption\":\"YOU CANalytics\"},\"image\":{\"@id\":\"https:\/\/ucanalytics.com\/blogs\/#logo\"}},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/ucanalytics.com\/blogs\/#website\",\"url\":\"https:\/\/ucanalytics.com\/blogs\/\",\"name\":\"YOU CANalytics |\",\"description\":\"Explore the Power of Data Science\",\"publisher\":{\"@id\":\"https:\/\/ucanalytics.com\/blogs\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/ucanalytics.com\/blogs\/?s={search_term_string}\"},\"query-input\":\"required name=search_term_string\"}],\"inLanguage\":\"en-US\"},{\"@type\":\"ImageObject\",\"@id\":\"https:\/\/ucanalytics.com\/blogs\/in-conversation-with-michael-berthold-knime\/#primaryimage\",\"inLanguage\":\"en-US\",\"url\":\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/07\/KNIME.jpg?fit=422%2C198&ssl=1\",\"contentUrl\":\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/07\/KNIME.jpg?fit=422%2C198&ssl=1\",\"width\":422,\"height\":198,\"caption\":\"Micheal Berthold - Founder KNIME\"},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/ucanalytics.com\/blogs\/in-conversation-with-michael-berthold-knime\/#webpage\",\"url\":\"https:\/\/ucanalytics.com\/blogs\/in-conversation-with-michael-berthold-knime\/\",\"name\":\"Interview with Michael Berthold - Founder KNIME\",\"isPartOf\":{\"@id\":\"https:\/\/ucanalytics.com\/blogs\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/ucanalytics.com\/blogs\/in-conversation-with-michael-berthold-knime\/#primaryimage\"},\"datePublished\":\"2014-07-19T16:20:22+00:00\",\"dateModified\":\"2016-10-13T09:06:11+00:00\",\"breadcrumb\":{\"@id\":\"https:\/\/ucanalytics.com\/blogs\/in-conversation-with-michael-berthold-knime\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/ucanalytics.com\/blogs\/in-conversation-with-michael-berthold-knime\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/ucanalytics.com\/blogs\/in-conversation-with-michael-berthold-knime\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/ucanalytics.com\/blogs\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"In Conversation with Michael Berthold &#8211; Founder KNIME\"}]},{\"@type\":\"Article\",\"@id\":\"https:\/\/ucanalytics.com\/blogs\/in-conversation-with-michael-berthold-knime\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/ucanalytics.com\/blogs\/in-conversation-with-michael-berthold-knime\/#webpage\"},\"author\":{\"@id\":\"https:\/\/ucanalytics.com\/blogs\/#\/schema\/person\/55961a1cea272ecdf290cb387be069b6\"},\"headline\":\"In Conversation with Michael Berthold &#8211; Founder KNIME\",\"datePublished\":\"2014-07-19T16:20:22+00:00\",\"dateModified\":\"2016-10-13T09:06:11+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/ucanalytics.com\/blogs\/in-conversation-with-michael-berthold-knime\/#webpage\"},\"wordCount\":2494,\"commentCount\":3,\"publisher\":{\"@id\":\"https:\/\/ucanalytics.com\/blogs\/#organization\"},\"image\":{\"@id\":\"https:\/\/ucanalytics.com\/blogs\/in-conversation-with-michael-berthold-knime\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/07\/KNIME.jpg?fit=422%2C198&ssl=1\",\"keywords\":[\"Predictive Analytics\",\"Retail Analytics\",\"Roopam Upadhyay\"],\"articleSection\":[\"Events &amp; Interviews\"],\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\/\/ucanalytics.com\/blogs\/in-conversation-with-michael-berthold-knime\/#respond\"]}]},{\"@type\":\"Person\",\"@id\":\"https:\/\/ucanalytics.com\/blogs\/#\/schema\/person\/55961a1cea272ecdf290cb387be069b6\",\"name\":\"Roopam Upadhyay\",\"image\":{\"@type\":\"ImageObject\",\"@id\":\"https:\/\/ucanalytics.com\/blogs\/#personlogo\",\"inLanguage\":\"en-US\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/dd1aa0b0e813f7639800bcfad6a554f1?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/dd1aa0b0e813f7639800bcfad6a554f1?s=96&d=mm&r=g\",\"caption\":\"Roopam Upadhyay\"},\"description\":\"This blog contains my personal views and thoughts on predictive Analytics and big data. - Roopam Upadhyay\",\"sameAs\":[\"roopam\"],\"url\":\"https:\/\/ucanalytics.com\/blogs\/author\/roopam\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Interview with Michael Berthold - Founder KNIME","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/ucanalytics.com\/blogs\/in-conversation-with-michael-berthold-knime\/","og_locale":"en_US","og_type":"article","og_title":"Interview with Michael Berthold - Founder KNIME","og_description":"KNIME I am a huge fan of open source software because I believe they democratize the playing field and promote creativity. I am always on a lookout for open source software for predictive analytics and data mining. KNIME,\u00a0the Konstanz Information Miner, is one of the best open source software for data mining, analytics, reporting andRead More...","og_url":"https:\/\/ucanalytics.com\/blogs\/in-conversation-with-michael-berthold-knime\/","og_site_name":"YOU CANalytics |","article_author":"roopam","article_published_time":"2014-07-19T16:20:22+00:00","article_modified_time":"2016-10-13T09:06:11+00:00","og_image":[{"width":422,"height":198,"url":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/07\/KNIME.jpg?fit=422%2C198&ssl=1","type":"image\/jpeg"}],"twitter_misc":{"Written by":"Roopam Upadhyay","Est. reading time":"13 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Organization","@id":"https:\/\/ucanalytics.com\/blogs\/#organization","name":"YOU CANalytics","url":"https:\/\/ucanalytics.com\/blogs\/","sameAs":[],"logo":{"@type":"ImageObject","@id":"https:\/\/ucanalytics.com\/blogs\/#logo","inLanguage":"en-US","url":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2015\/11\/YOU-CANalytics-Logo.jpg?fit=607%2C120","contentUrl":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2015\/11\/YOU-CANalytics-Logo.jpg?fit=607%2C120","width":607,"height":120,"caption":"YOU CANalytics"},"image":{"@id":"https:\/\/ucanalytics.com\/blogs\/#logo"}},{"@type":"WebSite","@id":"https:\/\/ucanalytics.com\/blogs\/#website","url":"https:\/\/ucanalytics.com\/blogs\/","name":"YOU CANalytics |","description":"Explore the Power of Data Science","publisher":{"@id":"https:\/\/ucanalytics.com\/blogs\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/ucanalytics.com\/blogs\/?s={search_term_string}"},"query-input":"required name=search_term_string"}],"inLanguage":"en-US"},{"@type":"ImageObject","@id":"https:\/\/ucanalytics.com\/blogs\/in-conversation-with-michael-berthold-knime\/#primaryimage","inLanguage":"en-US","url":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/07\/KNIME.jpg?fit=422%2C198&ssl=1","contentUrl":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/07\/KNIME.jpg?fit=422%2C198&ssl=1","width":422,"height":198,"caption":"Micheal Berthold - Founder KNIME"},{"@type":"WebPage","@id":"https:\/\/ucanalytics.com\/blogs\/in-conversation-with-michael-berthold-knime\/#webpage","url":"https:\/\/ucanalytics.com\/blogs\/in-conversation-with-michael-berthold-knime\/","name":"Interview with Michael Berthold - Founder KNIME","isPartOf":{"@id":"https:\/\/ucanalytics.com\/blogs\/#website"},"primaryImageOfPage":{"@id":"https:\/\/ucanalytics.com\/blogs\/in-conversation-with-michael-berthold-knime\/#primaryimage"},"datePublished":"2014-07-19T16:20:22+00:00","dateModified":"2016-10-13T09:06:11+00:00","breadcrumb":{"@id":"https:\/\/ucanalytics.com\/blogs\/in-conversation-with-michael-berthold-knime\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/ucanalytics.com\/blogs\/in-conversation-with-michael-berthold-knime\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/ucanalytics.com\/blogs\/in-conversation-with-michael-berthold-knime\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/ucanalytics.com\/blogs\/"},{"@type":"ListItem","position":2,"name":"In Conversation with Michael Berthold &#8211; Founder KNIME"}]},{"@type":"Article","@id":"https:\/\/ucanalytics.com\/blogs\/in-conversation-with-michael-berthold-knime\/#article","isPartOf":{"@id":"https:\/\/ucanalytics.com\/blogs\/in-conversation-with-michael-berthold-knime\/#webpage"},"author":{"@id":"https:\/\/ucanalytics.com\/blogs\/#\/schema\/person\/55961a1cea272ecdf290cb387be069b6"},"headline":"In Conversation with Michael Berthold &#8211; Founder KNIME","datePublished":"2014-07-19T16:20:22+00:00","dateModified":"2016-10-13T09:06:11+00:00","mainEntityOfPage":{"@id":"https:\/\/ucanalytics.com\/blogs\/in-conversation-with-michael-berthold-knime\/#webpage"},"wordCount":2494,"commentCount":3,"publisher":{"@id":"https:\/\/ucanalytics.com\/blogs\/#organization"},"image":{"@id":"https:\/\/ucanalytics.com\/blogs\/in-conversation-with-michael-berthold-knime\/#primaryimage"},"thumbnailUrl":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/07\/KNIME.jpg?fit=422%2C198&ssl=1","keywords":["Predictive Analytics","Retail Analytics","Roopam Upadhyay"],"articleSection":["Events &amp; Interviews"],"inLanguage":"en-US","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/ucanalytics.com\/blogs\/in-conversation-with-michael-berthold-knime\/#respond"]}]},{"@type":"Person","@id":"https:\/\/ucanalytics.com\/blogs\/#\/schema\/person\/55961a1cea272ecdf290cb387be069b6","name":"Roopam Upadhyay","image":{"@type":"ImageObject","@id":"https:\/\/ucanalytics.com\/blogs\/#personlogo","inLanguage":"en-US","url":"https:\/\/secure.gravatar.com\/avatar\/dd1aa0b0e813f7639800bcfad6a554f1?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/dd1aa0b0e813f7639800bcfad6a554f1?s=96&d=mm&r=g","caption":"Roopam Upadhyay"},"description":"This blog contains my personal views and thoughts on predictive Analytics and big data. - Roopam Upadhyay","sameAs":["roopam"],"url":"https:\/\/ucanalytics.com\/blogs\/author\/roopam\/"}]}},"jetpack_featured_media_url":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/07\/KNIME.jpg?fit=422%2C198&ssl=1","jetpack_sharing_enabled":true,"jetpack_shortlink":"https:\/\/wp.me\/p3L0jT-Ur","jetpack-related-posts":[{"id":2577,"url":"https:\/\/ucanalytics.com\/blogs\/choose-your-data-mining-statistics-software\/","url_meta":{"origin":3499,"position":0},"title":"Choose Your Data Mining &#038; Statistics Software \/ Language","author":"Roopam Upadhyay","date":false,"format":false,"excerpt":"One of the crucial decisions while doing data analysis is an appropriate choice of statistics software and language. In this article, I am going to analyze and help you choose the right data mining and statistics software for your purpose. I have used the following 12 software\u00a0 \/\u00a0 languages (tools)\u2026","rel":"","context":"In &quot;Analytics Tips and Tricks&quot;","block_context":{"text":"Analytics Tips and Tricks","link":"https:\/\/ucanalytics.com\/blogs\/category\/analytics-tips\/"},"img":{"alt_text":"Slide3","src":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/04\/Slide3.jpg?resize=350%2C200","width":350,"height":200,"srcset":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/04\/Slide3.jpg?resize=350%2C200 1x, https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2014\/04\/Slide3.jpg?resize=525%2C300 1.5x"},"classes":[]},{"id":2783,"url":"https:\/\/ucanalytics.com\/blogs\/in-conversation-with-eric-siegel-author-predictive-analytics\/","url_meta":{"origin":3499,"position":1},"title":"In Conversation with Eric Siegel: Author &#8216;Predictive Analytics&#8217;","author":"Roopam Upadhyay","date":false,"format":false,"excerpt":"In Conversation with.. Today we are starting a new series on YOU CANalytics called 'in conversation with'. In this series we will talk to the leaders and experts of predictive analytics and big data to gain deeper insight into the field. Dr. Eric Siegel Our first guest for the series\u2026","rel":"","context":"In &quot;Events &amp; Interviews&quot;","block_context":{"text":"Events &amp; Interviews","link":"https:\/\/ucanalytics.com\/blogs\/category\/events-and-interviews\/"},"img":{"alt_text":"","src":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2015\/12\/Slide15.jpg?fit=290%2C210&ssl=1&resize=350%2C200","width":350,"height":200},"classes":[]},{"id":55,"url":"https:\/\/ucanalytics.com\/blogs\/credit-scorecards-advanced-analytics-part-4\/","url_meta":{"origin":3499,"position":2},"title":"Credit Scorecards &#8211; Advanced Analytics (part 4 of 7)","author":"Roopam Upadhyay","date":false,"format":false,"excerpt":"Modeling in Advanced Analytics The room, full of Analysts, erupts with a loud round of laughter when a young business analyst narrates to us an incident from his recent trip back home. A distant aunt inquired about his new profession. His response \u2013 I am into modeling. She got all\u2026","rel":"","context":"In &quot;Credit Risk Analytics Series&quot;","block_context":{"text":"Credit Risk Analytics Series","link":"https:\/\/ucanalytics.com\/blogs\/category\/risk-analytics\/credit-risk-analytics-series\/"},"img":{"alt_text":"4. Scorecard Simple","src":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2013\/07\/4.-Scorecard-Simple1.jpg?resize=350%2C200","width":350,"height":200,"srcset":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2013\/07\/4.-Scorecard-Simple1.jpg?resize=350%2C200 1x, https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2013\/07\/4.-Scorecard-Simple1.jpg?resize=525%2C300 1.5x, https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2013\/07\/4.-Scorecard-Simple1.jpg?resize=700%2C400 2x"},"classes":[]},{"id":4590,"url":"https:\/\/ucanalytics.com\/blogs\/master-art-data-preparation-data-science\/","url_meta":{"origin":3499,"position":3},"title":"Master the Art of Data Preparation for Data Science","author":"Roopam Upadhyay","date":false,"format":false,"excerpt":"Every data scientist knows that in any business analytics and data science exercise 70-80% of the time is consumed in data preparation and data preprocessing. This is usually considered a drudgery in\u00a0comparison to the actual statistical modeling, machine learning, and business insights part. However, every good data scientist understands that\u2026","rel":"","context":"In &quot;Analytics Tips and Tricks&quot;","block_context":{"text":"Analytics Tips and Tricks","link":"https:\/\/ucanalytics.com\/blogs\/category\/analytics-tips\/"},"img":{"alt_text":"A Simple Schematic of Banking Datasets","src":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2015\/01\/Banking-Databases.jpg?resize=350%2C200","width":350,"height":200,"srcset":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2015\/01\/Banking-Databases.jpg?resize=350%2C200 1x, https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2015\/01\/Banking-Databases.jpg?resize=525%2C300 1.5x"},"classes":[]},{"id":2374,"url":"https:\/\/ucanalytics.com\/blogs\/learn-r-12-books-and-online-resources\/","url_meta":{"origin":3499,"position":4},"title":"Learn R : 12 Free Books and Online Resources","author":"Roopam Upadhyay","date":false,"format":false,"excerpt":"Please read the disclaimer about the Free PDF Books in this article at the bottom R, an open-source statistical and data mining programming language, is slowly but surely catching up in its race with commercial software like SAS & SPSS. I believe R will eventually replace SAS as the language\u2026","rel":"","context":"In &quot;Analytics Book Club&quot;","block_context":{"text":"Analytics Book Club","link":"https:\/\/ucanalytics.com\/blogs\/category\/analytics-book-club\/"},"img":{"alt_text":"","src":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2013\/07\/5-Swordsmith1.jpg?fit=768%2C1024&ssl=1&resize=350%2C200","width":350,"height":200,"srcset":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2013\/07\/5-Swordsmith1.jpg?fit=768%2C1024&ssl=1&resize=350%2C200 1x, https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2013\/07\/5-Swordsmith1.jpg?fit=768%2C1024&ssl=1&resize=525%2C300 1.5x, https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2013\/07\/5-Swordsmith1.jpg?fit=768%2C1024&ssl=1&resize=700%2C400 2x"},"classes":[]},{"id":4632,"url":"https:\/\/ucanalytics.com\/blogs\/customer-churn-data-science-to-understand-customer-behaviour\/","url_meta":{"origin":3499,"position":5},"title":"Customer Churn &#8211; Data Science to Understand Customer Behaviour","author":"Roopam Upadhyay","date":false,"format":false,"excerpt":"When they leave, they leave footprints\u00a0for others to follow \u00a0 Customer Churn A couple of months ago, I closed down my savings\u00a0account with a leading bank in India. I had this account for more than a decade. Since I have worked with the banking and financial services industry, I know\u2026","rel":"","context":"In &quot;Marketing Analytics&quot;","block_context":{"text":"Marketing Analytics","link":"https:\/\/ucanalytics.com\/blogs\/category\/marketing-analytics\/"},"img":{"alt_text":"","src":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2015\/02\/Customer-Churn.jpg?fit=640%2C448&ssl=1&resize=350%2C200","width":350,"height":200,"srcset":"https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2015\/02\/Customer-Churn.jpg?fit=640%2C448&ssl=1&resize=350%2C200 1x, https:\/\/i0.wp.com\/ucanalytics.com\/blogs\/wp-content\/uploads\/2015\/02\/Customer-Churn.jpg?fit=640%2C448&ssl=1&resize=525%2C300 1.5x"},"classes":[]}],"_links":{"self":[{"href":"https:\/\/ucanalytics.com\/blogs\/wp-json\/wp\/v2\/posts\/3499"}],"collection":[{"href":"https:\/\/ucanalytics.com\/blogs\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/ucanalytics.com\/blogs\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/ucanalytics.com\/blogs\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/ucanalytics.com\/blogs\/wp-json\/wp\/v2\/comments?post=3499"}],"version-history":[{"count":0,"href":"https:\/\/ucanalytics.com\/blogs\/wp-json\/wp\/v2\/posts\/3499\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/ucanalytics.com\/blogs\/wp-json\/wp\/v2\/media\/3526"}],"wp:attachment":[{"href":"https:\/\/ucanalytics.com\/blogs\/wp-json\/wp\/v2\/media?parent=3499"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/ucanalytics.com\/blogs\/wp-json\/wp\/v2\/categories?post=3499"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/ucanalytics.com\/blogs\/wp-json\/wp\/v2\/tags?post=3499"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}