{"id":285,"date":"2012-03-28T10:26:46","date_gmt":"2012-03-28T10:26:46","guid":{"rendered":"http:\/\/217.76.133.156\/?p=285"},"modified":"2012-03-28T21:27:11","modified_gmt":"2012-03-28T21:27:11","slug":"r-and-big-data","status":"publish","type":"post","link":"https:\/\/www.josemalvarez.es\/?p=285","title":{"rendered":"R &#038; Big Data Intro"},"content":{"rendered":"<p>I am very committed\u00a0to enhance my know-how on delivering solutions deal with Big Data in a high-performance fashion. I am continuously seeking for tools, algorithms, recipes\u00a0(e.g.<a title=\"Data Science e-Book\" href=\"http:\/\/www.datashaping.com\/ABbook5.pdf\"> Data Science e-book<\/a>), papers and technology to enable this kind of processing because it is consider to be relevant in next years but it is now a truth!<\/p>\n<p>Last week I was restarting the use of\u00a0<a title=\"R\" href=\"http:\/\/www.r-project.org\/\">R<\/a>\u00a0and the\u00a0<a title=\"R commander\" href=\"http:\/\/socserv.mcmaster.ca\/jfox\/Misc\/Rcmdr\/\">rcmdr<\/a>\u00a0to analyze and extract statistics out from my phd experiments using the<a title=\"Wilcoxon test\" href=\"http:\/\/en.wikipedia.org\/wiki\/Wilcoxon_signed-rank_test\">\u00a0Wilcoxon Test<\/a>. I started with R three years ago when I developed a simple graphical interface in Visual Studio to input data and request operations to the R interpreter, the motivation of this work was to help a colleague with his final degree project and the experience was very rewarding.<\/p>\n<p>Which is the relation between Big Data and R?<\/p>\n<p>It has a simple explanation, a key-enabler to provide added-value services is to manage and learn about historical logs so putting together an excellent statistics suite and the Big Data realm it is possible to answer the requirements of a great variety of services from domains like nlp, recommendation, business intelligence, etc. For instance, there are approaches to mix R with Hadoop \u00a0such as\u00a0<a title=\"RICARDO\" href=\"http:\/\/www.cs.ucsb.edu\/~sudipto\/papers\/sigmod2010-das.pdf\">RICARDO<\/a>\u00a0or\u00a0<a title=\"Parallel R\" href=\"http:\/\/shop.oreilly.com\/product\/0636920021421.do\">Parallel R<\/a>\u00a0and\u00a0\u00a0new companies are emerging to offer services based on R to process Big Data like\u00a0<a title=\"Revolution Analytics\" href=\"http:\/\/www.revolutionanalytics.com\/products\/enterprise-big-data.php\">Revolution Analytics<\/a>.<\/p>\n<p>This post was a short introduction to R as a tool to exploit Big Data. If you\u2019re interested in this kind of approaches, please take a look to next presentation by Lee Edfelsen:<\/p>\n<div id=\"__ss_9914141\" style=\"width: 425px;\"><strong style=\"display: block; margin: 12px 0 4px;\"><a title=\"Scalable Data Analysis in R Webinar Presentation\" href=\"http:\/\/www.slideshare.net\/RevolutionAnalytics\/scalable-data-analysis-in-r-webinar-presentation\" target=\"_blank\">Scalable Data Analysis in R Webinar Presentation<\/a><\/strong> <iframe loading=\"lazy\" src=\"http:\/\/www.slideshare.net\/slideshow\/embed_code\/9914141\" frameborder=\"0\" marginwidth=\"0\" marginheight=\"0\" scrolling=\"no\" width=\"425\" height=\"355\"><\/iframe><\/p>\n<div style=\"padding: 5px 0 12px;\">View more <a href=\"http:\/\/www.slideshare.net\/\" target=\"_blank\">presentations<\/a> from <a href=\"http:\/\/www.slideshare.net\/RevolutionAnalytics\" target=\"_blank\">Revolution Analytics<\/a><\/div>\n<\/div>\n<p>Keep on researching and learning!<\/p>\n","protected":false},"excerpt":{"rendered":"<a href=\"https:\/\/www.josemalvarez.es\/?p=285\" rel=\"bookmark\" title=\"Permalink to R &#038; Big Data Intro\"><p>I am very committed\u00a0to enhance my know-how on delivering solutions deal with Big Data in a high-performance fashion. I am continuously seeking for tools, algorithms, recipes\u00a0(e.g. Data Science e-book), papers and technology to enable this kind of processing because it is consider to be relevant in next years but it is now a truth! Last [&hellip;]<\/p>\n<\/a>","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[72],"tags":[79,78,77,49],"class_list":{"0":"post-285","1":"post","2":"type-post","3":"status-publish","4":"format-standard","6":"category-research-blog","7":"tag-algorithms","8":"tag-big-data","9":"tag-r","10":"tag-statistics","11":"h-entry","12":"hentry"},"_links":{"self":[{"href":"https:\/\/www.josemalvarez.es\/index.php?rest_route=\/wp\/v2\/posts\/285","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.josemalvarez.es\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.josemalvarez.es\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.josemalvarez.es\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.josemalvarez.es\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=285"}],"version-history":[{"count":6,"href":"https:\/\/www.josemalvarez.es\/index.php?rest_route=\/wp\/v2\/posts\/285\/revisions"}],"predecessor-version":[{"id":292,"href":"https:\/\/www.josemalvarez.es\/index.php?rest_route=\/wp\/v2\/posts\/285\/revisions\/292"}],"wp:attachment":[{"href":"https:\/\/www.josemalvarez.es\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=285"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.josemalvarez.es\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=285"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.josemalvarez.es\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=285"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}