As of June 2015 SparkR is integrated in Spark-1.4.0. However this is still work in progress: in the original version, no Spark MLlib machine learning algorithms were accessible via R. In Spark-1.5.0 it is already possible to create generalized linear models (glm).
In this one-day SparkR course, you will understand how Spark is working under the hood (MapReduce paradigm, lazy evaluation, …) and learn how to use SparkR. You will start setting up a local Spark cluster and access it via R. Next up you will learn basic data transformations in SparkR, either via R code or via SparkSql. Finally we will use SparkR’s glm and compare it to R’s glm and we will implement our own machine learning algorithm.
This training event is organise in collaboration with Oak3 (http://www.oak3.be). The Oak3 Academy is an IT Learning Center providing hands-on, intensive training and coaching to help students develop the skills they need for a successful career as an Information Technology Professional or as a knowledge worker (end-user of software). Our goal is to provide the highest quality training and knowledge transfer that enables a person to start or enhance his or her career as an IT professional or knowledge worker, in a short period of time. We therefore offer knowledge assimilation, facilitate expertise transfer and provide a rewarding learning experience. Our training solutions are designed to help students learn faster, master the latest information technologies and perform smarter.
Prerequisites: Previous experience with R is required, notions of Apache Spark are useful but not required.
When: Tuesday, November 24, 2015 from 9:00 AM to 5:00 PM (CET)
Where: European Data Innovation Hub – 23 Vorstlaan Watermaal-Bosvoorde, Brussel 1170 BE