What
- Learn how to apply data science techniques using parallel programming in Apache Spark to explore big (and small) data.
- Study online but work in group
- Get help from a local expert
Why we coach MOOCs
The European Data Innovation Hub is partnering with top experts to offer MOOC participants the possibility to do these online courses in group. During the duration of the Mooc participants will be welcome to come to the Hub in Brussels to work and to go through exercises with other participants. On specific days one or more domain expert will be present to coach the students.
Planning
About this course
Organizations use their data for decision support and to build data-intensive products and services, such as recommendation, prediction, and diagnostic systems. The collection of skills required by organizations to support these functions has been grouped under the term Data Science. This course will attempt to articulate the expected output of Data Scientists and then teach students how to use PySpark (part of Apache Spark) to deliver against these expectations. The course assignments include Log Mining, Textual Entity Recognition, Collaborative Filtering exercises that teach students how to manipulate data sets using parallel processing with PySpark.
This course covers advanced undergraduate-level material. It requires a programming background and experience with Python (or the ability to learn it quickly). All exercises will use PySpark (part of Apache Spark), but previous experience with Spark or distributed computing is NOT required. Students should take this Python mini-quiz before the course and take this Python mini-course if they need to learn Python or refresh their Python knowledge.
What you’ll learn
- Learn how to use Apache Spark to perform data analysis
- How to use parallel programming to explore data sets
- Apply Log Mining, Textual Entity Recognition and Collaborative Filtering to real world data questions
- Prepare for the Spark Certified Developer exam
Meet the online instructor:
Anthony D. Joseph
Meet the coach:
Kris Peeters from Dataminded
Certificate
Pursue a Verified Certificate to highlight the knowledge and skills you gain ($50)
-
Official and Verified
Receive a credential signed by the instructor, with the institution logo to verify your achievement and increase your job prospects
-
Easily Shareable
Add the certificate to your CV, resume or post it directly on LinkedIn
-
Proven Motivator
Get the credential as an incentive for your successful course completion
Job opportunities ?
Click here for Data related job offers.
Join our community on linkedin and attend our meetups.
Follow our twitter account: @datajobsbe
Have you been to our Meetups yet ?
Each month we organize a Meetup in Brussels focused on a specific DataScience topic.
Pingback: Opening Registrations for Data Science and Big Data training | The Brussels Data Science Community
Pingback: Free Training – Python for Data Science – Brussels | The Brussels Data Science Community