Summer Data Science activities in Belgium.

summer edition

We wish you happy holidays, in case you get bored check out our educational channel on youtube.

The European Data Innovation Hub is active during the summer.
Here is a short update from what to expect in the coming weeks:

Thank you for supporting the European Data Innovation Hub, we had a great academic year.

Philippe Van Impe
@pvanimpe
www.di-academy.com

Please forward the information about the data science bootcamp to your peers and friends.

Job – Junior Data Scientist

Screenshot 2016-07-01 12.02.02

Are you pursuing a career in data science?

We have a great opportunity for you: an intensive training program combined with interesting job opportunities!

Interested? Check out http://di-academy.com/bootcamp/ follow the link to our datascience survey and send your cv to training@di-academy.com

Once selected, you’ll be invited for the intake event that will take place in Brussels this summer.

Hope to see you there,

Nele & Philippe

Join us – Silicon Valley Inspiration Tour – March 26, 2016

 

20110531_014212_OBRIEN-valleys-052911-e1307311368351

Visit the Silicon Valley ecosystem:

Just after the Data Innovation Summit that will be held in Brussels on March 23, the Data Science Community is organising a Low Budget innovation tour in Silicon Valley.

Silicon Valley has a variety of business organizations and institutions that create a business environment that has proved to be highly conducive to the successful creation of startup firms, disruptive business models, and leadership in a variety of high-tech areas.

With our team we want to get a first hand impression on what are the key components of Silicon Valley, how do they work, and how do they fit together?

Key Characteristics of the Silicon Valley Ecosystem that we will focus on:

  • Dual ecosystem of large firms and startups
  • High financial returns for successful entrepreneurs and startups’ early employees
  • Global top-level human resources for all stages of startups
  • Business infrastructure (law firms, accounting firms, mentors, etc.)
  • Venture capital – most competitive market
  • Globally top class universities (Stanford, UC Berkeley, UCSF)
  • Human resource clusters anchored around top universities
  • Extensive government role in shaping technological trajectories and basic science
  • Highly competitive industries, balance between “open innovation” and secret protection
  • Balance of “open innovation” and intellectual property protection
  • “Technology Pump” of top human resources from all over the world
  • High labor mobility at all levels of management and talent
  • Culture of accepting failures (effective evaluation and monitoring)

We plan to arrive on Saturday, we will rent a house for our community members in San Francisco from Saturday 26/3 to Tuesday 29 then we will move to a house in San Jose on Wednesday.

Agenda of our Visit:

We have lined up a few companies that we want to meet. If you can help us and facilitate the access to an interesting company please let us know.

We have already received some nice invitations from:

We plan to attend a few meetups too, like to one from SF DataScience on Monday about  ‘Kafka and Data Science’ , …

On arrival the ‘Sons of Analytics’ will rent a bike to attend the LAUGHLIN RIVER RUN .

And of course we will travel to San-Jose to attend the Strata Hadoop event. Please always use our discount code at any O’Reilly activity: our 20% discount code UGBDSC

Strata + Hadoop World is the leading event on how big data and ubiquitous, real-time computing is shaping the course of business and society. It brings together the world’s best data scientists and business leaders to share hard-won knowledge and innovations in technology and strategy.

Check out the impressive program and make plans to join Strata + Hadoop World in San Jose March 28-31, 2016.

Save 20% on most passes with discount code UGBDSC.

“The best opportunity to learn about the technologies that are transforming big data and data science.”

Costs

We will travel on a budget and share cars and houses, make sure you take a sleeping bag. Our first estimation is that you should be able to do this for less than 3000.

Flight 800
Accomodation 6 300
Food and drinks 600
Car rental 120
Peto cash 200
Ticket Strata 1000
3020

Coordinator

  • scouting to find interesting meetings: all community members + Philippe
  • practical organisation: Laurence
  • if you are interested just put your name on the list

 

The ABC of Datascience blogs – collaborative update

abc-letters-on-white-sandra-cunningham

A – ACID – Atomicity, Consistency, Isolation and Durability

B – Big Data – Volume, Velocity, Variety

C – Columnar (or Column-Oriented) Database

  • CoolData By Kevin MacDonell on Analytics, predictive modeling and related cool data stuff for fund-raising in higher education.
  • Cloud of data blog By Paul Miller, aims to help clients understand the implications of taking data and more to the Cloud.
  • Calculated Risk, Finance and Economics

D – Data Warehousing – Relevant and very useful

E – ETL – Extract, transform and load

F – Flume – A framework for populating Hadoop with data

  • Facebook Data Science Blog, the official blog of interesting insights presented by Facebook data scientists.
  • FiveThirtyEight, by Nate Silver and his team, gives a statistical view of everything from politics to science to sports with the help of graphs and pie charts.
  • Freakonometrics Charpentier, a professor of mathematics, offers a nice mix of generally accessible and more challenging posts on statistics related subjects, all with a good sense of humor.
  • Freakonomics blog, by Steven Levitt and Stephen J. Dubner.
  • FastML, covering practical applications of machine learning and data science.
  • FlowingData, the visualization and statistics site of Nathan Yau.

G – Geospatial Analysis – A picture worth 1,000 words or more

H – Hadoop, HDFS, HBASE

  • Harvard Data Science, thoughts on Statistical Computing and Visualization.
  • Hyndsight by Rob Hyndman, on fore­cast­ing, data visu­al­iza­tion and func­tional data.

I – In-Memory Database – A new definition of superfast access

  • IBM Big Data Hub Blogs, blogs from IBM thought leaders.
  • Insight Data Science Blog on latest trends and topics in data science by Alumnus of Insight Data Science Fellows Program.
  • Information is Beautiful, by Independent data journalist and information designer David McCandless who is also the author of his book ‘Information is Beautiful’.
  • Information Aesthetics designed and maintained by Andrew Vande Moere, an Associate Professor at KU Leuven university, Belgium. It explores the symbiotic relationship between creative design and the field of information visualization.
  • Inductio ex Machina by Mark Reid’s research blog on machine learning & statistics.

J – Java – Hadoop gave it a nice push

  • Jonathan Manton’s blog by Jonathan Manton, Tutorial-style articles in the general areas of mathematics, electrical engineering and neuroscience.
  • JT on EDM, James Taylor on Everything Decision Management
  • Justin Domke blog, on machine learning and computer vision, particularly probabilistic graphical models.
  • Juice Analytics on analytics and visualization.

K – Kafka – High-throughput, distributed messaging system originally developed at LinkedIn

L – Latency – Low Latency and High Latency

  • Love Stats Blog By Annie, a market research methodologist who blogs about sampling, surveys, statistics, charts, and more
  • Learning Lover on programming, algorithms with some flashcards for learning.
  • Large Scale ML & other Animals, by Danny Bickson, started the GraphLab, an award winning large scale open source project

M – Map/Reduce – MapReduce

N – NoSQL Databases – No SQL Database or Not Only SQL

O – Oozie – Open-source workflow engine managing Hadoop job processing

  • Occam’s Razor by Avinash Kaushik, examining web analytics and Digital Marketing.
  • OpenGardens, Data Science for Internet of Things (IoT), by Ajit Jaokar.
  • O’reilly Radar O’Reilly Radar, a wide range of research topics and books.
  • Oracle Data Mining Blog, Everything about Oracle Data Mining – News, Technical Information, Opinions, Tips & Tricks. All in One Place.
  • Observational Epidemiology A college professor and a statistical consultant offer their comments, observations and thoughts on applied statistics, higher education and epidemiology.
  • Overcoming bias By Robin Hanson and Eliezer Yudkowsky. Present Statistical analysis in reflections on honesty, signaling, disagreement, forecasting and the far future.

P – Pig – Platform for analyzing huge data sets

  • Probability & Statistics Blog By Matt Asher, statistics grad student at the University of Toronto. Check out Asher’s Statistics Manifesto.
  • Perpetual Enigma by Prateek Joshi, a computer vision enthusiast writes question-style compelling story reads on machine learning.
  • PracticalLearning by Diego Marinho de Oliveira on Machine Learning, Data Science and Big Data.
  • Predictive Analytics World blog, by Eric Siegel, founder of Predictive Analytics World and Text Analytics World, and Executive Editor of the Predictive Analytics Times, makes the how and why of predictive analytics understandable and captivating.

Q – Quantitative Data Analysis

R – Relational Database – Still relevant and will be for some time

  • R-bloggers , best blogs from the rich community of R, with code, examples, and visualizations
  • R chart A blog about the R language written by a web application/database developer.
  • R Statistics By Tal Galili, a PhD student in Statistics at the Tel Aviv University who also works as a teaching assistant for several statistics courses in the university.
  • Revolution Analytics hosted, and maintained by Revolution Analytics.
  • Rick Sherman: The Data Doghouse on business and technology of performance management, business intelligence and datawarehousing.
  • Random Ponderings by Yisong Yue, on artificial intelligence, machine learning & statistics.

S – Sharding (Database Partitioning)  and Sqoop (SQL Database to Hadoop)

  • Salford Systems Data Mining and Predictive Analytics Blog, by Dan Steinberg.
  • Sabermetric Research By Phil Burnbaum blogs about statistics in baseball, the stock market, sports predictors and a variety of subjects.
  • Statisfaction A blog by jointly written by PhD students and post-docs from Paris (Université Paris-Dauphine, CREST). Mainly tips and tricks useful in everyday jobs, links to various interesting pages, articles, seminars, etc.
  • Statistically Funny True to its name, epidemiologist Hilda Bastian’s blog is a hilarious account of the science of unbiased health research with the added bonus of cartoons.
  • SAS Analysis, a weekly technical blog about data analysis in SAS.
  • SAS blog on text mining on text mining, voice mining and unstructured data by SAS experts.
  • SAS Programming for Data Mining Applications, by LX, Senior Statistician in Hartford, CT.
  • Shape of Data, presents an intuitive introduction to data analysis algorithms from the perspective of geometry, by Jesse Johnson.
  • Simply Statistics By three biostatistics professors (Jeff Leek, Roger Peng, and Rafa Irizarry) who are fired up about the new era where data are abundant and statisticians are scientists.
  • Smart Data Collective, an aggregation of blogs from many interesting data science people
  • Statistical Modeling, Causal Inference, and Social Science by Andrew Gelman
  • Stats with Cats By Charlie Kufs has been crunching numbers for over thirty years, first as a hydrogeologist and since the 1990s, as a statistician. His tagline is- when you can’t solve life’s problems with statistics alone.
  • StatsBlog, a blog aggregator focused on statistics-related content, and syndicates posts from contributing blogs via RSS feeds.
  • Steve Miller BI blog, at Information management.

T – Text Analysis – Larger the information, more needed analysis

U – Unstructured Data – Growing faster than speed of thoughts

V – Visualization – Important to keep the information relevant

  • Vincent Granville blog. Vincent, the founder of AnalyticBridge and Data Science Central, regularly posts interesting topics on Data Science and Data Mining

W – Whirr – Big Data Cloud Services i.e. Hadoop distributions by cloud vendors

X – XML – Still eXtensible and no Introduction needed

  • Xi’an’s Og Blog A blog written by a professor of Statistics at Université Paris Dauphine, mainly centred on computational and Bayesian topics.

Y – Yottabyte – Equal to 1,000 exabytes, 1 million petabytes and 1 billion terabytes

Z – Zookeeper – Help managing Hadoop nodes across a distributed network

Feel free to add your preferred blog in the comment bellow.

Other resources:

Nice video channels:

More Jobs ?

hidden-jobs1

Click here for more Data related job offers.
Join our community on linkedin and attend our meetups.
Follow our twitter account: @datajobsbe

Improve your skills:

Why don’t you join one of our  #datascience trainings in order to sharpen your skills.

Special rates apply if you are a job seeker.

Here are some training highlights for the coming months:

Check out the full agenda here.

Join the experts at our Meetups:

Each month we organize a Meetup in Brussels focused on a specific DataScience topic.

Brussels Data Science Meetup

Brussels, BE
1,417 Business & Data Science pro’s

The Brussels Data Science Community:Mission:  Our mission is to educate, inspire and empower scholars and professionals to apply data sciences to address humanity’s grand cha…

Next Meetup

DATA UNIFICATION IN CORPORATE ENVIRONMENTS

Wednesday, Oct 14, 2015, 6:30 PM
57 Attending

Check out this Meetup Group →

Job – Sentiance – Marketing Data Scientist – Antwerp

sentiance_logo_72dpi

Hi Philippe,

Given the topic of the meetup next Thursday, I think the following job opportunity might be relevant to post on your blog 🙂
At Sentiance we’re looking for a data scientist with experience in market segmentation:
http://www.sentiance.com/team/marketing-data-scientist/
However, we always welcome applications of junior candidates too!
http://www.sentiance.com/team/junior-data-scientist/

Thanks, and hope to see you thursday!
Vincent Spruyt
twitter id: @sentiance

As an experienced data analyst, you are ready to kick-off a new adventure in a fast-paced environment where you can work with the latest machine learning technologies and data science tools.

Job description

  1. You will be part of our Data Science Team and you are passionate about machine learning and data analysis.
  2. Using advanced data analytics, you will form hypotheses and draw meaningful insights about user behavior and user segmentation. As a marketing data scientist, you will explore relations between users and their preferences, discover interesting segments, perform advanced clustering and dimensionality reduction techniques.
  3. You will carry out research that will improve our general understanding of our users, and communicate your findings to other team members in order to initiate new platform development cycles.
  4. You will apply your statistical and mathematical background to real-life big-data problems, and use your machine learning knowledge on a day to day basis.
  5. You will work closely & interact with our Data Engineering Team as your work is used to improve our models and is pushed through our release process.
  6. Your main objectives will be the design and implementation of data mining and analysis algorithms and the communication of reports and quality metrics for current production processes.

Desired Skills & Experience:

  1. You have a masters degree or PhD in computer science or related field.
  2. You are an expert in advanced analytics and are experienced in hypothesis testing.
  3. You possess a deep understanding of clustering, manifold learning and predictive modeling techniques.
  4. You have good knowledge of and experience with any of Python, Matlab or R.
  5. You have a strong mathematical background and analytical mindset.
  6. You are fluent in English. Dutch is a plus.
  7. You can work independently and take matters into your own hands.
  8. The ability to quickly learn new technologies and successfully implement them is essential.

Bonus

Experience with any of the following is considered a plus:

  • Advanced Python knowledge and experience
  • Scikit-learn, Pandas, Numpy, Matplotlib
  • Experience with Spark or the Hadoop eco-system
  • Machine learning, data mining, data visualization

Apply:

Make sure that you are a member of the Brussels Data Science Community linkedin group before you apply. Join  here.

Please note that we also manage other vacancies that are not public, if you want us to bring you in contact with them too, just send your CV to datasciencebe@gmail.com .

Send your job application today! 

Please send Sentiance your resume and a strong motivation with reference sentiance/2015/MDS or apply on LinkedIn.

Job – Infofarm – DataScientist

Infofarm

InfoFarm breidt uit en is op zoek naar een nieuwe Data Scientist!

BEDRIJFSPROFIEL

InfoFarm is een Data Science bedrijf dat zich toespitst in het opleveren van kwaliteitsvolle Data Science en Big Data oplossingen aan haar klanten. Onze naam danken we aan één van de vele informele brainstormsessies onder collega’s die spontaan tijdens de middagpauze ontstaan. Een gezellige sessie later hadden we de hele analogie met het boerderijleven op poten: we planten ideeën, we ploegen door onze klant zijn data, laten deze groeien met andere data of inzichten en oogsten business waarde door er verschillende (machine learning) technieken op toe te passen.

We hebben een uniek team met verscheidene talenten en verschillende achtergronden: Data Scientists (mensen met een onderzoek achtergrond uit een kwantitatieve richting, Big Data Developers (sterk technische Java programmeurs) en Infrastructuurmensen (de bits-and-bytes mensen). Wij ontwikkelen samen geweldige oplossingen voor onze klanten uit verschillende sectoren. Om ons team te versterken zijn we op zoek naar een Big Data Developer. 

FUNCTIEOMSCHRIJVING

Als Data Scientist exploreer je datasets, verschaf je inzichten en help je klanten actie te ondernemen gebaseerd op deze inzichten. Je werkt zelfstandig of in een gemengd team, ofwel in onze kantoren ofwel in detachering bij de klant. Je bent niet bang om met creatieve oplossingen voor complexe problemen naar voren te treden. Je gidst onze Big Data Developers in het bouwen van Big Data applicaties gebaseerd op de inzichten die jij verkregen hebt. Je zal in verschillende sectoren en omgevingen belanden. De ene dag werk je voor een telecom bedrijf, om de dag nadien het waterzuivering systeem van België beter te leren kennen en ten slotte ook nog een Big Data applicatie in de logistieke sector te bouwen. Bij InfoFarm zijn geen twee projecten gelijkaardig, maar dat schrikt je niet af. Je kijkt er naar uit om bij te leren over verschillende businessen en om nieuwe ontwikkelingen en technologieën op de markt te volgen, alsook om  deze opgedane kennis uit te dragen naar onze klanten en binnen het team. 

FUNCTIEVEREISTEN

  • Je hebt een master diploma in een kwantitatieve richting (wiskunde, ingenieur, …). Een doctoraat is een pluspunt.
  • Kennis van een data analyse taal (R, Python, …) geeft je een voorsprong. Bereidheid om een van deze talen te leren is een vereiste.
  • Kennis van SQL is een voordeel.
  • Het leren begrijpen van Big Data tools (Hadoop, Hive, Pig, Spark, Spark MLlib, …) schrikt je niet af.
  • Kennis van Java en Scala vormen een meerwaarde.

Apply:

Make sure that you are a member of the Brussels Data Science Community linkedin group before you apply. Join  here.

Please note that we also manage other vacancies that are not public, if you want us to bring you in contact with them too, just send your CV to datasciencebe@gmail.com .

Bekijk de volledige job informatie hieronder en stuur als antwoord je CV naar jobs@infofarm.be!

(An English version can be requested via jobs@infofarm.be)

check out the original post: http://www.infofarm.be/articles/were-hiring-data-scientist

Job – Infofarm – Big Data Developer

Infofarm

InfoFarm breidt uit en is op zoek naar een nieuwe Big Data Developer!

BEDRIJFSPROFIEL

InfoFarm is een Data Science bedrijf dat zich toespitst in het opleveren van kwaliteitsvolle Data Science en Big Data oplossingen aan haar klanten. Onze naam danken we aan één van de vele informele brainstormsessies onder collega’s die spontaan tijdens de middagpauze ontstaan. Een gezellige sessie later hadden we de hele analogie met het boerderijleven op poten: we planten ideeën, we ploegen door onze klant zijn data, laten deze groeien met andere data of inzichten en oogsten business waarde door er verschillende (machine learning) technieken op toe te passen.

We hebben een uniek team met verscheidene talenten en verschillende achtergronden: Data Scientists (mensen met een onderzoek achtergrond uit een kwantitatieve richting, Big Data Developers (sterk technische Java programmeurs) en Infrastructuurmensen (de bits-and-bytes mensen). Wij ontwikkelen samen geweldige oplossingen voor onze klanten uit verschillende sectoren. Om ons team te versterken zijn we op zoek naar een Big Data Developer. 

FUNCTIEOMSCHRIJVING

Als Big Data Developer ontwikkel je voornamelijk Big Data applicaties op het Apache Hadoop of Apache Spark platform. Je werkt zelfstandig of in een gemengd team, ofwel in onze kantoren ofwel in detachering bij de klant. Je bent niet bang om met creatieve oplossingen voor complexe problemen naar voren te treden. De ene dag werk je voor een telecom bedrijf, om de dag nadien het waterzuivering systeem van België beter te leren kennen en ten slotte ook nog een Big Data applicatie in de logistieke sector te bouwen. Bij InfoFarm zijn geen twee projecten gelijkaardig, maar dat schrikt je niet af. Je kijkt er naar uit om bij te leren over verschillende businessen en om nieuwe ontwikkelingen en technologieën op de markt te volgen, alsook om  deze opgedane kennis uit te dragen naar onze klanten en binnen het team. 

FUNCTIEVEREISTEN

Je hebt minstens 2-3 jaar ervaring met Java ontwikkeling. Certificaties vormen een meerwaarde.

Je kan werken met Maven, Spring of EJB en één of meer RDBMS.

Kennis van Hadoop, Hive en Pig zijn een pluspunt, net als kennis van Spark en Spark MLlib. Bereidheid om je te certifiëren in een van deze domeinen is noodzakelijk.

Kennis van R en Scala zijn een voordeel.

Je hebt op zijn minst een Bachelor in Applied Computer Sciences. 

Apply:

Make sure that you are a member of the Brussels Data Science Community linkedin group before you apply. Join  here.

Please note that we also manage other vacancies that are not public, if you want us to bring you in contact with them too, just send your CV to datasciencebe@gmail.com .

Bekijk de volledige job informatie hieronder en stuur als antwoord je CV naar jobs@infofarm.be!

(An English version can be requested via jobs@infofarm.be)

check out the original post: http://www.infofarm.be/articles/were-hiring-big-data-developer-0

Job – NG-Data – Big Data Scientist – US and Belgium

NGDATA

Jo Buyl shared this job opportunity with us for a Big Data Scientist.

Job Description

In the era of Big Data, data is not useful until we identify patterns, apply context and intelligence. The data scientist, as an emerging career path, is at the core of organizational success with Big Data and for humanizing the data to help businesses better understands their consumer.

As a data scientist, you sift through the explosion of data to discover what the data is telling you. You figure out “what questions to ask” so that relevant information hidden in the large volumes and varieties of data can be extracted. The Data Scientist will be responsible for designing and implementing processes and layouts for complex, large-scale data sets used for modeling, data mining, and research purposes.

Opportunities

  • Be a true partner in defining the solutions, have and develop business acumen and bring technical perspective in furthering the product and business;
  • Aggregate data from various sources;
  • Help define, design, and build projects that leverage our data;
  • Develop computational algorithms and statistical methods that find patterns and relationships in large volumes of data;
  • Determine and implement mechanisms to improve our data quality;
  • Deliver clear, well-communicated and complete design documents;
  • Ability to work in a team as well as independently and deliver on aggressive goals;
  • Exhibit Creativity and resourcefulness at problem solving while collaborating and working effectively with best in class designers, engineers of different technical backgrounds, architects and product managers.

Personal Skills

  • You have a logical approach to the solution of problems and good conceptual ability and skills in analysis;
  • You have the ability to integrate research and best practices into problem avoidance and continuous improvement
  • You possess good interpersonal skills;
  • You are self reliant and capable of both independent work and as member of a team;
  • You are persistent, accurate, imaginative;
  • You are able and have the discipline to document and record results;
  • Be customer service oriented;
  • Be open minded and solution oriented;
  • You enjoy constantly expanding your knowledge base;
  • You are willing to travel up to five days per month.

Technical Background

The successful candidate should have 5+ years experience in large-scale software development, with at least 3 years in Hadoop. Have a strong cross-functional technical background, excellent written/oral communication skills, and a willingness and capacity to expand their leadership and technical skills.

  • BS / MS in computer Science;
  • Strong understanding of data mining and machine learning algorithms, data structures and related core software engineering concepts;
  • Understanding the concepts of Hadoop, HBase and other big data technologies; Understanding of marketing processes in the financial and or retail market;
  • Have a sound knowledge of SPSS and SQL

Apply:

Make sure that you are a member of the Brussels Data Science Community linkedin group before you apply. Join  here.

Please note that we also manage other vacancies that are not public, if you want us to bring you in contact with them too, just send your CV to datasciencebe@gmail.com .

Apply Today!

Upload your resume or send it to jobs@ngdata.com. We look forward to your application!

Job – Big Industries – Hadoop Developer

Big-Industries-stamp-logo

Matthias Vallaey      Matthias Vallaey, Partner at Big Industries asked us to post following vacancy

Big Industries (a Cronos Company) works together with you to translate your ideas into workable Big Data solutions that will create measurable value for your organisation.
Implementing the solution using proven big data technologies from industry leading vendors, integrating only the most appropriate, effective and sustainable technologies to deliver best-in-class products and services.
Big Industries helps to assess, identify and integrate effective refinements in order to increase the value that big data solutions bring.
We are fulfillment partners for Cloudera and MapR, the premiere Hadoop distributions, for BeLux and offer expert consulting, systems integration and tailored application development with knowledge and experience across a broad range of industries.

Specialties

Hadoop, Big Data, Systems Integration, Consulting, HBase, Spark, MapReduce, SolrCloud, Impala, Kafka

Job Description

As a Big Data Developer you will work in a team building big data solutions. You will be developing, maintaining, testing and evaluating big data solutions within organisations. Generally you will be working on implementing complex and large scale big data projects with a focus on collecting, parsing, managing, analysing and visualizing large datasets to turn raw data into insights using multiple toolsets, techniques and platforms.

Soft skills

Team player – embraces change, able to adapt to working in varied software delivery environments. Can-do attitude, pragmatic, results-oriented – lateral thinker.

Mandatory experience & skills

  • Computing or Mathematics diploma, or 4 years experience active work experience within systems integration teams.
  • Thorough understanding of Java, and solid grasp of software development best practises.
  • Experience using hadoop and related technologies (eg. pig, hive, spark, impala), ideally with popular hadoop data processing pipeline patterns and technologies (cascading, crunch, oozie).
  • Willing to work to become Cloudera Developer certified.
  • Development exposure on both cloud and classic compute environments.
  • Very good Linux systems and Linux shell scripting knowledge.

Apply:

Make sure that you are a member of the Brussels Data Science Community linkedin group before you apply. Join  here.

Here is the original jobpost .

Contact Matthias Vallaey matthias.vallaey@bigindustries.be (+32 496 57 66 27).

Job – ING – Hadoop Specialist

 ing hadoop

Here is an other request for a Hadoop expert, over the past months we have seen many similar requests. We will organize a 3 days  Hadoop introduction training mid March. If you are interested to attend this training please send an email to pvanimpe@gmail.com .

 

ING Belgium is, as the first universal direct bank, a strong player on the Belgian market. The bank provides its customers with a wide range of financial products and services via the distribution channel of their choice (click and face). Customers of ING Belgium are as well retail and private banking clients, as well as small, medium and large enterprises. ING IT employs up to 9,000 IT professionals from the Netherlands and Belgium. Within each business unit of ING you will meet our IT people. All these IT specialists have an important role: in a highly complex and dynamic world, the IT people make the bank work.

Are you looking for…

… a position in which you actively take part in the design and innovative usage of a state-of-the art Big Data solution to enable ING’s business lines to gain new insights from the wide variety of data available in the bank?
… a position in which you will experience first-hand that data is the oil of the 21st century, and (advanced) analytics the combustion engine?
… a position in which you will continuously share new experiences and develop yourself further in the Big Data domain?

Your role

• Be part of the dedicated Advanced Predictive Analytics Scrum team consisting of a mix of profiles such as Data Scientists, Data Experts and Big Data Developers
• Take the lead in the design of our evolving Big Data environment
• Develop the flows to bring the data into the Big Data environment by using Flume & Sqoop
• Prepare the data by implementing transformation steps and quality controls thanks to Pig and Spark
• Analyze the data via the usage of distributed machine learning algorithms written in Java & Scala, based on the input of the Data Scientists
• Perform the installation, configuration and maintenance of the Big Data applicative stack
• Collaborate with application owners to define the best approach to fetch the data produced by their application

Profile:

You have …

• …a Master or Bachelor degree in information technology, computer science or equivalent
• … 5 or more years of development experience with Java and/or Scala
• … 5 or more years of experience in Data Warehousing / Business Intelligence
• … working knowledge or hands-on experience with Hadoop Clusters, including setting up, configuring and monitoring Hadoop file cluster
• …the ability to translate Use Cases into Big Data Hadoop solutions
• … working experience with Hadoop data platforms and its ecosystem (Map Reduce, Pig, Hive, Spark…) is a definite plus
• … knowledge of scripting language (OS level, JVM based) is considered as an extra asset
• … a good knowledge of English. A good command of French and/or Dutch is considered as an asset.

And are you also someone who is …

• … result-oriented, analytical and strong in problem solving
• … well organized: you work out detailed action plans how to reach your goal, taking into account the constraints and priorities
• … technical-minded but who can still grasp the bigger picture and the business objectives
• … a team player able to work in an agile and highly collaborative environment
• … able to challenge yourself to cope with the fast-paced technological evolutions
• … eager to learn continuously from and who is active in the Big Data community
• … a strong communicator, able to present concepts and data in a structured way
• … assertive but diplomatic

How to apply?

Make sure you are a member of the Brussels Data Science Community linkedin group before you apply. Join  here.

Please note that we also manage other vacancies that are not public, if you want us to bring you in contact with them too, just send your CV to datasciencebe@gmail.com .

For more information and ammplication please visit Apply now

More Jobs ?

hidden-jobs1

Click here for more Data related job offers.
Join our community on linkedin and attend our meetups.
Follow our twitter account: @datajobsbe

Have you been to our Meetups yet ?

Each month we organize a Meetup in Brussels focused on a specific DataScience topic.

Brussels Data Science Meetup

Brussels, BE
755 Business & Data Science pro’s

The Brussels Data Science Community, is the Belgian voice for data scientists with a conscience.  If you have a desire to stay at the forefront of technological innovation & p…

Next Meetup

Data Sciences in the Telecom Sector

Thursday, Feb 19, 2015, 6:30 PM
140 Attending

Check out this Meetup Group →