Book page

Big Data tools for advanced users 1st edition - 2024

 Big Data tools for advanced users 
Course LeaderMarco Puts
Target GroupIT professionals whose role is to support statisticians with big data infrastructure, either via local big data clusters or via cloud solutions, and the engineering of big data processing. Methodologists and statisticians with a strong IT background who are expected to handle big data infrastructures and unstructured data.
Entry Qualifications
  • Sound command of English. Participants should be able to make short interventions and to actively participate in discussions
  • The participants should be computer literate and able to programme in R and/or Python
  • Learn how to extract relevant information for statistical purposes from huge amounts of data
  • Big data clusters;
  • Cloud computing;
  • Hadoop and MapReduce;
  • Analyzing data in Hadoop with SQL: Hive;
  • Distributed programming with Spark;
  • NoSQL databases;
  • Techniques and tools for extracting data from the web
Expected OutcomeParticipants will have a broad overview of modern state of the art techniques for managing and analyzing big data, its tools and infrastructure.
Training Methods
  • Presentations and lectures
  • Exchange of views/experiences on national practices
  • Exercises
Required ReadingNone
Suggested Reading
Required PreparationParticipants should have at least some basic programming knowledge, especially in Python and R languages. Knowledge of relational databases are strongly suggested.


Marco PUTS (CBS Netherlands)

Martijn Tennekes (CBS Netherlands)

Bjoern Ole Mussmann (eScience Center)



1st edition

Practical Information    
WhenDurationWhereOrganiserAPPLICATION VIA National Contact Point
9 – 11 July 20243 days

The Hague,



Public Sector GmbH

Deadline: 13.05.2024