The course Data science: Data Analysis offers a range of techniques and algorithms from statistics, machine learning and data mining to make predictions about future events and to uncover hidden structures in data. The course has a strong practical focus; participants actively learn how to apply these techniques to real data and how to interpret their results. The course covers both classical and modern topics in data analysis.
What puts former criminals on the right track? How can we prevent heart disease? Can Twitter predict election outcomes? What does a violent brain look like? How many social classes does 21st century society have? Are hospitals spending too much on health care, or too little?
Statistical learning is the art and science of tackling questions like these by analysing data. Just as cartographers make maps to see what a country looks like, data analysts make graphics that reveal hidden structures in the data. And just as doctors diagnose sick patients and advise healthy ones on how to stay healthy, data analysts predict the consequences of actions and/or events so we can act on that knowledge. Methods from statistics, data mining, and machine learning play an important part in this process.
The course has a strong practical character; the focus is not on the mathematics behind the methods but on the principles that make them work. Participants learn how to apply these methods to real data and how to interpret the results. The course covers both classical and modern topics in data analysis.
Basic knowledge of the statistical software program R is required (e.g. of the level of the Summer School Data Science: Statistical Programming with R or the online e-book R for Data Science by Hadley Wickham).
Participants are requested to bring their own laptop computer. Software will be available online.
This course is part of a series of 5 courses in the Summer School Data Science specialisation taught by UU’s department of Methodology & Statistics. Please see here for more information about the full specialisation. This course can also be taken separately.
Summer School Data Science specialisation:
- Data science: Statistical Programming with R (S24: 5 - 9 July)
- Data science: Multiple Imputation in Practice (S28: 12 - 15 July)
- Data science: Introduction to Text Mining with R (S41: 12 - 14 July )
- Data science: Data analysis (this course)
- Data science: Applied Text Mining (S42: 26 - 29 July)
Upon completing 3 out of 5 courses in the specialisation (no more than one text mining course), students can obtain a certificate. Each course may also be taken separately.
Please note that there is always the possibility that we have to change the course pending COVID19-related developments. The exact details, including a day-to-day program, will be communicated 6 weeks prior to the start of the course.
Applied researchers and master students from applied fields such as sociology, psychology, education, political science, public policy, quantitative criminology, human development, marketing, management, biology, medicine, computational linguistics, communication sciences.
A maximum of 60 participants will be allowed in this course. Please note that the selection for this course will be done on a first-come-first-served basis.
Aim of the course
This course aims to provide you with hands-on experience applying classical as well as modern statistical learning techniques, using R.
For an overview of all our summer school courses offered by the Department of Methodology and Statistics please click here.
Five full days. A typical course day starts at 9.00 and ends at 17.00 with breaks for coffee, lunch and tea.
Please note that there are no graded activities included in this course. Therefore, we are not able to provide students with a transcript of grades. You will obtain a certificate upon completion of this course.
You can choose between two options for participating in this course, but please note that there is always the possibility that we have to change the course pending COVID19-related developments:
- If you choose the livestream option, you will get a discount on the course fee since we will not provide lunch then. The lectures will be broadcasted in Central European Summer Time via a livestream (not recorded). Participants can ask questions via the chat which will be moderated by a second lecturer who will either directly answer your questions via the chat or ask your questions to the first lecturer during class. You will also receive online support during the group computer labs from our team. Additionally, Q&A sessions will be organised so you will benefit from our normal high level expertise while enjoying the class from the comfort of your own chair.
- If you choose the campus option, you will be able to attend the lectures and computer labs at our campus. Of course, we will follow all COVID19-guidelines that hold at the time of the start of your course. We will keep you updated about the newest developments (see also https://www.uu.nl/en/information-coronavirus).
If you are interested in the campus option, let us know via a message in the application form under ‘Student Comment’.
The physical course costs €720, but if you participate via the livestream you will get a 100 euro discount.
Tuition fee for PhD students from the Faculty of Social and Behavioural Sciences from Utrecht University will be funded by the Graduate School of Social and Behavioural Sciences.
There are no scholarships available for this course.
Irma Reyersen | E: firstname.lastname@example.org
For frequently asked questions about how we organize our summer school courses during the pandemic, please click here