The course Data science: Data Analysis offers a range of techniques and algorithms from statistics, machine learning and data mining to make predictions about future events and to uncover hidden structures in data. The course has a strong practical focus; participants actively learn how to apply these techniques to real data and how to interpret their results. The course covers both classical and modern topics in data analysis.
Course by tag
R is rapidly becoming the standard platform for data analysis. This course offers an elaborate introduction into statistical programming in R. Students learn to operate R, form pipelines for data analysis, make high quality graphics, fit, assess and interpret a variety of statistical models and do advanced statistical programming. The statistical theory in this course covers t-testing, regression models for linear, dichotomous, ordinal and multivariate data, statistical inference, statistical learning, bootstrapping and Monte Carlo simulation techniques.
If you expect to work with the software Mplus, this course can help you to get started! This course is a compact 1-day workshop on using Mplus to get you started. We will focus on how to get the syntax running; how to avoid common mistakes; how to interpret the output and how to deal with error messages. In the exercises you will run multiple regression and factor analysis models, which are the basis of many structural equation models in Mplus.
This 4-day course teaches you the basics in solving your own missing data problems appropriately. Participants will learn how to form imputation models, how to combine data sets, how to model non-response, how to use diagnostics to inspect the imputed values, how to obtain valid inference on incomplete data and how to avoid many of the pitfalls associated with real-life missing data problems.
This is a four-day course on how to study dynamics in intensive longitudinal data, such as ambulatory assessments (AA), experience sampling method (ESM) data, ecological momentary assessments (EMA), real time data capture, observational data or electronic daily diaries. We provide a tour of diverse modeling approaches for such data and the philosophies behind them, as well as practical experience with these modeling techniques using different software packages (including R and Mplus).
Applications of text mining are everywhere: social media, web search, advertising, emails, customer service, healthcare, marketing, etc. In this course, students will learn how to apply text mining methods on text data and analyse them in a pipeline with statistical learning algorithms. The course has a strongly practical hands-on focus, and students will gain experience in using and interpreting text mining on data examples from humanities, social sciences, and healthcare.
This course introduces the basic and advanced concepts and ideas in text mining and natural language processing. In this course, students will learn how to apply text mining methods on text data and analyse them in a pipeline with machine learning and deep learning algorithms. The course has a strongly practical hands-on focus, and students will gain experience in using text mining on real data from social sciences, humanities, and healthcare and interpreting the results.
This is a five-day course on structural equation modeling (SEM) using Mplus. If you already know how to analyse your data in Mplus but want to learn more about what you are actually doing, and especially if you want to know more about advanced longitudinal analyses, this course is for you. The course consists of in-depth lectures on the fundamentals of Mplus and advanced longitudinal models.
We offer a 5-day course on how to perform basic SEM analyses using Mplus. The main objective of this course is to learn how to analyse several models with Mplus (e.g. path models, multiple group models, mediation and moderation, confirmatory factor analysis, and longitudinal models). No previous knowledge of Mplus is assumed, but prior knowledge of SEM, although not mandatory, will make this course more useful.
This course describes the stages involved in Bayesian analysis: specifying the prior and data models, deriving inference, model checking and refinement. We discuss prior and posterior predictive checking, and selecting a technique for sampling from a probability distribution. Other topics discussed are: approximate measurement invariance (a Bayesian method to assess comparability of data), evaluating hypotheses via the Bayes Factor and information criteria, and combining evidence from multiple studies addressing the same research question. Finally, we propose strategies for reproducibility and reporting standards, outlining the WAMBS-checklist (when to Worry and how to Avoid the Misuse of Bayesian Statistics).
This course will teach you the theoretical basics of multilevel modelling and some important methodological and statistical issues. You will also learn how to analyse multilevel data sets with the HLM and Mplus programs, to interpret the output and to report the results. The benefits of multilevel analysis are discussed both in theory as with empirical examples. This course restricts to a quantitative (i.e. continuous) outcome variable. Categorical outcomes are part of the course Advanced Multilevel.
The course is based on a total survey error perspective and discusses the major sources of survey error. Participants will be presented with tools for detection and adjustment of such errors. Analysis methods are introduced using both SPSS and R. Topics include complex sampling, nonresponse adjustment, measurement error, analysis of incomplete data and advanced use of administrative data. Special attention will be given to the analysis of complex surveys that include weighting, stratification and design effects.