Edwin De Jonge
Statistical consultant / data scientist, Statistics Netherlands

Edwin de Jonge is a statistical consultant and data scientist at Statistics Netherlands: the Dutch government agency that is responsible for producing official demographic, economic, social and environmental statistics. His expertise is statistical computing, data visualisation and exploratory techniques. He well versed in several programming languages including R and Python. Edwin is author of several R packages and book on using RStudio. Currently he is writing a book on data cleaning with applications in R.


Government/Open Data
Location: 212
Alex Priem (Statistics Netherlands), Edwin De Jonge (Statistics Netherlands)
Histograms and heatmaps are often used to summarize large data sets. We provide guidelines for using them effectively and efficiently. We illustrate this using the complete Dutch income tax data by looking at distributions in wealth and income. Analysis of this data set is complicated by the large amount of variables. We use clustering techniques to automatically find relevant patterns. Read more.