How to clean up your messy data?

Event date:

3 June 2025

Location: DANS Offices, Anna van Saksenlaan 51 2593 HW The Hague

OpenRefine is a free desktop application that has been described as 'a power tool for working with messy data'.  OpenRefine is most useful for cleaning and standardising data in a simple tabular format such as a spreadsheet. Therefore, it can be especially useful to researchers in the social sciences and humanities, who are increasingly sharing their research materials and are concerned about others making sense of them.

You can use OpenRefine to learn more about; transform into different formats; and enhance or correct your data. Specific functions include, for example, removing duplicate records, standardising date formats, or finding different spellings of the same name and replacing them with a single consistent name.

One wonderful feature of OpenRefine is that it always uses a copy of your data – so no accidentally modifying or writing over your data! It works completely offline, keeping your data private on your own computer until (where relevant) you decide to export and share it!

In this in-person Data Carpentries training session, OpenRefine for Social Science Data, on June 3 from 10.00 – 12.30, you will learn how to import data into OpenRefine, how to use the programme to interrogate, clean and organise it. We will use a sample dataset to show you how you can speed up repetitive tasks by replaying previous actions on multiple datasets. We will also show you how to reverse or undo actions on your data. Finally, we will introduce you to using OpenRefine for importing, filtering, clustering, transforming, and exporting data. These concepts are complex, and you may not have encountered them before, but by the end of the workshop, you will have practical experience, gained from working on a range of examples and exercises.

This workshop will cover

The basics of how to work with OpenRefine, beginning with how to import data into it
An introduction to using the functions of the program to interrogate, clean and organise the data.
Exporting and saving the data once you have cleaned it
How to find out more and build upon your knowledge
Questions can be sent to DANS Training Coordinator and lead instructor for this workshop, Dr Deborah Thorpe. The session will be co-instructed by DANS Research Data Management Specialist Dr Kim Ferguson.

Coffee and tea will be provided, as well as lunch at the end of the workshop.

What you will need to know

The workshop assumes no prior knowledge of the skills or tools.  

The language of instruction will be English. 

What you will need to bring

Data Carpentry’s teaching is hands-on so you will need to bring your own laptop with OpenRefine pre-installed as per the instructions on the OpenRefine for Social Science Summary and Setup page 

We will be working with sample data. Download the data file linked from the above page to a place that is easily findable on your computer. 

Registration 

The workshop is full, but pleas fill out the form the join the waiting list.

Your data is only processed in the context of this event. More information can be found in our Privacy Statement. 

Waiting list: OpenRefine for Social Science Data

Name(Required)
Fill in your e-mail address
Fill in the name of your institution/organisation.
Please let us know if you have any dietary requirements
Please let us know if you have any preferences and/or accessibility requirements to participate fully in the workshop

Do you have questions about this event?

This field is for validation purposes and should be left unchanged.
Your name(Required)
Medewerker

Deborah Thorpe Ph.D.

Research Data Management Specialist
Medewerker

Kim Ferguson Ph.D.

Research Data Management Specialist