Setup

Getting ready

You need to install OpenRefine and download a data file to follow this lesson.

Installing and running OpenRefine

OpenRefine is a free, open-source Java application. You can download OpenRefine from http://openrefine.org/download.html. This lesson has been tested with all versions of OpenRefine up to the latest tested version, 3.5.2

Packages are available on https://openrefine.org/download.html for Windows, MacOS, and Linux. Please download the latest stable version, choosing the “kit” for your operating system. Current versions of the “Windows kit with embedded Java” and “Mac kit” include everything you need to run OpenRefine. The “Linux kit” and traditional “Windows kit” require a “Java Runtime Environment” (JRE) installed on your system (see notes below).

If you are using an older version of OpenRefine, it is recommended you upgrade to the latest tested version.

Please follow the installation instructions in the OpenRefine User Manual: Installation Instructions

Notes:

Downloading the datasets

For this workshop we will be using two datasets. You should download both csv files DOAJ_big and DOAJ_small and make sure to have them available on your Desktop or in a directory you can easily locate on your computer.

Exiting OpenRefine

To exit OpenRefine, close all the browser tabs or windows, then navigate to the command line window. To close this window and ensure OpenRefine exits properly, hold down [control] and press [c] on your keyboard. This will save all changes to your projects.

Getting help

If you encounter problems installing or running OpenRefine, a good source of support is the OpenRefine mailing list and user forum. Include your operating system when searching to find the most relevant answers for your issue, such as threads related to Windows, macOS, or Linux.

You may also want to check the Stack Overflow OpenRefine tag or the OpenRefine Gitter room.

There are also general and specialist tutorials about using OpenRefine available on the web, including: