This section aims to familiarise readers with Article 3 and Article 4 of the DSM Directive, which introduce two mandatory exceptions and limitations to copyright for text and data mining. The section was authored by Benjamin White and Maja Bogataj Jančič. It represents the views of LIBER and Communia on the implementation of those provisions.

For a summary of this guide please see the TL;DR version of this guide: ****

TL;DR: Articles 3-4: Text and data mining

Articles 3-4: Table of Contents

What is at issue in Articles 3 and 4?

🌰 In a nutshell

Article 3 (Text and Data Mining for the Purposes of Scientific Research) requires member states to introduce a mandatory exception to copyright and related rights (including sui generis database rights) in their national laws for the purposes of data analytics. This new exception gives researchers who have legal access to the open web as well as the collections of universities, libraries, archives and other cultural heritage organisations across the EU, the freedom to undertake data analytics without requiring permission from rightsholders. Member states are encouraged to meet with research organisations, cultural heritage organisations and rightsholders to discuss appropriate security measures relating to the exercise of the exception. The right to enjoy this new exception cannot be removed by either contract or by technical protection measures.
Article 4 (Exception or Limitation for Text and Data Mining) requires member states to introduce a mandatory limitation or exception to copyright and related rights (including sui generis database rights) in their national laws for the purposes of data analytics for anyone who wishes to mine materials subject to copyright and related rights (including software as well as sui generis database rights). Rightsholders however can prevent data mining under this exception if they so choose.

The main difference between the two exceptions is that the research exception (Article 3) allows researchers affiliated to public interest organisations to keep a copy of the information they mined, and this cannot be prevented by contract or technical protection measures. The second exception (Article 4), which can be enjoyed by anyone at all, only allows data analytics to be performed on content that rights holders permit mining to be undertaken on.

These new mandatory exceptions have the potential to support Big Data and Artificial Intelligence (AI) in Europe. However a poor implementation in national law could severely hamper the ability of beneficiaries of the exceptions to undertake text and data mining.

Some context

What is text and data mining? Article 2(2) of the DSM Directive defines text and data mining (TDM) as “any automated analytical technique aimed at analysing text and data in digital form in order to generate information which includes but is not limited to patterns, trends and correlations.”

Big Data is increasingly ubiquitous, and is used by many different players from large companies, individuals through to the research sector. The core of Big Data, which is also one of the fundamental facets of Artificial Intelligence (AI), is the ability for computers to analyse and extract information from structured and unstructured datasets in response to a particular algorithm based instruction or query. This is often referred to as “text and data mining” in a legal context, or more widely “data analytics”.

According to the European Commission funded Future TDM report “Trend Analysis, Future Applications and the Economics of TDM”, European based data analytics can be expected to grow to a value of $10.3 billion by 2021. Given the ubiquity of data mining, particularly in the US (which shows significantly higher exploitation levels of data that in Europe), for reasons of international competitiveness the European legislature decided in 2016 to introduce new copyright limitations and exceptions to allow EU-based organisations to benefit lawfully from this new technology.

Initially in 2012 the European Commission intended to introduce a licence-based solution for text and data mining under an initiative known as “Licences for Europe”. However upon further investigation, the view of the Commission changed as it realised that licensing would encumber the growth of Big Data in Europe. In 2016 to support innovation, economic growth and research competitiveness the Commission proposed a limitation and exception for individuals and organisations who wish to undertake text and data mining for the purposes of research.

🛠 Breaking down Articles 3 and 4

Let’s have a look at the Articles in detail:

What is at issue in Articles 3 and 4?

🌰 In a nutshell

Some context

🛠 Breaking down Articles 3 and 4

What is the aim of these exceptions?