David Lillis: A Decade of Legal Argumentation Mining: Datasets and Approaches

A Decade of Legal Argumentation Mining: Datasets and Approaches

Gechuan Zhang, Paul Nulty and David Lillis

In Proceedings of the 27th International Conference on Natural Language & Information Systems (NLDB 2022), Valencia, Spain, June 2022.

Abstract

The growing research field of argumentation mining (AM) in the past ten years has made it a popular topic in Natural Language Processing. However, there are still limited studies focusing on AM in the context of legal text (Legal AM), despite the fact that legal text analysis more generally has received much attention as an interdisciplinary field of traditional humanities and data science. The goal of this work is to provide a critical data-driven analysis of the current situation in Legal AM. After outlining the background of this topic, we explore the availability of annotated datasets and the mechanisms by which these are created. This includes a discussion of how arguments and their relationships can be modelled, as well as a number of different approaches to divide the overall Legal AM task into constituent sub-tasks. Finally we review the dominant approaches that have been applied to this task in the past decade, and outline some future directions for Legal AM research.