The European Union case law corpus (EUCLCORP): a multilingual parallel and comparative corpus of EU court judgments

Aleksandar Trklja; Karen McAuliffe

The European Union case law corpus (EUCLCORP): a multilingual parallel and comparative corpus of EU court judgments

Birmingham Law School

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Abstract

The empirical approach to the study of legal language has recently undergone profound development. Corpus linguistics study has, in particular, revealed previously unnoticed features of the legal language at both the lexico grammatical and discourse level. Existing resources such as legal databases, however, do not contain functionalities that enable the application of corpus linguistics methodology. To address this gap in the context of EU law we developed a multilingual corpus of judgments that allows scholars and practitioners to investigate in a systematic way a range of issues such as the history of the meaning(s) of legal term, the migration of terms between legal systems, the use of binominals or the distribution of formulaic expressions in EU legal sub-languages. As well as being the first multilingual corpus of judgments it is also the largest legal multilingual corpus ever created. Since it contains case law from two sources (the Court of Justice of the European Union and EU national courts) it is also the largest comparable corpus of legal texts. The aim of the corpus is to contribute to the further development of the emerging field of language and law.

Original language	English
Title of host publication	Proceedings of the Second Workshop on Corpus-Based Research in the Humanities
Subtitle of host publication	CRH-2
Editors	Andrew U Frank, Christine Ivanovic, Francesco Mambrini, Marco Passarotti, Caroline Sporleder
Publisher	Gerastree Proceedings
Pages	217-226
Volume	1
ISBN (Print)	9783901716430
Publication status	Published - 25 Jan 2018
Event	Second Workshop on Corpus-Based Research in the Humanities: CRH-2 - Austrian Academy of Sciences, Vienna, Austria Duration: 25 Jan 2018 → 26 Jan 2018 https://www.oeaw.ac.at/ac/crh2/

Workshop

Workshop	Second Workshop on Corpus-Based Research in the Humanities
Country/Territory	Austria
City	Vienna
Period	25/01/18 → 26/01/18
Internet address	https://www.oeaw.ac.at/ac/crh2/

Access to Document

https://www.oeaw.ac.at/fileadmin/subsites/academiaecorpora/PDF/CRH2.pdfLicence: None: All rights reserved

H2020_ERC-POC_EUCL-CORP
McAuliffe, K.
European Commission
1/07/16 → 31/12/17
Project: EU

The EU Case Law Corpus
McAuliffe, K. (Creator), University of Birmingham, 1 Sept 2020
http://www.euclcorp.bham.ac.uk/#!/
Dataset

Cite this

Trklja, A., & McAuliffe, K. (2018). The European Union case law corpus (EUCLCORP): a multilingual parallel and comparative corpus of EU court judgments. In A. U. Frank, C. Ivanovic, F. Mambrini, M. Passarotti, & C. Sporleder (Eds.), Proceedings of the Second Workshop on Corpus-Based Research in the Humanities: CRH-2 (Vol. 1, pp. 217-226). Gerastree Proceedings. https://www.oeaw.ac.at/fileadmin/subsites/academiaecorpora/PDF/CRH2.pdf

Trklja, Aleksandar ; McAuliffe, Karen. / The European Union case law corpus (EUCLCORP) : a multilingual parallel and comparative corpus of EU court judgments. Proceedings of the Second Workshop on Corpus-Based Research in the Humanities: CRH-2. editor / Andrew U Frank ; Christine Ivanovic ; Francesco Mambrini ; Marco Passarotti ; Caroline Sporleder. Vol. 1 Gerastree Proceedings, 2018. pp. 217-226

@inproceedings{c86677d6828241d2b724edb4f560b5dd,

title = "The European Union case law corpus (EUCLCORP): a multilingual parallel and comparative corpus of EU court judgments",

abstract = "The empirical approach to the study of legal language has recently undergone profound development. Corpus linguistics study has, in particular, revealed previously unnoticed features of the legal language at both the lexico grammatical and discourse level. Existing resources such as legal databases, however, do not contain functionalities that enable the application of corpus linguistics methodology. To address this gap in the context of EU law we developed a multilingual corpus of judgments that allows scholars and practitioners to investigate in a systematic way a range of issues such as the history of the meaning(s) of legal term, the migration of terms between legal systems, the use of binominals or the distribution of formulaic expressions in EU legal sub-languages. As well as being the first multilingual corpus of judgments it is also the largest legal multilingual corpus ever created. Since it contains case law from two sources (the Court of Justice of the European Union and EU national courts) it is also the largest comparable corpus of legal texts. The aim of the corpus is to contribute to the further development of the emerging field of language and law.",

author = "Aleksandar Trklja and Karen McAuliffe",

year = "2018",

month = jan,

day = "25",

language = "English",

isbn = "9783901716430",

volume = "1",

pages = "217--226",

editor = "Frank, {Andrew U} and Christine Ivanovic and Mambrini, {Francesco } and Marco Passarotti and Caroline Sporleder",

booktitle = "Proceedings of the Second Workshop on Corpus-Based Research in the Humanities",

publisher = "Gerastree Proceedings",

address = "Austria",

note = "Second Workshop on Corpus-Based Research in the Humanities : CRH-2 ; Conference date: 25-01-2018 Through 26-01-2018",

url = "https://www.oeaw.ac.at/ac/crh2/",

}

Trklja, A & McAuliffe, K 2018, The European Union case law corpus (EUCLCORP): a multilingual parallel and comparative corpus of EU court judgments. in AU Frank, C Ivanovic, F Mambrini, M Passarotti & C Sporleder (eds), Proceedings of the Second Workshop on Corpus-Based Research in the Humanities: CRH-2. vol. 1, Gerastree Proceedings, pp. 217-226, Second Workshop on Corpus-Based Research in the Humanities, Vienna, Austria, 25/01/18. <https://www.oeaw.ac.at/fileadmin/subsites/academiaecorpora/PDF/CRH2.pdf>

The European Union case law corpus (EUCLCORP): a multilingual parallel and comparative corpus of EU court judgments. / Trklja, Aleksandar; McAuliffe, Karen.
Proceedings of the Second Workshop on Corpus-Based Research in the Humanities: CRH-2. ed. / Andrew U Frank; Christine Ivanovic; Francesco Mambrini; Marco Passarotti; Caroline Sporleder. Vol. 1 Gerastree Proceedings, 2018. p. 217-226.

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

TY - GEN

T1 - The European Union case law corpus (EUCLCORP)

T2 - Second Workshop on Corpus-Based Research in the Humanities

AU - Trklja, Aleksandar

AU - McAuliffe, Karen

PY - 2018/1/25

Y1 - 2018/1/25

N2 - The empirical approach to the study of legal language has recently undergone profound development. Corpus linguistics study has, in particular, revealed previously unnoticed features of the legal language at both the lexico grammatical and discourse level. Existing resources such as legal databases, however, do not contain functionalities that enable the application of corpus linguistics methodology. To address this gap in the context of EU law we developed a multilingual corpus of judgments that allows scholars and practitioners to investigate in a systematic way a range of issues such as the history of the meaning(s) of legal term, the migration of terms between legal systems, the use of binominals or the distribution of formulaic expressions in EU legal sub-languages. As well as being the first multilingual corpus of judgments it is also the largest legal multilingual corpus ever created. Since it contains case law from two sources (the Court of Justice of the European Union and EU national courts) it is also the largest comparable corpus of legal texts. The aim of the corpus is to contribute to the further development of the emerging field of language and law.

AB - The empirical approach to the study of legal language has recently undergone profound development. Corpus linguistics study has, in particular, revealed previously unnoticed features of the legal language at both the lexico grammatical and discourse level. Existing resources such as legal databases, however, do not contain functionalities that enable the application of corpus linguistics methodology. To address this gap in the context of EU law we developed a multilingual corpus of judgments that allows scholars and practitioners to investigate in a systematic way a range of issues such as the history of the meaning(s) of legal term, the migration of terms between legal systems, the use of binominals or the distribution of formulaic expressions in EU legal sub-languages. As well as being the first multilingual corpus of judgments it is also the largest legal multilingual corpus ever created. Since it contains case law from two sources (the Court of Justice of the European Union and EU national courts) it is also the largest comparable corpus of legal texts. The aim of the corpus is to contribute to the further development of the emerging field of language and law.

UR - https://www.oeaw.ac.at/fileadmin/subsites/academiaecorpora/PDF/CRH2.pdf

M3 - Conference contribution

SN - 9783901716430

VL - 1

SP - 217

EP - 226

BT - Proceedings of the Second Workshop on Corpus-Based Research in the Humanities

A2 - Frank, Andrew U

A2 - Ivanovic, Christine

A2 - Mambrini, Francesco

A2 - Passarotti, Marco

A2 - Sporleder, Caroline

PB - Gerastree Proceedings

Y2 - 25 January 2018 through 26 January 2018

ER -

The European Union case law corpus (EUCLCORP): a multilingual parallel and comparative corpus of EU court judgments

Abstract

Workshop

Access to Document

Fingerprint

Projects

H2020_ERC-POC_EUCL-CORP

Datasets

The EU Case Law Corpus

Cite this