ANTONIO: Towards a Systematic Method for Generating NLP Benchmarks for Verification

  • Marco Casadio
  • , Luca Arnaboldi
  • , Matthew L. Daggitt
  • , Omri Isac
  • , Tanvi Dinkar
  • , Daniel Kienitz
  • , Verena Rieser
  • , Ekaterina Komendantskaya

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Verification of machine learning models used in Natural Language Processing (NLP) is known to be a hard problem. In particular, many known neural network verification methods that work for computer vision and other numeric datasets do not work for NLP. Here, we study technical reasons that underlie this problem. Based on this analysis, we propose practical methods and heuristics for preparing NLP datasets and models in a way that renders them amenable to state-of-the-art verification methods. We implement these methods as a Python library called ANTONIO that links to the neural network verifiers ERAN and Marabou. We perform evaluation of the tool using an NLP dataset R-U-A-Robot suggested as a benchmark for verifying legally critical NLP applications. We hope that, thanks to its general applicability, this work will open novel possibilities for including NLP verification problems into neural network verification competitions, and will popularise NLP problems within this community.
Original languageEnglish
Title of host publicationProceedings of the 6th Workshop on Formal Methods for ML Enabled Autonomous Systems
EditorsNina Narodytska, Guy Amir, Guy Katz, Omri Isac
PublisherEasyChair Publications
Pages59-70
Number of pages12
DOIs
Publication statusPublished - 23 Oct 2023
Event6th Workshop on Formal Methods for ML-Enabled Autonomous Systems - Paris, France
Duration: 17 Jul 202318 Jul 2023

Publication series

NameKalpa Publications in Computing
PublisherEasyChair Publications
Volume16
ISSN (Electronic)2515-1762

Conference

Conference6th Workshop on Formal Methods for ML-Enabled Autonomous Systems
Abbreviated titleFoMLAS 2023
Country/TerritoryFrance
CityParis
Period17/07/2318/07/23

Keywords

  • abstract interpretation
  • adversarial training
  • neural network verification
  • nlp

Fingerprint

Dive into the research topics of 'ANTONIO: Towards a Systematic Method for Generating NLP Benchmarks for Verification'. Together they form a unique fingerprint.

Cite this