Skip to main navigation Skip to search Skip to main content

Towards generating web-accessible STEM documents from PDF

  • Volker Sorge
  • , Akashdeep Bansal
  • , Neha M. Jadhav
  • , Himanshu Garg
  • , Ayushi Verma
  • , M. Balakrishnan

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

PDF is still a very popular format that is widely used to exchange and archive electronic documents. And although considerable efforts have been made to ensure accessibility of PDF documents, they are still far from ideal when complex content like formulas, diagrams or tables is present. Unfortunately, many publications in scientific subjects are available in PDF format only and are therefore, if at all, only partially accessible. In this paper, we present a fully automated web-based technology to convert PDF documents into an accessible single file format. We concentrate on presenting working solutions for mathematical formulas and tables while also discussing some of the open problems in this context and how we aim to solve them in the future.

Original languageEnglish
Title of host publicationProceedings of the 17th International Web for All Conference, W4A 2020
PublisherAssociation for Computing Machinery
ISBN (Electronic)9781450370561
DOIs
Publication statusPublished - 20 Apr 2020
Event17th International Web for All Conference, W4A 2020 - Taipei, Taiwan, Province of China
Duration: 20 Apr 202021 Apr 2020

Publication series

NameProceedings of the 17th International Web for All Conference, W4A 2020

Conference

Conference17th International Web for All Conference, W4A 2020
Country/TerritoryTaiwan, Province of China
CityTaipei
Period20/04/2021/04/20

Bibliographical note

Publisher Copyright:
© 2020 ACM.

Keywords

  • PDF
  • STEM accessibility
  • web

ASJC Scopus subject areas

  • Computer Networks and Communications

Fingerprint

Dive into the research topics of 'Towards generating web-accessible STEM documents from PDF'. Together they form a unique fingerprint.

Cite this