Skip to main navigation Skip to search Skip to main content

Normalization of fossil plant megafossil databases for diversity and palaeobiogeography analyses by filtering taxonomic duplication: Principles, methods, examples and recommendations

  • Zhen Xu*
  • , Emma Bateson
  • , Christopher Cleal
  • , Reece Hutton
  • , Jianxin Yu
  • , Shi-Jun Wang
  • , Andrew Knoll
  • , Benjamin J.W. Mills
  • , Jason Hilton
  • *Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

46 Downloads (Pure)

Abstract

Fossil plants are key to many palaeobiogeography and deep time diversity studies but correctly interpreting them can be fraught with problems due to fragmentation in the fossil record. A typical vascular plant comprises 10–12 separate organs depending on its systematic affinity, but finding them complete is exceptionally rare. Fragmentation can result from multiple processes including ontogeny during the plant’s life-cycle, or from post-mortem taphonomic processes in fluvial systems. In traditional approaches where raw data is amassed directly in the field, from existing physical collections or electronic databases, duplication is inevitable in that different organs of the same plant species may be inadvertently counted independently, skewing results. Here we outlined methods for filtering palaeobotanical data using a technique called normalization to remove taxonomic duplications, with examples are provided from different types of preservation. We use two case studies to highlight normalization and demonstrate the impacts of analysing unfiltered (raw) versus filtered (normalized) data. The first case study evaluates plant data from the late Permian and Triassic compression/impression floras of South China, focussing on species richness/diversity assessments through the Permian-Triassic mass extinction and its recovery. In this case study, normalization reduced the number of taxa but revealed more detailed evolutionary patterns and the magnitude of the floristic turnover, previously obscured by the fragmental preservation typical of plant fossils and nomenclature. The second case study evaluates Carboniferous to Permian anatomically preserved coal-ball floras from Europe, North America and China, focussing on palaeobiogeography and floral provinciality. Normalization reduced the number of coal-ball assemblages when surveyed at both genus and species level but revealed differences in relationships and floristic endemism. We conclude that normalized results should be considered alongside raw data, as they show important and complimentary information which can greatly aid in overall interpretation.
Original languageEnglish
Article number113236
Number of pages12
JournalPalaeogeography Palaeoclimatology Palaeoecology
Volume678
Early online date26 Aug 2025
DOIs
Publication statusPublished - 15 Nov 2025

Bibliographical note

Undergraduate research publication

Fingerprint

Dive into the research topics of 'Normalization of fossil plant megafossil databases for diversity and palaeobiogeography analyses by filtering taxonomic duplication: Principles, methods, examples and recommendations'. Together they form a unique fingerprint.

Cite this