Spatial data quality of herbarium datasets and implications for decision-making on biodiversity conservation in Brazil

Spatial data quality of herbarium datasets and implications for decision-making on biodiversity conservation in Brazil
Barros, F.S.1, Fernandes, R.A.1, Moraes, M.A.1, Pougy, N.M.1, Caram, J.S.1, Dalcin, E.C.2 and Martinelli, G.2

1 Centro Nacional de Conservação da Flora/Jardim Botânico do Rio de Janeiro, Rua Pacheco Leão 915, 22460-030, Rio de Janeiro – Brazil.
2 Instituto de Pesquisa Jardim Botânico do Rio de Janeiro, Rua Pacheco Leão 915, 22460- 030, Rio de Janeiro – Brazil.

Abstract: The present level of biodiversity depletion and loss makes quality datasets important for biodiversity conservation. However poor data quality is still critical and limits the usefulness of these datasets. Thereby, data quality assessments are important to ensure a responsible use of those datasets. The Brazilian National Centre for Flora Conservation was created with the objective of assessing the extinction risk of plant species, enabling conservation action planning. In this context, a dataset was created after the compilation of occurrence records of threatened species. The present study aims to assess quality of the dataset and records, and to test quality improvement after data cleaning efforts. We have used the five-component scheme for assessing dataset quality. Significance of the differences between expected and observed proportions were tested using the degree of confidence between them. The Mann-Whitney test was used to compare errors between the original dataset and the cleaned out one. Results indicate poor quality, not only for dataset (p<0.10) but also for records (p<0,10). Only 54,306 records (22.30%) were considered of good quality. Logical inconsistencies in the dataset were present in 8,237 records (3.37%).

Keywords: biodiversity conservation, data cleaning, data quality, spatial accuracy.

AttachmentSize
BarrosAccuracy2012.pdf95.26 KB