On the impact of Citizen Science-derived data quality on deep learning based classification in marine images.

View/ Open
Average rating
votes
Date
2019Author
Langenkamper, Daniel
Simon-Lledo, Erik
Hosking, Brett
Jones, Daniel O. B.
Nattkemper, Tim W.
Metadata
Show full item recordAbstract
The evaluation of large amounts of digital image data is of growing importance for biology,
including for the exploration and monitoring of marine habitats. However, only a tiny percentage
of the image data collected is evaluated by marine biologists who manually interpret
and annotate the image contents, which can be slow and laborious. In order to overcome
the bottleneck in image annotation, two strategies are increasingly proposed: “citizen science”
and “machine learning”. In this study, we investigated how the combination of citizen
science, to detect objects, and machine learning, to classify megafauna, could be used to
automate annotation of underwater images. For this purpose, multiple large data sets of
citizen science annotations with different degrees of common errors and inaccuracies
observed in citizen science data were simulated by modifying “gold standard” annotations
done by an experienced marine biologist. The parameters of the simulation were determined
on t.....
Resource URL
https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0218086https://doi.org/10.5281/zenodo.3236009.
Journal
PLoS ONEVolume
14Issue
6, Article e0218086Page Range
16pp.Document Language
enSustainable Development Goals (SDG)
14.aEssential Ocean Variables (EOV)
N/ADOI Original
https://doi. org/10.1371/journal.pone.0218086Citation
Langenkamper, D., Simon-Lledo, E.,Hosking, B., Jones, D.O.B. and Nattkemper, T.W. (2019) On the impact of Citizen Science-derived data quality on deep learning based classification in marine images. PLoS ONE 14(6): e021808, 16pp. DOI: https://doi. org/10.1371/journal.pone.0218086Collections
The following license files are associated with this item: