Automatic detection and classification of coastal Mediterranean fish from underwater images: good practices for robust training

Show simple item record

dc.contributor.author Catalán, I.A.
dc.contributor.author Álvarez-Ellacuría, A.
dc.contributor.author Lisani, J.L.
dc.contributor.author Sánchez, J.
dc.contributor.author Vizoso, G.
dc.contributor.author Heinrichs-Maquilón, E.
dc.contributor.author Hinz, H.
dc.contributor.author Alós, J.
dc.contributor.author Signarioli, M.
dc.contributor.author Aguzzi, J.
dc.contributor.author Francescangeli, M.
dc.contributor.author Palmer, M.
dc.date.accessioned 2023-10-05T07:32:54Z
dc.date.available 2023-10-05T07:32:54Z
dc.identifier.uri http://hdl.handle.net/11201/161924
dc.description.abstract [eng] Further investigation is needed to improve the identification and classification of fish in underwater images using artificial intelligence, specifically deep learning. Questions that need to be explored include the importance of using diverse backgrounds, the effect of (not) labeling small fish on precision, the number of images needed for successful classification, and whether they should be randomly selected. To address these questions, a new labeled dataset was created with over 18,400 recorded Mediterranean fish from 20 species from over 1,600 underwater images with different backgrounds. Two state-of-the-art object detectors/classifiers, YOLOv5m and Faster RCNN, were compared for the detection of the 'fish' category in different datasets. YOLOv5m performed better and was thus selected for classifying an increasing number of species in six combinations of labeled datasets varying in background types, balanced or unbalanced number of fishes per background, number of labeled fish, and quality of labeling. Results showed that i) it is cost-efficient to work with a reduced labeled set (a few hundred labeled objects per category) if images are carefully selected, ii) the usefulness of the trained model for classifying unseen datasets improves with the use of different backgrounds in the training dataset, and iii) avoiding training with low-quality labels (e.g., small relative size or incomplete silhouettes) yields better classification metrics. These results and dataset will help select and label images in the most effective way to improve the use of deep learning in studying underwater organisms.
dc.format application/pdf
dc.relation.isformatof https://doi.org/10.3389/fmars.2023.1151758
dc.relation.ispartof Frontiers In Marine Science, 2023, vol. 10, p. 1-11
dc.rights , 2023
dc.subject.classification 51 - Matemàtiques
dc.subject.classification 004 - Informàtica
dc.subject.classification 5 - Ciències pures i naturals
dc.subject.other 51 - Mathematics
dc.subject.other 004 - Computer Science and Technology. Computing. Data processing
dc.subject.other 5 - Mathematical and Natural Sciences
dc.title Automatic detection and classification of coastal Mediterranean fish from underwater images: good practices for robust training
dc.type info:eu-repo/semantics/article
dc.date.updated 2023-10-05T07:32:54Z
dc.rights.accessRights info:eu-repo/semantics/openAccess
dc.identifier.doi https://doi.org/10.3389/fmars.2023.1151758


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search Repository


Advanced Search

Browse

My Account

Statistics