Automatic detection and classification of coastal Mediterranean fish from underwater images: good practices for robust training

Catalán, I.A.; Álvarez-Ellacuría, A.; Lisani, J.L.; Sánchez, J.; Vizoso, G.; Heinrichs-Maquilón, E.; Hinz, H.; Alós, J.; Signarioli, M.; Aguzzi, J.; Francescangeli, M.; Palmer, M.

Home
→
Producció científica
→
Articles
→
Multidisciplinar
→
View Item

dc.contributor.author	Catalán, I.A.
dc.contributor.author	Álvarez-Ellacuría, A.
dc.contributor.author	Lisani, J.L.
dc.contributor.author	Sánchez, J.
dc.contributor.author	Vizoso, G.
dc.contributor.author	Heinrichs-Maquilón, E.
dc.contributor.author	Hinz, H.
dc.contributor.author	Alós, J.
dc.contributor.author	Signarioli, M.
dc.contributor.author	Aguzzi, J.
dc.contributor.author	Francescangeli, M.
dc.contributor.author	Palmer, M.
dc.date.accessioned	2023-10-05T07:32:54Z
dc.date.available	2023-10-05T07:32:54Z
dc.identifier.uri	http://hdl.handle.net/11201/161924
dc.description.abstract	[eng] Further investigation is needed to improve the identification and classification of fish in underwater images using artificial intelligence, specifically deep learning. Questions that need to be explored include the importance of using diverse backgrounds, the effect of (not) labeling small fish on precision, the number of images needed for successful classification, and whether they should be randomly selected. To address these questions, a new labeled dataset was created with over 18,400 recorded Mediterranean fish from 20 species from over 1,600 underwater images with different backgrounds. Two state-of-the-art object detectors/classifiers, YOLOv5m and Faster RCNN, were compared for the detection of the 'fish' category in different datasets. YOLOv5m performed better and was thus selected for classifying an increasing number of species in six combinations of labeled datasets varying in background types, balanced or unbalanced number of fishes per background, number of labeled fish, and quality of labeling. Results showed that i) it is cost-efficient to work with a reduced labeled set (a few hundred labeled objects per category) if images are carefully selected, ii) the usefulness of the trained model for classifying unseen datasets improves with the use of different backgrounds in the training dataset, and iii) avoiding training with low-quality labels (e.g., small relative size or incomplete silhouettes) yields better classification metrics. These results and dataset will help select and label images in the most effective way to improve the use of deep learning in studying underwater organisms.
dc.format	application/pdf
dc.relation.isformatof	https://doi.org/10.3389/fmars.2023.1151758
dc.relation.ispartof	Frontiers In Marine Science, 2023, vol. 10, p. 1-11
dc.rights	, 2023
dc.subject.classification	51 - Matemàtiques
dc.subject.classification	004 - Informàtica
dc.subject.classification	5 - Ciències pures i naturals
dc.subject.other	51 - Mathematics
dc.subject.other	004 - Computer Science and Technology. Computing. Data processing
dc.subject.other	5 - Mathematical and Natural Sciences
dc.title	Automatic detection and classification of coastal Mediterranean fish from underwater images: good practices for robust training
dc.type	info:eu-repo/semantics/article
dc.date.updated	2023-10-05T07:32:54Z
dc.rights.accessRights	info:eu-repo/semantics/openAccess
dc.identifier.doi	https://doi.org/10.3389/fmars.2023.1151758