Deep learning and computer vision algorithms for detection and classification of bearded seal vocalizations in the Arctic Ocean

Author(s)Escobar-Amado, Christian David
Date Accessioned2022-10-31T11:48:37Z
Date Available2022-10-31T11:48:37Z
Publication Date2022
SWORD Update2022-08-10T19:09:46Z
AbstractYear-round recordings of bearded seal calls were collected in the northeastern edge of the Chukchi Continental Slope (Alaska) in 2016-2017, 2018-2019, and 2019-2020. While the underwater vocalizations of bearded seals are often analyzed manually or using automatic detections manually validated, in this work, two detection and classification systems (DCS) based on deep learning techniques are proposed. ☐ The first system is divided in two sections. First, regions of interest (ROI) containing possible bearded seal vocalizations are found by the spectrogram 2D normalized cross-correlation of the measured signal and a representative template of each of two main calls of interest. Second, convolutional neural networks (CNN) are used to validate and classify the ROIs among several possible classes. The CNNs are trained on 80% of the ROIs manually labeled from one of the recorders. When validating on the remaining 20%, the CNNs show an accuracy above 95.5%. To assess the generalization performance of the networks, the CNNs are tested on the remaining recorders, located at different positions and deployed at different years, with a precision above 89.2% for the main class of the two types of calls. ☐ The second proposed DCS is based on the You Only Look Once (YOLO) algorithm on its latest version, YOLOV5 where the network learns how to detect and classify bearded seal vocalizations by using the principle of computer vision for object detection in images where bounding boxes enclose the object of interest. With this method the detection and classification are carried out by the deep learning models without the need for knowing specifics of the signal, meaning no ROIs or masks are needed. Another advantage of using YOLOV5 over other typical DCS is that the predicted bounding boxes have embedded statistical information about the vocalization such as the duration, bandwidth, and center frequency of the signals. In the generalization stage, YOLOV5 achieved an accuracy of 93.87% with a precision and recall above 94.9% and 90.6%, respectively, for the eight proposed classes. Furthermore, an analysis of the vocal behavior of the bearded seals showed that there exists a geographical dependence where this species prefers shallower water depths in the Chukchi Continental Slope.en_US
AdvisorBadiey, Mohsen
AdvisorWan, Lin
DegreeM.S.
DepartmentUniversity of Delaware, Department of Electrical and Computer Engineering
Unique Identifier1349338879
URLhttps://udspace.udel.edu/handle/19716/31545
Languageen
PublisherUniversity of Delawareen_US
URIhttps://login.udel.idm.oclc.org/login?url=https://www.proquest.com/dissertations-theses/deep-learning-computer-vision-algorithms/docview/2700742872/se-2?accountid=10457
KeywordsBearded sealsen_US
KeywordsComputer visionen_US
KeywordsDeep learningen_US
KeywordsMarine mammalsen_US
KeywordsOcean acousticsen_US
KeywordsYOLOen_US
TitleDeep learning and computer vision algorithms for detection and classification of bearded seal vocalizations in the Arctic Oceanen_US
TypeThesisen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
EscobarAmado_udel_0060M_14996.pdf
Size:
30.69 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
2.22 KB
Format:
Item-specific license agreed upon to submission
Description: