Here you can find all computer vision-related projects in Orcasound! Ongoing projects:
- Real-Time Automated Vessel Detection System Using Side View Images
Author: Ze Cui, Samantha King, Scott Veirs, Val Veirs
Here is an intro video to help you quickly go through the project!
We propose to create an open object detection model for real-time marine vessel monitoring. This project will involve two phases and associated deliverables. First, we will collaborate with Protected Seas and Beam Reach to build an 11-class side-view vessel data set using the Roboflow annotation app. Then, using this data set, we will develop a vessel detection model using the YOLO algorithm.
Create an open-access side-view vessel image dataset using images from the M2 system located in the Orcasound Lab (here are some images of M2 at Orcasound Lab)and publish it under a Creative Commons license. We envision a dataset of 10,000 samples governed by CC BY-SA 4.0 license. Easy access and discovery of this training set will be accomplished by serving it as part of Orcasound’s open data registry within Amazon’s open data sponsored S3 bucket called the “Acoustic Sandbox”(for free under AWS open data sponsorship through 2024). The marine vessels are classified into 11 classes (as of 08/21/2023), including:
- non-commercial small
- non-commercial medium
- non-commercial large
- non-commercial sailing
- commercial small
- commercial large fishing
- commercial large passenger
- commercial cargo
- commercial tug
- other
- unknown
We manually labeled a portion of the images, and during the labeling process we found that the amount of images was huge, but the quality varied. Some samples do not contain ships or are very blurry. On one hand, we would like the dataset to be of high quality: the outlines and colors of the vessels can be clearly seen, and the bounding box with a minimum dimension of at least 50 pixels. On the other hand, we also wanted to include the most information about the actual scene, so we added three additional qualifiers to further describe each vessel class: distant/ blurry/backlit. The three most common causes of poor samples in the actual scene were listed as:
- Distant. The vessel is too small or too far away. Result in a bounding box with a dimension smaller than 50 pixels.
- Blurry. Blurring of the vessel's outline.
- Backlit. Overexposure causes loss of the ship's color information.
Using the qualifier, we categorized and incorporated some of the otherwise problematic samples into the dataset, which helped to increase the diversity of the dataset as well as balance the classes. But despite this, we still face the problem of an unbalanced dataset. For this reason, we intend to quickly filter the raw data of m2 and extract as many minority-class images as possible. We next intend to use the currently available dataset to train a model for detecting the presence of vessels in the images, which will be used to filter the raw data in m2, reducing the time and effort of manual annotation.
Explicit definitions of the classification of vessels and usage of qualifiers can be found through Vessel Classification Dictionary.
-
added on Jul/13/23: vesselDetection_071023
note: This dataset includes data from July and October 2022. Includes 2641 vessel images across all classes.All datasets are located in Orcasound's S3 bucket:
s3://visual-sandbox/orca-eye-aye/data/
Open data archive could be accessed via e.g. --aws --no-sign-request s3 sync s3://visual-sandbox/orca-eye-aye/ .
Create two automated real-time vessel object detection systems. One is for detecting the existence of vessels in the image and is also used as a filter for better-quality data, and the other is for vessel classification.
For detecting the existence of vessels, the quality of the training data is not required to be very high, but the outlines still need to be discernible. Since there is no open access dataset for vessels, our project extends to task 2: compile a high-quality dataset for vessels ( looking for a license for the dataset). This dataset is a compilation and classification of the accumulated data from the M2 system and will be crucial for the future training of deep learning models for vessel detection and classification.
Produce an open object detection model for real-time marine vessel monitoring. Retrain Yolo v5. First, we will try to transfer learning to the pre-trained Yolo model. The code used to process the data sets from the Roboflow annotation app, train the model, assess its performance, and possibly retrain the model will be shared via a public repository within the Orcasound Github organization under this particular permissive open-source software license: MIT. Publish the model along with the performance assessment. The model will be stored alongside the training data within the Acoustic Sandbox bucket or could be published online in another forum (e.g. Kaggle or Zenodo?). The model will be shared under OpenRAIL-M and the source code will be shared under OpenRAIL-S.
Marine Monitor (M2) is a shore-based, multi-sensor platform that integrates X-band marine radar, optical cameras, and other sensors with custom software to autonomously track and report on vessel activity in nearshore areas. By using radar, vessels of all types are tracked by the system, including smaller boats that are typically not required to participate in common tracking systems. M2 also receives and documents vessel information from the Automatic Identification System (AIS) which is primarily used by larger commercial vessels. The camera is dynamically directed to vessel locations, provided by both radar and AIS, throughout its transit, so that images are captured while vessels are within range of the M2 system. M2 was designed by ProtectedSeas as a tool for marine managers to more effectively monitor and document human activities in sensitive marine areas.
I would like to extend my deepest gratitude to Scott Veris, Samantha King, and Val Veris for their invaluable mentorship throughout this project. Their insights and expertise have been instrumental in shaping both the direction and outcomes of this work. I extend my appreciation to them for defining the true spirit of dedication in academic research and practice.
methodology: process of generating data, training adding license