0% found this document useful (0 votes)

36 views11 pages

Improved YOLOV7-TINY Network For Sea Bream Detecti

Uploaded by

Raluca-Cristina Guriencu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

36 views11 pages

Improved YOLOV7-TINY Network For Sea Bream Detecti

Uploaded by

Raluca-Cristina Guriencu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 11

Journal of Computing and Electronic Information Management

ISSN: 2413-1660 | Vol. 15, No. 2, 2024

Improved YOLOV7-TINY Network for Sea Bream

Detection
Linhua Jiang1, 2, Yuanyuan Yang1, Entuo Liu2, Lingxi Hu1, Jiahao Xu1, Lei Chen1, Jintao Zhang1,
Peng Liu1, Wei Long1, *
1 School of Information Engineering, Huzhou University, Huzhou 313000, China
2 TheInstitute of Advanced Vision Research, Hangzhou Research Institute of Xidian University, Hangzhou 311231, China
*
Corresponding author: Wei Long (Email: 1w@zjhu.edu.cn)

Abstract: Accurate identification of underwater fish species is of great scientific and economic significance in aquaculture, as
it can provide scientific basis for aquaculture production and promote related research. However, the complexity of the
underwater environment is affected by various factors such as light, water quality, and mutual occlusion of fish species. Therefore,
underwater fish images are often not clear enough, which limits the accurate identification of underwater targets. In this paper,
an improved YOLOV7-TINY model for sea bream detection is proposed. We employ FasterNet to replace the backbone network
of the YOLOV7-TINY model, further reducing model parameters and computational complexity without compromising accuracy.
By leveraging cascaded feature fusion in the backbone network, we effectively address the challenges posed by multi-scale
datasets and insufficient information extraction. Additionally, the RESNETCBAM attention mechanism is incorporated into the
feature maps at three different scales, allowing the network to better capture relevant information from complex underwater
environments while minimizing unnecessary interference. Finally, the ECIOU loss function is adopted to optimize frame
adjustments and reduce the training time of the model.
Keywords: Cascaded feature fusion; Attention mechanism; Improved YOLOV7-TINY network; ECIOU.

established and classified databases of different types of fish

1. Introduction photos using computers. Nathalie Castignolles et al.
In recent years, with the rapid development of deep observed and counted the number of fish species passing
learning technology and its successful applications in various through rivers through window glass. Lee et al. used holistic
fields such as facial recognition, autonomous driving, speech shape matching for fish recognition, which improved the
recognition, and text processing , the structural system of accuracy of fish individual recognition, but the high
deep learning technology has become more mature and computational cost led to limited practicality. In traditional
perfected. Leveraging its powerful data processing machine learning, fish recognition work is completed by
capabilities, an increasing number of researchers have begun computers instead of humans through analyzing the acquired
to apply it to the field of aquaculture. This trend not only data. Ding Shunrong et al. used a fish classification method
drives further development of deep learning technology but based on particle swarm optimization SVM, which can
also brings tremendous convenience to aquaculture work, classify fish more accurately compared to traditional support
further promoting the development of the aquaculture vector machines, but the recognition accuracy decreases when
industry. For aquaculture enterprises, applying deep learning dealing with similar morphologies and textures of the same
technology to aquaculture work is of great significance. species. Yao Runlu et al. used digital image processing and
Through deep learning technology, individual recognition of BP neural networks for freshwater fish recognition, but their
cultured fish in fixed waters can better grasp the growth status innovation in methodology is still limited. Petr Cisar et al.
of cultured fish. Moreover, it enables the formulation of more introduced a method for the automatic individual recognition
reasonable feeding plans based on the overall number of of Atlantic salmon based on dot patterns on the skin. Renato
cultured fish, facilitating precise feeding and reducing et al. utilized photo recognition technology for individual
economic losses caused by improper feeding to some extent. identification of cichlid fish.
Additionally, individual recognition of fish can enable Traditional object detection algorithms have been
aquaculture workers to promptly understand the health status constrained by many limitations, including poor robustness
of cultured fish, allowing timely treatment when they are sick, and difficulty in feature extraction. In contrast, deep learning
thus ensuring better growth of cultured fish and reducing neural networks can map low-dimensional image data to
unnecessary economic losses for aquaculture enterprises. high-dimensional space, automatically learn feature
Because of the underwater environment and numerous mappings through operations such as convolution and pooling
other uncertain factors, individual fish recognition still faces during forward propagation. Following AlexNet and VGG ,
many challenges. In the development process of fish many excellent deep learning frameworks have been
individual recognition, it is mainly divided into traditional proposed, such as Recurrent Neural Networks (RNN) , Long
fish individual recognition and deep learning-based Short-Term Memory (LSTM) , Generative Adversarial
individual recognition methods. Traditional fish individual Network (GAN) , GoogLeNet , ResNet , MobileNet series , , ,
recognition requires manual feature extraction, which is time- and DenseNet , etc. Models based on convolutional neural
consuming and inefficient, resulting in lower recognition networks utilize parameter sharing, speeding up the
accuracy for different fish species. Strachan et al. computation of algorithms. Based on this, many researchers

55
have adopted this idea to construct object detection representatives of the entire feature map for computation.
algorithms. Currently, popular object detection algorithms Without loss of generality, it is assumed that the input and
can be categorized into two types: two-stage and one-stage. output feature maps have the same number of channels.
The former is based on the principle of coarse positioning and
fine classification, first identifying candidate regions
containing objects and then performing classification, such as
R-CNN , Fast-RCNN , Faster-RCNN , Mask-RCNN , etc.,
which are relatively slower in detection speed compared to
the latter. One-stage object detection algorithms directly
predict object classification and localization through
convolutional neural networks, achieving a better balance
between accuracy and speed. Representative algorithms in
this category include SSD , the YOLO series (YOLOV3 ,
YOLOV4 , YOLOV5 , YOLOV6 , YOLOV7 ). Zhao et al. Figure 1. PConv structure
proposed a YOLO-UOD underwater detection algorithm
based on Yolov4-tiny. Li et al. introduced a triplet attention The FLOPs (Floating Point Operations) of PConv can be
mechanism in YOLOV5 to improve underwater biological expressed as ℎ × 𝑤 × 𝑘 2 × 𝑐𝑝2 ,where ℎ and 𝑤 represent the
feature extraction capabilities. Zhai et al. added CBAM to width and height of the feature map, k denotes the size of the
Yolov5s to improve recognition accuracy and efficiency and convolution kernel, and 𝑐𝑝 signifies the number of channels
introduced a multi-scale algorithm to enhance image contrast. in the conventional convolution. In PConv, 𝑐in in
Due to the complex underwater environment, simply conventional convolution is replaced by 𝑐P . In practical
𝑐 1
applying the above models to sea bream recognition still applications, 𝑟 = 𝑝 = is typically assumed. Consequently,
𝑐 4
poses some problems: the FLOPs of PConv are only 1/16 of those of conventional
(1) Underwater environments are affected by factors such convolution.
as lighting and water quality, and the background of the The memory access pattern of PConv can be represented as
captured images also poses difficulties for detection, leading ℎ × 𝑤 × 2𝑐𝑝 + 𝑘 2 × 𝑐𝑝2 , which is approximately ℎ × 𝑤 ×
to inaccurate detection results.
2𝑐𝑝 , where ℎ and 𝑤 are the width and height of the feature
(2) During the process of feature extraction and fusion, the
models may not fully extract multi-scale information from map, 𝑘 is the size of the convolution kernel, and 𝑐P is the
underwater fish schools. number of channels in the conventional convolution. The
(3) The original model has long training times and is large memory access of PConv is only a quarter of that of
in size, making it inconvenient for deployment on mobile conventional convolution, hence no additional memory
devices. access is required.
Our work aims to address the aforementioned issues. In order to fully and effectively utilize information from all
channels, a pointwise convolution (PWConv) is added after
2. Methods the PConv. The FLOPs of PWConv and PConv decoupling
can be calculated as ℎ × 𝑤 × (𝑘 2 × 𝑐𝑝2 + 𝑐 × 𝑐𝑝 ).Compared
2.1. FasterNet to regular convolutions, this reduces the computational
With the presence of significant redundant computations in workload. Each FasterNet module consists of one PConv
the backbone network of YOLOV7-TINY, FasterNet is based layer, two PWConv layers, batch normalization, and a ReLU
on reducing redundant computations and memory access, activation function, as shown in Figure 2.
which can further improve accuracy without compromising
the original Baseline.
For lightweight networks such as MobileNet, ShuffleNet,
and GhostNet, they utilize depthwise convolution (DWConv)
or group convolution (GConv) to extract spatial features.
However, in the process of reducing floating-point operations
(FLOPs), as indicated by (1), detection latency may also be
influenced by the number of floating-point operations per
second (FLOPS), and operators are often affected by the side
effects of increased memory access.
𝐹𝐿𝑂𝑃𝑠
𝐿𝑎𝑡𝑒𝑛𝑐𝑦 = (1)
𝐹𝐿𝑂𝑃𝑆
To achieve a high number of floating-point operations per
second (FLOPS) while reducing FLOPs, partial convolution
(PConv) is used to decrease memory access and
computational redundancy. Essentially, the FLOPs of PConv
are lower than regular Convolution, but FLOPS are higher
than DWConv/GConv. In other words, PConv better utilizes
the computational power of the device, while also being
effective in extracting spatial features.
In Figure 1, PConv applies regular Convolution to extract
spatial features from some input channels while keeping the
rest unchanged. For contiguous or regular memory access, the
first or last contiguous 𝑐P channels are considered as Figure 2. FasterNet Block

56
2.2. Fussion Block combination of these two modules leads to improved handling
of complex environmental conditions and other challenges.
The Fusion Block achieves more accurate object detection
Specifically, this involves reducing the dimensionality of
and localization by integrating feature maps from different
feature maps at the same scale using 1x1 convolutions. For
levels. Specifically, it consists of two essential components:
larger-scale feature maps, dimensionality reduction is
the feature fusion module and the multi-scale feature fusion
performed initially using 1x1 convolutions, followed by
module. The feature fusion module merges feature maps from
downsampling using a 3x3 convolution with a stride of 2. For
different levels through a feature pyramid network, thereby
smaller-scale feature maps, upsampling is conducted using
enhancing the model's capability for object detection. On the
2x2 transpose convolutions. Finally, the resulting feature
other hand, the multi-scale feature fusion module combines
maps from these three parts are concatenated together,
feature maps from different scales, enabling the model to
followed by dimensionality reduction using 1x1 convolutions
better adapt to objects of varying sizes and proportions. The
again, as shown in Figure 3.

Figure 3. Fussion Block

2.3. ResNetCBAM pooling region, aiming to extract prominent characteristics of

the target while preserving texture, contours, and other details
The attention mechanism in deep learning is a mechanism in the image to the fullest extent. On the other hand, average
that promotes autonomous learning in network models and pooling computes the average value of all elements within the
captures important features. The Convolutional Block region, retaining more background information of the image.
Attention Module (CBAM) is a simple and effective attention The specific steps for obtaining channel attention are as
module. It concatenates channel attention and spatial attention follows: perform max pooling and average pooling along the
modules between the input and output of the CBAM structure. spatial dimension for an input with dimensions HxWxC,
Both of these modules use global max-pooling and global resulting in two features with dimensions 1x1xC each. These
average-pooling to extract richer global and local semantic features are then fed into a shared network consisting of
information. CBAM first calculates its channel attention multi-layer perceptrons (MLP) with hidden layers. After
features from the channel dimension and then multiplies these computation, the output feature maps are added element-wise,
channel attention features with the original feature maps, and the final channel feature attention is obtained using the
recalculating the channel weights of the original feature maps. sigmoid activation function. The channel attention
Subsequently, the feature maps with channel attention are mechanism is illustrated in Figure 5.
input into the spatial attention module, allowing the network ResNetCBAM is an extension of the ResNet architecture,
to adaptively learn the importance of different pixel positions where the attention module CBAM is concatenated after the
within the same channel, ultimately obtaining filtered convolution within the residual block and before the shortcut
important features. connection. This addition aims to filter out effective features
For spatial attention, it is based on global average pooling from the feature maps. The CBAM module operates by
and global max pooling along the channel dimension to obtain redistributing the weights of the extracted feature maps
two feature maps with dimensions HxWx1. These two feature without altering the original ResNet network structure. By
maps are concatenated along the channel dimension to form a extracting both channel and spatial attention, the residual
feature map with dimensions HxWx2. Subsequently, the block not only pays attention to whether there are target tasks
concatenated feature map undergoes a 7x7 convolution to but also where the targets are located. This effectively
reduce the channel dimension to 1. Finally, the output is enhances the feature extraction capability of the network
obtained through the sigmoid activation function. The spatial model. Figure 6 illustrates the schematic diagram of the
attention mechanism is illustrated in Figure 4. residual structure after adding the CBAM module.
For channel attention, global max pooling and global
average pooling are utilized to integrate spatial information.
Max pooling retains the most salient features within the

57
Figure 4. Spatial Attention Module

Figure 5. Channel Attention Module

Figure 6. ResNetCBAM

2.4. Loss Function considers three key geometric factors: overlap area, center
point distance, and aspect ratio. CIoU utilizes IoU, Euclidean
Object detection can typically be divided into two stages: distance, corresponding aspect ratio, and angle to measure the
localization and detection. The accuracy of the localization overlap area between the target and the ground truth box.
stage is primarily influenced by regression loss functions, In the regression stage, it is not suitable for both width and
leading to the emergence of various new regression loss height of the predicted box to increase or decrease
functions. simultaneously because 𝛼𝑣 cannot accurately represent the
To measure the similarity between predicted boxes and confidence in the true width and height differences. Therefore,
ground truth boxes and to select appropriate positive and when the model converges to the linear ratio between the
negative samples, Intersection over Union (IoU) has become width and height of the predicted frame and the true frame, it
the most popular metric in bounding box regression. To may sometimes hinder the effective optimization of similarity.
further optimize the IoU metric, the IoU loss function has The EIOU_Loss function decomposes the aspect ratio factor
been proposed. 𝛼𝑣 in CIOU_Loss to calculate the width and height of the
However, the IoU loss function fails when there is no predicted frame and the true frame, thus addressing the
overlap between the predicted box and the IoU loss function. problem of CIOU_Loss.
To address these issues, many IoU-based evaluation systems When dealing with distant edges, only the calculation of
have been derived, which improve the shortcomings of the EIOU_Loss can become slow and may not converge
original IoU loss function from different perspectives, prematurely. To address this issue, a new enhanced loss
significantly enhancing its robustness. function called ECIOU has been proposed, which can
Among them, the Generalized Intersection over Union facilitate adjustments to the predicted frame and improve
(GIoU) , Distance Intersection over Union (DIoU) , and frame regression rates.
Complete Intersection over Union (CIoU) loss functions are The foundation of ECIOU is the combination of the CIOU
the most representative methods. They have made significant and EIOU loss functions. Initially, the aspect ratio of the
progress in the field of object detection, but there is still predicted frame is adjusted by CIOU until it converges to an
considerable room for optimization. appropriate range. Then, each side is finely tuned by EIOU
Among these methods, CIoU is considered the most until it converges to the correct value.ECIOU_Loss is
optimal boundary regression loss function because it

58
computed using Equation (5). by directly predicting the positions and classes of bounding
boxes in the neural network. Due to the simple and efficient
𝜌2 (𝒃, 𝒃𝑔𝑡 ) design of the YOLO series, it has become one of the preferred
ℛ𝐶𝐼𝑜𝑈 = + 𝛼𝑣 (2) algorithms for real-time object detection tasks.
𝑎2
4 𝑤 𝑔𝑡
𝑤
2 By utilized YOLOv7-TINY as the BASELINE and
𝑣 = 2 (𝑎𝑟𝑐𝑡𝑎𝑛 𝑔𝑡 − 𝑎𝑟𝑐𝑡𝑎𝑛 ) (3) replaced the backbone network with the FasterNet network,
𝜋 ℎ ℎ
we aim to reduce redundant computations without sacrificing
𝜌2 (𝒃, 𝒃𝑔𝑡 ) accuracy. Subsequently, we employed the multi-scale
ℒ𝐶𝐼𝑜𝑈 = 1 − 𝐼𝑜𝑈 + + 𝛼𝑣 (4)
𝑎22 𝑔𝑡 BIFUSSION to better integrate contextual information.
2
𝜌 (𝑏 , 𝑏) 𝜌 (ℎ , ℎ) 𝜌 (𝑤 𝑔𝑡 , 𝑤)
𝑔𝑡 2
𝐸𝐶𝐼𝑂𝑈𝐿𝑜𝑠𝑠 = 1 − 𝐼𝑂𝑈 + 𝛼𝑣 + + + (5) Before the prediction head, we separately added
𝑐2 𝑐ℎ2 𝑐𝑤2
theRESNETCBAM attention mechanism after the ELAN
2.5. The improved YOLOv7-TINY network module, allowing the network to better capture and represent
architecture important features, thus enhancing the model's performance
and generalization ability. The YOLOv7-TINY (BASELINE)
The YOLO (You Only Look Once) network is a popular and the improved ResNetCBAM-FUSSION-YOLO network
real-time object detection algorithm. Its main idea is to treat architecture are illustrated in Figures 7 and 8, respectively.
the object detection problem as a single regression problem,

Figure 7. YOLOV7-TINY

59
Figure 8. ResNetCBAM-FUSSION-YOLO
𝑇𝑃
3. Results and Analysis 𝑃= (6)
𝑇𝑃 + 𝐹𝑃
3.1. Environment configuration Recall(R) represents the prediction result as the proportion
of the actual positive samples in the positive samples to the
When training the model, the LINUX operating system was
positive samples in the whole sample. The calculation
utilized alongside the PyTorch 1.12.0 framework. The server
formula is as follows:
configuration included an Intel(R) Xeon(R) Silver 4110 CPU 𝑇𝑃
@ 2.10GHz processor, 188GB of memory, and an NVIDIA 𝑅= (7)
𝑇𝑃+𝐹𝑁
A100 PCIe GPU with 80GB of memory, running CUDA 12.2 AP (average precision) refers to the average accuracy in
driver version. The input image size was set to 640x640, with object detection. It combines the model’s performance under
a batch size of 16 and 200 training steps. The learning rate different precision and recall conditions and reflects the
was set to 0.01, momentum to 0.937, and the weight decay for balance between accuracy and recall. AP is calculated as the
stochastic gradient descent (SGD) was 0.0005. For training area under the precision–recall curve (PR curve), as shown in
and testing purposes, the PyCharm software was installed on Equation (8).
1
a local WINDOWS system, communicating with the server
via remote connection. The original images were divided into 𝐴𝑃 = ∫ 𝑃(𝑅)𝑑𝑅 (8)
0
training, validation, and test sets in an 8:1:1 ratio. Additionally, The mAP metric evaluates the overall performance of the
data augmentation was applied to the training set to enhance model across all categories. It is calculated by taking the
the robustness of the experiment. average of the AP (Average Precision) values for different
3.2. Model evaluation criteria categories, as shown in Equation (9):
∑𝑛𝑖=0 𝐴𝑃(𝑖)
In this experiment, a comprehensive set of metrics was 𝑚𝐴𝑃 = (9)
𝑛
adopted to evaluate the performance of fish object detection.
Precision(P) represents the proportion of positive samples 3.3. Experiment
in the samples with positive prediction results. The definition 3.3.1. Data Preparation
is shown in Equation (6): The data images were collected from Időkép and

60
manually annotated using LabelImg software for 1430 images 3.3.2. Analysis of the results of the ResNetCBAM-
with a resolution of 1920x1080. To ensure the diversity and FUSSION-YOLO experiment
completeness of the data, we selected images from multiple In this experiment, a batch size of 16 was used, meaning
different time periods for capture. Through manual selection, that 16 images were processed in each training iteration. The
we ensured that the images have complete boundaries and YOLOV7 network provides good visualization effects. After
necessary clarity to guarantee the quality of the data. Data the model training is completed, test_batch_label.jpg is
augmentation was applied to the training set to enhance the generated to observe the real bounding boxes on the
robustness of the experiment. validation set for that batch, and test_batch_pred.jpg is
generated to observe the predicted bounding boxes on the
validation set for that batch. The predicted results on the
validation set during model training are shown in Figure 9.

Figure 9. Validation set results

3.3.3. Comparison of loss function results of different Table 1. Loss function and training time
IOU
According to Table 1, we employed four loss functions, IOU Training Time/h
namely SIOU, EIOU, CIOU, and ECIOU, on the same model, SIOU 1.896
and compared their training times. As indicated by the graph, EIOU 1.61
the training time for ECIOU is approximately 1.1 hours,
which is notably shorter compared to the other three methods CIOU 1.893
for completing 200 epochs. This would significantly enhance ECIOU 1.113
practical training efficiency. Additionally, as shown in Table
Table 2. Test results for different loss functions
2, in the ResNetCBAM-FUSSION-YOLO model using the
ECIOU loss function, the mAP@0.5 is higher by 1.2%, 2.6%, MODEL MAP0.5/% MAP@.5:0.9/5% P/% R/%
and 1.1% compared to using the SIOU, CIOU, and EIOU loss
SIOU 91.4 49.5 91.7 85.9
functions, respectively. Moreover, the mAP0.5:0.95 is higher
by 3.3%, 3%, and 1.7% compared to using the SIOU, CIOU, CIOU 90 49.8 92.1 85.5
and EIOU loss functions, respectively. Both precision and EIOU 91.5 51.1 92.1 85.7
recall also perform the best with the ECIOU loss function ECIOU 92.6 52.8 94.1 86.2
among these four. Consequently, we selected ECIOU as the
loss function for the ResNetCBAM-FUSSION-YOLO 3.3.4. Comparative Experiments of Various Mainstream
network model. Models
In summary, the improved YOLOv7-TINY model has To demonstrate the superior performance of the proposed
better detection accuracy compared to the original model, model on lightweight devices in the detection of fish schools,
with the MAP@0.5 increasing from 90.6% to 92.6%, a comparison of the improved network proposed in this paper
indicating that the proposed network model offers higher with mainstream target recognition networks during detection
detection precision. The computational complexity is reduced is shown in Table 3. In Table 3, our model P and R are both
by 16.7%, the number of parameters decreases by 11.5%, and superior to the other algorithms listed in the table, indicating
the model training time is shortened by 49.2%. This reduction good model performance. In terms of mAP@0.5, our method
in hardware resource requirements facilitates model shows improvements of 3.5% and 9.3% over SSD and Faster-
deployment in mobile devices and provides a reference for RCNN, respectively. Furthermore, our model's performance
intelligent aquaculture applications. surpasses YOLOV3, YOLOV4, and YOLOV5s by 1.8%,
3.4%, and 2.3%, respectively. In terms of mAP@0.5:0.95, it
surpasses SSD and Faster-RCNN by 2.3% and 6.2%,

61
respectively, and surpasses YOLOV4 and YOLOV5s by 1% high detection accuracy. Additionally, in terms of detection
and 0.8%, respectively. Our model's computational and speed, our model achieves 85.9FPS, surpassing the inference
parameter quantities are the smallest in the table, only 11 and speeds of other mainstream networks. Figure 10 compares
5.32M, respectively, meaning that it consumes minimal mAP@0.5 and FPS under different models.
hardware resources in device deployment while maintaining
Table 3. Comparative Experiment
MODEL MAP@0.5/% MAP@.5:0.95/% FPS P/% R/% Parameters/M GFLOPs/G
YOLOV5s 90.3 52 58.62 93.2 85.5 7.01M 15.8
YOLOV4 89.2 51.8 40 92.6 84.4 9.11M 20.6
YOLOV3 90.8 53.8 49.4 93.3 85.3 6.15M 154.5
Faster-RCNN 83.3 46.6 38.7 67 85.3 28.47M 470.1
SSD 89.1 50.5 47.1 90 83.8 26.15M 31.4
Our model 92.6 52.8 85.9 94.1 86.2 5.32M 11

100 90.3 89.2 90.8 89.1 92.6

90 83.3
80 85.9
MAP@0.5、FPS

70 58.62
60 49.4 47.1
50 40 38.7
40
30
20
10
0
YOLOV5s YOLOV4 YOLOV3 Faster-RCNN SSD Our model
MODEL

MAP@0.5% FPS

Figure 10. Comparison of map@0.5 and FPS under different models

3.3.5. Ablation experiment size, ResNetCBAM-FUSSION-YOLO consumes only 3.8%

To validate the effectiveness of each module in the of GPU, which is only 0.9% higher than the baseline, while
improved network, we conducted ablation experiments. We the YOLOV7 model consumes 51.3%. In terms of MAP@0.5
compared our proposed ResNetCBAM-FUSSION-YOLO evaluation, ResNetCBAM-FUSSION-YOLO outperforms
model with several counterparts, including the original YOLOV7-TINY by 2 percentage points, surpassing
YOLOV7, YOLOV7-TINY, YOLOV7-TINY with FasterNet YOLOV7-TINY-FasterNet-Fussion and YOLOV7-TINY-
backbone (YOLOV7-TINY-FasterNet), YOLOV7-TINY FasterNet by 0.8 and 1.4 percentage points respectively. In
with FasterNet backbone and cascade context fusion module terms of computational complexity, it reduces by 2.2
(YOLOV7-TINY-FasterNet-Fussion), and ResNetCBAM- compared to YOLOV-TINY.It is worth noting that the
FUSSION-YOLO with RESNETCBAM attention ResNetCBAM-FUSSION-YOLO model achieves higher
mechanism, in terms of GPU consumption, mAP values, accuracy and requires fewer hardware resources compared to
parameter count, accuracy, recall, and computational YOLOV7-TINY. Our model also shows an improvement in
complexity.According to the analysis in Table 4, it can be FPS compared to the baseline. Furthermore, although
observed that our proposed ResNetCBAM-FUSSION-YOLO YOLOV7 exhibits higher accuracy, it has slower detection
model occupies 14.58% of the parameters of the original speed and higher hardware resource requirements compared
YOLOV7 model, reducing parameter count by 0.69 million to other models, making it challenging to deploy in practical
compared to the YOLOV7-TINY model. With the same batch scenarios.
Table 4. Ablation results
PARAMETERS
MODEL GPU/% MAP@0.5/% MAP@.5:0.95/% P/% R/% GFLOPs/G FPS
/M
YOLOV7 51.3 93.8 56.5 36.5 93.6 88.6 103.8 52.4
YOLOV7-TINY 2.9 90.6 51.6 6.01 91 87.3 13.2 79.9
YOLOV7-TINY-
3.1 91.2 50.1 4.28 90.9 86.6 9 95.6
FasterNet
YOLOV7-TINY-
3.2 91.8 51.5 4.29 91.8 86 9.1 90.7
FasterNet-Fussion
Our model 3.8 92.6 52.8 5.32 94.1 86.2 11 85.9

3.3.6. ResNetCBAM-FUSSION-YOLO test results and fine-tuning on the validation set, we selected the best
After multiple iterations of deep learning network training models and loaded them into the computer to obtain inference

62
results on the test set. Among them, we selected three models, conditions, reducing the missed detection rate. Overall, the
namely ResNetCBAM-FUSSION-YOLO, YOLOV7, and ResNetCBAM-FUSSION-YOLO network can rapidly,
YOLOV7-Tiny, to detect fish schools, as shown in Figures 11, accurately, and comprehensively detect fish schools in harsh
12, and 13. It can be observed that compared to YOLOV7 and environments. This is of significant importance for
YOLOV7-Tiny, ResNetCBAM-FUSSION-YOLO can detect understanding the growth status of aquaculture fish and
all fish schools in the images, regardless of multi-scale or achieving precise feeding.
dense regions, even under turbid water and lighting

Figure 11. ResNetCBAM-FUSSION-YOLO

Figure 12. YOLOV7

Figure 13. YOLOV7-Tiny

to different environments and scenarios, thereby improving
4. Discussion the feasibility and stability of practical applications.
The population detection model for sea bream proposed in
this paper ensures accuracy based on lightweighting, but there 5. Conclusion
are still some key issues that need further exploration. Firstly,
(1) By collecting camera data and manually labeling
the dataset plays a crucial role in the target detection model.
images using Labelimg software, a dataset for detecting sea
Currently, we only focus on detecting sea bream underwater,
bream in complex environments was created. In order to
which still poses a risk of model performance degradation for
enhance the robustness of the model, data augmentation
other fish species. Additionally, during the dataset annotation
techniques were also applied during the dataset creation
process, due to the large workload and subjective nature,
process.
errors in labeling are inevitable. In future research, we
(2) The FasterNet backbone was used to replace the
consider using computer-assisted human labeling or
original YOLOv7-TINY algorithm, aiming to reduce the
employing pre-trained models and manually modifying the
model's parameter count and computational complexity, as
outputs of pre-trained models to ensure optimal labeling
well as accelerate detection speed, while optimizing
effectiveness.
performance without significantly compromising accuracy.
Furthermore, when deploying the model, hardware
This lays the foundation for subsequent network optimization.
resources need to be considered. Although we used FasterNet,
(3) Building upon (2), the three layers provided by the
which significantly reduced the model's parameter count and
Backbone were utilized as inputs to the Fusion module to
computational load, other methods can still be employed for
further extract contextual information from the model. Before
lightweighting the model. For instance, reducing the model
feeding into the detection head, the RESNETCBAM attention
size by removing redundant connections or unnecessary
mechanism was applied to features of three different scales,
parameters, or employing knowledge distillation where the
enhancing the model's ability to integrate multi-scale
output of a teacher model is used as auxiliary information to
information and texture features. To accelerate training speed,
train a student model. These methods can enhance
the EIOU and CIOU loss functions were combined to form
lightweighting efficiency and stability.
ECIOU, which not only enhances the accuracy, robustness,
Lastly, the data augmentation techniques employed in the
and stability of the bounding box matching metric in object
experiments to expand the dataset may not fully consider all
detection tasks but also incorporates the advantages of both
variations in complex aquaculture environments. Therefore,
loss functions.zation.
further research is needed to enhance the model's adaptability
In future work, we will further expand the types of fish

63
populations that need to be detected in aquaculture and [9] Chen Feifen, "Research and application of water meter reading
different scenarios of aquaculture environments. We aim to recognition based on deep learning," Master's thesis, Guilin
establish a digital fishery system by integrating sensors such University of Electronic Technology, 2022. doi:
10.27049/d.cnki.ggldc.2021.000020.
as dissolved oxygen and pH meters, to achieve more accurate
and comprehensive assessment of fish growth conditions. The [10] A. Krizhevsky, I. Sutskever, and G. E. Hinton, "ImageNet
goal is to improve fisheries production efficiency and classification with deep convolutional neural networks,"
optimize resource utilization. Additionally, we will enhance Commun. ACM, vol. 60, no. 6, pp. 84–90, May 2017, doi:
10.1145/3065386.
the corresponding detection algorithms to find a better
balance between speed and accuracy. [11] K. Simonyan and A. Zisserman, "Very Deep Convolutional
Overall, the proposed sea bream group object detection Networks for Large-Scale Image Recognition," arXiv, April 10,
model demonstrates high efficiency and is suitable for 2015. Accessed: March 14, 2024. [Online]. Available:
http://arxiv.org/abs/1409.1556
deployment on mobile devices, providing strong technical
support for aquaculture technology. [12] W. Zaremba, I. Sutskever, and O. Vinyals, "Recurrent Neural
Network Regularization," arXiv, February 19, 2015. Accessed:
Acknowledgment March 14, 2024. [Online]. Available:
http://arxiv.org/abs/1409.2329
Funding: This work was supported in part by the National [13] K. Greff, R. K. Srivastava, J. Koutnik, B. R. Steunebrink, and
Natural Science Foundation of China un-der Grant 62175037, J. Schmidhuber, "LSTM: A Search Space Odyssey," IEEE
in part by the Huzhou Key R&D Program Agricultural Trans. Neural Netw. Learning Syst., vol. 28, no. 10, pp. 2222–
“Double Strong” Special Project (No. 2022ZD2060),in part 2232, October 2017, doi: 10.1109/TNNLS.2016.2582924.
by Zhejiang-French Digital Monitoring Lab for Aquatic [14] I. Goodfellow et al., "Generative Adversarial Nets."
Resources and Environment, Department of Science and
Technology of Zhejiang Province, and in part by the Huzhou [15] C. Szegedy et al., "Going deeper with convolutions," in 2015
Key Laboratory of Waters Robotics Technology (2022-3), IEEE Conference on Computer Vision and Pattern Recognition
(CVPR), Boston, MA, USA: IEEE, June 2015, pp. 1–9. doi:
Huzhou Science and Technology Bureau. 10.1109/CVPR.2015.7298594.

References [16] K. He, X. Zhang, S. Ren, and J. Sun, "Deep Residual Learning
for Image Recognition," in 2016 IEEE Conference on
[1] Xu Hai, Xie Hongtao, and Zhang Yongdong, "Advances in Computer Vision and Pattern Recognition (CVPR), Las Vegas,
Visual Domain Generalization Techniques and Research," NV, USA: IEEE, June 2016, pp. 770–778. doi:
Journal of Guangzhou University (Natural Science Edition), 10.1109/CVPR.2016.90.
vol. 21, no. 2, pp. 42–59, 2022.
[17] A. G. Howard et al., "MobileNets: Efficient Convolutional
[2] N. J. C. Strachan, P. Nesvadba, and A. R. Allen, "Fish species Neural Networks for Mobile Vision Applications," arXiv,
recognition by shape analysis of images," Pattern Recognition, April 16, 2017. Accessed: March 14, 2024. [Online]. Available:
vol. 23, no. 5, pp. 539–544, January 1990, doi: 10.1016/0031- http://arxiv.org/abs/1704.04861
3203(90)90074-U.
[18] M. Sandler, A. Howard, M. Zhu, A. Zhmoginov, and L.-C.
[3] N. Castignolles, M. Cattoen, and M. Larinier, "Identification Chen, "MobileNetV2: Inverted Residuals and Linear
and counting of live fish by image analysis," presented at Bottlenecks," in 2018 IEEE/CVF Conference on Computer
IS&T/SPIE 1994 International Symposium on Electronic Vision and Pattern Recognition, Salt Lake City, UT: IEEE,
Imaging: Science and Technology, S. A. Rajala and R. L. June 2018, pp. 4510–4520. doi: 10.1109/CVPR.2018.00474.
Stevenson, eds., San Jose, CA, March 1994, pp. 200–209. doi:
10.1117/12.171067. [19] A. Howard et al., "Searching for MobileNetV3," in 2019
IEEE/CVF International Conference on Computer Vision
[4] D.-J. Lee, R. B. Schoenberger, D. Shiozawa, X. Xu, and P. (ICCV), Seoul, Korea (South): IEEE, October 2019, pp. 1314–
Zhan, "Contour matching for a fish recognition and migration- 1324. doi: 10.1109/ICCV.2019.00140.
monitoring system," presented at Optics East, K. G. Harding,
ed., Philadelphia, PA, December 2004, p. 37. doi: [20] G. Huang, Z. Liu, L. Van Der Maaten, and K. Q. Weinberger,
10.1117/12.571789. "Densely Connected Convolutional Networks," in 2017

[5] Ding Shunrong and Xiao Ke, "Research on fish classification [21] R. Girshick, J. Donahue, T. Darrell, and J. Malik, "Rich Feature
method based on particle swarm optimization SVM and multi- Hierarchies for Accurate Object Detection and Semantic
feature fusion," Chinese Journal of Agricultural Mechanization, Segmentation."
vol. 41, no. 11, pp. 113–118, 170, 2020, doi: [22] S. Ren, K. He, R. Girshick, and J. Sun, "Faster R-CNN:
10.13733/j.jcam.issn.2095-5553.2020.11.018. Towards Real-Time Object Detection with Region Proposal
[6] Yao Runlu, Gui Yongwen, and Huang Qiugui, "Freshwater fish Networks," IEEE Transactions on Pattern Analysis and
species recognition based on machine vision," Journal of Machine Intelligence, vol. 39, no. 6, pp. 1137–1149, June 2017,
Microcomputers and Applications, vol. 36, no. 24, pp. 37–39, doi: 10.1109/TPAMI.2016.2577031.
2017, doi: 10.19358/j.issn.1674-7720.2017.24.011. [23] K. He, G. Gkioxari, P. Dollar, and R. Girshick, "Mask R-
[7] P. Cisar, D. Bekkozhayeva, O. Movchan, M. Saberioon, and R. CNN," presented at Proceedings of the IEEE International
Schraml, "Computer vision based individual fish identification Conference on Computer Vision, 2017, pp. 2961–2969.
using skin dot pattern," Sci Rep, vol. 11, no. 1, p. 16904, Accessed: March 14,2024. [Online]. Available: https:
August 2021, doi: 10.1038/s41598-021-96476-4. //openaccess.thecvf.com/content_iccv_2017/html/He_Mask_
R-CNN_ICCV_2017_paper.html
[8] R. B. Dala-Corte, J. B. Moschetta, and F. G. Becker, "Photo-
identification as a technique for recognition of individual fish: [24] W. Liu et al., "SSD: Single Shot MultiBox Detector," in
a test with the freshwater armored catfish Rineloricaria Computer Vision – ECCV 2016, vol. 9905, B. Leibe, J. Matas,
aequalicuspis Reis & Cardoso, 2001 (Siluriformes: N. Sebe, and M. Welling, eds., Lecture Notes in Computer
Loricariidae)," Neotrop. ichthyol., vol. 14, no. 1, 2016, doi: Science, vol. 9905., Cham: Springer International Publishing,
10.1590/1982-0224-20150074. 2016, pp. 21–37. doi: 10.1007/978-3-319-46448-0_2.

64
[25] J. Redmon and A. Farhadi, "YOLOv3: An Incremental 6856. Accessed: March 15, 2024. [Online] .Available:
Improvement," arXiv, April 8, 2018. doi: https://openaccess.thecvf.com/content_cvpr_2018/html/Zhang
10.48550/arXiv.1804.02767. _ShuffleNet_An_Extremely_CVPR_2018_paper.html
[26] A. Bochkovskiy, C.-Y. Wang, and H.-Y. M. Liao, "YOLOv4: [36] K. Han, Y. Wang, Q. Tian, J. Guo, C. Xu, and C. Xu,
Optimal Speed and Accuracy of Object Detection," arXiv, "GhostNet: More Features From Cheap Operations," presented
April 22, 2020. doi: 10.48550/arXiv.2004.10934. at Proceedings of the IEEE/CVF Conference on Computer
Vision and Pattern Recognition, 2020, pp. 1580–1589.
[27] X. Zhu, S. Lyu, X. Wang, and Q. Zhao, "TPH-YOLOv5: Accessed: March 15, 2024. [Online]. Available:
Improved YOLOv5 Based on Transformer Prediction Head for https://openaccess.thecvf.com/content_CVPR_2020/html/Han
Object Detection on Drone-captured Scenarios," in 2021 _GhostNet_More_Features_From_Cheap_Operations_CVPR
IEEE/CVF International Conference on Computer Vision _2020_paper.html
Workshops (ICCVW), Montreal, BC, Canada: IEEE, October
2021, pp. 2778–2788. doi: 10.1109/ICCVW54120.2021.00312. [37] X. Glorot, A. Bordes, and Y. Bengio, "Deep Sparse Rectifier
Neural Networks," in Proceedings of the Fourteenth
[28] C. Li et al., "YOLOv6: A Single-Stage Object Detection International Conference on Artificial Intelligence and
Framework for Industrial Applications," arXiv, September 7, Statistics, JMLR Workshop and Conference Proceedings, June
2022. doi: 10.48550/arXiv.2209.02976. 2011, pp. 315–323. Accessed: March 15, 2024. [Online].
[29] C.-Y. Wang, A. Bochkovskiy, and H.-Y. M. Liao, "YOLOv7: Available: https://proceedings.mlr.press/v15/glorot11a.html
Trainable Bag-of-Freebies Sets New State-of-the-Art for Real- [38] S. Woo, J. Park, J.-Y. Lee, and I. S. Kweon, "CBAM:
Time Object Detectors," in 2023 IEEE/CVF Conference on Convolutional Block Attention Module," presented at
Computer Vision and Pattern Recognition (CVPR), Vancouver, Proceedings of the European Conference on Computer Vision
BC, Canada: IEEE, June 2023, pp. 7464–7475. doi: (ECCV), 2018, pp. 3-19. Accessed: March 15, 2024 .[Online].
10.1109/CVPR52729.2023.00721. Available:https://openaccess.thecvf.com/content_ECCV_201
[30] S. Zhao, J. Zheng, S. Sun, and L. Zhang, "An Improved YOLO 8/html/Sanghyun_Woo_Convolutional_Block_Attention_EC
Algorithm for Fast and Accurate Underwater Object CV_2018_paper.html
Detection," Symmetry, vol. 14, no. 8, Art. no. 8, August 2022, [39] D. E. Rumelhart, G. E. Hinton, and R. J. Williams, "Learning
doi: 10.3390/sym14081669. representations by back-propagating errors," Nature, vol. 323,
[31] Y. Li, X. Bai, and C. Xia, "An Improved YOLOV5 Based on no. 6088, pp. 533–536, October 1986, doi: 10.1038/323533a0.
Triplet Attention and Prediction Head Optimization for Marine [40] H. Rezatofighi, N. Tsoi, J. Gwak, A. Sadeghian, I. Reid, and S.
Organism Detection on Underwater Mobile Platforms," JMSE, Savarese, "Generalized Intersection Over Union: A Metric and
vol. 10, no. 9, p. 1230, September 2022, doi: a Loss for Bounding Box Regression," presented at
10.3390/jmse10091230. Proceedings of the IEEE/CVF Conference on Computer Vision
[32] X. Zhai, H. Wei, Y. He, Y. Shang, and C. Liu, "Underwater Sea and Pattern Recognition, 2019, pp.658–666.Accessed: March
Cucumber Identification Based on Improved YOLOv5," 15,2024.[Online].Available:https://openaccess.thecvf.com/con
Applied Sciences, vol. 12, no. 18, p. 9105, September 2022, doi: tent_CVPR_2019/html/Rezatofighi_Generalized_Intersection
10.3390/app12189105. _Over_Union_A_Metric_and_a_Loss_for_CVPR_2019_pape
r.html
[33] A. Markus, G. Kecskemeti, and A. Kertesz, "Flexible
Representation of IoT Sensors for Cloud Simulators," in 2017 [41] Z. Zheng, P. Wang, W. Liu, J. Li, R. Ye, and D. Ren, "Distance-
25th Euromicro International Conference on Parallel, IoU Loss: Faster and Better Learning for Bounding Box
Distributed and Network-based Processing (PDP), March 2017, Regression," Proceedings of the AAAI Conference on
pp. 199–203. doi: 10.1109/PDP.2017.87. Artificial Intelligence, vol. 34, no. 07, Art. no. 07, April 2020,
doi: 10.1609/aaai. v34i07.6999.
[34] J. Chen et al., "Run, Don’t Walk: Chasing Higher FLOPS for
Faster Neural Networks," arXiv.org .Accessed:November 8, [42] Z. Zheng et al., "Enhancing Geometric Factors in Model
2023. [Online]. Available: https://arxiv.org/abs/2303.03667v3 Learning and Inference for Object Detection and Instance
Segmentation," IEEE Transactions on Cybernetics, vol. 52, no.
[35] X. Zhang, X. Zhou, M. Lin, and J. Sun, "ShuffleNet: An 8, pp. 8574–8586, August 2022, doi: 10.1109 /
Extremely Efficient Convolutional Neural Network for Mobile TCYB.2021.3095305.
Devices," presented at Proceedings of the IEEE Conference on
Computer Vision and Pattern Recognition, 2018, pp. 6848–

CSS or Hands-On Activity Monitoring Form
No ratings yet
CSS or Hands-On Activity Monitoring Form
2 pages
Computer Programming
No ratings yet
Computer Programming
28 pages
Fish Disease
No ratings yet
Fish Disease
77 pages
Portable Device For Ornamental Shrimp Counting Using Unsupervised Machine Learning
No ratings yet
Portable Device For Ornamental Shrimp Counting Using Unsupervised Machine Learning
10 pages
Windows Batch File Programming 1st Edition by Premkumar ISBN Instant Download
100% (2)
Windows Batch File Programming 1st Edition by Premkumar ISBN Instant Download
79 pages
Deep Learning Approach To Fish Survival Prediction in A Fish Pond
No ratings yet
Deep Learning Approach To Fish Survival Prediction in A Fish Pond
14 pages
Single Multi Core Comparision Report Final
No ratings yet
Single Multi Core Comparision Report Final
102 pages
Mock Exam Paper 2
No ratings yet
Mock Exam Paper 2
3 pages
Addressing Fisheries Policies in Pakistan Using Deep Learning
No ratings yet
Addressing Fisheries Policies in Pakistan Using Deep Learning
30 pages
Fishes 08 00514 With Cover
No ratings yet
Fishes 08 00514 With Cover
18 pages
Report
No ratings yet
Report
34 pages
q014p097
No ratings yet
q014p097
16 pages
A Modern Approach To Wire Harness Engineering Whitepaper
No ratings yet
A Modern Approach To Wire Harness Engineering Whitepaper
15 pages
HY-TTC 32: Quick Start Guide For CODESYS
100% (1)
HY-TTC 32: Quick Start Guide For CODESYS
29 pages
COST Pallid Sturgeon
No ratings yet
COST Pallid Sturgeon
91 pages
Fish
No ratings yet
Fish
11 pages
CITES Identification Materials Volume6 en
No ratings yet
CITES Identification Materials Volume6 en
632 pages
Fish Up To 5ch
No ratings yet
Fish Up To 5ch
100 pages
Germania Tübingen
No ratings yet
Germania Tübingen
52 pages
Estonia Tartu
No ratings yet
Estonia Tartu
51 pages
Bim Implementation
No ratings yet
Bim Implementation
175 pages
Aqua Research Paper Updated
No ratings yet
Aqua Research Paper Updated
27 pages
Hybrid Deep Learing To Detect Freshness
No ratings yet
Hybrid Deep Learing To Detect Freshness
15 pages
YOLOv8 - Fish Journal
No ratings yet
YOLOv8 - Fish Journal
10 pages
Controls Agilent 1100
No ratings yet
Controls Agilent 1100
80 pages
Suedia Uppsala
No ratings yet
Suedia Uppsala
16 pages
Sensors 22 07603 v2
No ratings yet
Sensors 22 07603 v2
29 pages
You Only Look Once v8 For Fish Species Identification
No ratings yet
You Only Look Once v8 For Fish Species Identification
8 pages
Automated Detection, Classification and Counting of Fish in Fish Passages With Deep Learning
No ratings yet
Automated Detection, Classification and Counting of Fish in Fish Passages With Deep Learning
15 pages
Irjet V10i770
No ratings yet
Irjet V10i770
7 pages
Underwater Fish Detection and Counting Using Mask Regional Convolutional Neural Network 2022
No ratings yet
Underwater Fish Detection and Counting Using Mask Regional Convolutional Neural Network 2022
23 pages
Ilham85,+ (7) .+5728 Article+Text 26575 1 2 20250326
No ratings yet
Ilham85,+ (7) .+5728 Article+Text 26575 1 2 20250326
6 pages
Finlanda Turku
No ratings yet
Finlanda Turku
12 pages
HDR Projects 5 - Manual
No ratings yet
HDR Projects 5 - Manual
129 pages
Application of Computer For Monitoring Catla Growth
No ratings yet
Application of Computer For Monitoring Catla Growth
9 pages
Olanda Utrecht
No ratings yet
Olanda Utrecht
11 pages
Jmse 12 00055 v2
No ratings yet
Jmse 12 00055 v2
18 pages
Targeted Data Augmentation and Hierarchical Classi
No ratings yet
Targeted Data Augmentation and Hierarchical Classi
22 pages
20 Page
No ratings yet
20 Page
20 pages
Sensors 23 02835 v2
No ratings yet
Sensors 23 02835 v2
19 pages
Biology 11 01727
No ratings yet
Biology 11 01727
16 pages
Fishes 08 00514
No ratings yet
Fishes 08 00514
17 pages
Crash 2024 12 05 - 18.20.31 Client
No ratings yet
Crash 2024 12 05 - 18.20.31 Client
9 pages
Fish Species Classification in Underwater Video Monitoring Using Convolutional Neural Networks
No ratings yet
Fish Species Classification in Underwater Video Monitoring Using Convolutional Neural Networks
9 pages
Pelayanan Ktp-El Di Dinas Kependudukan Dan Pencatatan Sipil Kabupaten Tanjung Jabung Barat
No ratings yet
Pelayanan Ktp-El Di Dinas Kependudukan Dan Pencatatan Sipil Kabupaten Tanjung Jabung Barat
118 pages
Designing Tiny Forests As A Lesson For Transdiscip
No ratings yet
Designing Tiny Forests As A Lesson For Transdiscip
9 pages
Active Detection For Fish Species Recognition in Underwater Environments
No ratings yet
Active Detection For Fish Species Recognition in Underwater Environments
10 pages
Ahbb 2014 0002
No ratings yet
Ahbb 2014 0002
36 pages
FireYOLO-Lite Lightweight Forest Fire Detection Ne
No ratings yet
FireYOLO-Lite Lightweight Forest Fire Detection Ne
21 pages
Survey 16
No ratings yet
Survey 16
10 pages
SE Lab Manual 3rd Sem
No ratings yet
SE Lab Manual 3rd Sem
66 pages
IEEE Conference Template 5
No ratings yet
IEEE Conference Template 5
3 pages
Deep Learning Fish Recognition in Underwater Using YOLOv8
No ratings yet
Deep Learning Fish Recognition in Underwater Using YOLOv8
8 pages
Introduction
No ratings yet
Introduction
13 pages
Fish Species Recognition With Faster R C
No ratings yet
Fish Species Recognition With Faster R C
11 pages
Rothschild 2023
No ratings yet
Rothschild 2023
9 pages
A Data Driven Approach To Dynamic Geofencing For Sustainable and Profitable Fisheries
No ratings yet
A Data Driven Approach To Dynamic Geofencing For Sustainable and Profitable Fisheries
9 pages
13 Page
No ratings yet
13 Page
13 pages
Deep Learning For Smart Fish Farming: Applications, Opportunities and Challenges
No ratings yet
Deep Learning For Smart Fish Farming: Applications, Opportunities and Challenges
47 pages
Autonomous UAV Inspection of Insulators Based On I
No ratings yet
Autonomous UAV Inspection of Insulators Based On I
26 pages
2023 Automated Fish Classification Using
No ratings yet
2023 Automated Fish Classification Using
19 pages
A17 6022 WWW
No ratings yet
A17 6022 WWW
6 pages
Linux - Command Line Interface
No ratings yet
Linux - Command Line Interface
17 pages
Optimization of Food Rations Used For The Pre-Deve
No ratings yet
Optimization of Food Rations Used For The Pre-Deve
7 pages
Potentially Functional Apple Snacks Infused in The
No ratings yet
Potentially Functional Apple Snacks Infused in The
14 pages
pdf1805 10106 PDF
No ratings yet
pdf1805 10106 PDF
6 pages
Cueto 2021
No ratings yet
Cueto 2021
8 pages
Calibrating Glucose Sensors at The Edge A Stress G
No ratings yet
Calibrating Glucose Sensors at The Edge A Stress G
12 pages
Dipu Rpaper
No ratings yet
Dipu Rpaper
11 pages
Anas 2021 IOP Conf. Ser. Earth Environ. Sci. 944 012007
No ratings yet
Anas 2021 IOP Conf. Ser. Earth Environ. Sci. 944 012007
13 pages
Determination of Bioactive Compounds in Vaccinium
No ratings yet
Determination of Bioactive Compounds in Vaccinium
6 pages
Literature Review Hritick
No ratings yet
Literature Review Hritick
5 pages
Conference Paper
No ratings yet
Conference Paper
7 pages
Afin - Articol Stiintific
No ratings yet
Afin - Articol Stiintific
13 pages
Danemarca Copenhaga
No ratings yet
Danemarca Copenhaga
9 pages
EE - JTBI Journal Template
No ratings yet
EE - JTBI Journal Template
5 pages
Live Fish Species Classification in Underwater Ima
No ratings yet
Live Fish Species Classification in Underwater Ima
15 pages
Employment of Artificial Intelligence in Fish Fraud
No ratings yet
Employment of Artificial Intelligence in Fish Fraud
6 pages
Lingonberry Vaccinium Vitis-Idaea L Health Effects
No ratings yet
Lingonberry Vaccinium Vitis-Idaea L Health Effects
10 pages
ST Report 2 - 29 - 44
No ratings yet
ST Report 2 - 29 - 44
5 pages
Freshwater Fish Image Classifier
No ratings yet
Freshwater Fish Image Classifier
54 pages
Limnology Ocean Methods - 2016 - Salman - Fish Species Classification in Unconstrained Underwater Environments Based On
No ratings yet
Limnology Ocean Methods - 2016 - Salman - Fish Species Classification in Unconstrained Underwater Environments Based On
16 pages
Sumathi Salesforce Developer Resume
No ratings yet
Sumathi Salesforce Developer Resume
2 pages
MR Ope Work
No ratings yet
MR Ope Work
14 pages
Terrarium A Tiny Ecosystem
No ratings yet
Terrarium A Tiny Ecosystem
6 pages
Hierarchical Deep Learning Models For Identification of Fish Species
No ratings yet
Hierarchical Deep Learning Models For Identification of Fish Species
5 pages
Sludge Treatment
No ratings yet
Sludge Treatment
7 pages
CSE 444 Detection and Recognition of Bangladeshi Fishes Using Surf and Convolutional Neural Network
No ratings yet
CSE 444 Detection and Recognition of Bangladeshi Fishes Using Surf and Convolutional Neural Network
7 pages
Proposal of Descriptors To Study The Variability o
No ratings yet
Proposal of Descriptors To Study The Variability o
12 pages
Real-Time Marine Anumals Images Classification by Embedded System Based On Mobilenet and Transfer Learning
No ratings yet
Real-Time Marine Anumals Images Classification by Embedded System Based On Mobilenet and Transfer Learning
5 pages
Ammonia Excretion and Urea Handling by Fish Gills
No ratings yet
Ammonia Excretion and Urea Handling by Fish Gills
18 pages
Cisco Ise InstallationGuide30
No ratings yet
Cisco Ise InstallationGuide30
100 pages
What Is Computer Science
No ratings yet
What Is Computer Science
7 pages
A Comparison of YOLO and Mask R-CNN For Segmenting Head and Tail of Fish
No ratings yet
A Comparison of YOLO and Mask R-CNN For Segmenting Head and Tail of Fish
6 pages
(Revision) MLR-VGGNet (No Authors) Revision 2 - Submit (Abstract)
No ratings yet
(Revision) MLR-VGGNet (No Authors) Revision 2 - Submit (Abstract)
2 pages
2 Image Based Fish Recognition
No ratings yet
2 Image Based Fish Recognition
4 pages
IEEE Conference Template 1
No ratings yet
IEEE Conference Template 1
5 pages
Class XII (As Per CBSE Board) : Informatics Practices
No ratings yet
Class XII (As Per CBSE Board) : Informatics Practices
18 pages
Shreshth CS Project Main
No ratings yet
Shreshth CS Project Main
15 pages
Study of Cad and Creo 2D Sketching: 1. Reasons For Implementing A Cad System
No ratings yet
Study of Cad and Creo 2D Sketching: 1. Reasons For Implementing A Cad System
3 pages
Mobilenet Ikan v3
No ratings yet
Mobilenet Ikan v3
13 pages
Military Navy Ships PowerPoint Templates
No ratings yet
Military Navy Ships PowerPoint Templates
48 pages
Resume of Noor Mohammad MInhaj - 1
No ratings yet
Resume of Noor Mohammad MInhaj - 1
1 page
Module 5: Basic Processing Unit: 1.some Fundamental Concepts
No ratings yet
Module 5: Basic Processing Unit: 1.some Fundamental Concepts
27 pages
PBM MN Series How To Configure Mgate 5101 With Siemens s7 1200 Tech Note v1.0
No ratings yet
PBM MN Series How To Configure Mgate 5101 With Siemens s7 1200 Tech Note v1.0
13 pages
Acknowledgement CHAPTER-1 Introduction CHAPTER-2 System Analysis 2.1 Definition
100% (1)
Acknowledgement CHAPTER-1 Introduction CHAPTER-2 System Analysis 2.1 Definition
61 pages
Pedersen Detection of Marine Animals in A New Underwater Dataset With CVPRW 2019 Paper
No ratings yet
Pedersen Detection of Marine Animals in A New Underwater Dataset With CVPRW 2019 Paper
9 pages
Field Installation Guide v5 - 3
No ratings yet
Field Installation Guide v5 - 3
54 pages
Dgx1 v100 System Architecture Whitepaper
No ratings yet
Dgx1 v100 System Architecture Whitepaper
43 pages
Advances in Blueberry Vaccinium SPP in Vitro Cultu
No ratings yet
Advances in Blueberry Vaccinium SPP in Vitro Cultu
21 pages
3 Vol 18 No 1
No ratings yet
3 Vol 18 No 1
8 pages
Application of Augmented Reality Technology in Chemistry Experiment Teaching
No ratings yet
Application of Augmented Reality Technology in Chemistry Experiment Teaching
4 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Improved YOLOV7-TINY Network For Sea Bream Detecti

Uploaded by

Improved YOLOV7-TINY Network For Sea Bream Detecti

Uploaded by

Journal of Computing and Electronic Information Management

ISSN: 2413-1660 | Vol. 15, No. 2, 2024

Improved YOLOV7-TINY Network for Sea Bream

established and classified databases of different types of fish

Figure 3. Fussion Block

2.3. ResNetCBAM pooling region, aiming to extract prominent characteristics of

Figure 5. Channel Attention Module

Figure 9. Validation set results

100 90.3 89.2 90.8 89.1 92.6

Figure 10. Comparison of map@0.5 and FPS under different models

3.3.5. Ablation experiment size, ResNetCBAM-FUSSION-YOLO consumes only 3.8%

Figure 11. ResNetCBAM-FUSSION-YOLO

Figure 12. YOLOV7

Figure 13. YOLOV7-Tiny

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.