1 s2.0 S2352146520307754 Main
1 s2.0 S2352146520307754 Main
1 s2.0 S2352146520307754 Main
com
ScienceDirect
Available online at www.sciencedirect.com
Available online
Transportation at www.sciencedirect.com
Research Procedia 00 (2019) 000–000
www.elsevier.com/locate/procedia
ScienceDirect
ScienceDirect
Transportation Research Procedia 50 (2020) 234–241
Transportation Research Procedia 00 (2019) 000–000
www.elsevier.com/locate/procedia
XIV International Conference 2020 SPbGASU “Organization and safety of traffic in large cities”
network
South Ural State University, 76 Lenina Prosp., Chelyabinsk, 454080, Russia
2352-1465 © 2020 Georgii Kataev, Vitalii Varkentin, Kseniia Nikolskaia. Published by ELSEVIER B.V.
*
Corresponding
This author.article
is an open access Tel: +7-999-589-72-37
under the CC BY-NC-ND license (https://creativecommons.org/licenses/by-nc-nd/4.0)
E-mail address:
Peer-review nikolskaya174@gmail.com
under responsibility of the scientific committee of the XIV International Conference 2020 SPbGASU “Organization and safety of
traffic in large cities”
2352-1465 ©
2352-1465 © 2020
2020Georgii
The Authors.
Kataev, Published by ELSEVIER
Vitalii Varkentin, B.V.
Kseniia Nikolskaia. Published by ELSEVIER B.V.
Thisisisananopen
This open access
access article
article under
under the BY-NC-ND
the CC CC BY-NC-NDlicenselicense (https://creativecommons.org/licenses/by-nc-nd/4.0)
(https://creativecommons.org/licenses/by-nc-nd/4.0)
Peer-review under responsibility of the scientific committee of the XIV International Conference 2020 SPbGASU “Organization and
Peer-review under responsibility of the scientific committee of the XIV International Conference 2020 SPbGASU “Organization and safety of
safety of traffic in large cities”
traffic in large cities”
10.1016/j.trpro.2020.10.029
Georgii Kataev et al. / Transportation Research Procedia 50 (2020) 234–241 235
2 Georgii Kataev, Vitalii Varkentin, Kseniia Nikolskaia / Transportation Research Procedia 00 (2019) 000–000
transport in problem areas of the city (Goryaev et al. 2018). Installing CCTV cameras at every stop is a very expensive
solution. Therefore, it was decided to develop an application for calculating pedestrian traffic based on the existing
infrastructure. At the moment, Chelyabinsk has 37 CCTV cameras that cover stops at the busiest intersections of the
city. To develop the application, we used a data set from one video surveillance camera.
2. Related studies
Lin et al. (2017) describe an algorithm for analyzing video traffic received using UAV (Unmanned Aerial Vehicle).
The algorithm consists of four parts: the first two parts deal with vehicle detection, and the last two parts estimate
traffic flow parameters. The first part of the algorithm is a Haar cascade classifier trained using randomly generated
Haar-like features, performing primary detection of regions with vehicles. The second part of the algorithm is a
convolutional neural network developed by the authors. Using the regions selected in the previous stage, it determines
the exact location of vehicles. The network training data set consists of 20,000 image samples. It is available for public
use (Manzoor et al. 2019). The authors managed to achieve 99.55% accuracy on a test set over 100 training epochs.
The third part of the algorithm is vehicle tracking based on the KLT (Kanade–Lucas–Tomasi) method. Finally, the
fourth part of the algorithm is the estimation of traffic flow parameters.
Fedorov et al. (2019) address the problem of traffic flow estimation based on the data from video surveillance
cameras. They propose a system based on Faster-CNN two-stage detector, whose performance is enhanced with
several modifications: focal loss, adaptive feature pooling, additional mask branch, and anchors optimization. The
system also includes SORT (simple online and real time tracking) tracker that helps to solve the problem of multiple
object tracking. The system is able to operate with a maximum relative error of less than 10%. The data set for neural
network training included 982 frames.
Wei et al. (2019) developed a traffic tracking system for CCTV cameras with a low frame rate (0.3–1 Hz). A pre-
trained SSD-Mobilenet network was used for recognition. The network was trained on a CityCam data set and a
custom labeled data set comprising 2000 images. The value of the average absolute error during system operation is
8 times less compared with the optical flow algorithm.
Asha and Narasimhadhan (2018) propose a traffic management system that captures data from hand-held video
cameras. Hand-held video cameras were chosen by the authors to create various types of interference: camera
“shaking”, a more complex environment, a lot of shadows, etc. The system operates in three stages: recognition and
classification by the YOLO neural network, multiple object tracking with the use of a correlation filter, and vehicle
counting based on the tracked trajectory. The YOLO neural network was trained on the PASCAL VOC data set
(Varkentin et al. 2019a).
Cao et al. (2019) describe an intelligent transportation system based on the modified YOLO neural network. The
authors needed increased accuracy of detection under different weather conditions and at different times of day. One
of the main improvements was a modification of the formula for IoU (Intersection over Union) determination
(Varkentin et al. 2019b). Thus, bounding boxes were generated 10% more accurately than with the standard YOLO.
The authors trained the network on the VOC2007 data set with normalization of pixel values in images to a range [-
1, 1]. For testing, they used their own UA-CAR data set based on 26,000 images taken from the UA-DETRAC data
set. With its use, the developed neural network showed a 10–20% increase in its accuracy.
Song et al. (2019) describe a vehicle detection and counting system, the main element of which is the YOLOv3
neural network. The algorithm of the system is quite simple: first, the received roadway video data is segmented; then,
the YOLOv3 neural network is used to detect vehicles; and then, features are extracted by the ORB algorithm to track
the movement of vehicles. The neural network was trained on a set of data collected from many other data sets. It
includes 11,000 images from surveillance cameras, dashboard cameras, and cameras that are not intended for
monitoring.
A set of data for training and testing the neural network was provided by Intersvyaz company. Image labeling was
performed in the COCO Annotator web tool (Varkentin et al. 2019c), which provides a user-friendly interface and a
236 Georgii Kataev et al. / Transportation Research Procedia 50 (2020) 234–241
Georgii Kataev, Vitalii Varkentin, Kseniia Nikolskaia / Transportation Research Procedia 00 (2019) 000–000 3
wide range of functions. The COCO format for the data set was chosen because the labeling is saved in a .json file,
which is supported by all modern programming languages. The entire data set contains 750 images.
During the design of the application architecture, a component diagram was developed that breaks down the
software system into structural components and relationships between them. The diagram is shown in Fig. 1.
The developed application is based on a convolutional neural network of the YOLOv3 system. Its basic architecture
consists of three main parts: a feature extraction algorithm (backbone), a detector, and a classifier.
Backbone: it is the feature extraction algorithm named DarkNet-53 by its authors. It consists of 53 convolutional
layers, each of which includes a normalization layer. The activation function for each layer is Leaky ReLU. Between
certain layers, the dimension of the feature map decreases by 2 times. In total, this algorithm decreases the dimension
by 32 times. For further work of YOLO, the algorithm should output three feature maps with dimensions decreased
from the size of the original image by 8, 16, and 32 times, respectively. The algorithm operation scheme is shown in
Fig. 2.
4 Georgii Kataev, Vitalii Varkentin, Kseniia
Georgii Kataev Nikolskaia
et al. / Transportation
/ Transportation ResearchResearch
ProcediaProcedia
50 (2020)00234–241
(2019) 000–000 237
Detector: it is a convolutional neural network consisting of 200 layers. Layers with a convolutional kernel of
dimensions 1 × 1 and 3 × 3 alternate. The last layer must have a 1 × 1 convolutional kernel. For a feature map of the
smallest dimension, the processing is performed only by the neural network. For the remaining two maps,
concatenation with a lower-dimension map that the network has already processed is applied before it is sent to the
neural network. This process is shown in Fig. 3.
Classifier: it is a fully connected neural network consisting of three layers with the number of neurons 2048, 2048,
and 13, respectively. The number of neurons in the last layer is equal to the number of classes from which the forecast
is made.
Yolo_video.py script was implemented to run recognition and classification in images, whose input parameters are
shown in Fig. 4.
238 Georgii Kataev et al. / Transportation Research Procedia 50 (2020) 234–241
Georgii Kataev, Vitalii Varkentin, Kseniia Nikolskaia / Transportation Research Procedia 00 (2019) 000–000 5
1) model — the path to the file with weights of the used neural network;
2) anchors — the path to the file with anchors of the used neural network;
3) classes — the path to the file with the classes that need to be recognized;
4) gpu_num — the number of GPUs that will be used during application operation;
5) image — the boolean flag that enables image recognition mode;
6) input — the path to the folder with the initial data to work with;
7) output — the path to the folder where the results are to be saved.
1) graphics accelerator NVIDIA GEFORCE GTX 1650 SUPER (1725 MHz, 4 GB GDDR6, 12 Gbps, 1280 CUDA
Cores);
2) CPU AMD Ryzen 5 2600X (6 cores, 12 threads, 3.6 GHz);
3) 16 GB RAM;
4) ОС Linux Mint 19.3 Tricia.
The entire data set contained 750 images. The training sample contained 80% of the total sample, which amounted
to 600 images. The training lasted for 350 epochs and 6 hours.
Train.py script was implemented to train the neural network. When training a neural network, the script requires
train.txt file, containing information about labeling. The string template in this file is shown in Fig. 5, where x1, y1 —
the coordinates of the lower-left corner of the labeled object, x2, y2 — the coordinates of the upper-right corner of the
labeled object, and class — integer representation of the class name. The contents of train.txt file are shown in Figs.
5 and 6.
The main result of this work is the development of a prototype application for calculating pedestrian traffic. Fig. 7
shows an example of application operation. Each pedestrian is highlighted in a rectangular frame.
Georgii Kataev et al. / Transportation Research Procedia 50 (2020) 234–241 239
6 Georgii Kataev, Vitalii Varkentin, Kseniia Nikolskaia / Transportation Research Procedia 00 (2019) 000–000
To determine the effectiveness of the trained neural network, a test sample was created containing 20% of the total
sample, which is equal to 150 images. Three tests were performed with different values of the IoU and Min_score
parameters: for the first test, both parameters were 0.4, for the second test, IoU = 0.3 and Min_score = 0.5, and for the
third test, IoU = 0.4 and Min_score = 0.15. In the first test, the accuracy of the neural network was 54%, in the second
test — 67%, in the third test — 79%. Accuracy refers to the ratio between the correctly recognized vehicles and the
total number of vehicles, expressed as a percentage.
5. Conclusions
In this work, the problem of calculating pedestrian traffic was solved. As an example, we chose one busy
intersection in Chelyabinsk, where two stops and a pedestrian crossing are clearly visible. Following the purpose of
the study, the designed topology of an artificial neural network was implemented, the neural network was trained on
a training sample, and an application was developed. The developed neural network was tested. As part of the term
project, an application was developed for recognizing motor transport with the use of a convolutional neural network
with the YOLOv3 architecture.
In the future, it is planned to implement the solution at all intersections of Chelyabinsk equipped with video
cameras. To do this, we will increase the data set and modify the topology for operation in real time.
Acknowledgments
The authors express their gratitude to Intersvyaz company for providing access to the video stream for scientific
research.
Funding
The work was supported by Act 211 of the Government of the Russian Federation, contract 02.A03.21.0011.
References
Asha, C.S., Narasimhadhan, A.V., 2018. Vehicle counting for traffic management system using YOLO and correlation filter. 2018 IEEE
International Conference on Electronics, Computing and Communication Technologies (CONECCT). Bangalore, India. 1–6. DOI:
10.1109/CONECCT.2018.8482380.
Brylev, I., Evtiukov, S., Evtiukov, S., 2018. Problems of calculating the speed of two-wheeled motor vehicles in an accident. Transportation
Research Procedia 36, 84–89. DOI: 10.1016/j.trpro.2018.12.047.
Cao, C.-Y., Zheng, J.-C., Huang, Y.-Q., Liu, J., Yang, C.-F., 2019. Investigation of a promoted You Only Look Once algorithm and its application
in traffic flow monitoring. Applied Sciences 9 (17), 3619. DOI: 10.3390/app9173619.
Danilov, I.K., Marusin, A.V., Marusin, A.V., Danilov, S.I., Andryushchenko, I.S., 2018. Diagnosis of the fuel equipment of diesel engines with
multicylinder high pressure fuel injection pump for the movement of the injector valve for the diagnostic device. ICFET'18: Proceedings of the
4th International Conference on Frontiers of Educational Technologies, 157–160. DOI: 10.1145/3233347.3233363.
240 Georgii Kataev et al. / Transportation Research Procedia 50 (2020) 234–241
Georgii Kataev, Vitalii Varkentin, Kseniia Nikolskaia / Transportation Research Procedia 00 (2019) 000–000 7
Danilov, I., Marusin, A., Mikhlik, M., Uspensky, I., 2020. Development of the mathematical model of fuel equipment and justification for
diagnosing diesel engines by injector needle displacement. Transport Problems 15 (1), 93–104. DOI: 10.21307/tp-2020-009.
De Luca, G., Gallo, M., 2020. The use of Artificial Neural Networks for extending road traffic monitoring data spatially: an application to the
neighbourhoods of Benevento. Transportation Research Procedia 45, 635–642. DOI: 10.1016/j.trpro.2020.03.047.
Evtiukov, S., Golov, E., Ginzburg, G., 2018a. Finite element method for reconstruction of road traffic accidents. Transportation Research Procedia
36, 157–165. DOI: 10.1016/j.trpro.2018.12.058.
Evtiukov, S., Karelina, M., Terentyev, A., 2018b. A method for multi-criteria evaluation of the complex safety characteristic of a road vehicle.
Transportation Research Procedia 36, 149–156. DOI: 10.1016/j.trpro.2018.12.057.
Fedorov, A., Nikolskaia, K., Ivanov, S., Shepelev, V., Minbaleev. A., 2019. Traffic flow estimation with data from a video surveillance camera.
Journal of Big Data 6, 73.
Ginzburg, G., Evtiukov, S., Brylev, I., Volkov, S., 2017. Reconstruction of road accidents based on braking parameters of category L3 vehicles.
Transportation Research Procedia 20, 212–218. DOI: 10.1016/j.trpro.2017.01.054.
Goryaev, N., Myachkov, K., Larin, O., 2018. Optimization of “green wave” mode to ensure priority of fixed-route public transport. Transportation
Research Procedia 36, 231–236. DOI: 10.1016/j.trpro.2018.12.072.
Kerimov, M., Safiullin, R., Marusin, A., Marusin, A., 2017. Evaluation of functional efficiency of automated traffic enforcement systems.
Transportation Research Procedia 20, 288–294. DOI: 10.1016/j.trpro.2017.01.025.
Kurakina, E., Evtiukov, S., Rajczyk, J., 2018. Forecasting of road accident in the DVRE system. Transportation Research Procedia 36, 380–385.
DOI: 10.1016/j.trpro.2018.12.111.
Lin, T.-Y., Goyal, P., Girshick, R., He, K., Dollár, P., 2017. Focal loss for dense object detection. 2017 IEEE International Conference on Computer
Vision, 2980–2988.
Manzoor, M.A., Morgan, Y., Bais, A., 2019. Real-time vehicle make and model recognition system. Machine Learning and Knowledge Extraction
1 (2), 611–629.
Marusin, A.V., 2017a. A method of assessing the efficiency of systems of automatic recording of traffic violations. PhD Thesis in Engineering.
Saint Petersburg State University of Architecture and Civil Engineering, Saint Petersburg.
Marusin, A.V., 2017b. Improving the diagnostics of plunger pairs in high-pressure fuel pumps of motor and tractor diesel engines. PhD Thesis in
Engineering. Kostychev Ryazan State Agrotechnological University, Ryazan.
Marusin, A.V., Abliazov, T.Kh., 2019. Public-private partnership as a mechanism for development of automated digital systems. Transport of the
Russian Federation, 3 (82), 23–25.
Marusin, A.V., Danilov, I.K., Khlopkov, S.V., Marusin, A.V., Uspenskiy, I.A., 2020. Development of a mathematical model of fuel equipment and
the rationale for diagnosing diesel engines by moving the injector needle. IOP Conference Series: Earth and Environmental Science 422, 012126.
DOI: 10.1088/1755-1315/422/1/012126.
Marusin, A., Marusin, A., Ablyazov, T., 2019. Transport infrastructure safety improvement based on digital technology implementation. Atlantis
Highlights in Computer Sciences, Vol. 1. International Conference on Digital Transformation in Logistics and Infrastructure (ICDTLI 2019),
353–357. DOI: 10.2991/icdtli-19.2019.61.
Marusin, A., Marusin, A., Danilov, I., 2018. A method for assessing the influence of automated traffic enforcement system parameters on traffic
safety. Transportation Research Procedia 36, 500–506. DOI: 10.1016/j.trpro.2018.12.136.
Repin, S., Evtiukov, S., Maksimov, S., 2018. A method for quantitative assessment of vehicle reliability impact on road safety. Transportation
Research Procedia 36, 661–668. DOI: 10.1016/j.trpro.2018.12.128.
Safiullin, R., Kerimov, M., Afanasyev, A., Marusin, A., 2018. A model for justification of the number of traffic enforcement facilities in the region.
Transportation Research Procedia 36, 493–499. DOI: 10.1016/j.trpro.2018.12.135.
Safiullin, R., Marusin, A., Safiullin, R., Ablyazov, T., 2019. Methodical approaches for creation of intelligent management information systems by
means of energy resources of technical facilities. E3S Web of Conferences 140, 10008. DOI: 10.1051/e3sconf/201914010008.
Song, H., Liang, H., Li, H., Dai, Z., Yun, X., 2019. Vision-based vehicle detection and counting system using deep learning in highway scenes.
European Transport Research Review 11, 51. DOI: 10.1186/s12544-019-0390-4.
Soo, S., Abdel Sater, K.I., Khodyakov, A.A., Marusin, A.V., Danilov, I.K., Khlopkov, S.V., Andryushenko, I.S., 2020. The ways of effectiveness
increase of liquid fuel with organic addition appliance in aerospace equipment. Advances in the Astronautical Sciences 170, 833–838.
Varkentin, V., Kolobanov, D., 2019. Development of an application for vehicle classification using neural networks technologies. 2019
International Conference “Quality Management, Transport and Information Security, Information Technologies”. Sochi, Russia, 215–219. DOI:
10.1109/ITQMIS.2019.8928359
Varkentin, V., Shadehin, V., 2019a. Development of an application for localization and recognition of road signs using neural network technology.
2019 International Conference “Quality Management, Transport and Information Security, Information Technologies”. Sochi, Russia, 209–214.
DOI: 10.1109/ITQMIS.2019.8928336.
Varkentin, V., Schukin, M., 2019b. Development of an application for car license plates recognition using neural network technologies. 2019
International Conference “Quality Management, Transport and Information Security, Information Technologies”. Sochi, Russia, 203–208. DOI:
10.1109/ITQMIS.2019.8928373.
Vorozheikin, I., Marusin, A., Brylev, I., Vinogradova, V., 2019c. Digital technologies and complexes for provision of vehicular traffic safety.
Atlantis Highlights in Computer Sciences, Vol. 1. International Conference on Digital Transformation in Logistics and Infrastructure (ICDTLI
2019), 385–389. DOI: 10.2991/icdtli-19.2019.67.
Georgii Kataev et al. / Transportation Research Procedia 50 (2020) 234–241 241
8 Georgii Kataev, Vitalii Varkentin, Kseniia Nikolskaia / Transportation Research Procedia 00 (2019) 000–000
Wei, P., Shi, H., Yang, J., Qian, J., Ji, Y., Jiang, X., 2019. City-scale vehicle tracking and traffic flow estimation using low frame-rate traffic
cameras. UbiComp/ISWC '19 Adjunct: Adjunct Proceedings of the 2019 ACM International Joint Conference on Pervasive and Ubiquitous
Computing and Proceedings of the 2019 ACM International Symposium on Wearable Computers, 602–610. DOI:
https://doi.org/10.1145/3341162.3349336.