Pothole Research Paper
Pothole Research Paper
GOOGLE MAPS
SETTING
PRIORITY
Labelling: The features of the data are timestamp, 3-axes
MySQL Database coordinates of accelerometer and gyroscope sensors. By taking
ISSUE
into consideration the z-axis of the accelerometer, the thresh-
old value was found which was used to identify potholes on
ANDROID APPLICATION the journeys recorded.
DETECTION
Real-Time Detection
Geo-Location with Upvoting
Verifying Status of Issues
ALGORITHMS
To detect potholes in real-time, consecutive samples are
examined to find road anomalies by comparing with the
ACCELEROMETER
threshold and the resultant down-sampled points are stored
IMAGE DATA
GYROSCOPE
DATA separately. When a pothole is detected, the z-axis of the
accelerometer sensor has a pattern by which there is a sudden
Fig. 2: Architecture increase followed by a decrease in its value. Therefore, a
vertical and a horizontal limit were set to create a window and
As illustrated in Figure 2, the complaints are sent to the were obtained by using the consecutive down sampling points.
Django server and stored in the MySQL database. An algo- Since the z-axis of the accelerometer sensor was used for
rithm is developed to set a priority to each complaint which is detection, the horizontal limit was used to find the “h-up” and
displayed on the portal using Google Maps API, for the civic “h-dn” values which created a window as shown in Figure 3.
authorities to verify and facilitate the redressal of them. A Here, the blue lines denote the sample data that was collected
status is added for each complaint which can be edited only for training and the red marker denotes the downsampled data
by the civic authorities based on the actions taken for each that was obtained after comparing with the threshold value.
complaint, which is displayed on mobile. Once the pothole is The X-axis denotes the time series data converted to integer
resolved, the complaint is removed from the database, which for convenience and the Y-axis denotes the acceleration values
is thus reflected in the application. in the z-direction for each sample. A window is also created
whose upper limit is “h-up” and the lower limit is “h-dn”.
B. Mobile Application Once these values are detected, the downsampled data
The solution deals with the detection of potholes using is compared with the window and a label is assigned for
various methods which are done using Android OS based each sample which gives the detail of the sample whether
smartphones. The popularity of smartphones makes the it is a pothole or not. The label ‘1’ is assigned if it is a
system very user-friendly. It enables strong participation, and pothole, otherwise ‘0’, For a continuous value of ‘1’ in the
interaction of users to solve the problem. Several features sample data, it denotes one pothole. In Figure 4, the green
have been added to make the system reliable. Some of the marker denotes the label ‘1’ that is, potholes are detected at
features are explained below. those points and the other markers meaning the same as earlier.
1) Using the Accelerometer sensor in mobiles: Training Classifier Model: The dataset was divided into
Dataset: The smartphone was mounted on the dashboard train and test data such that 30% of the data was used in
of the vehicle which recorded the time and accelerometer test data in classification models to calculate accuracy. It
and gyroscope readings for training. The data was collected is trained on the training data to detect potholes. Because
camera setup or through pedestrian participation. Citizens
can participate by drawing the authorities’ attention to such
problems. Generally, it is over social media platforms, which
often gets overlooked in the heap of new information. This is
where a dedicated complaint platform can come in handy.
Prediction: The system records the real-time data from Dataset: The dataset used for this work is obtained from
the sensors and preprocesses it to remove the unnecessary Google Images of bad roads. It consists of 254 images and a
high-frequency data for prediction. Afterwards, the prediction total of 1757 samples of potholes. The potholes are manually
of potholes on this data is done based on the trained classifier annotated in the XML format, containing the vertices, Xmin ,
model. Alerts of the locations of the predicted potholes are Ymin , Xmax and Ymax of the bounding boxes. The dataset is
sent to the users on their smartphones. Finally, these locations trained for 150 epochs with a batch size of 4 and 400 steps
are sent to the server which maintains the database of all per epoch.
recorded pothole locations.
Detection: For detection, the input image is provided in the
2) Image-based pothole identification: BGR format. It is normalized by subtracting the ImageNet
Using the accelerometer sensors to detect potholes provides mean of all three channels. It was found that a threshold of
automatic complaint redressal by the passengers of the 0.4 detects the maximum true positive boundary boxes for
vehicle. However, it requires driving into the pothole, which the potholes, throughout multiple tests.
causes great inconvenience to the passengers. Hence, an
image-based pothole detector is also considered, which 3) Geotagging locations of the complaints:
can be utilized in two ways – as a dash-mounted phone The complaints of the users are tagged with the GPS location.
For getting the GPS location, the fused location provider API by the authorities.
is used. The API uses technologies like mobile GPS, Wi-Fi and
cellular connection to accurately determine the exact position
of the user. Hence, to achieve the best results in determining
the location, the API is set to the highest accuracy. When
the system detects a pothole or a user clicks a photo of it,
the GPS coordinates of the last known location of the device
are recorded. These values are sent to the server with the
complaint, hence marking its location on the map.
4) Complaint Feed and Upvoting similar complaints:
The feed is added to the system to allow the users to view
complaints raised by other users within their proximity. The
default minimum range is 100 meter which can be increased
using the slider by the users. This helps to display all the posts
within that range for the user. The complaints recorded by the
system are geofenced with a radius of fifty meters around each Fig. 6: Complaints per Ward
pothole location using Google Services API. Whenever a user
passes over this region, a notification is sent to them which Dataset: During the testing of the application, the data
warns them about the nearby potholes beforehand. reported potholes in Mumbai was collected. The data consists
In case a user finds any complaint critical, he can upvote of the timestamp, latitudes and longitudes of various locations
that complaint to give it a higher priority, so that immediate reported by the users, the number of people facing the
action is taken on it. Similarly, upvoting is also used to problem and the date when the issue got resolved. The data
consolidate repetitive complaints from the same location. is from the time period of 1st October 2019 to 28th February
The priority of complaints increases as the votes increase. 2020 consisting of 1995 complaints in total.
Hence, a complaint with a higher number of votes is given
the maximum priority which is considered as one of the SARIMA Model: As one of the important functionalities of
important parameters for setting the priority of the complaints. the system is forecasting the number of potholes for the future
on the dashboard, the Seasonal Autoregressive Integrated
Moving Average (SARIMA) model was used on our time
C. Web Portal series dataset which consisted of five months of data. To
make the data stationary for efficient prediction, an Augmented
The solution also deals with the redressal of pothole
Dickey-Fuller test was performed. While using univariate
complaints by the users. This feature is solely handled by
time data, some aspects like autoregression, differencing and
civic authorities using a web portal which helps them get
moving average were considered. To smoothen the data, in the
information on the complaints and forecasts the expected
moving average model, three different window sizes 5 days, 30
number of complaints arising in the future. The portal is
days and 90 days were used to check for a trend in the curve.
only accessible by the authorities who can perform several
By increasing the window size, finding a trend was easier.
tasks for the resolution of the complaints. For making it a
While dealing with this model, seven important parameters
competent system, several novel functionalities have been
need to be considered: trend autoregression order(p), trend dif-
added to the system which is explained below.
ference order(d), the trend moving average order(q), seasonal
autoregressive order(P), seasonal difference order(D), seasonal
1) Dashboard:
moving average order(Q) and the number of time steps for
The civic authorities have access to the web portal with a
a single seasonal period(s). The equation for the SARIMA
dashboard that serves the authorities to view and analyze all
model is as follows:
the complaints posted by the mobile users.
The dashboard helps in maintaining a count of active SARIM A(p, d, q)(P, D, Q)s (1)
complaints, being worked on or those which are resolved.
By using this data, we analyze and generate various insights The parameter values were set as follows: p = range(0, 5),
which can be used by the Government authorities to plan d = 1, q = range(0, 5), P = range(0, 5), D = 1, Q = range(0,
better strategies for the resolution of the pothole problem. 5), s = 5. By fitting the model on the data, the duration of
The data shows various wards which are facing the issue as predictive maintenance of the predicted potholes prone roads
shown in Figure 6. Information of the authority which was is suggested to the authorities.
responsible for maintaining the roads in a particular ward can
be found out. The statistics also show the time taken by the 2) Prioritization:
authorities to resolve the complaints and those areas where Existing systems lack the insight the officials have over the
the problem kept on reappearing even after getting resolved complaints which are of utmost importance. It so happens
that the most important complaints are spammed by those
which can be dealt with later. A unique way to deal with
such a problem is by setting the priority to all the complaints
reported by the users and sort them accordingly. Four key
parameters significant to road conditions are identified and for
setting the priority the sum of these parameters is considered.
IV. R ESULTS
A. Pothole Detection Using Sensors
Sample accelerometer and gyroscope sensor readings are (a) Potholes with patches of (b) Dry-textured pothole with-
smooth road out puddles
divided into train and test data. Cross-validation is performed
on the test data that has been scrutinized from the sample Fig. 10: Results of tested images
dataset. Several evaluation measures have been taken into
consideration for choosing the most appropriate model. The progression of precision and recall over the epochs are
given in Figure 11. After testing with around 25 images, the
final model gives a precision of 0.83 and a recall of 0.72 which
gives realistic performance for the model.