0% found this document useful (0 votes)

80 views7 pages

Artificial Intelligence Based Optical Character Recognition Using Visual Impaired People Shopping Trolley Technology

This document describes a system to help visually impaired people shop independently using technology. The system uses a webcam to capture images, optical character recognition (OCR) to convert images to text, and text-to-speech to audibly output text for the user. It detects product names and details using RFID tags and an RFID reader. An ultrasonic sensor is used to detect obstacles. The system is designed to allow visually impaired people to shop freely without assistance by reading aloud product names and providing spatial awareness through obstacle detection.

Uploaded by

Pavan kumar Kotapally

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

80 views7 pages

Artificial Intelligence Based Optical Character Recognition Using Visual Impaired People Shopping Trolley Technology

Uploaded by

Pavan kumar Kotapally

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 7

Artificial Intelligence Based Optical Character Recognition

Using Visual Impaired People Shopping Trolley Technology

K. Geeth Apuroop B. Rakeshwar Reddy T. Chiranjeevi Rao
Department of ECE Department of ECE Department of ECE
SRMIST, Chennai SRMIST, Chennai SRMIST, Chennai
kb3675@srmist.edu.in bb9387@srmist.edu.in tt5367@srmist.edu.in

Abstract Literature Survey

Disability may be a thing where people need to 2.1 An Electromagnetic Sensor Prototype to
depend upon others for his or her works. One help Blind People in Autonomous Walking
among those disabilities is blindness. So far we
We can use electromagnetic sensor to help in
have N number of methods proposed which
walking for blind people. And also helpful for
makes life easy for visually impaired people.
the people who are affected by some visual
Purchasing a product independently is one of the
diseases. It has a radar on the cane making
challenge they are facing on daily basis. To
aware that user is presence in front of the
overcome this problem we came with a solution
obstacle in a wide range and safer range. This
where we use a webcam which captures the
system exhibits noise tolerance and small in
image and using tesseract algorithm text is
size. We used this paper for the survey and
extracted from the image and the text will be
learning of sensor which helps us in distance
converted into an audio which they could hear
calculating. -
through the headphones. On implementing this
technique, during this shopping trolley
technology to spot the merchandise placed 2.2 Reading Aid for the Blind people using
thanks to machine learning and accuracy OCR and Open CV
location find him.
Optical character recognition helps in
Introduction identifying the characters using the camera.
Using this technique blind peoples can Those images are converted to audio output.
move freely future without the assistance of This is the machine translation, text to speech.
other persons, this will be employed AI and computer vision is the field. The
by reading the name of the merchandise and image files will be processed to tesseract and
merchandise details. So that, visual impaired will be converted to text from image and this
people even be ready to shop like normal entire process is using Raspberry pi. This
people. This prototype works during a thanks research is focused on OCR automatic reader
to assist the blind people not only by reading for blinds and people with eye diseases. It
out the but also it calculates the uses python programming as main
space ahead of them. programming language.

2.3 Smart Stick for Blind People with Live

Video Feed
In our life vision plays vital role. It ability
and capability are important. For people who
are visually impaired, they need the help of
others. Instead, the smart system that helps
the blind people to detect obstacle with the
help of blind stick. Raspberry Pi:

2.4 Object Detection Methodologies for blind It's the Microprocessor

people or alittle computer. It consists of RAM,
Processor unit, input ports, output ports,
Vision is that the most vital sense. Image graphics card and also SD
plays crucial role within the human perception card. it's designed to figure on linux also as
of the encompassing environment. Digital on windows platform. It are often operated
image processing is that the field of which it with SSH, or use FTP to transfer the
processes the digital image. the files. it's 1GB RAM. It uses python language
thing identification is that the difficult task for coding. It also provides camera port, USB
for visually impaired people. There are still connections etc.
limitations that need more improving.
It provides the survey and an analysis of OCR:
varied evaluations for the technologies OCR is an application to convert printed
that utilized in the thing identification task. text on the image into machine encoded text
2.5 Text to Speech for the Visually Impaired format.

People who are visually impaired or suffering It is the technique to edit, search, stored the
with eye disease cannot be able to read printed text. OCR uses techniques like as
newspaper or book. So, this system can help “pattern matching”
in reading those by converting text to speech. And “features extraction” to process the printed
Components text on the image captured.

RFID Tags: ESPEAK:

Tags are identified by using It is an algorithm which synthesize

electromagnetic field. There are two sorts speech which works in all platforms like
of tags (i.e. active tag and passive tag) Active windows , Linux and so on . Can change the
tag has its own power supply but the passive sound of the voice by varying characteristics of
tag required for the facility . Passive a text file into pitch range and disturbance to the
tag consists of antenna, chip and means to voice. It uses an ASCII representation. within
gather DC power for reader. In proposed the proposed
system, we've used passive tags system it’s used to convert the document file
because that's less costly and ready to detect obtain because the OCR output into audio file
for area within the super market. for the VIP.

RFID Reader:
RFID reader is employed to Ultrasonic Sensor:
read also as write the info onto the tags. Tag Ultrasonic sensor measure distance by
needed are available the range of reader to using ultrasonic wave and receives the wave
retrieve data from it. reflected back from the thing opposite there
to. It measures the time required during the
emission and reception of the waves to
calculate the space. The device itself work as
emitter and receptor. In proposed
system it's wont to detect the thing or the
obstacles face by blind man.
D. C. Motors:
DC motor contains rotatory electrical motors to
convert DC electrical energy into mechanical Earphone: As an output device
energy. We can control the speed of the motor
Ultrasonic sensor: Detect obstacles
by changing the supply voltage and also by
varying the strength of current in the field.
When we place a conductor, which carries
current in a field containing magnet we can
experience a force. The design of this motor is
far better more when compared to brushed
motors. The complication of transferring energy
RFID Reader: To read the RFID tags
to spinning rotor from outside the motor is
eliminated. It also increases the lifetime and
works with high efficiency without any
maintenance but it contains more risky motor
speed controllers with high cost.

System design:

OCR: To convert image file into text file

Block Diagram:
Module
Raspberry pi: It is used for processing
Image Capturing and Processing
Capturing the image
The respective Webcam can capture an image up
Camera: To capture the image to 30 frames per second. As we are using a USB
type camera, separate driver software’s is not
required and it can be used as play and play. The based on gradient (Sobel – first order
captured image is converted to text and can be derivatives) and two based on Laplacian
heard through a headphone or a speaker which (second order derivative so sensitive to
has 3.5mm jack. Webcam used consists of noise). First, the derivative of image is
features like good resolution, night vision calculated first followed by pointing peak
moreover with a continuous Autofocus. points. Which consists of larger values than
adjacent points the set of extreme points
collectively is an edge. Every image
captured is induced with at least one type of
noise, for increment of efficiency and in
order to reduce the errors the noises are to
Processing the image be filters with suitable filter. For example,
OpenCV offers a wide range of filters for
The image captured introduces noise or poor
reduction of noise. Depending upon nature
quality of page has to be cleared before further
of noise certain filter is chosen to reduce all
processing. This is achieved by processing the
the noises. Edge detection with OpenCV
image. By image processing the pixel density
example is Gaussian blur function.
and quality can be adjusted and corrected.
Appropriate threshold is applied to remove the c) Background Separation
unwanted noise in the image. The number of
pixels are added to objects depend on the size Background separation or subtraction is
and shape of structuring element to process the major step in most of vision-based
image captured. applications. Consider the cases like details
regarding the vehicle is extracted from
Image processing consists of several steps, as traffic camera or a static camera captures the
follows: number of visitors entering the counter etc.
In all above cases necessary step is to extract
a) Filtering
the person or vehicle alone.
It a neighborhood process, in which
Automatic Text Extraction
applying some algorithm to values of pixels
in the adjacent then the pixels of the output The intention of Optical Character Recognition
image corresponding to input is determined. (OCR) is to distinguish optical patterns
This technique helps in modifying or (commonly contained in a virtual photo) with
enhancing an image filter. For example, to respective to alphanumeric or different
remove unwanted features or to emphasize characters. The technique of OCR entails steps
certain features of an image, the only way to like segmentation, characteristic extraction and
achieve those modifications by this process. classification. Considering authentic digital files
in repository, the extraction of signature is
Filtering includes smoothing, sharpening
simple the PDF or PowerPoint shape of
and edge enhancement of the image and
authentic digital files are transformed into
implemented under image processing. suitable high-resolution photo (TIFF, JPEG,
b) Edge Detection etc.) on which signature is computed.

One of the fundamental operations in

performing image processing is edge
detection. This process is useful to reduce
data (pixel) amount to process as well as
structural aspect of image is maintained.
This technique is of two schemes –one
•        The output of the technique is the binary
photo, there the textual content pixels and
historical past pixels may be visible in
distinctive binary levels.
ü Character Recognition
•        This is ultimate level of the technique this
is person reputation.
•        Binary textual content to ASCII textual
content conversion is being completed here.

Text extraction and reputation technique has 5

steps particularly textual content detection,
textual content localization, textual content
tracking, binarization, and person reputation.
ü Text Detection
•        It takes enter as photo or video component
and test it has textual content or not Tesseract is an open supply textual
content reputation (OCR) Engine. It is well
•        It identifies the textual content having matched with many programming languages and
vicinity withinside the photo frameworks. It analyzes the whole present
format to discover textual content in a large
ü Text Localization document, in any other case it could be utilized
•        Combining the areas containing texts to in conjunction to textual content detector for
assess textual content items and study the spotting textual content from the photo.
bounds across the items is the primary technique
of textual content localization.
ü Text Tracking
•        Only video statistics makes use of the
technique of textual content tracking.
•        It rectifies the output/end result of the
textual content localization and textual content
detection.
•        For the clarity purpose, textual content
embedded withinside the video seems in extra
than thirty consecutive frames.
ü Text Binarization The process of organizing text lines into
blocks, then the lines and the entire region will
•        It is used to split the textual content item
be analyzed for the fixed pitch. In this way word
from historical past in a segmental manner.
finding will be done. Text lines will be done into and obstacle detection are often designed. We
segmental words by the way of character used RFID as well as Raspberry-pi
spacing. It is a two-pass process recognition. technologies to make a smarter environment
Recognizing of each word is made in an attempt for blind people. To unwind the solution to
in the first pass. It will be passed to an adaptive detect the object of interest we’ve used a
classifier as training data when the word is motion based recognition. We use distribution
satisfactory. It recognizes text lower down the of pixel edges and stroke orientation to
page which gets more accurately is the adaptive extract text and Optical Character
classifier. Recognition to recognize text characters
which are converted as audio file and they can
hear through headphones.
Text recognition and output
Off-the-shelf OCR perform
Future Enhancement
text recognition prior to output from
Our future work will extend the text
the localized text regions which are localization algorithm will further more features
informative words. For the an we’ll address the human interface issues
related to text reading by the blind user.
accommodation of characters, the text
region denotes the required rectangular Results

area inside it. The edge boundary of By providing this technology people that
are all suffered from vision impairment may
the required text characters contacts
desire like normal persons and training about
with the border of the text. Proper this device to those people can use this device
margin areas and binary to segment efficiently. This scheme is modernized and
elongated to new form by modernizing software.
text characters are entirely done by the
OCR which generates good
performance. In script files, recognized
text codes will be recorded. Word
recognition is performed by Off the
shelf OCR and converts into audio
output for the visual impaired people.

Conclusion
The proposed system will enable visually
impaired people to buy without others help in Fig.1. image taken by camera
supermarket. Being specific gadget for fig.1.2 Text conversion
product identification, Section information

Smart Guides Princeton Review-Word Smart 6th Edit
No ratings yet
Smart Guides Princeton Review-Word Smart 6th Edit
5 pages
Smart Walking Cane for the Specially Abled People Presentation
No ratings yet
Smart Walking Cane for the Specially Abled People Presentation
22 pages
A Report On Existing AI Work For Visually Impaired People: Ayesha Tariq
No ratings yet
A Report On Existing AI Work For Visually Impaired People: Ayesha Tariq
51 pages
project ppt of low cost ventilation
No ratings yet
project ppt of low cost ventilation
16 pages
Project Group1
No ratings yet
Project Group1
38 pages
report
No ratings yet
report
25 pages
"Text Recognition and Face Detection Aid For Visually Impaired Person Using Raspberry Pi
No ratings yet
"Text Recognition and Face Detection Aid For Visually Impaired Person Using Raspberry Pi
62 pages
B.tech Major Project Abstract 2022 - 23
No ratings yet
B.tech Major Project Abstract 2022 - 23
2 pages
First_review_1MS21LVS06
No ratings yet
First_review_1MS21LVS06
12 pages
Text Reader For Blind
No ratings yet
Text Reader For Blind
6 pages
chapter1+2
No ratings yet
chapter1+2
6 pages
Blind
No ratings yet
Blind
24 pages
Text Reader For Visually Impaired Person Using Image Processing Open-CV
No ratings yet
Text Reader For Visually Impaired Person Using Image Processing Open-CV
8 pages
Open Source Computer Vision
No ratings yet
Open Source Computer Vision
79 pages
Smartglasses for visually impaired
No ratings yet
Smartglasses for visually impaired
7 pages
Last Edited
No ratings yet
Last Edited
8 pages
Final Invision
No ratings yet
Final Invision
6 pages
PROJECT SYNOPSIS
No ratings yet
PROJECT SYNOPSIS
8 pages
Title Name - PPTX REAL TIME ASSISTANT - PPTX Monday
No ratings yet
Title Name - PPTX REAL TIME ASSISTANT - PPTX Monday
11 pages
Visual OCR
No ratings yet
Visual OCR
17 pages
Artificial Vision1
No ratings yet
Artificial Vision1
9 pages
Smart AI Cane Project
No ratings yet
Smart AI Cane Project
9 pages
1.1 Motivation: 1 A Prototype of Camera Based Assisstive Text Reader Using Rpi
No ratings yet
1.1 Motivation: 1 A Prototype of Camera Based Assisstive Text Reader Using Rpi
51 pages
Object detection research paper
No ratings yet
Object detection research paper
4 pages
Intelligent_Glasses_for_the_Visually_Impaired_with_Google_Cloud_API
No ratings yet
Intelligent_Glasses_for_the_Visually_Impaired_with_Google_Cloud_API
4 pages
Visual Assistance for Blind Using Image Processing
No ratings yet
Visual Assistance for Blind Using Image Processing
5 pages
Ultrasonic Smart Goggles for Blind People
No ratings yet
Ultrasonic Smart Goggles for Blind People
4 pages
IJCRT2405288
No ratings yet
IJCRT2405288
4 pages
Iarjset 2022 9420
No ratings yet
Iarjset 2022 9420
5 pages
Survey Paper Image Reader For Blind Pers
No ratings yet
Survey Paper Image Reader For Blind Pers
3 pages
Blind Stick
No ratings yet
Blind Stick
4 pages
A Novel Based Intelligent Spectacles For Visually Impaired
No ratings yet
A Novel Based Intelligent Spectacles For Visually Impaired
9 pages
Raspberry Pi Based Smart Reader For Visually Impaired People
50% (2)
Raspberry Pi Based Smart Reader For Visually Impaired People
12 pages
DOC-20250312-WA0015.
No ratings yet
DOC-20250312-WA0015.
10 pages
Visual Based Product Identification For Blind: Project Report On
No ratings yet
Visual Based Product Identification For Blind: Project Report On
23 pages
Third Eye An Aid For Visually Impaired 1
No ratings yet
Third Eye An Aid For Visually Impaired 1
6 pages
Research Paper 1
No ratings yet
Research Paper 1
8 pages
Smart Stick For Visually Impaired
No ratings yet
Smart Stick For Visually Impaired
6 pages
Raspberry Pi Based Reader For Blind People
No ratings yet
Raspberry Pi Based Reader For Blind People
4 pages
Ijcrt July Student 2022
No ratings yet
Ijcrt July Student 2022
5 pages
Smart Reader For Blind People
No ratings yet
Smart Reader For Blind People
3 pages
Recognizing of Text and Product Label From Hand Held Entity Intended For Visionless Persons
No ratings yet
Recognizing of Text and Product Label From Hand Held Entity Intended For Visionless Persons
3 pages
A Smart Reader For Visually Impaired People Using Raspberry PI
No ratings yet
A Smart Reader For Visually Impaired People Using Raspberry PI
5 pages
Roll No: In-House: Fig. 1 Block Diagram of Microcontroller Unit
No ratings yet
Roll No: In-House: Fig. 1 Block Diagram of Microcontroller Unit
2 pages
Portable Camera Based Assitive Product Label Reading For Blind and Visually Impaired Individuals
No ratings yet
Portable Camera Based Assitive Product Label Reading For Blind and Visually Impaired Individuals
3 pages
Electronic Eye For Visually Challenged People
No ratings yet
Electronic Eye For Visually Challenged People
4 pages
IEEE Template
No ratings yet
IEEE Template
5 pages
Ijireeice 2023 11408
No ratings yet
Ijireeice 2023 11408
4 pages
Application Form - Domain Academy Latest Version V03052023a
No ratings yet
Application Form - Domain Academy Latest Version V03052023a
9 pages
2303 07451 PDF
No ratings yet
2303 07451 PDF
6 pages
IJCRT2207295
No ratings yet
IJCRT2207295
4 pages
AI Powered Glasses for Visually Impaired Person
No ratings yet
AI Powered Glasses for Visually Impaired Person
6 pages
Assistive Technology For Visually Impaired Using Tensor Flow Object Detection in Raspberry Pi and Coral USB Accelerator
No ratings yet
Assistive Technology For Visually Impaired Using Tensor Flow Object Detection in Raspberry Pi and Coral USB Accelerator
4 pages
Usha Mittal Institute of Technology S.N.D.T. Women's University Electronics & Communication Engg. / Electronics Engg. Department
No ratings yet
Usha Mittal Institute of Technology S.N.D.T. Women's University Electronics & Communication Engg. / Electronics Engg. Department
1 page
09160169
No ratings yet
09160169
6 pages
Massachusetts Eye and Ear Infirmary Illustrated Manual of Ophthalmology, 4th Edition Instant Reading Access
100% (12)
Massachusetts Eye and Ear Infirmary Illustrated Manual of Ophthalmology, 4th Edition Instant Reading Access
14 pages
The Effect of Poor Eyesight on Student Performance
No ratings yet
The Effect of Poor Eyesight on Student Performance
5 pages
2000 Census of Population and Housing
No ratings yet
2000 Census of Population and Housing
127 pages
Voice Assisted Text Reading System For Visually Impaired Persons
No ratings yet
Voice Assisted Text Reading System For Visually Impaired Persons
6 pages
Ai Glass 1
No ratings yet
Ai Glass 1
6 pages
Student Manual Reading
No ratings yet
Student Manual Reading
365 pages
Federal Benefits For Veterans, Dependents, & Survivors 2010 Edition (VA Pamphlet 80-10-01)
100% (3)
Federal Benefits For Veterans, Dependents, & Survivors 2010 Edition (VA Pamphlet 80-10-01)
192 pages
DJI FPV Goggles Disclaimer and Safety Guidelines
No ratings yet
DJI FPV Goggles Disclaimer and Safety Guidelines
54 pages
Foundation of Special and Inclusive Education
No ratings yet
Foundation of Special and Inclusive Education
6 pages
Foundation of Education - Notes
No ratings yet
Foundation of Education - Notes
69 pages
Appendix RRL
No ratings yet
Appendix RRL
27 pages
Iep Plan
No ratings yet
Iep Plan
2 pages
2 - Sudden and Gradual Vision Loss
No ratings yet
2 - Sudden and Gradual Vision Loss
31 pages
SESSION 2 Difficulty in Performing Adaptive Skills Deaf Blindness
No ratings yet
SESSION 2 Difficulty in Performing Adaptive Skills Deaf Blindness
52 pages
Medical Terminology For Medical Transcription Trainees
No ratings yet
Medical Terminology For Medical Transcription Trainees
74 pages
1.1. Purpose: Software Requirements Specification For Blind Voice Mail
50% (2)
1.1. Purpose: Software Requirements Specification For Blind Voice Mail
15 pages
NCS Glaucoma
No ratings yet
NCS Glaucoma
47 pages
JCAPCPL - FY24 - Annual - SABAL First Draft New
No ratings yet
JCAPCPL - FY24 - Annual - SABAL First Draft New
13 pages
Education (IDC)
No ratings yet
Education (IDC)
16 pages
Review On Nagarjun Varti
No ratings yet
Review On Nagarjun Varti
5 pages
Voice Over Script 2
No ratings yet
Voice Over Script 2
3 pages
Primary Eye Care and Community Participation
No ratings yet
Primary Eye Care and Community Participation
4 pages
IOBadv
No ratings yet
IOBadv
11 pages
Wayfinding Design Logic, Application and Some Thoughts On Universality
No ratings yet
Wayfinding Design Logic, Application and Some Thoughts On Universality
13 pages
Typology of Children With Special Needs
No ratings yet
Typology of Children With Special Needs
1 page
Safety Guard For Blind Using 8051: Saneesh C T, Deepashree A V, Gagana Divya, Trupti Agrawal
No ratings yet
Safety Guard For Blind Using 8051: Saneesh C T, Deepashree A V, Gagana Divya, Trupti Agrawal
2 pages
(In Pursuance To The Icsi Guidelines For Scribe (Writer) And/Or Extra Time-2019)
No ratings yet
(In Pursuance To The Icsi Guidelines For Scribe (Writer) And/Or Extra Time-2019)
4 pages
Social Distancing Detector Using Yolo
No ratings yet
Social Distancing Detector Using Yolo
4 pages
Section A. Household Characteristics
No ratings yet
Section A. Household Characteristics
6 pages
6 General Types of Disabilities: Here Are 10 of The Most Common Conditions That Are Considered Disabilities
No ratings yet
6 General Types of Disabilities: Here Are 10 of The Most Common Conditions That Are Considered Disabilities
8 pages
IRJMETS Sri Ram.j
No ratings yet
IRJMETS Sri Ram.j
3 pages
Smart Vision For The Blind People: R.Mohanapriya, U.Nirmala, C.Pearlin Priscilla
No ratings yet
Smart Vision For The Blind People: R.Mohanapriya, U.Nirmala, C.Pearlin Priscilla
4 pages
Day 4 - Assignment
No ratings yet
Day 4 - Assignment
2 pages
Alzheimer's and Eyesight-Alzheimer's Disease - 1
No ratings yet
Alzheimer's and Eyesight-Alzheimer's Disease - 1
4 pages
Narrative Report Lexi
No ratings yet
Narrative Report Lexi
3 pages
Optical Character Recognition: Unlocking the Power of Computer Vision for Optical Character Recognition
From Everand
Optical Character Recognition: Unlocking the Power of Computer Vision for Optical Character Recognition
Fouad Sabry
No ratings yet
Smart Camera: Revolutionizing Visual Perception with Computer Vision
From Everand
Smart Camera: Revolutionizing Visual Perception with Computer Vision
Fouad Sabry
No ratings yet
Optical Character Recognition: Fundamentals and Applications
From Everand
Optical Character Recognition: Fundamentals and Applications
Fouad Sabry
No ratings yet
Digital Photography Essentials
From Everand
Digital Photography Essentials
Duncan Evans
4/5 (2)

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Artificial Intelligence Based Optical Character Recognition Using Visual Impaired People Shopping Trolley Technology

Uploaded by

Artificial Intelligence Based Optical Character Recognition Using Visual Impaired People Shopping Trolley Technology

Uploaded by

Artificial Intelligence Based Optical Character Recognition

Using Visual Impaired People Shopping Trolley Technology

Abstract Literature Survey

2.3 Smart Stick for Blind People with Live

2.4 Object Detection Methodologies for blind It's the Microprocessor

RFID Tags: ESPEAK:

Tags are identified by using It is an algorithm which synthesize

OCR: To convert image file into text file

One of the fundamental operations in

Text extraction and reputation technique has 5

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.