100% found this document useful (2 votes)

590 views4 pages

Final Assignment

The document describes a final project that involves processing images within a ZIP file to search for keywords and faces. The task is to write Python code to extract images from a ZIP file of newspaper pages, detect faces within each image using OpenCV, perform optical character recognition (OCR) on the text using Tesseract, and generate contact sheets of faces found on pages mentioning the searched keywords. Example output is provided showing contact sheets of faces detected on pages containing the words "Christopher" and "Mark" when searching the small and large ZIP files, respectively. Code is provided to implement the necessary classes and functions to complete these tasks, including loading the images from the ZIP, detecting faces, recognizing text, and generating the contact sheets.

Uploaded by

Anonymous 9RVX45Lk

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

100% found this document useful (2 votes)

590 views4 pages

Final Assignment

Uploaded by

Anonymous 9RVX45Lk

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

final_assignment file:///C:/Users/gsiddharth/Desktop/final_assignment.

html

The Project
1. This is a project with minimal scaffolding. Expect to use the the discussion forums to gain insights! It’s not cheating to ask
others for opinions or perspectives!
2. Be inquisitive, try out new things.
3. Use the previous modules for insights into how to complete the functions! You'll have to combine Pillow, OpenCV, and
Pytesseract
4. There are 4 functions you need to complete to have a working project. These functions are described using the RE
formula, which stands for Requires and Effects. Each function will have its own RE located directly above the function
definition. The Requires section describes what is needed in the function argument (inbetween the function definition
parenthesis). The Effects portion outlines what the function is supposed to do.
5. There are hints provided in Coursera, feel free to explore the hints if needed. Each hint provide progressively more details
on how to solve the issue. This project is intended to be comprehensive and difficult if you do it without the hints.

The Assignment
Take a ZIP file (https://en.wikipedia.org/wiki/Zip_(file_format)) of images and process them, using a library built into python
(https://docs.python.org/3/library/zipfile.html) that you need to learn how to use. A ZIP file takes several different files and
compresses them, thus saving space, into one single file. The files in the ZIP file we provide are newspaper images (like you
saw in week 3). Your task is to write python code which allows one to search through the images looking for the occurrences
of keywords and faces. E.g. if you search for "pizza" it will return a contact sheet of all of the faces which were located on the
newspaper page which mentions "pizza". This will test your ability to learn a new (library (https://docs.python.org/3/library
/zipfile.html)), your ability to use OpenCV to detect faces, your ability to use tesseract to do optical character recognition, and
your ability to use PIL to composite images together into contact sheets.

Each page of the newspapers is saved as a single PNG image in a file called images.zip (./readonly/images.zip). These
newspapers are in english, and contain a variety of stories, advertisements and images. Note: This file is fairly large (~200
MB) and may take some time to work with, I would encourage you to use small_img.zip (./readonly/small_img.zip) for testing.

Here's an example of the output expected. Using the small_img.zip (./readonly/small_img.zip) file, if I search for the string
"Christopher" I should see the following image:

Christopher Search
If I were to use the images.zip (./readonly/images.zip) file and search for "Mark" I should see the following image (note that
there are times when there are no faces on a page, but a word is found!):

Mark Search

Note: That big file can take some time to process - for me it took nearly ten minutes! Use the small one for testing.

1 of 4 4/29/2019, 2:49 PM
final_assignment file:///C:/Users/gsiddharth/Desktop/final_assignment.html

In [1]: import zipfile

import math
from PIL import Image
import pytesseract
import cv2 as cv
import numpy as np

class scanned_pages:
# loading the face detection classifier
face_cascade = cv.CascadeClassifier('readonly/haarcascade_frontalface_default.x
ml')
thumbnail_width = 128
thumbnail_height= 128
def __init__(self, zip_file_name):
self.zip_file = zipfile.ZipFile(zip_file_name, 'r')
self.archive_member_names = self.zip_file.namelist()
self.image_files = {x.filename:Image.open(self.zip_file.open(x)) for x in s
elf.zip_file.infolist() }
self.image_nparrs = {key: np.asarray(value) for key, value in self.image_fi
les.items()}
self.face_coods_in_files = {key: self.get_face_cood_list(value) for key, va
lue in self.image_nparrs.items()}
self.faces_in_files = {key: self.cut_faces(value, key) for key, value in se
lf.face_coods_in_files.items()}
self.contact_sheets = {key:self.generate_contact_sheets(value) for key, val
ue in self.faces_in_files.items()}
self.text_in_scanned = {key:self.ocr(value) for key, value in self.image_fi
les.items() }
for key,value in self.image_files.items():
value.close()
self.zip_file.close()

def get_face_cood_list(self, image_nparr):

return self.face_cascade.detectMultiScale(image_nparr,scaleFactor=1.3,minNe
ighbors=5, minSize=(30,30))

def cut_faces(self, cood_list, filename):

faces = [Image.fromarray(self.image_nparrs[filename][x[1]:x[1]+x[2], x[0]:x
[0]+x[3],:])
for x in cood_list]
return faces

def generate_contact_sheets(self, faces_list):

if len(faces_list)==0:
return None
for face in faces_list:
face.thumbnail([self.thumbnail_height,self.thumbnail_width])
first_image=faces_list[0]
num_rows = math.ceil(len(faces_list)/5.0)
contact_sheet=Image.new(first_image.mode, (self.thumbnail_width*5,self.thum
bnail_height*num_rows))
x=0
y=0
for img in faces_list:
contact_sheet.paste(img, (x, y) )
if x+first_image.width == contact_sheet.width:
x=0
y=y+first_image.height
else:
x=x+first_image.width
return contact_sheet

def ocr(self, image):

text = pytesseract.image_to_string(image)

2 of 4 4/29/2019, 2:49 PM
final_assignment file:///C:/Users/gsiddharth/Desktop/final_assignment.html

Testing for Chris in small file

Results found in file a-0.png

Results found in file a-3.png

Testing for Mark in large file

Results found in file a-0.png

Results found in file a-1.png

Results found in file a-10.png

But there were no faces in that file
Results found in file a-13.png

3 of 4 4/29/2019, 2:49 PM
final_assignment file:///C:/Users/gsiddharth/Desktop/final_assignment.html

Results found in file a-2.png

Results found in file a-3.png

Results found in file a-8.png

But there were no faces in that file

In [ ]:

4 of 4 4/29/2019, 2:49 PM

Programming For Engineers in Python: Recitation 12
No ratings yet
Programming For Engineers in Python: Recitation 12
39 pages
Face Recognition Project Report
50% (2)
Face Recognition Project Report
13 pages
Task
No ratings yet
Task
69 pages
Report On Facial Recognition System
No ratings yet
Report On Facial Recognition System
19 pages
Nur Syarizanie Lab Csc583
No ratings yet
Nur Syarizanie Lab Csc583
14 pages
Bit 22034
No ratings yet
Bit 22034
18 pages
Numpy
No ratings yet
Numpy
13 pages
Ip Lab Programs
No ratings yet
Ip Lab Programs
34 pages
Search Creators CG LAB Program-12
No ratings yet
Search Creators CG LAB Program-12
4 pages
CV Practical Record Editted - PDF
No ratings yet
CV Practical Record Editted - PDF
36 pages
Nguyen Le Duy - Project - Introcomp-1
No ratings yet
Nguyen Le Duy - Project - Introcomp-1
18 pages
Summary of Lecture 1: "Let There Be Light"
No ratings yet
Summary of Lecture 1: "Let There Be Light"
32 pages
From Pi
No ratings yet
From Pi
8 pages
Final Project
No ratings yet
Final Project
4 pages
Last One
No ratings yet
Last One
10 pages
Cvrlabmanual
No ratings yet
Cvrlabmanual
30 pages
Face Detection
No ratings yet
Face Detection
4 pages
Tres Bien
No ratings yet
Tres Bien
6 pages
International University: Term Project Report Topic: Face Stream Verification System
No ratings yet
International University: Term Project Report Topic: Face Stream Verification System
10 pages
Absolutely
No ratings yet
Absolutely
6 pages
Face Login System Using Python - Technozune
No ratings yet
Face Login System Using Python - Technozune
8 pages
Face Recognisation (Image)
No ratings yet
Face Recognisation (Image)
4 pages
Solo Rfid
No ratings yet
Solo Rfid
5 pages
Bien
No ratings yet
Bien
5 pages
Fps +% +track+rec
No ratings yet
Fps +% +track+rec
4 pages
Afternoon - Quiz 2 Key
No ratings yet
Afternoon - Quiz 2 Key
6 pages
Subhash Arun Dwivedi - CV - Lab Report!
No ratings yet
Subhash Arun Dwivedi - CV - Lab Report!
26 pages
ALCANTARAuLaboratory 6 Image Processing Student - 031006
No ratings yet
ALCANTARAuLaboratory 6 Image Processing Student - 031006
9 pages
Lab Program 12
No ratings yet
Lab Program 12
11 pages
Track + Rec+ Database
No ratings yet
Track + Rec+ Database
3 pages
Appendix D
No ratings yet
Appendix D
13 pages
Recognizing Handwritten Digits With Scikit-Learn: Punam Seal
No ratings yet
Recognizing Handwritten Digits With Scikit-Learn: Punam Seal
21 pages
Code
No ratings yet
Code
4 pages
Lab Report 2
No ratings yet
Lab Report 2
9 pages
Prac 2 ACV-merged
No ratings yet
Prac 2 ACV-merged
8 pages
Creat His
No ratings yet
Creat His
8 pages
Topic
No ratings yet
Topic
13 pages
AI Project
No ratings yet
AI Project
9 pages
Object Oriented122
No ratings yet
Object Oriented122
8 pages
Import cv2
No ratings yet
Import cv2
6 pages
Built-In Wavelet Families and Wavelets: Objective
No ratings yet
Built-In Wavelet Families and Wavelets: Objective
5 pages
7a 47 49 61 Dip Code
No ratings yet
7a 47 49 61 Dip Code
8 pages
Word Extraction-1
No ratings yet
Word Extraction-1
2 pages
Digital Image Processing Lab Manual# 2
No ratings yet
Digital Image Processing Lab Manual# 2
6 pages
Assignment-4 NumPy Applications
No ratings yet
Assignment-4 NumPy Applications
2 pages
Project Review - Final B187
No ratings yet
Project Review - Final B187
15 pages
CV Lab File
No ratings yet
CV Lab File
39 pages
Open CV
No ratings yet
Open CV
4 pages
Nishu Project
No ratings yet
Nishu Project
12 pages
Software Project Face Detection System Using Open CV Final Report
No ratings yet
Software Project Face Detection System Using Open CV Final Report
2 pages
Mini Project
No ratings yet
Mini Project
9 pages
Building An Image Mosaic: SPI2 Final Project 2020
No ratings yet
Building An Image Mosaic: SPI2 Final Project 2020
2 pages
English Worksheet Set-1 PDF
No ratings yet
English Worksheet Set-1 PDF
37 pages
Best Face Rec PDF
No ratings yet
Best Face Rec PDF
1 page
25 Awesome Python Scripts
No ratings yet
25 Awesome Python Scripts
26 pages
Face - Recognition - Face - Recognition - Cli - Py at Master Ageitgey - Face - Recognition GitHub PDF
No ratings yet
Face - Recognition - Face - Recognition - Cli - Py at Master Ageitgey - Face - Recognition GitHub PDF
1 page
Python Mini Report PDF
100% (2)
Python Mini Report PDF
13 pages
TDC Barbarian 5e
No ratings yet
TDC Barbarian 5e
17 pages
Babok Review l0 Upd v1.0
No ratings yet
Babok Review l0 Upd v1.0
27 pages
ComputerGraphicsNotesWeek9 01 0418
No ratings yet
ComputerGraphicsNotesWeek9 01 0418
6 pages
Manibog v. People G.R. No. 211214 March 20 2019
No ratings yet
Manibog v. People G.R. No. 211214 March 20 2019
11 pages
Resolution No. 22 s2024 (Adopting The Health Action Plan)
No ratings yet
Resolution No. 22 s2024 (Adopting The Health Action Plan)
2 pages
Part-B Important MCQs
No ratings yet
Part-B Important MCQs
24 pages
Software Distribution Agreement
No ratings yet
Software Distribution Agreement
10 pages
Making It Happen G3 Pavement and Specification
No ratings yet
Making It Happen G3 Pavement and Specification
18 pages
Finance Mo1-Mo8
No ratings yet
Finance Mo1-Mo8
50 pages
Summer 2020 P1
No ratings yet
Summer 2020 P1
20 pages
OWASP Top Ten Web Application Vulnerabilities in J2EE
No ratings yet
OWASP Top Ten Web Application Vulnerabilities in J2EE
41 pages
Class Ix Study Material
No ratings yet
Class Ix Study Material
74 pages
ANP Midterm #1 Review Answer Key
No ratings yet
ANP Midterm #1 Review Answer Key
7 pages
2.leon Kleyn CV
No ratings yet
2.leon Kleyn CV
5 pages
Transmission Line Theory
No ratings yet
Transmission Line Theory
25 pages
Mec 201 26 Aug 20
No ratings yet
Mec 201 26 Aug 20
21 pages
Surgery Observation Paper
No ratings yet
Surgery Observation Paper
3 pages
14 G.R. No. 142773 People V Delim
No ratings yet
14 G.R. No. 142773 People V Delim
14 pages
Logical Fallacies - Study Notes Based On Fallacies - NTA UGC NET PAPER 1
No ratings yet
Logical Fallacies - Study Notes Based On Fallacies - NTA UGC NET PAPER 1
6 pages
PIG Paper: Dress Code in Herricks High School
100% (1)
PIG Paper: Dress Code in Herricks High School
14 pages
RK500-02 PH Sensor: Features
No ratings yet
RK500-02 PH Sensor: Features
3 pages
Arthur Schopenhauer Quotes
No ratings yet
Arthur Schopenhauer Quotes
4 pages
Mathematics 10 - Fourth Quarter Summative Test 2
100% (3)
Mathematics 10 - Fourth Quarter Summative Test 2
2 pages
(A) The Original Debtor Is Freed of Liability Since Novation Took Place and This Relieved Him of His Obligation
No ratings yet
(A) The Original Debtor Is Freed of Liability Since Novation Took Place and This Relieved Him of His Obligation
5 pages
Behavioral Counseling For STD/HIV Risk Reduction: Learning Objectives
No ratings yet
Behavioral Counseling For STD/HIV Risk Reduction: Learning Objectives
10 pages
Đề thi thử vào 10 môn tiếng anh 2022
No ratings yet
Đề thi thử vào 10 môn tiếng anh 2022
2 pages
Hanuman Chalisa With Meaning
100% (5)
Hanuman Chalisa With Meaning
4 pages
FY2019 IV On Demand Bio Data
No ratings yet
FY2019 IV On Demand Bio Data
3 pages
Persuasion Map
No ratings yet
Persuasion Map
1 page
Lect 9-10 Choosing Brand Elements To Build Brand Equity
No ratings yet
Lect 9-10 Choosing Brand Elements To Build Brand Equity
36 pages
TensorFlow深度学习项目实战: Chinese Edition
From Everand
TensorFlow深度学习项目实战: Chinese Edition
Posts & Telecom Press
No ratings yet
Angular Generative AI: Building an intelligent CV enhancer with Google Gemini
From Everand
Angular Generative AI: Building an intelligent CV enhancer with Google Gemini
Abdelfattah Ragab
No ratings yet
Firebase Storage for Angular: A reliable file upload solution for your applications
From Everand
Firebase Storage for Angular: A reliable file upload solution for your applications
Abdelfattah Ragab
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Final Assignment

Uploaded by

Final Assignment

Uploaded by

final_assignment file:///C:/Users/gsiddharth/Desktop/final_assignment.

In [1]: import zipfile

def get_face_cood_list(self, image_nparr):

def cut_faces(self, cood_list, filename):

def generate_contact_sheets(self, faces_list):

def ocr(self, image):

Testing for Chris in small file

Results found in file a-3.png

Testing for Mark in large file

Results found in file a-1.png

Results found in file a-10.png

Results found in file a-2.png

Results found in file a-3.png

Results found in file a-8.png

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Final Assignment

Uploaded by

Final Assignment

Uploaded by

final_assignment file:///C:/Users/gsiddharth/Desktop/final_assignment.

In [1]: import zipfile

def get_face_cood_list(self, image_nparr):

def cut_faces(self, cood_list, filename):

def generate_contact_sheets(self, faces_list):

def ocr(self, image):

*****Testing for Chris in small file*****

Results found in file a-3.png

*****Testing for Mark in large file*****

Results found in file a-1.png

Results found in file a-10.png

Results found in file a-2.png

Results found in file a-3.png

Results found in file a-8.png

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Testing for Chris in small file

Testing for Mark in large file