Content-Length: 257333 | pFad | http://github.com/Smartproxy/python-scrapy-amazon/blob/main/README.md

10 python-scrapy-amazon/README.md at main · Smartproxy/python-scrapy-amazon · GitHub
Skip to content

Latest commit

 

History

History
35 lines (23 loc) · 1.25 KB

README.md

File metadata and controls

35 lines (23 loc) · 1.25 KB

Python Scrapy Amazon Scraper

Scrape Amazon product listings utilising scrapy & residential proxies

Prerequisites

To get started with Scrapy you will first need to install it using methods provided in their documentation. Check here for more information

Authentication & Proxy setup

Once you have an active subscription you can find your credentials & proxy addresses in Dashboard > Residential > Proxy Setup

Navigate to settings.py in /amazon/amazon/ folder and modify the following lines to authenticate.

SMARTPROXY_USER = 'SPusername' ## Smartproxy Username (Sub-user)
SMARTPROXY_PASSWORD = 'SPpassword' ## Password for your user
SMARTPROXY_ENDPOINT = 'gate.smartproxy.com' ## Endpoint you'd like to use
SMARTPROXY_PORT = '7000' ## Port of the endpoint you are using.

Running the scraper

Navigate to the project folder and run the following command

scrapy crawl amazon_search

Results

Amazon search results will be saved in /amazon/data folder in a .csv format









ApplySandwichStrip

pFad - (p)hone/(F)rame/(a)nonymizer/(d)eclutterfier!      Saves Data!


--- a PPN by Garber Painting Akron. With Image Size Reduction included!

Fetched URL: http://github.com/Smartproxy/python-scrapy-amazon/blob/main/README.md

Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy