Content-Length: 256902 | pFad | http://github.com/Smartproxy/reddit-python-scraper/blob/main/README.md

B3 reddit-python-scraper/README.md at main · Smartproxy/reddit-python-scraper · GitHub
Skip to content

Latest commit

 

History

History
27 lines (17 loc) · 1.25 KB

README.md

File metadata and controls

27 lines (17 loc) · 1.25 KB

Reddit Scraper

Scrape Reddit utilising Smartproxy's Web Scraping API

Dependencies

BeautifulSoup

Authentication

Once you have an active Web Scraping API subscription, you can try sending a request right from the dashboard Web Scraping API > API playground method tab simply by clicking on Send Request. You will also see an example of curl request generated on the right.

This Pyhton code example uses Base64 encoded user:pass authentication.

Parser type Example location Download
HTML to JSON reddit_python_scraper.py curl https://raw.githubusercontent.com/Smartproxy/reddit-python-scraper/blob/main/reddit_python_scraper.py > reddit_python_scraper.py

HTML to JSON

This Python script extracts Subreddit details, post data and comments straight from the HTML of Reddit post page and saves them to a JSON file.









ApplySandwichStrip

pFad - (p)hone/(F)rame/(a)nonymizer/(d)eclutterfier!      Saves Data!


--- a PPN by Garber Painting Akron. With Image Size Reduction included!

Fetched URL: http://github.com/Smartproxy/reddit-python-scraper/blob/main/README.md

Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy