Skip to content

giuseppegambino/Scraping-TripAdvisor-with-Python-2020

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

37 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Scraping TripAdvisor with Python 2020 *

Python implementation of web scraping of TripAdvisor with Selenium in a new 2020 website.

There are two scripts:

  • "restaurants_scraper.py" to scrape restaurant
  • "things_to_do_scraper.py" to scrape hotels, attraction and monuments.

The python function is commented, write me if you have doubts. If you have a slow connection and you encounter code problems, try increasing the seconds of time.sleep () function

Features implemented:

  • The click function to open the "more" button of the reviews
  • The click function to change the page
  • Csv file with the date, the score, the title and the full review!

How to use:

  • First approach: download the python file, open it and edit the default fields (csv file path, number of pages, tripadvisor url)

  • Second approach: download the file and launch it directly from the terminal, passing:

    • the path of your csv file where the reviews will be stored
    • the number of pages of the desired website that you want to scrape
    • the url of tripadvisor website that you want to scrape

Code to paste into terminal: python3 path_to_downloaded_script/things_to_do_scraper.py desktop/reviews.csv 50 https://www.tripadvisor.com/Attraction_Review-g187791-d192285-Reviews-Colosseum-Rome_Lazio.html

What I used:

  • Python 3.8.2
  • Selenium 3.141.0
  • Safari 14.0.1
  • Visual Studio Code 1.51.1
  • Macbook Pro 13" M1 2020 with macOS Big Sur 11.0.1

*This activity has been supported by a grant from the Project IDEHA - PON "Ricerca e Innovazione" 2014-2020 - Innovation for Data Elaboration in Heritage Areas, Azione II

About

Python implementation of web scraping of TripAdvisor with Selenium in a new 2019 website

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy