This dataset is taken from
- to perform pruning of the data, eleminating duplicacy, removing stopwords, performing stemming in order to obtain the positive and negative reviews which can be further used for applying models.
This dataset consists of reviews of fine foods from amazon.
The data span a period of more than 10 years, including all ~500,000 reviews up to October 2012.
Reviews include product and user information, ratings, and a plain text review.
It also includes reviews from all other Amazon categories.
Dataset link
Number of reviews: 568,454
Number of users: 256,059
Number of products: 74,258
Timespan: Oct 1999 - Oct 2012
Number of Attributes/Columns in data: 10
Reviews from Oct 1999 - Oct 2012
568,454 reviews
256,059 users
74,258 products
260 users with > 50 reviews
IdRow Id
ProductIdUnique identifier for the product
UserIdUnqiue identifier for the user
ProfileNameProfile name of the user
HelpfulnessNumeratorNumber of users who found the review helpful
HelpfulnessDenominatorNumber of users who indicated whether they found the review helpful
ScoreRating between 1 and 5
TimeTimestamp for the review
SummaryBrief summary of the review
TextText of the review