Amazon Review Scraper With Keyword Frequency Analysis

A client needed a simple Amazon review scraping script for market research. The goal was to collect all customer reviews from a specific Amazon product page and analyze the language customers used to describe that product.

The client wanted to enter either an Amazon product URL or an ASIN and the script would scrape every review for that product, not just the first review page. After collecting the reviews, the tool needed to calculate the most common words and phrases used by customers.

Project Requirement

The client needed a lightweight script that could:

  • Accept an Amazon product URL or ASIN
  • Extract the product title
  • Scrape all customer review text
  • Handle review pagination
  • Collect hundreds or thousands of reviews from one product
  • Analyze keyword frequency
  • Find common 1-word, 2-word, 3-word and 4-word phrases
  • Export results as a text file or HTML table

The main purpose was to understand how real customers described a product in their own words.

Main Challenge

The key challenge was collecting all reviews from the product, not just the visible reviews on the first page. Amazon review pages are paginated, so the scraper needed to move through each review page until all available reviews were collected.

Another important part was cleaning the review text before analysis. Customer reviews often contain repeated words, punctuation, short filler words and inconsistent formatting. Without proper cleaning, the keyword frequency output would not be useful.

Our Solution

We built a custom Amazon review scraper that accepted either a direct Amazon URL or an ASIN as input. The script extracted the product title, visited the review pages, handled pagination and collected review text from each page.

Once scraping the reviews, the tool processed the review text and generated keyword frequency results. It counted the most repeated single words as well as common 2-word, 3-word and 4-word phrases.

Data Extracted

The scraper collected:

  • Product ASIN
  • Product title
  • Review title
  • Review text
  • Review page URL
  • Total reviews processed

Keyword Analysis Output

The analysis generated:

  • Most common words
  • Most common 2-word phrases
  • Most common 3-word phrases
  • Most common 4-word phrases
  • Occurrence count for each keyword or phrase

This helped the client identify repeated customer language, common product benefits, complaints and buying triggers.

Final Result

The client received a simple Amazon review scraping and keyword analysis tool that could be run from the command line or browser. Instead of manually reading hundreds of reviews, the client could quickly extract customer language and use the results for product research, listing optimization, ad copy and market analysis.

This case study shows how Amazon review scraping can be combined with text analysis to turn customer feedback into useful product and marketing insights.