Amazon Product Weight Data Scraping

A client needed to collect missing product weight data for nearly 3,000 Amazon products. The available dataset already contained UPC codes, but important shipping information such as package weight was missing. The client wanted an automated way to search Amazon products using UPC numbers and return the product weight in a structured Excel or database format.

To solve this, we developed a custom Amazon scraping solution that matched UPC codes with Amazon product listings and extracted weight-related product information directly from Amazon pages and product metadata. The system was designed to process thousands of products efficiently while maintaining accurate product matching.

Project Requirements

The client provided a list of products containing UPC_10 identifiers. The goal was to:

  • Search Amazon products using UPC codes
  • Identify the correct matching product
  • Extract product weight details
  • Return the final dataset in Excel format
  • Automate the process for thousands of products

Challenges

One of the biggest challenges was product matching accuracy. Some UPC codes returned multiple Amazon listings, while others had inconsistent or incomplete product information. In some cases, weight data appeared in different sections of the product page depending on the category.

Additional challenges included:

  • Missing structured weight fields
  • Inconsistent product metadata
  • Duplicate listings
  • Amazon page structure variations
  • Handling large-scale lookup requests efficiently

Our Solution

We built a custom Amazon data extraction workflow that combined UPC-based product lookup with automated scraping logic. The scraper searched Amazon product listings, identified the best matching ASIN, and extracted package weight and shipping-related details from the product page.

The workflow included:

  • UPC to ASIN matching
  • Automated Amazon product page scraping
  • Product metadata extraction
  • Data validation checks
  • Export to Excel and database-ready formats

The scraper was optimized to process thousands of products while minimizing failed lookups and incorrect matches.

Data Extracted

The final dataset included:

  • UPC code
  • Product title
  • ASIN
  • Product weight
  • Shipping weight
  • Product URL
  • Category information

Final Result

The client received a clean and structured dataset containing product weight information for thousands of Amazon listings. The automated workflow significantly reduced manual research time and made it easier to enrich their existing product catalog with accurate shipping data.

This project demonstrates how custom Amazon scraping solutions can be used to collect missing marketplace data at scale for inventory management, logistics, catalog enrichment, and eCommerce operations.