📦 Amazon Web Scraping and Analysis Project

📌 Overview

This project is a Python-based Amazon web scraper that extracts product details like title, price, rating, reviews, and links based on user input. The data is then analyzed to recommend the best product using numerical analysis and visualization techniques.

🚀 Features

✅ Scrapes product data from Amazon using BeautifulSoup and requests
✅ Allows users to input search keywords and number of pages to scrape
✅ Performs data analysis on extracted product data (Price, Rating, Reviews)
✅ Uses Seaborn & Matplotlib for visualization
✅ Recommends the best product based on a calculated score

📁 Project Structure

amazon-scraper/
│── amazon_scraper.py       # Web scraper script  
│── form_server.py          # Flask server for user input  
│── templates/  
│   ├── index.html          # Web form UI  
│── static/  
│   ├── style.css           # Styling for web pages  
│── amazon_products.csv     # Scraped data (Generated)  
│── README.md               # Project Documentation  
│── requirements.txt        # Dependencies

🛠️ Technologies Used

Python (BeautifulSoup, requests, pandas)
Flask (for the web interface)
Matplotlib & Seaborn (for data visualization)
GitHub (for version control)

🎯 How It Works

Run the Flask server → Opens a webpage to enter a search keyword & number of pages
Scrapes Amazon → Extracts product details and saves them in a CSV file
Performs Data Analysis → Determines the best product using a score metric
Displays Results → Shows analysis graphs and the best product link

📌 Installation & Usage

🔹 Prerequisites

Ensure you have Python 3+ and install dependencies:

pip install -r requirements.txt

🚀 Running the Project

🔹 Step 1: Start the Flask Server

Run the following command in your terminal:

python form_server.py

🔹 Step 2: Open the Web Interface

Open your browser and go to:

http://localhost:5000

Enter the search keyword (e.g., "laptop") and the number of pages to scrape.

Click the "Scrape" button to start the process.
The scraper will extract product details and save them in a CSV file.

5. Once completed, the page will display:

✅ A link to the scraped data
📊 Visualizations of the analysis
🏆 A link to the best-recommended product

📊 Data Analysis & Visualization

🔹 Scatter Plot: Rating vs. Reviews

X-axis: Number of Reviews
Y-axis: Rating
Bubble Size: Product Price
Purpose: Shows the relationship between customer ratings and the number of reviews.

🔹 Scatter Plot: Price vs. Score

X-axis: Price
Y-axis: Score (Calculated as (Rating * Reviews) / Price)
Color Gradient: Score Intensity
Purpose: Identifies the most cost-effective product based on rating and popularity.

🏆 Best Product Selection Criteria

The best product is determined using the formula: $$Score = (Rating * Reviews) / Price$$
The product with the highest score is recommended.

📌 Visualization Output

The analysis graphs are generated using Matplotlib & Seaborn.
The graphs are displayed on the results page along with the best product link.

💡 Future Improvements

✅ Implement Scrapy for faster scraping
✅ Enhance error handling for blocked requests
✅ Deploy on cloud (Heroku/AWS) for remote access

📜 License

This project is for educational purposes only. Amazon does not allow automated scraping, so use it responsibly.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
images		images
static		static
README.md		README.md
amazon_products.csv		amazon_products.csv
amazon_scraper.py		amazon_scraper.py
custom_server.py		custom_server.py
form_server.py		form_server.py
index.html		index.html
script.js		script.js
styles.css		styles.css
webscraping.code-workspace		webscraping.code-workspace

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

📦 Amazon Web Scraping and Analysis Project

📌 Overview

🚀 Features

📁 Project Structure

🛠️ Technologies Used

🎯 How It Works

📌 Installation & Usage

🔹 Prerequisites

🚀 Running the Project

🔹 Step 1: Start the Flask Server

🔹 Step 2: Open the Web Interface

📊 Data Analysis & Visualization

🔹 Scatter Plot: Rating vs. Reviews

🔹 Scatter Plot: Price vs. Score

🏆 Best Product Selection Criteria

📌 Visualization Output

💡 Future Improvements

📜 License

About

Uh oh!

Releases

Packages

Languages

hashiramauchiha/amazon-scraper

Folders and files

Latest commit

History

Repository files navigation

📦 Amazon Web Scraping and Analysis Project

📌 Overview

🚀 Features

📁 Project Structure

🛠️ Technologies Used

🎯 How It Works

📌 Installation & Usage

🔹 Prerequisites

🚀 Running the Project

🔹 Step 1: Start the Flask Server

🔹 Step 2: Open the Web Interface

📊 Data Analysis & Visualization

🔹 Scatter Plot: Rating vs. Reviews

🔹 Scatter Plot: Price vs. Score

🏆 Best Product Selection Criteria

📌 Visualization Output

💡 Future Improvements

📜 License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages