Capstone Project

Python-Web Scraping

Objective

Develop a Python application to scrape and analyze product data from Snapdeal's men's sports shoes category, accessible at Snapdeal Men's Sports Shoes. The goal is to extract detailed information on various products to facilitate market analysis, price comparison, and consumer insights.

 Scope of Work

1. Data Extraction:

Retrieve the following details for each product:

  • Product Name
  • Brand
  • Price (Original and Discounted)
  • Discount Percentage
  • Product Rating and Number of Reviews

2. Data Storage:

Store the extracted data in a structured format such as CSV for ease of analysis

 Scope of Work

4. Visualization (Optional):

Create visual representations (e.g., bar charts, histograms) to illustrate findings from the data analysis

3. Data Analysis (Optional):

  • Identify top brands based on the number of products listed
  • Analyze price distribution and discount trends
  • Determine the most common sizes available

Technical Requirements

  • Python
  • requests and BeautifulSoup for web scraping 
  • pandas for data manipulation 
  • matplotlib or seaborn for data visualization (if analysis is performed)

Libraries:

Programming Language:

Deliverables

4

3

2

1

Python script(s) performing the web scraping

Collected dataset in CSV or JSON format

 Analytical report or visualizations (if analysis is conducted)

Documentation detailing the project's methodology,challenges faced,and solutions implemented

Python-Web Scraping

By Content ITV

Python-Web Scraping

  • 7