Skip to content

muneer22/syntarget

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

18 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Target.com Spider

This Scrapy spider is designed to scrape product data from www.target.com.

Running the Spider

To run the spider, use the following command:

scrapy crawl target_com -a url=https://www.target.com/p/-/A-79344798

If you want the output in a JSON file for testing purposes, append -o filename.json to the above command:

scrapy crawl target_com -a url=https://www.target.com/p/-/A-79344798 -o target_data.json

Installation

  1. Create a Python 3 virtual environment:
virtualenv -p python3 <env_name>
  1. Activate the virtual environment :
source env_name/bin/activate
  1. Install the required dependencies from the requirements.txt file:
pip install -r requirements.txt

Assumptions

  • specs: This field is assumed to be the field present on the website as "Specifications".

Example Data

A JSON file target_data.json is provided, which contains data from three example URLs.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages