Skip to content

Practice parser that parses pep documents using scrapy

Notifications You must be signed in to change notification settings

alisher-nil/scrapy_parser

Repository files navigation

Static Badge Static Badge Static Badge

scrapy_parser_pep

This is a project that utilizes Scrapy to parse PEP (Python Enhancement Proposals) documents.

Table of Contents

Installation

basic

  1. Clone the repository:
git clone https://github.com/alisher-nil/scrapy_parser_pep.git
  1. Navigate to the project directory:
cd scrapy_parser_pep
  1. Create virtual environment
python -m venv .venv
  1. Update pip
python -m pip install --upgrade pip
  1. Install the required dependencies:
pip install -r requirements.txt

uv

The project can be initialized with uv.

# after moving to a project's directory
uv sync

Usage

To run the PEP parser, use the following command from the project directory:

scrapy crawl pep
# or with uv
uv run scrapy crawl pep

The result should be two csv files:

  1. pep_< date >.csv containing a list with numbers, names and current statuses of all peps.
  2. status_summary_< date >.csv containing a list of statuses and their counts among all the peps in descending order with a total.

Author

Please feel free to contact me with any questions or feedback:

About

Practice parser that parses pep documents using scrapy

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages