This is a project that utilizes Scrapy to parse PEP (Python Enhancement Proposals) documents.
- Clone the repository:
git clone https://github.com/alisher-nil/scrapy_parser_pep.git- Navigate to the project directory:
cd scrapy_parser_pep- Create virtual environment
python -m venv .venv- Update pip
python -m pip install --upgrade pip- Install the required dependencies:
pip install -r requirements.txtThe project can be initialized with uv.
# after moving to a project's directory
uv syncTo run the PEP parser, use the following command from the project directory:
scrapy crawl pep
# or with uv
uv run scrapy crawl pepThe result should be two csv files:
- pep_< date >.csv containing a list with numbers, names and current statuses of all peps.
- status_summary_< date >.csv containing a list of statuses and their counts among all the peps in descending order with a total.
Please feel free to contact me with any questions or feedback:
- Email: alisher.nil@gmail.com
- GitHub: alisher-nil