Skip to content
This repository was archived by the owner on Jun 29, 2023. It is now read-only.

neolaw84/image-scraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

image-scraper

To download images from online galleries using simple directory structure

Change Log

  • 2021-08-05 : MVP

    • Minimum config
    • Site with pagination
    • Each page has multiple sub-pages (model pages)
    • Each model page has multiple sibling-pages
    • Each model page has links to images
    • Produces url to uuid table as meta data
    • Delays randomly
  • 2021-08-06 : Check if downloaded

    • Check in the meta file if it is downloaded
    • Append to meta file accordingly
    • Replace config with easydict
  • 2021-08-17 : Refactor

    • Default config is to scrap all jpg/jpeg/png from the initial page
    • And to increment the pages
    • Also, dig the site for 3 links deep for all href (both relative and absolute)

About

To download images from online galleries using simple directory structure

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages