Skip to content

Conversation

@jechols
Copy link
Contributor

@jechols jechols commented Nov 12, 2014

Simplifies the robots.txt rules and adds an index to disallow spidering of /images/tiles. In chronam core, /images/tiles isn't the only URL that generates tiles (see ticket #109 ), but I believe it is the primary URL for OpenSeadragon.

Unfortunately, any sites still using the old URL style (/lccn/snxxxxxxxxx/YYYY-MM-DD/ed-#/seq-#/image_514x514_from_1022,4094_to_2050,5122.jpg) won't benefit from this, but addressing #109, and having users update local themes, would allow this to work for everybody. (Or extra rules could be added to catch this style of URL, too)

It might seem strange to disallow bots from hitting the tiles, but Apache log analysis showed us that we were getting a huge amount of traffic from bots - including thumbnails and the dynamic tiles.

Blocks all bots instead of just named bots, and blocks thumbnails since
they're all dynamically generated and bots can really slow the site down
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants