Jump to content

Blocking Web Scraping


Sarak

Recommended Posts

Recently I've noticed a site/crawler called "webextract.net" online at my store - http://www.totalfancydress.com

 

After some research, I found that it's a site scraper that crawls your website and copies designs, product info & images for the purposes of duplicating and/or selling it.

 

In my robots.txt file, the secure pages aren't allowed to be accessed by anyone. But how do I prevent "Scraping" websites from accessing my site at all without blocking every other agent, such as Google?

Edited by Sarak (see edit history)
Link to comment
Share on other sites

  • 2 years later...

Create an account or sign in to comment

You need to be a member in order to leave a comment

Create an account

Sign up for a new account in our community. It's easy!

Register a new account

Sign in

Already have an account? Sign in here.

Sign In Now
×
×
  • Create New...