Rich:

> - keep changing the design (or just HTML) so that they have to change
their
>parsing code


This hasn't stopped them so far, we even went as far as 'randomising' the
names of the query parameters used in the search that gets scraped for each
visit, and they cracked the formula.

Paul:

>Going just on what you posted
>Would it be possible to do a check on CGI.HTTP_REFERER contains the
>current domain name ?
>So only pages in the site can call subpages and they cannot be called
direct ?
>Or are they just viewing a dynamic page ? that anyone can see ?
>
>After all .. file - save as ;)




What happens is the end user buys this bit of software from the competitor
and this software sends a request to our sites search engine. I assume the
software spoofs a user_agent and looks like any other user. And yes, they
just request a dynamic pace and then parse out the good bits.

I should add that we are not the only site that this software 'scrapes', we
just seem to be the only ones who can be arsed to try and stop them.


-- 
** Archive: http://www.mail-archive.com/dev%40lists.cfdeveloper.co.uk/

To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
For human help, e-mail: [EMAIL PROTECTED]

Reply via email to