Rich: > - keep changing the design (or just HTML) so that they have to change their >parsing code
This hasn't stopped them so far, we even went as far as 'randomising' the names of the query parameters used in the search that gets scraped for each visit, and they cracked the formula. Paul: >Going just on what you posted >Would it be possible to do a check on CGI.HTTP_REFERER contains the >current domain name ? >So only pages in the site can call subpages and they cannot be called direct ? >Or are they just viewing a dynamic page ? that anyone can see ? > >After all .. file - save as ;) What happens is the end user buys this bit of software from the competitor and this software sends a request to our sites search engine. I assume the software spoofs a user_agent and looks like any other user. And yes, they just request a dynamic pace and then parse out the good bits. I should add that we are not the only site that this software 'scrapes', we just seem to be the only ones who can be arsed to try and stop them. -- ** Archive: http://www.mail-archive.com/dev%40lists.cfdeveloper.co.uk/ To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED] For human help, e-mail: [EMAIL PROTECTED]
