> This hasn't stopped them so far, we even went as far as 
> 'randomising' the
> names of the query parameters used in the search that gets 
> scraped for each
> visit, and they cracked the formula.

if your site has a valid copyright statement about the use of the data then
probably the only thing you could do against such a stubborn pirate is to
begin legal action.

Even Peter's graphical option won't be much use if they hire someone to sit
there and manually scrape the data, which they may well do if they're half
as determined as they sound.

Some legal action and bad publicity can do wonders to prevent this sort of
thing, as we've discovered before.

The flash thing *is* an option (but which again, won't stop someone manually
writing out all the data). But do you a) want your customers to have to use
flash as an interface and b) want to go through all the hassle of creating
the flash interface with the looming dangers that the flash plugin might be
facing at the moment in browsers.

(oooh, controversial).

> -----Original Message-----
> From: Peter Dray [mailto:[EMAIL PROTECTED]
> Sent: 15 September 2003 13:54
> To: [EMAIL PROTECTED]
> Subject: RE: [ cf-dev ] Stopping 'db scrapes'
> 
> 
> Rich:
> 
> > - keep changing the design (or just HTML) so that they have 
> to change
> their
> >parsing code
> 
> 
> This hasn't stopped them so far, we even went as far as 
> 'randomising' the
> names of the query parameters used in the search that gets 
> scraped for each
> visit, and they cracked the formula.
> 
> Paul:
> 
> >Going just on what you posted
> >Would it be possible to do a check on CGI.HTTP_REFERER contains the
> >current domain name ?
> >So only pages in the site can call subpages and they cannot be called
> direct ?
> >Or are they just viewing a dynamic page ? that anyone can see ?
> >
> >After all .. file - save as ;)
> 
> 
> 
> 
> What happens is the end user buys this bit of software from 
> the competitor
> and this software sends a request to our sites search engine. 
> I assume the
> software spoofs a user_agent and looks like any other user. 
> And yes, they
> just request a dynamic pace and then parse out the good bits.
> 
> I should add that we are not the only site that this software 
> 'scrapes', we
> just seem to be the only ones who can be arsed to try and stop them.
> 
> 
> -- 
> ** Archive: http://www.mail-archive.com/dev%40lists.cfdeveloper.co.uk/
> 
> To unsubscribe, e-mail: [EMAIL PROTECTED]
> For additional commands, e-mail: [EMAIL PROTECTED]
> For human help, e-mail: [EMAIL PROTECTED]
> 


-- 
** Archive: http://www.mail-archive.com/dev%40lists.cfdeveloper.co.uk/

To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
For human help, e-mail: [EMAIL PROTECTED]

Reply via email to