You could MD4 the parts you care about, store that, fetch it and compare. If there is a reliable timestamp, you could use that. But that would be app-dependent.
In general, you need to store some info about each source document and figure out whether it is new. This get much hairier with a web spider when you have dupes and servers that go away then come back. wunder On 9/14/07 6:08 PM, "Chris Hostetter" <[EMAIL PROTECTED]> wrote: > How do you know that the document hasn't changed since the last time you > indexed it?