hi Erik
that was food for thought..content length may not work if
substitutions leave length unchanged..
Will look into L distance ..thanks for the suggestion
regards
harry

> Content length (which you could also get using the HTTP header "Content 
> Length") won't necessarily tell you if content has changed. I think your 
> problem is a candidate 
> forhttp://en.wikipedia.org/wiki/Levenshtein_distance(calculating the 
> "distance" between two strings), for which I think there are Python 
> implementations.
>
> Depending on your requirements, you could add other heuristics to detect 
> major changes, e.g. load the page into an XML parser and only check certain 
> <div>'s. But further suggestions would require more information on your 
> problem.

-- 
You received this message because you are subscribed to the Google Groups 
"Django users" group.
To post to this group, send email to django-us...@googlegroups.com.
To unsubscribe from this group, send email to 
django-users+unsubscr...@googlegroups.com.
For more options, visit this group at 
http://groups.google.com/group/django-users?hl=en.

Reply via email to