Author: Anar
Email: [EMAIL PROTECTED]
Message:
Could You plz tell me, why the clones-detection does not work properly - 
there is different crc32, counted for the same document (just urls are with www. or 
without it):

mysql> select rec_id,status,url,crc32  from url where url like 
'%/pubs/ai/20011208_009%' limit 2;
+--------+--------+-----------------------------------------------------+------------+
| rec_id | status | url                                                 | crc32      |
+--------+--------+-----------------------------------------------------+------------+
|  11636 |    200 | http://www.bakupages.com/pubs/ai/20011208_009_EN.asp| 1904538298 |
| 154623 |    200 | http://bakupages.com/pubs/ai/20011208_009_EN.asp    |  535608886 |
+--------+--------+-----------------------------------------------------+------------+

but it is the same document.
Why does it have different crc32 for it?
And, how can one deal with it?

thank You in advance

Reply: <http://www.mnogosearch.org/board/message.php?id=3720>

___________________________________________
If you want to unsubscribe send "unsubscribe general"
to [EMAIL PROTECTED]

Reply via email to