Hi everyone,

after merging one database into another database there seems to 
be missing entrys in the new database.

I have 2 configurations:
1. destination_environment.conf which is nearly empty. There are only very 
   few pages to dig
2. source_environment.conf which is the result of a larger dig.

First i create source_environment:
# htdig -t -i -a -s -v  -c /usr/htdig/conf/source_environment.conf
# htmerge -a -s -v -c /usr/htdig/conf/source_environment.conf

Then i create destination_environment:
# rm /usr/htdig/db_destination_environment/*
# htdig/bin/htdig -i -s -v -c /usr/htdig/conf/destination_environment.conf
# htdig/bin/htmerge -s -v -c /usr/htdig/conf/destination_environment.conf

Now i merge source_environment into destination_environment:
# htmerge -s -v -v -c /usr/htdig/conf/destination_environment.conf -m 
/usr/htdig/conf/source_environment.conf

The problem: Some (not all!) entrys which htsearch do find  
in source_environment he does not find in the destination_environment.

If i compare the size of the databases for source_environment and 
destination_environment i can see that the files for the 
destination_environment are smaller than those for source_environment
which also indicates a problem.

# ls -l db_source_environment
-rw-rw-r--    1 utz      utz      63290368 Dez 22 15:26 db.docdb
-rw-rw-r--    1 utz      utz      53361491 Dez 22 14:31 db.docs
-rw-rw-r--    1 utz      utz       5239808 Dez 22 15:06 db.docs.index
-rw-rw-r--    1 utz      utz      72872594 Dez 22 14:31 db.urls
-rw-rw-r--    1 utz      utz      350006513 Dez 22 15:04 db.wordlist
-rw-rw-r--    1 utz      utz      260100096 Dez 22 15:04 db.words.db

# ls -l db_destination_environment
-rw-rw-r--    1 utz      utz      58299392 Dez 22 15:52 db.docdb
-rw-rw-r--    1 utz      utz       5213184 Dez 22 15:52 db.docs.index
-rw-rw-r--    1 utz      utz           134 Dez 22 15:25 db.urls
-rw-rw-r--    1 utz      utz      350015346 Dez 22 15:51 db.wordlist
-rw-rw-r--    1 utz      utz      236006400 Dez 22 15:51 db.words.db

Can you please give me some hints how to analyse or solve the
problem. Thanks a lot.

Utz




_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to