Hello, We have just released aspseek-1.1.2. Please note that it is devel release. Changes are: - Ability to incrementally calculate citations, ranks, lastmod - Fixed bug in unicode version which can lead to incorrect indexing - Optimized internal index caches (improved concurrency) - Fixed incorrect searching with pattern (unicode version) - Option to set byte order in wordurl[1].word (unicode version) - Charset guesser improvements - Minor cleanup of code, configs and docs Most notable thing in new version is "Incremental citations". It is disabled by default, and is not compatible with the current data files, so if you want to use it you have to reindex everything from scratch. But it helps very much in a cituation where you want to index many URLs and have relatively small amount of RAM. More precise figures are yet have to be determined, but WITHOUT incremental citations you can deal with 5-6 millions of URLs having 1 Gig of RAM, and WITH incremental citations you can deal with about 10 millions (this is rough estimation). Note that this is experimental stuff and there can be some bugs. Also, index caches of words and hrefs (that are there to improve speed) were rewritten in a way that improved cache concurrency, which leads to less lockups and further improves indexing speed. For those who care, rewritten functions are CWordCache::GetWordID and CStoredHrefs::GetHref. In charset guesser the code that selects a pieces from document to be taken for guessing is substantially improved, so guesser should work more correctly now. Download and enjoy! -- [EMAIL PROTECTED] http://kir.sever.net ICQ 7551596 -- If you can't stand the heat, sit down or leave the sauna
