Hello,

We have just released aspseek-1.1.2. Please note that it is devel
release.

Changes are:
- Ability to incrementally calculate citations, ranks, lastmod
- Fixed bug in unicode version which can lead to incorrect indexing
- Optimized internal index caches (improved concurrency)
- Fixed incorrect searching with pattern (unicode version)
- Option to set byte order in wordurl[1].word (unicode version)
- Charset guesser improvements
- Minor cleanup of code, configs and docs

Most notable thing in new version is "Incremental citations".
It is disabled by default, and is not compatible with the current
data files, so if you want to use it you have to reindex everything
from scratch. But it helps very much in a cituation where you want
to index many URLs and have relatively small amount of RAM. More
precise figures are yet have to be determined, but WITHOUT incremental
citations you can deal with 5-6 millions of URLs having 1 Gig of RAM,
and WITH incremental citations you can deal with about 10 millions
(this is rough estimation). Note that this is experimental stuff and
there can be some bugs.

Also, index caches of words and hrefs (that are there to improve
speed) were rewritten in a way that improved cache concurrency, which
leads to less lockups and further improves indexing speed. For those
who care, rewritten functions are CWordCache::GetWordID and
CStoredHrefs::GetHref.

In charset guesser the code that selects a pieces from document to be
taken for guessing is substantially improved, so guesser should work
more correctly now.

Download and enjoy!

-- [EMAIL PROTECTED]  http://kir.sever.net  ICQ 7551596 --
If you can't stand the heat, sit down or leave the sauna

Reply via email to