Re: [htdig-dev] building htdig-3.2.0_beta6 on amd64

2005-07-11 Thread Geoff Hutchison
On Jul 10, 2005, at 4:24 PM, Renat Lumpau wrote: I'm one of the Gentoo maintainers of htdig. We have recently received a bug report [1] that may be of interest to you. The bug has to do with building htdig on AMD64 and necessary -fPIC -DPIC. Please take a look and advise. I'm confused.

Re: [htdig-dev] Release this code! 3.2 is done!??!

2005-03-09 Thread Geoff Hutchison
On Mar 8, 2005, at 6:44 AM, Dan Langille wrote: If the Pre-Release Checklist has not been done, I have no clues about LeakTester, checker, purify, gprof. Anyone? I can vouch that essentially everything in the pre-release checklist has been done. The utilities you mention are used for finding

Re: [htdig-dev] Release this code! 3.2 is done!??!

2005-03-01 Thread Geoff Hutchison
Geoff: could you create a release tarball (and maybe create a document detailed how you generaly do this) Such a document already exists. It's a little out-of-date (since it's back in the days before SourceForge), but it's still pretty much correct: http://htdig.org/dev/checklist.html I'd

Re: [htdig-dev] New Mirror

2004-11-30 Thread Geoff Hutchison
What's actually the procedure for updating the htdig-website. If I knew what to do, I could help webmastering the website. The website runs out of the maindocs directory of CVS. If you change files in the maindocs CVS, they will be taken up on the website and the mirrors. Due to SF.net

Re: [htdig-dev] Mirror site update?

2004-11-30 Thread Geoff Hutchison
On Nov 30, 2004, at 2:32 AM, Claus Larsen wrote: But now more than 3 months later nothing have happend, according to the last modified date on http://www.htdig.org/mirrors.html nothing has happend since 2004-08-18. I apologize. Updating the mirror list seems to be one of the tasks/jobs that has

[htdig-dev] Current Status as of snapshot 3.2.0b6-20040926

2004-09-26 Thread Geoff Hutchison
STATUS of ht://Dig branch 3-2-x RELEASES: 3.2.0b6: Scheduled: 31 May 2004. 3.2.0b5: Released: 10 Nov 2003. 3.2.0b4: Cancelled. 3.2.0b3: Released: 22 Feb 2001. 3.2.0b2: Released: 11 Apr 2000. 3.2.0b1: Released: 4 Feb 2000. (Please note that everything added here should have

Re: [htdig-dev] Licenses..

2004-07-23 Thread Geoff Hutchison
On Jul 22, 2004, at 12:24 PM, Gilles Detillieux wrote: My understanding, though I may be wrong (Geoff Hutchison could provide the definitive answer), is that the 3.1.x code base does not include any extensions or customisations to the Sleepycat Berkeley DB code, This is correct. Over the 3.1.x

[htdig-dev] Current Status as of snapshot 3.2.0b6-20040711

2004-07-11 Thread Geoff Hutchison
STATUS of ht://Dig branch 3-2-x RELEASES: 3.2.0b6: Scheduled: 31 May 2004. 3.2.0b5: Released: 10 Nov 2003. 3.2.0b4: Cancelled. 3.2.0b3: Released: 22 Feb 2001. 3.2.0b2: Released: 11 Apr 2000. 3.2.0b1: Released: 4 Feb 2000. (Please note that everything added here should have

[htdig-dev] Current Status as of snapshot 3.2.0b6-20040704

2004-07-04 Thread Geoff Hutchison
STATUS of ht://Dig branch 3-2-x RELEASES: 3.2.0b6: Scheduled: 31 May 2004. 3.2.0b5: Released: 10 Nov 2003. 3.2.0b4: Cancelled. 3.2.0b3: Released: 22 Feb 2001. 3.2.0b2: Released: 11 Apr 2000. 3.2.0b1: Released: 4 Feb 2000. (Please note that everything added here should have

[htdig-dev] Current Status as of snapshot 3.2.0b6-20040620

2004-06-20 Thread Geoff Hutchison
STATUS of ht://Dig branch 3-2-x RELEASES: 3.2.0b6: Scheduled: 31 May 2004. 3.2.0b5: Released: 10 Nov 2003. 3.2.0b4: Cancelled. 3.2.0b3: Released: 22 Feb 2001. 3.2.0b2: Released: 11 Apr 2000. 3.2.0b1: Released: 4 Feb 2000. (Please note that everything added here should have

[htdig-dev] Current Status as of snapshot 3.2.0b5-20040530

2004-05-30 Thread Geoff Hutchison
STATUS of ht://Dig branch 3-2-x RELEASES: 3.2.0b6: Scheduled: 31 May 2004. 3.2.0b5: Released: 10 Nov 2003. 3.2.0b4: Cancelled. 3.2.0b3: Released: 22 Feb 2001. 3.2.0b2: Released: 11 Apr 2000. 3.2.0b1: Released: 4 Feb 2000. (Please note that everything added here should have

[htdig-dev] Current Status as of snapshot 3.2.0b5-20040523

2004-05-23 Thread Geoff Hutchison
STATUS of ht://Dig branch 3-2-x RELEASES: 3.2.0b6: Scheduled: 31 May 2004. 3.2.0b5: Released: 10 Nov 2003. 3.2.0b4: Cancelled. 3.2.0b3: Released: 22 Feb 2001. 3.2.0b2: Released: 11 Apr 2000. 3.2.0b1: Released: 4 Feb 2000. (Please note that everything added here should have

[htdig-dev] Current Status as of snapshot 3.2.0b5-20040516

2004-05-16 Thread Geoff Hutchison
STATUS of ht://Dig branch 3-2-x RELEASES: 3.2.0b6: Scheduled: 31 May 2004. 3.2.0b5: Released: 10 Nov 2003. 3.2.0b4: Cancelled. 3.2.0b3: Released: 22 Feb 2001. 3.2.0b2: Released: 11 Apr 2000. 3.2.0b1: Released: 4 Feb 2000. (Please note that everything added here should have

[htdig-dev] Current Status as of snapshot 3.2.0b5-20040502

2004-05-02 Thread Geoff Hutchison
STATUS of ht://Dig branch 3-2-x RELEASES: 3.2.0b6: Scheduled: 31 May 2004. 3.2.0b5: Released: 10 Nov 2003. 3.2.0b4: Cancelled. 3.2.0b3: Released: 22 Feb 2001. 3.2.0b2: Released: 11 Apr 2000. 3.2.0b1: Released: 4 Feb 2000. (Please note that everything added here should have

[htdig-dev] Current Status as of snapshot 3.2.0b5-20040418

2004-04-18 Thread Geoff Hutchison
STATUS of ht://Dig branch 3-2-x RELEASES: 3.2.0b5: Released: 10 Nov 2003. 3.2.0b4: Cancelled. 3.2.0b3: Released: 22 Feb 2001. 3.2.0b2: Released: 11 Apr 2000. 3.2.0b1: Released: 4 Feb 2000. (Please note that everything added here should have a tracker PR# so we can be sure

[htdig-dev] Current Status as of snapshot 3.2.0b5-20040411

2004-04-11 Thread Geoff Hutchison
STATUS of ht://Dig branch 3-2-x RELEASES: 3.2.0b5: Released: 10 Nov 2003. 3.2.0b4: Cancelled. 3.2.0b3: Released: 22 Feb 2001. 3.2.0b2: Released: 11 Apr 2000. 3.2.0b1: Released: 4 Feb 2000. (Please note that everything added here should have a tracker PR# so we can be sure

[htdig-dev] Current Status as of snapshot 3.2.0b5-20040404

2004-04-04 Thread Geoff Hutchison
STATUS of ht://Dig branch 3-2-x RELEASES: 3.2.0b5: Released: 10 Nov 2003. 3.2.0b4: Cancelled. 3.2.0b3: Released: 22 Feb 2001. 3.2.0b2: Released: 11 Apr 2000. 3.2.0b1: Released: 4 Feb 2000. (Please note that everything added here should have a tracker PR# so we can be sure

[htdig-dev] Current Status as of snapshot 3.2.0b5-20040321

2004-03-21 Thread Geoff Hutchison
STATUS of ht://Dig branch 3-2-x RELEASES: 3.2.0b5: Released: 10 Nov 2003. 3.2.0b4: Cancelled. 3.2.0b3: Released: 22 Feb 2001. 3.2.0b2: Released: 11 Apr 2000. 3.2.0b1: Released: 4 Feb 2000. (Please note that everything added here should have a tracker PR# so we can be sure

[htdig-dev] Current Status as of snapshot 3.2.0b5-20040314

2004-03-14 Thread Geoff Hutchison
STATUS of ht://Dig branch 3-2-x RELEASES: 3.2.0b5: Released: 10 Nov 2003. 3.2.0b4: Cancelled. 3.2.0b3: Released: 22 Feb 2001. 3.2.0b2: Released: 11 Apr 2000. 3.2.0b1: Released: 4 Feb 2000. (Please note that everything added here should have a tracker PR# so we can be sure

[htdig-dev] Current Status as of snapshot 3.2.0b5-20040307

2004-03-07 Thread Geoff Hutchison
STATUS of ht://Dig branch 3-2-x RELEASES: 3.2.0b5: Released: 10 Nov 2003. 3.2.0b4: Cancelled. 3.2.0b3: Released: 22 Feb 2001. 3.2.0b2: Released: 11 Apr 2000. 3.2.0b1: Released: 4 Feb 2000. (Please note that everything added here should have a tracker PR# so we can be sure

[htdig-dev] Current Status as of snapshot 3.2.0b5-20040222

2004-02-22 Thread Geoff Hutchison
STATUS of ht://Dig branch 3-2-x RELEASES: 3.2.0b5: Released: 10 Nov 2003. 3.2.0b4: Cancelled. 3.2.0b3: Released: 22 Feb 2001. 3.2.0b2: Released: 11 Apr 2000. 3.2.0b1: Released: 4 Feb 2000. (Please note that everything added here should have a tracker PR# so we can be sure

[htdig-dev] Current Status as of snapshot 3.2.0b5-20040215

2004-02-15 Thread Geoff Hutchison
STATUS of ht://Dig branch 3-2-x RELEASES: 3.2.0b5: Released: 10 Nov 2003. 3.2.0b4: Cancelled. 3.2.0b3: Released: 22 Feb 2001. 3.2.0b2: Released: 11 Apr 2000. 3.2.0b1: Released: 4 Feb 2000. (Please note that everything added here should have a tracker PR# so we can be sure

[htdig-dev] Current Status as of snapshot 3.2.0b5-20040201

2004-02-01 Thread Geoff Hutchison
STATUS of ht://Dig branch 3-2-x RELEASES: 3.2.0b5: Released: 10 Nov 2003. 3.2.0b4: Cancelled. 3.2.0b3: Released: 22 Feb 2001. 3.2.0b2: Released: 11 Apr 2000. 3.2.0b1: Released: 4 Feb 2000. (Please note that everything added here should have a tracker PR# so we can be sure

[htdig-dev] Current Status as of snapshot 3.2.0b5-20031228

2003-12-28 Thread Geoff Hutchison
STATUS of ht://Dig branch 3-2-x RELEASES: 3.2.0b5: Released: 10 Nov 2003. 3.2.0b4: Cancelled. 3.2.0b3: Released: 22 Feb 2001. 3.2.0b2: Released: 11 Apr 2000. 3.2.0b1: Released: 4 Feb 2000. (Please note that everything added here should have a tracker PR# so we can be sure

[htdig-dev] Current Status as of snapshot 3.2.0b5-20031214

2003-12-14 Thread Geoff Hutchison
STATUS of ht://Dig branch 3-2-x RELEASES: 3.2.0b5: Released: 10 Nov 2003. 3.2.0b4: Cancelled. 3.2.0b3: Released: 22 Feb 2001. 3.2.0b2: Released: 11 Apr 2000. 3.2.0b1: Released: 4 Feb 2000. (Please note that everything added here should have a tracker PR# so we can be sure

[htdig-dev] Current Status as of snapshot 3.2.0b5-20031207

2003-12-07 Thread Geoff Hutchison
STATUS of ht://Dig branch 3-2-x RELEASES: 3.2.0b5: Released: 10 Nov 2003. 3.2.0b4: Cancelled. 3.2.0b3: Released: 22 Feb 2001. 3.2.0b2: Released: 11 Apr 2000. 3.2.0b1: Released: 4 Feb 2000. (Please note that everything added here should have a tracker PR# so we can be sure

[htdig-dev] Current Status as of snapshot 3.2.0b5-20031130

2003-11-30 Thread Geoff Hutchison
STATUS of ht://Dig branch 3-2-x RELEASES: 3.2.0b5: Released: 10 Nov 2003. 3.2.0b4: Cancelled. 3.2.0b3: Released: 22 Feb 2001. 3.2.0b2: Released: 11 Apr 2000. 3.2.0b1: Released: 4 Feb 2000. (Please note that everything added here should have a tracker PR# so we can be sure

[htdig-dev] Current Status as of snapshot 3.2.0b5-20031123

2003-11-23 Thread Geoff Hutchison
STATUS of ht://Dig branch 3-2-x RELEASES: 3.2.0b5: Released: 10 Nov 2003. 3.2.0b4: Cancelled. 3.2.0b3: Released: 22 Feb 2001. 3.2.0b2: Released: 11 Apr 2000. 3.2.0b1: Released: 4 Feb 2000. (Please note that everything added here should have a tracker PR# so we can be sure

Re: [htdig-dev] (Attn: Geoff) Re: Numbered HTML Entities mangled in Result Blurbs

2003-11-18 Thread Geoff Hutchison
On Nov 17, 2003, at 7:32 PM, Neal Richter wrote: This part of the code is pretty cheese-whizzy, so attention Geoff! Any insights? I am assuming that at some point this worked properly. I *really* don't have much time. I'm attempting to finish my last chapter by Friday and that's going to take

[htdig-dev] Current Status as of snapshot 3.2.0b4-20031109

2003-11-11 Thread Geoff Hutchison
STATUS of ht://Dig branch 3-2-x RELEASES: 3.2.0b5: Released: 10 Nov 2003. 3.2.0b4: Cancelled. 3.2.0b3: Released: 22 Feb 2001. 3.2.0b2: Released: 11 Apr 2000. 3.2.0b1: Released: 4 Feb 2000. (Please note that everything added here should have a tracker PR# so we can be sure

[htdig-dev] Current Status as of snapshot 3.2.0b4-20030928

2003-09-29 Thread Geoff Hutchison
STATUS of ht://Dig branch 3-2-x CHECKLIST FOR 3.2.0b5: * Check bugs listed in bug-tracker... * Polish release docs (Geoff) * Must be able to (a) make check and (b) index www.htdig.org using robotstxt_name: master-htdig on all systems listed as supported. Systems tested so

[htdig-dev] Current Status as of snapshot 3.2.0b4-20030921

2003-09-21 Thread Geoff Hutchison
STATUS of ht://Dig branch 3-2-x CHECKLIST FOR 3.2.0b5: * Apply memory leak patches (Neal) * Check bugs listed in bug-tracker... * Polish release docs (Geoff) * Must be able to (a) make check and (b) index www.htdig.org using robotstxt_name: master-htdig on all systems listed

Fwd: [htdig-dev] Current Status as of snapshot 3.2.0b4-20030914

2003-09-15 Thread Geoff Hutchison
Begin forwarded message: From: David Bannon [EMAIL PROTECTED] Date: Sun Sep 14, 2003 5:38:32 PM America/Chicago To: [EMAIL PROTECTED] Cc: Geoff Hutchison [EMAIL PROTECTED] Subject: RE: [htdig-dev] Current Status as of snapshot 3.2.0b4-20030914 Reply-To: [EMAIL PROTECTED] I try it under

[htdig-dev] Current Status as of snapshot 3.2.0b4-20030914

2003-09-14 Thread Geoff Hutchison
STATUS of ht://Dig branch 3-2-x CHECKLIST FOR 3.2.0b5: * Apply memory leak patches (Neal) * Check bugs listed in bug-tracker... * Polish release docs (Geoff) * Must be able to (a) make check and (b) index www.htdig.org using robotstxt_name: master-htdig on all systems listed

[htdig-dev] Current Status as of snapshot 3.2.0b4-20030907

2003-09-07 Thread Geoff Hutchison
STATUS of ht://Dig branch 3-2-x CHECKLIST FOR 3.2.0b5: * Apply memory leak patches (Neal) * Check bugs listed in bug-tracker... * Polish release docs (Geoff) * Must be able to (a) make check and (b) index www.htdig.org using robotstxt_name: master-htdig on all systems listed

[htdig-dev] Current Status as of snapshot 3.2.0b4-20030803

2003-08-03 Thread Geoff Hutchison
STATUS of ht://Dig branch 3-2-x CHECKLIST FOR 3.2.0b5: * Apply memory leak patches (Neal) * Check bugs listed in bug-tracker... * Polish release docs (Geoff) * Must be able to (a) make check and (b) index www.htdig.org using robotstxt_name: master-htdig on all systems listed

[htdig-dev] Current Status as of snapshot 3.2.0b4-20030727

2003-07-27 Thread Geoff Hutchison
STATUS of ht://Dig branch 3-2-x CHECKLIST FOR 3.2.0b5: * Apply memory leak patches (Neal) * Check bugs listed in bug-tracker... * Polish release docs (Geoff) * Must be able to (a) make check and (b) index www.htdig.org using robotstxt_name: master-htdig on all systems listed

[htdig-dev] Current Status as of snapshot 3.2.0b4-20030713

2003-07-13 Thread Geoff Hutchison
STATUS of ht://Dig branch 3-2-x CHECKLIST FOR 3.2.0b5: * Apply memory leak patches (Neal) * Check bugs listed in bug-tracker... * Polish release docs (Geoff) * Must be able to (a) make check and (b) index www.htdig.org using robotstxt_name: master-htdig on all systems listed

Re: [htdig-dev] htdig stores program data in /etc istead of /var

2003-07-11 Thread Geoff Hutchison
as part of your package creates this file. As I do not use Debian, I cannot give you any more information--only that it's not an upstream issue. Cheers, -Geoff -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ --- This SF.Net

Re: [htdig-dev] 64-bit clean (was 3.2b4 snapshots)

2003-07-07 Thread Geoff Hutchison
big are your databases exactly? Are your problems limited to htmerge? (In which case, I likely know the problem, and it's not due to 64-bit addressing.) -Geoff -- -Geoff Hutchison Williams Students Online http://wso.williams.edu

[htdig-dev] Current Status as of snapshot 3.2.0b4-20030706

2003-07-06 Thread Geoff Hutchison
STATUS of ht://Dig branch 3-2-x CHECKLIST FOR 3.2.0b5: * Apply memory leak patches (Neal) * Check bugs listed in bug-tracker... * Polish release docs (Geoff) * Must be able to (a) make check and (b) index www.htdig.org using robotstxt_name: master-htdig on all systems listed

[htdig-dev] Current Status as of snapshot 3.2.0b4-20030629

2003-06-29 Thread Geoff Hutchison
STATUS of ht://Dig branch 3-2-x CHECKLIST FOR 3.2.0b5: * Apply memory leak patches (Neal) * Check bugs listed in bug-tracker... * Polish release docs (Geoff) * Must be able to (a) make check and (b) index www.htdig.org using robotstxt_name: master-htdig on all systems listed

Re: [htdig-dev] Re: Report on Native Win32 memory leak fixes

2003-06-23 Thread Geoff Hutchison
On Saturday, June 21, 2003, at 09:03 AM, Lachlan Andrew wrote: have to implement a proper fix before the beta goes out. However, I don't think I'll have time for that for the next two months :( Translation: I'm in favour of your checking it in. Ditto. -Geoff

Re: [htdig-dev] 3.2.0b5 Next progress check :)

2003-06-23 Thread Geoff Hutchison
I don't think Geoff was saying we *shouldn't* use 2.7. It's just that we haven't actually *tested* it under 2.7. If (when :) you can confirm that it still works under 2.8, Right. Sorry for the confusion. If it works under gcc-2.7, so much the better! But IMHO, we should be pushing people towards

[htdig-dev] Current Status as of snapshot 3.2.0b4-20030622

2003-06-22 Thread Geoff Hutchison
STATUS of ht://Dig branch 3-2-x CHECKLIST FOR 3.2.0b5: * Apply memory leak patches (Neal) * Check bugs listed in bug-tracker... * Polish release docs (Geoff) * Must be able to (a) make check and (b) index www.htdig.org using robotstxt_name: master-htdig on all systems listed

[htdig-dev] Current Status as of snapshot 3.2.0b4-20030615

2003-06-15 Thread Geoff Hutchison
STATUS of ht://Dig branch 3-2-x CHECKLIST FOR 3.2.0b5: * Apply memory leak patches (Neal) * Check bugs listed in bug-tracker... * Polish release docs (Geoff) * Must be able to (a) make check and (b) index www.htdig.org using robotstxt_name: master-htdig on all systems listed

[htdig-dev] Re: Compute Page Rank GPL Code

2003-06-12 Thread Geoff Hutchison
http://slashdot.org/article.pl?sid=03/06/12/1742209mode=threadtid=134 Basically, it looks like there's now a GPL'd Java implementation of what Google has published about their PageRank feature. The current ht://Dig backlink_factor is a quick-and-dirty hack I put in to try to get some of the

Re: [htdig-dev] Score factors almost ignored

2003-06-11 Thread Geoff Hutchison
I'm guessing you mean the scoring in the 3.2 code? The base score of documents I search for is typically 0.0001, while the backlink factor is typically 2000. Since these are added, the weight given to the document itself is approximately zero! Does anyone know how this came about? Well, that

[htdig-dev] Re: SunOS cc problem

2003-06-03 Thread Geoff Hutchison
1. It works fine with --disable-bigfile and I'd be inclined to leave it at that for 3.2.0b5. (If people have indexes over 4GB, the I say eliminating redundancy from the database structure is a higher priority than enabling big file support...) So are we saying that for SunOS native cc, we're

Re: [htdig-dev] Re: Fwd: OS X and libtool feedback

2003-06-01 Thread Geoff Hutchison
On Saturday, May 31, 2003, at 12:00 AM, Jim Cole wrote: the time being. We can always add a FAQ telling people to use --disable-shared if it comes up a lot. Or we can continue the hack that I put into the configure scripts to set --disable-shared as the default on powerpc-*-* targets. Peter

Re: [htdig-dev] SunOS cc problem

2003-06-01 Thread Geoff Hutchison
Sorry I've been AWOL. There was a big grant review this last week and lots of things were dumped on me. The problem with make check on SunOS with native cc is the size of off_t, the size of an offset in a file. This seems to be related to the --enable-bigfile configure option. Does anyone

[htdig-dev] Re: Copyright and 3.2.0b5

2003-06-01 Thread Geoff Hutchison
Hmm. We need to update copyright information before releasing 3.2.0b5. 1) Files need to have current copyright, especially if they have been touched since 2001. 2) As per the ht://Dig group decision, the source is now available under the LGPL. Thus the COPYING file, as well as the per-file

[htdig-dev] Current Status as of snapshot 3.2.0b4-20030601

2003-06-01 Thread Geoff Hutchison
STATUS of ht://Dig branch 3-2-x CHECKLIST FOR 3.2.0b5: * Add more items to checklist :-) * Must be able to (a) make check and (b) index www.htdig.org using robotstxt_name: master-htdig on all systems listed as supported. Systems tested so far: - Mandrake 8.2, gcc 3.2

[htdig-dev] Current Status as of snapshot 3.2.0b4-20030330

2003-03-30 Thread Geoff Hutchison
STATUS of ht://Dig branch 3-2-x RELEASES: 3.2.0b5: Next release, First quarter 2003??? 3.2.0b4: In progress -- snapshots called 3.2.0b4 until prerelease. 3.2.0b3: Released: 22 Feb 2001. 3.2.0b2: Released: 11 Apr 2000. 3.2.0b1: Released: 4 Feb 2000. (Please note that everything

[htdig-dev] Current Status as of snapshot 3.2.0b4-20030309

2003-03-09 Thread Geoff Hutchison
STATUS of ht://Dig branch 3-2-x RELEASES: 3.2.0b5: Next release, First quarter 2003??? 3.2.0b4: In progress -- snapshots called 3.2.0b4 until prerelease. 3.2.0b3: Released: 22 Feb 2001. 3.2.0b2: Released: 11 Apr 2000. 3.2.0b1: Released: 4 Feb 2000. (Please note that everything

[htdig-dev] Re: Several questions...

2003-03-03 Thread Geoff Hutchison
That could have its own problems. If they are labelled -1, -2, ... then phrase searching would have to match *backwards* for negative numbers. Then if true positions overflowed into negative numbers, ...very negative number, then it is essentially starting from a very large (unsigned) location.

Re: [htdig-dev] Several questions...

2003-02-28 Thread Geoff Hutchison
1. Why do the documentation for external_parser and the comments before Retriever::got_word both say that the word location must be in the range 0-1000? That's a 3.1-ism. The documentation is wrong. Oops. first word of any *other* entry. Could we add meta information at successive

[htdig-dev] Current Status as of snapshot 3.2.0b4-20030223

2003-02-23 Thread Geoff Hutchison
STATUS of ht://Dig branch 3-2-x RELEASES: 3.2.0b5: Next release, First quarter 2003??? 3.2.0b4: In progress -- snapshots called 3.2.0b4 until prerelease. 3.2.0b3: Released: 22 Feb 2001. 3.2.0b2: Released: 11 Apr 2000. 3.2.0b1: Released: 4 Feb 2000. (Please note that everything

[htdig-dev] Fwd: Formation of the ht://Dig Group

2003-02-22 Thread Geoff Hutchison
Begin forwarded message: From: Dave Stevens [EMAIL PROTECTED] Date: Sun Feb 9, 2003 9:47:57 PM US/Central To: Geoff Hutchison [EMAIL PROTECTED] Subject: Re: Formation of the ht://Dig Group Dear Mr. Hutchison, My apologies for not replying sooner to your invitation. I got your mail because I

Re: [htdig-dev] Feature Idea -or- tell me it's been done!

2003-02-21 Thread Geoff Hutchison
vars tell me! I'd guess they're less widely used, but: bad_querystr bad_extensions valid_extensions server_max_docs can also limit things (as would the robots.txt and meta-robots tag). -- -Geoff Hutchison Williams Students Online http://wso.williams.edu

Re: [htdig-dev] update to Documentation via defaults.xml

2003-02-21 Thread Geoff Hutchison
http://www.tedmasterweb.com/htdig/ Always appreciate the feedback... I think it's looking pretty good overall. It'd be nice to have lists by category, programs, etc. and I'm sure that's on your TODO list. Minor nit-picky things. It might not be a bad idea to have letters (with no links)

Re: Antwort: Re: [htdig-dev] Problem with illegal instruction with Version 3.2.0.b3 on AIX 5.1L (RS6000)

2003-02-21 Thread Geoff Hutchison
I'm now working with this Snapshot, but have the same truble with lex like jesse on AIX. - lex -L `test -f conf_lexer.lxx || echo './'`conf_lexer.lxx It shouldn't need to be running flex/lex. The code does have the appropriately-generated file. Make sure that the conf_lexer.cxx hasn't

[htdig-dev] Re: Release schedule for 3.2.0b5?

2003-02-21 Thread Geoff Hutchison
Well, I had an oral exam on Thursday, so I've been quite busy the last few weeks and fortunately things should settle down a bit. (I can't say I followed much e-mail unless my filters threw it into the Family mailbox, sorry.) SHOWSTOPPER: * Still need thorough testing of the database, with

[htdig-dev] Current Status as of snapshot 3.2.0b4-20030216

2003-02-16 Thread Geoff Hutchison
STATUS of ht://Dig branch 3-2-x RELEASES: 3.2.0b5: Next release, First quarter 2003??? 3.2.0b4: In progress -- snapshots called 3.2.0b4 until prerelease. 3.2.0b3: Released: 22 Feb 2001. 3.2.0b2: Released: 11 Apr 2000. 3.2.0b1: Released: 4 Feb 2000. (Please note that everything

Re: [htdig-dev] Feature Idea -or- tell me it's been done!

2003-02-14 Thread Geoff Hutchison
On Fri, 14 Feb 2003, Neal Richter wrote: What if we had a feature that stripped the querystrs from a URL contained in bad_querystr rather than rejecting them? url_rewrite_rules ? -- -Geoff Hutchison Williams Students Online http://wso.williams.edu

Re: [htdig-dev] Compile farm's Results and commit

2003-02-04 Thread Geoff Hutchison
On Monday, February 3, 2003, at 12:48 PM, Gabriele Bartolini wrote: What kind of problem have you had specifically, Geoff? At the time, it didn't correctly compile C++ code. While it sounds like that's fixed, I guess I'm also just trying to say that we have plenty of OS X testers. -Geoff

Re: [htdig-dev] request for manage_attributes.pl (creates attributes based on defaults.xml)

2003-02-03 Thread Geoff Hutchison
Also, I'm still new to XML so pardon what may be a stupid question, but, rather than writing your own DTD, would it be possible for us to borrow the DTD for XHTML and then modify that to meet our needs (adding our own custom elements)? That way we could include all types of html in the

Re: [htdig-dev] Re: not allowing getopt::std

2003-02-03 Thread Geoff Hutchison
Could someone who knows what exact: and hidden: mean please explain what they are for (and/or document them officially)? I don't want to break anything while trying to fix the bug. These are fuzzy algorithms essentially. You could have endings:blah. You're right that it's undocumented, and

Re: [htdig-dev] Compile farm's Results and commit

2003-02-03 Thread Geoff Hutchison
Everything is ok on all Linux on all platforms (i686, Alpha, Sparc); MacOS x 10.1 still has that problem with shared libraries (as it was before) whereas Solaris on a Sparc R220 doesn't go. Just a note that I've had strange problems with the MacOS X 10.1 node on the compile farm. Since there

Re: [htdig-dev] Questions about Dictionary::hashCode

2003-02-03 Thread Geoff Hutchison
I'm wondering if we couldn't add a String to the Dictionary class and use that instead of doing a malloc/strcpy everytime.. this function is called jillions of times. That's probably a good idea. I'm also curious as to why not use Knuth's golden ratio hash function, it's a well studied

Re: [htdig-dev] Release schedule for 3.2.0b5?

2003-02-03 Thread Geoff Hutchison
Is there a list of tasks which *must* be completed before the release of 3.2.0b4/5? If the STATUS file is that list, can I suggest that some things be classed as not essential (at least defaults.xml, and preferably most of it)? The STATUS file is the list, though it's intended to be updated by

[htdig-dev] Current Status as of snapshot 3.2.0b4-20030202

2003-02-02 Thread Geoff Hutchison
STATUS of ht://Dig branch 3-2-x RELEASES: 3.2.0b5: Next release, tentatively 1 Feb 2003. 3.2.0b4: In progress -- snapshots called 3.2.0b4 until prerelease. 3.2.0b3: Released: 22 Feb 2001. 3.2.0b2: Released: 11 Apr 2000. 3.2.0b1: Released: 4 Feb 2000. (Please note that

[htdig-dev] Current Status as of snapshot 3.2.0b4-20030126

2003-01-26 Thread Geoff Hutchison
STATUS of ht://Dig branch 3-2-x RELEASES: 3.2.0b5: Next release, tentatively 1 Feb 2003. 3.2.0b4: In progress -- snapshots called 3.2.0b4 until prerelease. 3.2.0b3: Released: 22 Feb 2001. 3.2.0b2: Released: 11 Apr 2000. 3.2.0b1: Released: 4 Feb 2000. (Please note that

[htdig-dev] Current Status as of snapshot 3.2.0b4-20030119

2003-01-19 Thread Geoff Hutchison
STATUS of ht://Dig branch 3-2-x RELEASES: 3.2.0b5: Next release, tentatively 1 Feb 2003. 3.2.0b4: In progress -- snapshots called 3.2.0b4 until prerelease. 3.2.0b3: Released: 22 Feb 2001. 3.2.0b2: Released: 11 Apr 2000. 3.2.0b1: Released: 4 Feb 2000. (Please note that

[htdig-dev] Re: File Locking ...

2003-01-16 Thread Geoff Hutchison
. But the item hasn't changed in any substantial way since the previous discussion of locking (which looks like Sep-2002). http://sourceforge.net/mailarchive/message.php?msg_id=2014435 -- -Geoff Hutchison Williams Students Online http://wso.williams.edu

[htdig-dev] Current Status as of snapshot 3.2.0b4-20030112

2003-01-12 Thread Geoff Hutchison
STATUS of ht://Dig branch 3-2-x RELEASES: 3.2.0b5: Next release, tentatively 1 Feb 2003. 3.2.0b4: In progress -- snapshots called 3.2.0b4 until prerelease. 3.2.0b3: Released: 22 Feb 2001. 3.2.0b2: Released: 11 Apr 2000. 3.2.0b1: Released: 4 Feb 2000. (Please note that

[htdig-dev] Current Status as of snapshot 3.2.0b4-20021229

2002-12-29 Thread Geoff Hutchison
STATUS of ht://Dig branch 3-2-x RELEASES: 3.2.0b5: Next release, tentatively 1 Dec 2002. 3.2.0b4: In progress -- snapshots called 3.2.0b4 until prerelease. 3.2.0b3: Released: 22 Feb 2001. 3.2.0b2: Released: 11 Apr 2000. 3.2.0b1: Released: 4 Feb 2000. (Please note that

Re: [htdig-dev] regex, sym-links, OBJEXT and CVS

2002-12-19 Thread Geoff Hutchison
- Why was regex.h renamed gregex.h in 3.1.6? It seems to break the configure script, so that it always reports HAVE_BROKEN_REGEX. Strange, it wasn't doing that for me, but perhaps that's because I was using gcc-3.x? The change was made because certain systems have serious problems

Re: [htdig-dev] CVE: CAN-2000-1191 ----Help

2002-12-18 Thread Geoff Hutchison
And it only responded with Unable to read configuration file...it did not return back the .conf file location. ... Can you please tell me where to fix this. Yes. You will need to update to htdig-3.1.6. http://www.htdig.org/where.html -- -Geoff Hutchison Williams Students Online http

[htdig-dev] Current Status as of snapshot 3.2.0b4-20021215

2002-12-15 Thread Geoff Hutchison
STATUS of ht://Dig branch 3-2-x RELEASES: 3.2.0b5: Next release, tentatively 1 Dec 2002. 3.2.0b4: In progress -- snapshots called 3.2.0b4 until prerelease. 3.2.0b3: Released: 22 Feb 2001. 3.2.0b2: Released: 11 Apr 2000. 3.2.0b1: Released: 4 Feb 2000. (Please note that

Re: [htdig-dev] stemming

2002-12-02 Thread Geoff Hutchison
Indexing the stems is a good suggestion. It would certainly give faster searching. If it replaced the unstemmed inverted file then it would also save on storage requirements, but it would mean we couldn't search on the unstemmed version (if that is of concern). The general strategy used by

[htdig-dev] Current Status as of snapshot 3.2.0b4-20021201

2002-12-01 Thread Geoff Hutchison
STATUS of ht://Dig branch 3-2-x RELEASES: 3.2.0b5: Next release, tentatively 1 Dec 2002. 3.2.0b4: In progress -- snapshots called 3.2.0b4 until prerelease. 3.2.0b3: Released: 22 Feb 2001. 3.2.0b2: Released: 11 Apr 2000. 3.2.0b1: Released: 4 Feb 2000. (Please note that

Re: [htdig-dev] htDig history

2002-11-25 Thread Geoff Hutchison
On Monday, November 25, 2002, at 12:18 PM, Andrea Capiluppi wrote: i was analyzing htDig, but my problem is that i don't have the sequence of the different versions. If you put together http://www.htdig.org/RELEASE.html http://www.htdig.org/dev/htdig-3.2/RELEASE.html (in reverse

Re: [htdig-dev] Re: Forward porting changes for 3.1.6 to 3.2.0b4

2002-11-25 Thread Geoff Hutchison
Is the policy to have all possible stemmings, even if they are non-words, like unrealises? If so, we can really go to town on the affixes :) No, and I'd expect that ispell doesn't want them either. Of course many people have moved away from ispell too... Is the release still scheduled for 1

[htdig-dev] Current Status as of snapshot 3.2.0b4-20021124

2002-11-24 Thread Geoff Hutchison
STATUS of ht://Dig branch 3-2-x RELEASES: 3.2.0b5: Next release, tentatively 1 Dec 2002. 3.2.0b4: In progress -- snapshots called 3.2.0b4 until prerelease. 3.2.0b3: Released: 22 Feb 2001. 3.2.0b2: Released: 11 Apr 2000. 3.2.0b1: Released: 4 Feb 2000. (Please note that

[htdig-dev] Current Status as of snapshot 3.2.0b4-20021117

2002-11-17 Thread Geoff Hutchison
STATUS of ht://Dig branch 3-2-x RELEASES: 3.2.0b5: Next release, tentatively 1 Dec 2002. 3.2.0b4: In progress -- snapshots called 3.2.0b4 until prerelease. 3.2.0b3: Released: 22 Feb 2001. 3.2.0b2: Released: 11 Apr 2000. 3.2.0b1: Released: 4 Feb 2000. (Please note that

[htdig-dev] Re: developer position

2002-11-17 Thread Geoff Hutchison
Hi Ryan, There's no set position for most free software development. If you'd like to contribute, great--give what time you can. The mifluz project per-se doesn't really exist, as the main developer, Loic Dachary, has moved on to other things. However, the mifluz project is (and has always

Re: [htdig-dev] defaults.cc

2002-11-10 Thread Geoff Hutchison
On Sunday, November 10, 2002, at 06:43 PM, Lachlan Andrew wrote: While we're changing Configuration.cc, what do people think about issuing a warning if an attribute is not found, rather than silently using the default_value argument? That would remind developers to add the attribute to

Re: [htdig-dev] Re: Forward porting changes for 3.1.6 to 3.2.0b4

2002-11-10 Thread Geoff Hutchison
On Sunday, November 10, 2002, at 06:40 PM, Lachlan Andrew wrote: they are what I was planning to change first. That leaves lots of changes to documentation and configuration files. The documentation changes are, of course, a bit tricky. After all, you can't directly compare attrs.html

Re: [htdig-dev] Questions

2002-11-08 Thread Geoff Hutchison
On Fri, 8 Nov 2002, Lachlan Andrew wrote: Regarding the flags, I can see why it makes sense to store the information, but it doesn't need to be as a bit-field. I do think it makes sense to have a bit field. Remember that we're not just planning a database for HTML documents. Yes, some of

Re: [htdig-dev] defaults.xml

2002-11-07 Thread Geoff Hutchison
like to do it.) -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ --- This sf.net email is sponsored by: See the NEW Palm Tungsten T handheld. Power Color in a compact size! http://ads.sourceforge.net/cgi-bin/redirect.pl

Re: [htdig-dev] db.words.db using only key and empty value?

2002-11-07 Thread Geoff Hutchison
, I don't remember enough of the Berkeley DB details to know if that's the only method for comparing keys. If we change it, will things stay consistent? -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ --- This sf.net email

Re: [htdig-dev] PHP and htsearch

2002-11-07 Thread Geoff Hutchison
the matches_per_page attribute to something quite high in your config file, or the matchesperpage form variable: http://www.htdig.org/attrs.html#matches_per_page http://www.htdig.org/hts_form.html -- -Geoff Hutchison Williams Students Online http://wso.williams.edu

Re: [htdig-dev] db.words.db using only key and empty value?

2002-11-04 Thread Geoff Hutchison
Sorry, I've been really busy and haven't had much time to comment on this. On Saturday, November 2, 2002, at 08:29 PM, Gilles Detillieux wrote: How much of this database fragmentation would be due to the fact that there are records of different lengths, and how much would be due to updating a

[htdig-dev] Current Status as of snapshot 3.2.0b4-20021027

2002-10-27 Thread Geoff Hutchison
STATUS of ht://Dig branch 3-2-x RELEASES: 3.2.0b5: Next release, tentatively 1 Dec 2002. 3.2.0b4: In progress -- snapshots called 3.2.0b4 until prerelease. 3.2.0b3: Released: 22 Feb 2001. 3.2.0b2: Released: 11 Apr 2000. 3.2.0b1: Released: 4 Feb 2000. (Please note that

Re: [htdig-dev] ULR.cc patch

2002-10-26 Thread Geoff Hutchison
it? This patch also contains the patches I've submitted earlier, since I can't find a snapshot which incorporates them. (I realise that you are very busy...) I apologize--I think somehow I missed them. I did inspect them tonight and am integrating them. I *think* they should make this next

Re: [htdig-dev] db.words.db using only key and empty value?

2002-10-22 Thread Geoff Hutchison
On Tuesday, October 22, 2002, at 02:50 PM, Neal Richter wrote: It looks to me like the db.words.db is using only a 'key' value, and has a blank 'value' for each and every key! Nope. Remember that value as it currently stands is the anchor--if any. So if your documents don't have anchors

Re: 3.2.0b4 release [htdig-dev] (was 3.2 Stability)

2002-10-17 Thread Geoff Hutchison
I talked to Neal off-list, so I'd like to clarify as well. I think the three of us are thinking basically the same thing, but it doesn't help when we talk about 3.3 or 4.0. So let's talk about how to get 3.2.0b4 out soon. On Thu, 17 Oct 2002, Gilles Detillieux wrote: I guess it comes down

Re: [htdig-dev] Status of defaults.xml

2002-10-16 Thread Geoff Hutchison
On Wednesday, October 16, 2002, at 02:27 AM, Brian White wrote: * 95% of htdocs/attrs.html I guess I'm not clear on what 95% means. Does this refer to the markup that you mentioned before? I still need to bundle up the changes - I was thinking of creating a patch based on 3.2.0b4 and

Re: [htdig-dev] Re: 3.2 Stability (was [htdig-members] reasons for objecting to LGPL change)

2002-10-16 Thread Geoff Hutchison
I'm going to take two separate issues and separate them for the moment: 1) What changes are needed for a solid 3.2.0 release. 2) The mifluz merge (in a separate e-mail). Please don't take any of my comments as overly critical or flaming. You're new to the project and attempting to take on some

  1   2   3   >