Re: [htdig] GNU Make

2000-05-26 Thread Geoff Hutchison
At 10:02 AM -0700 5/26/00, John Melo wrote: Where is possible to download the GNU Make? This is confuse for me. ftp://ftp.gnu.org/pub/gnu/make/ -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list

Re: [htdig] pause

2000-05-25 Thread Geoff Hutchison
, this directory structure is not standard across platforms. otherwise, how do I retrieve it easily ? I'd probably use some form of the ps command. If you have a 'killall' command, you might use that or at least look at the script itself. -- -Geoff Hutchison Williams Students Online http

Re: [htdig] Indexing question.

2000-05-25 Thread Geoff Hutchison
write PDF files that are just graphics or bitmaps and so these cannot be indexed. What sort of indication do you have that they are parsed when you run htdig -vvv? -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from

Re: [htdig] Indexing large amount of non-related files

2000-05-24 Thread Geoff Hutchison
. What features of the glimpse request language do you think are missing in ht://Dig? -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive

Re: [htdig] Replacing db files?

2000-05-24 Thread Geoff Hutchison
with multiple parts. So if you wanted to have some locking like this, you would need some sort of "versioning" so that htsearch would know that all the databases were from the same set. -- -Geoff Hutchison Williams Students Online http://wso.wi

Re: [htdig] cookies and session-ids

2000-05-24 Thread Geoff Hutchison
-server and per-url attribute blocks, you could specify the cookie for a given group of URLs or server. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You

Re: [htdig] Indexing large amount of non-related files

2000-05-24 Thread Geoff Hutchison
which is poor.) No problem. There's a regex fuzzy in the 3.2 code. Right now it's only for strict regular expressions, but we could probably make a "basic regex" syntax as well. -- -Geoff Hutchison Williams Students Online http://wso.wi

Re: [htdig] URL substitution (or other solution)

2000-05-24 Thread Geoff Hutchison
?pagename=pagename.html You want to use the url_part_aliases attribute. You set up this in your digging config file: url_part_aliases: http://hostname/ *1 Then you make up a *new* config file for searching with this: url_part_aliases: http://hostname/cgi-bin/program/loadpage?pagename= *1 -- -Geoff

Re: [htdig] Problem with content-type text/html; charset=SOMETHING

2000-05-24 Thread Geoff Hutchison
as well as fairly detailed reasoning from htdig as far as rejecting or indexing things. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive

Re: [htdig] pause

2000-05-24 Thread Geoff Hutchison
this flag, it will read this list in and resume. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this.

Re: [htdig] excluding index-pages

2000-05-23 Thread Geoff Hutchison
that's the target of the link. So that hyperlink text counts for the message.html pages automatically. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this.

Re: [htdig] compilation issue with 3.2.0b2

2000-05-23 Thread Geoff Hutchison
o have to bring back the autoconf make tests. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this.

Re: [htdig] Re: the mysterious Deleted, no excerpt problem

2000-05-23 Thread Geoff Hutchison
leave it alone if possible." My preferred editor on the Mac (Alpha), does this as well. So nulls won't automatically be stripped. Out of curiousity, how does it show in browsers? This may be one of those "real world" issues where the HTML parser just needs to be more liberal. -- -Geof

Re: [htdig] htdig -s option for checking dead links

2000-05-23 Thread Geoff Hutchison
al documents, now we are getting 3,400.??? My guess is that some of these "documents" are really the stubs for broken links or documents forbidden by the robots.txt file. If you have an alternate way of counting URLs, this would be a useful comparison. -- -Geoff Hutchison

Re: [htdig] Indexing large amount of non-related files

2000-05-23 Thread Geoff Hutchison
you might get some use out of local_urls, though if they don't have extensions, you might see it hit the HTTP server a lot as it tries to figure out the MIME type. Also remember that ht://Dig currently doesn't have any sort of "index this directory" feature. -- -Geoff Hutchison William

Re: [htdig] delayed remove of bad urls

2000-05-23 Thread Geoff Hutchison
ht://Dig stores the access time of documents in the database, there is currently no use of this data. One suggestion would be to ping the servers before you index them. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from

Re: [htdig] Replacing db files?

2000-05-23 Thread Geoff Hutchison
nge states, it will probably give some sort of error. There isn't any sort of version system to ensure the databases are all of one set. (Hmm, that's not a bad idea...) -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htd

Re: [htdig] Installation Problems with 3.2b2

2000-05-22 Thread Geoff Hutchison
es a whole bunch of tests--some of them will work and some of them will not. Some platforms require libl (or liblex) to compile. Do you have compilation errors when you run make? -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubs

RE: [htdig] Indexing binary files by filename

2000-05-22 Thread Geoff Hutchison
to support external converters, to avoid a lot of the complications of writing external parsers. So this functionality appeared in 3.1.4 and later. Cheers, -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig

Re: [htdig] https and htdig

2000-05-22 Thread Geoff Hutchison
for searching. http://www.htdig.org/attrs.html#url_part_aliases In the indexing one: url_part_aliases: http:// *1 Then in the searching one: url_part_aliases: https:// *1 -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe

Re: [htdig] Installation Problems with 3.2b2

2000-05-22 Thread Geoff Hutchison
, but that's only useful if we know what the user-level configure output says when it stops! (The config.log file is more of a debugging help than anything else.) -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig

RE: [htdig] Segfault indexing a site with 3.2.0b2

2000-05-22 Thread Geoff Hutchison
is this? This is definitely a bug (and hasn't been reported yet). Can you take a look at the core file with gdb and give us a backtrace? -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send

[htdig] Re: htdig-3.2.0b2 install

2000-05-22 Thread Geoff Hutchison
ece of urban lore. I create a completely new set each month to prevent the possibility of corrupted databases. But I don't know how this would affect htsearch performance at all. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubs

Re: [htdig] Stability of beta, and a couple newbie questions

2000-05-19 Thread Geoff Hutchison
it just do an update? If you specify -i, it deletes them and then does a complete rebuild Similarly if you don't specify -i and there aren't any old databases, it does a rebuild instead of an update. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ --

Re: [htdig] Indexing binary files by filename

2000-05-19 Thread Geoff Hutchison
bably want a minimal script that returns something for a title and something for an excerpt. (My suggestion would be to return the file type as an excerpt, like "QuickTime movie" or "MP3 file" but anything is fine.) Then you'd probably want to remove these file types f

Re: [htdig] Weird endings problem

2000-05-19 Thread Geoff Hutchison
as a key and this should be ignored before it ever gets to the database. If it managed to build the databases I'm sure they're OK. These messages are really more debugging output or warnings than errors. Thanks for the report though, you probably found a bug. -- -Geoff Hutchison Williams Students

Re: [htdig] How to search phrases ?

2000-05-18 Thread Geoff Hutchison
that. You enter "water mill" with the quotes into the search form and it will match phrases. That's it. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [

Re: [htdig] Error when linking on Alpha server

2000-05-18 Thread Geoff Hutchison
config files. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this.

Re: [htdig] Endings databases of two languages

2000-05-18 Thread Geoff Hutchison
easy thing to do. If someone wants to carefully modify the Endings source to loop through a series of languages and merge the results into one database, that might be a useful starting point. -- -Geoff Hutchison Williams Students Online http://wso.wi

Re: [htdig] Accent problem.

2000-05-17 Thread Geoff Hutchison
with different library versions. (Thus the whole LSB project...) -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this.

RE: [htdig] %20 chars in visible htsearch output

2000-05-17 Thread Geoff Hutchison
it's a URL that actually works. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this.

Re: [htdig] installing htdig on a server without telent access

2000-05-16 Thread Geoff Hutchison
have to be wary of things like library incompatibilities and so on if the machines are even running different versions of the same OS or distribution. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing

RE: [htdig] 2 questions: nbsp; and bad_words

2000-05-16 Thread Geoff Hutchison
that Apache has attributes beyond what's in the .conf files, why is it any different for ht://Dig? -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this.

Re: [htdig] Indexing news articles ?

2000-05-15 Thread Geoff Hutchison
? If they are linked and you are indexing over HTTP (or if you don't have local_urls_only set), then they will be indexed. Most likely, your webserver will send them as text files. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu

Re: [htdig] A Suggestion on Accents

2000-05-15 Thread Geoff Hutchison
lly.) All that said, it would be possible to patch the code in WordList.cc and remove accents before storing the word. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL

Re: [htdig] result format

2000-05-15 Thread Geoff Hutchison
restrict" value="_1.php3" Cheers, -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this.

Re: [htdig] how does the dig the ranking ?

2000-05-15 Thread Geoff Hutchison
you consider backlink_factor and date_factor, but this is about it. The 3.2 betas are a bit more complicated and there are still some scoring issues to be cleaned up, but the result should be much better. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu

Re: [htdig] Looking for Sol.7.0 Binary on catdoc

2000-05-15 Thread Geoff Hutchison
would check sunfreeware.com which at least *used* to have binaries for different versions of SunOS for major programs. I would also look up catdoc on freshmeat.net and see if the original author has binaries. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu

Re: [htdig] What's New?

2000-05-15 Thread Geoff Hutchison
it myself, but I imagine it works OK. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this.

Re: [htdig] BAD TAG IN SERIALIZED DATA: 110 and DB2 error messages

2000-05-15 Thread Geoff Hutchison
debugging (-g) is turned on. Then follow my instructions about running it through gdb. If we can get a backtrace at the time of the DB2 error message, it would greatly help. Thanks, -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To u

Re: [htdig] Accent problem.

2000-05-15 Thread Geoff Hutchison
attribute). See my other message just now about the +/- of this approach or simply stripping accented words. As you noted in your message, the two words do not mean the same thing! -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubs

Re: [htdig] Can't build htdig 3.1.5 (htfuzzy)

2000-05-12 Thread Geoff Hutchison
expression, REG_ICASE | REG_NOSUB | REG_EXT We could probably arrange a regex configure test for this, but we include our own regex code for exactly this reason--many platforms have "broken" regex functions. You will need to fix the errors, but you can probably ignore the

Re: [htdig] Attention RPM users - quick poll (slightly off)

2000-05-12 Thread Geoff Hutchison
-0mandrake.i586.rpm I usually take Gilles's RPMs and rebuild them for LinuxPPC and upload them as well. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You

Re: [htdig] Question about writing parsers

2000-05-11 Thread Geoff Hutchison
mation you want and pass the HTML back to htdig. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this.

Re: [htdig] Implement an htdig-like feature

2000-05-11 Thread Geoff Hutchison
(of course) that ht://Dig is licensed under the GPL. Judging from your domain name, this doesn't seem to be a likely problem. :-) -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message

Re: [htdig] Some pages are parsed very slowly.

2000-05-11 Thread Geoff Hutchison
of 3-4 dozen links without any sort of # mark. This would provide some element of comparison. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive

Re: [htdig] Htdig unable to read link

2000-05-11 Thread Geoff Hutchison
are returning HTTP headers with every request. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this.

Re: [htdig] Q: Can I ...

2000-05-11 Thread Geoff Hutchison
. Otherwise, I'd have a database indexed by checksums pointing to DocIDs. This would be an analogue to the db.docs.index file which is a database of URLs pointing to DocIDs. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe

Re: [htdig] htdig-3.1.5 +prune_parent_dir_href patch version 0.0

2000-05-11 Thread Geoff Hutchison
it. The code for this should be in the Retriever and should essentially just mark the duplcate URL as _notfound or _noindex. (Any additional discussion on this is probably best moved to the htdig3-dev mailing list.) -- -Geoff Hutchison Williams Students Online http://wso.williams.edu

Re: [htdig] htdig-3.1.5 +prune_parent_dir_href patch version 0.0

2000-05-10 Thread Geoff Hutchison
sense to ultimately have a few possible signature methods. Inode lookup is fast, but it won't catch copies of files. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL

Re: [htdig] htdig-3.1.5 +prune_parent_dir_href patch version 0.0

2000-05-10 Thread Geoff Hutchison
n the 3.2 tree, URLs are indexed by hopcount, so you would only want to store the URL for the first time you see a document. It is not easy to "replace" the URL in the database. There are a few pieces that need to be updated carefully. This is another reason for sticking to the first URL...

Re: [htdig] unknown locale, but really strange, not the normalproblem

2000-05-10 Thread Geoff Hutchison
not at all. The config parser does not care about any sort of order. This is why it is so confusing... -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You wi

Re: [htdig] How to manage infinite-loop conditions in Htsearch.(3.1.5)

2000-05-09 Thread Geoff Hutchison
ng example of this/equivalent? Sure, you can do this through things like process accounting and sometimes the ulimit command. But these are brute-force approaches. My suggestion is to use the many available built-in controls like max_hop_count or server_max_docs if you think you're getting infinite loops.

Re: [htdig] htdig-3.1.5 +prune_parent_dir_href patch version 0.0

2000-05-09 Thread Geoff Hutchison
it also has not been tried. This would probably need to be an option since it might misidentify "duplicate" servers. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a me

Re: [htdig] multiple DBs

2000-05-07 Thread Geoff Hutchison
re. Do you mean the patch posted on the list for "collections" of databases? If so, are the individual config entries being repeated? (It would be nice to see more details.) -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubs

Re: [htdig] Suse 6.2 + symlinks + htdig 3.1.5

2000-05-05 Thread Geoff Hutchison
not have changed *htsearch*, but the code also assumes the databases will be in a proper form--there's some limited checking but it is possible to corrupt them. That said, it should be pretty hard to produce a segfault. If you can provide a backtrace with gdb, it would be greatly appreciated. -- -Geoff

Re: [htdig] htdig-3.1.5 +prune_parent_dir_href patch version 0.0

2000-05-05 Thread Geoff Hutchison
at the time you run configure. For example: CFLAGS="-O3 -funroll-loops" CXXFLAGS="-O3" ./configure This is what I do personally. But I'm also aware as a maintainer that this is not something to be blindly followed--I've tested the compiler and the code at this level and it works

Re: [htdig] Htdig unable to read link

2000-05-05 Thread Geoff Hutchison
things like tcp_max_retries and tcp_wait_time as mentioned in the release notes and documentation. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You

Re: [htdig] Problem with Accents.0 patch

2000-05-03 Thread Geoff Hutchison
of the path in the patch. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this.

Re: [htdig] indexing multiple web sites

2000-05-03 Thread Geoff Hutchison
date/index. (The latter was broken until this last snapshot.) -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this.

Re: [htdig] restricting urls and the html templates

2000-05-03 Thread Geoff Hutchison
at the build_select_lists attribute: http://www.htdig.org/attrs.html#build_select_lists Cheers, -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive

Re: [htdig] search questions

2000-05-03 Thread Geoff Hutchison
tdig.org/FAQ.html#q1.9 Cheers, -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this.

Re: [htdig] Suse 6.2 + htdig 3.1.5: looping again

2000-05-02 Thread Geoff Hutchison
e is a bug in there, but where ? What makes this part of the tree different from all others ? Apache has fancy indexing turned on. Good question. If you try to index *just* this directory, do you see anything funny? Cheers, -- -Geoff Hutchison Williams Students Online http://wso.wi

Re: [htdig] Suse 6.2 + htdig 3.1.5

2000-05-02 Thread Geoff Hutchison
ery URL that doesn't match this prefix will be ignored. Effectively, you've limited the indexing to only that directory and its subdirectories (i.e. each year and month). My guess is that your limit_urls_to is a bit liberal. As I asked earlier, what does your config file look like? Cheers,

Re: [htdig] Configure errors

2000-05-01 Thread Geoff Hutchison
to compile. If that doesn't work correctly, then we'll start looking at things like the config.log. Sorry for the long-winded explanation. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message

Re: [htdig] Suse 6.2 + htdig 3.1.5: looping again

2000-05-01 Thread Geoff Hutchison
. What you describe is *exactly* what htdig uses. There are a variety of issues if you're browsing through the Retriever code, but suffice to say there is a hash of the visited URLs. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu

Re: [htdig] BAD TAG IN SERIALIZED DATA: XXX

2000-05-01 Thread Geoff Hutchison
) and indexing for the first time (or from scratch), then there is something very wrong. Otherwise my suggestion would be to reindex from scratch. You can do this easily if you use the -a option and you don't have any databases ending in .work. -- -Geoff Hutchison Williams Students Online http

Re: [htdig] Can htdig index chinese pages?

2000-05-01 Thread Geoff Hutchison
Not at the moment. Sorry, -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this.

Re: [htdig] htdig and PHP

2000-04-27 Thread Geoff Hutchison
tp://www.devshed.com/Server_Side/PHP/search/ There are also several PHP wrappers and Perl wrappers in the contrib/ directory that comes with your source distribtion and on http://www.htdig.org/contrib/ Cheers, -- -Geoff Hutchison Williams Students Online htt

Re: [htdig] htdig / Suse 6.2: very long run ?

2000-04-26 Thread Geoff Hutchison
actually lock cleanly. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this.

Re: [htdig] htdig / Suse 6.2: very long run ?

2000-04-26 Thread Geoff Hutchison
believe it already handles PostScript files nicely. Cheers, -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this.

Re: [htdig] Remove_bad_urls ?!

2000-04-26 Thread Geoff Hutchison
e that had the original typo may not have changed, so you won't see the report again (since it won't be re-parsed). Ref:'s are also getting lost for bad urls .. can anyone explain why that happens ? As above. Since it doesn't re-parse the files, it doesn't know what the referring page was.

Re: [htdig] Spanish words

2000-04-26 Thread Geoff Hutchison
of South American and Mexican sites in the list of users: http://www.htdig.org/uses.html No one has yet contributed such a list, but it would be appreciated! -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig

Re: [htdig] Latin documents

2000-04-26 Thread Geoff Hutchison
On Wed, 26 Apr 2000, Alexey Rodriguez wrote: I am having problems to index documents with latin letters. For example the word "matemaacute;ticas" is split in two. Is there an option to the configuration file to enable it? See the FAQ: http://www.htdig.org/FAQ.html#q5.8

Re: [htdig] Indexing scope

2000-04-26 Thread Geoff Hutchison
matching one of the patterns. If one of your patterns happened to just be http://, you'd index the whole web. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You

Re: [htdig] htdig / Suse 6.2: very long run ?

2000-04-25 Thread Geoff Hutchison
n previous versions, you can also get this from parsing the db.docs file created by using the -t switch. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You wi

Re: [htdig] noindex_start but follow?

2000-04-25 Thread Geoff Hutchison
and are increasingly being used by browsers and robots a like for similar purposes. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message

Re: [htdig] german umlauts with htsearch

2000-04-25 Thread Geoff Hutchison
On Tue, 25 Apr 2000, florian dfkdf wrote: See the FAQ, e.g. http://www.htdig.org/FAQ.html#q5.8 -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You

Re: [htdig] htdig / Suse 6.2: very long run ?

2000-04-25 Thread Geoff Hutchison
mails from the list due to the spam filter. If there was any mail on this $SUBJ between my (plp) and your (Geoff H.)'s emails, then please say so. From now on, list messages make it. Nothing posted to the list at least. Cheers, -- -Geoff Hutchison Williams Students Online http://wso.wi

Re: [htdig] coldfusion with htdig

2000-04-24 Thread Geoff Hutchison
are the extensions of the documents you want to index? (.html? or .cfm?) Cheers, -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this.

Re: [htdig] Problem with Javascript

2000-04-24 Thread Geoff Hutchison
nding your file as an attachment. (This is good practice for any largish file anyway.) -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a me

Re: [htdig] Indexing through ezmlm-cgi

2000-04-20 Thread Geoff Hutchison
On Thu, 20 Apr 2000, James Moore wrote: Why is the url getting rejected? Anyone have any luck indexing ezmlm archives?? What have you set for your exclude_urls attribute? Remember that the default includes cgi-bin, so you'll need to set it to empty. exclude_urls: Cheers, -- -Geoff

Re: [htdig] Searching php docs mySQL db together

2000-04-19 Thread Geoff Hutchison
abase plus static pages). I started looking thru the htdig archives, but Yes, assuming you have some sort of CGI that serves up parts of your database. Ht://Dig has no problems indexing dynamic pages. -- -Geoff Hutchison Williams Students Online http://wso.wi

Re: [htdig] Can I use ht://dig on any web server?

2000-04-19 Thread Geoff Hutchison
on the license if you haven't heard of it.) I say "usually" because it's not very hard to set up if you have shell access and have installed UNIX programs before. Otherwise you may want to ask for help from your ISP. Cheers, -- -Geoff Hutchison Williams Students Online http://wso.wi

Re: [htdig] Fraze searching

2000-04-19 Thread Geoff Hutchison
On Wed, 19 Apr 2000, Tomas Tikovsky wrote: Can i search for frazes using htdig? See http://www.htdig.org/FAQ.html#q1.9 -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message

Re: [htdig] compiling 3.2b2 under Solaris

2000-04-19 Thread Geoff Hutchison
any suggestions. If you take a look further up your configure script, it will tell you that the script thinks you have a cross-compiler. Are you using a compilare you received as a binary package? Are you sure this compiler is for Solaris 2.6? -- -Geoff Hutchison Williams Students Online http

Re: [htdig] Compiling 3.2b2 under Solaris (X86)

2000-04-19 Thread Geoff Hutchison
sable-shared because currently there are problems building shared libraries on Solaris. It is, unfortunately, a known (but annoying) bug. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message

Re: [htdig] Fatal DB2 Problem

2000-04-19 Thread Geoff Hutchison
e so the program can write to it. These used to be caused by two parts of the code trying to write at the same time. I'm assuming nothing else is trying to write to this file? From your paths, I'm guessing you're running this on MacOS X Server, right? If so, what compiler are you using?

Re: [htdig] Htdig and MySql

2000-04-19 Thread Geoff Hutchison
L. I hope that answers your question, -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this.

Re: [htdig] Only getting 9 search results back

2000-04-18 Thread Geoff Hutchison
xperienced this with Netscape in a few versions.) If you see the links in the webpage source, then it's not a problem with the code or the databases, it's either a problem in the templates or the browser. -- -Geoff Hutchison Williams Students Online http://wso.wi

Re: [htdig] Index word doc meta tags (toc)

2000-04-18 Thread Geoff Hutchison
to ignore everything but this. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this.

Re: [htdig] Unable to search for ipnumbers and such...

2000-04-17 Thread Geoff Hutchison
At 3:13 PM +0200 4/17/00, Johan Edback wrote: I cant get the ht://Dig package to search for ipnumbers, version numbers and such things. Ie: it interprets 10.250.1.17 as 10250117 and since that doesn't exsits... See http://www.htdig.org/FAQ.html#q4.12 -- -Geoff Hutchison Williams Students Online

Re: [htdig] Indexing scope

2000-04-17 Thread Geoff Hutchison
le, w/ just the URL's and add in the spaces, from the db...OT newbie question, could I put that Perl script in cron? Yep. Or you could call it from the script that calls htdig, but just beforehand. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ --

Re: [htdig] Indexing scope

2000-04-16 Thread Geoff Hutchison
who wishes he could figure these things out as fast as you responded :). It helps knowing some of the code and answering a few of these before. ;-) Cheers, -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig ma

Re: [htdig] ASCII database layout

2000-04-13 Thread Geoff Hutchison
to differentiate from the meta description.) -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this.

Re: [htdig] Using H1 instead of TITLE

2000-04-13 Thread Geoff Hutchison
in the search results. In addition to everything that Gilles pointed out, you can also edit the templates. There isn't a template variable for the H1, but you could certainly take out the $(TITLE) variable there--perhaps simply the URL would be better for you. -- -Geoff Hutchison Williams Students

Re: [htdig] Sort by Date from Meta Tags [patch]

2000-04-12 Thread Geoff Hutchison
At 1:37 PM +0200 4/12/00, Michael Pfennich wrote: -- What about limiting the Results by time, eg, search for docs, containing "Searchword", and creation/modification Time between Jan 1999 and Feb 1999 This is on the TODO list. Cheers, -- -Geoff Hutchison Williams Students O

Re: [htdig] Why did I receive this email!

2000-04-12 Thread Geoff Hutchison
tally." To unsubscribe either follow the instructions at the bottom of the e-mail, or go to http://www.htdig.org/mailing.html. You will need to confirm an unsubscription as well (so that someone doesn't unsubscribe someone else). Cheers, -Geoff Hutchison Williams Students Online http://wso.wi

Re: [htdig] The document contained no data?! Help

2000-04-12 Thread Geoff Hutchison
At 10:21 PM +0100 4/12/00, Richard Booth wrote: I take it that each time I want to perform a search of my cache using htdig and wwwoffle that I need to refresh the database? Yes, otherwise there's no way for ht://Dig to know that there's anything new. -- -Geoff Hutchison Williams Students

Re: [htdig] htdig and alt img tags

2000-04-11 Thread Geoff Hutchison
, it gets header_factor_X...). If you really want to disable indexing alt tags right now, I'd suggest commenting out or removing that appropriate part of HTML.cc (it's fairly well labeled). -- -Geoff Hutchison Williams Students Online http://wso.williams.edu

<    1   2   3   4   5   6   7   8   9   10   >