Re: [htdig] 3.2.0b2 compile problems on RedHat 7.0

2000-10-24 Thread Geoff Hutchison
failure on all platforms that I'm following up on, but otherwise I'd recommend the snapshot over the 3.2.0b2 release. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL

RE: [htdig] key word file or DB

2000-10-24 Thread Geoff Hutchison
files. (Normally I would expect that they'd also add in commands so that the rundig script would update those as well, but that's clearly not the case.) One way or another I think you really need to talk to the person who installed ht://Dig in the first place. -- -Geoff Hutchison Williams Students

Re: [htdig] Help : Incomplete Search Results

2000-10-23 Thread Geoff Hutchison
. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this. List archives: http://www.htdig.org/mail/menu.html FAQ:http

Re: [htdig] restricting result set

2000-10-23 Thread Geoff Hutchison
/attrs.html#valid_punctuation http://www.htdig.org/attrs.html#extra_word_chars So this makes #BIB equivalent to bib. If you want to separate them, you'll probably want to add # to the extra_word_chars attribute. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu

Re: [htdig] Search file type

2000-10-23 Thread Geoff Hutchison
that match .html or .pdf and don't care about trailing slashes, you can use the restrict field in the search form: http://www.htdig.org/hts_form.html -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing

Re: [htdig] Search mutiple directories

2000-10-23 Thread Geoff Hutchison
want to restrict a search to a specific URL pattern, use the "restrict" or "exclude" fields in the search form: http://www.htdig.org/hts_form.html -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from th

Re: AW: [htdig] HTMERGE doesn't remove URL

2000-10-20 Thread Geoff Hutchison
something that could be added to the new 'htpurge' tool at some point. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm

Re: [htdig] Basics - Already searched archives

2000-10-20 Thread Geoff Hutchison
al case" in your server config for the indexer. Keep in mind it's probably coming from a paritcular IP and can have a specific user-agent field. 2) Grab the CygWin pakcage for running UNIX apps on Win32 machines and try using local_filesystem digging. (No, I don't know if this works.)

Re: [htdig] Including Pull-Down Menu Pages

2000-10-20 Thread Geoff Hutchison
f the game with ht://Dig than with other packages for the reasons I just outlined. Cheers, -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a messa

Re: [htdig] How do I search a subset of my database?

2000-10-20 Thread Geoff Hutchison
the entire database. Is there something else I need to do? Nope, you don't need a separate conf file. Just use the restrict or exclude fields in the search form to limit result URLs to matching certain patterns. http://www.htdig.org/hts_form.html -- -Geoff Hutchison Williams Students Online http

Re: [htdig] Indexing Flash (was Including Pull-Down Menu Pages)

2000-10-20 Thread Geoff Hutchison
to *run* it to get any sort of URL information. Unless a future version becomes "search engine friendly" nothing can (or will) improve. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send

RE: [htdig] key word file or DB

2000-10-19 Thread Geoff Hutchison
automatically generates keywords from the html pages contained within the site? thanks This is correct--I have no idea why you were asked to compile a list of keywords, but ht://Dig indexes the site and generates an index of words in the documents. -- -Geoff Hutchison Williams Students Online http

RE: [htdig] key word file or DB

2000-10-19 Thread Geoff Hutchison
On Thu, 19 Oct 2000, Steve Webster wrote: Also, How do I find out which version I'm running? Most versions you can get the version from the usage information. Try typing: htdig -? -- -Geoff Hutchison Williams Students Online http://wso.williams.edu

Re: [htdig] 3.2.0b2 compile problems on RedHat 7.0

2000-10-19 Thread Geoff Hutchison
assurances to the contrary, C++ compilation with that compiler is NOT reliable. Of course version 3.1.5 compiles cleanly (and is supposedly on the PowerTools disk). -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htd

Re: [htdig] General Question

2000-10-19 Thread Geoff Hutchison
other set.) Digging and searching at the same time should be possible in the 3.2.0 code when it's released but it has not been tested fully. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send

Re: [htdig] Local digging files without extension

2000-10-18 Thread Geoff Hutchison
into file:// URLs for this purpose. Since we have some nice regex parsing for config files why not use them here as well and extend it to the full filename? Interesting idea. Of course no other mime.types support works this way, so we couldn't easily use a standard mime.types file. -- -Geoff

Re: [htdig] key word file or DB

2000-10-18 Thread Geoff Hutchison
I highly suggest that you take this opportunity to upgrade. If pages are no longer present, they will be removed from the databases automatically when updating them. For this reason, most people set periodic update jobs using the cron program. -- -Geoff Hutchison Williams Students

Re: [htdig] Some indexing problems

2000-10-17 Thread Geoff Hutchison
URLs above and did not change the limit_urls_to, then it will only index the one URL. Since no other pages will match the pattern set in limit_urls_to, it will reject all links. Try something like: limit_urls_to: http://www.foo.com/en/ See http://www.htdig.org/attrs.html#limit_urls_to -- -Geoff

[htdig] Re: Ornamentation?

2000-10-16 Thread Geoff Hutchison
, but there is no requirement to do so. The GNU GPL only has restrictions on the distribution of the source code, not on any form of use. There are many examples of sites that do not include any sort of ht://Dig logo, e.g. http://www.linux.com/ -- -Geoff Hutchison Williams Students Online http

Re: [htdig] Local digging files without extension

2000-10-16 Thread Geoff Hutchison
. I'd be glad to give pointers in the 3.2 code if someone is interested in this. It would obviously be well-received. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL

Re: [htdig] Local digging files without extension

2000-10-16 Thread Geoff Hutchison
the patch to 3.2 code than to 3.1.5, but I'd port it as necessary. (please don't hold your breath for the patch though), We're all volunteers here. Nothing comes before its time. ;-) -- -Geoff Hutchison Williams Students Online http://wso.williams.edu

Re: [htdig] spam

2000-10-16 Thread Geoff Hutchison
filter as well. I'll investigate tomorrow. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this. List archives: http

Re: [htdig] 3.1.5: Completed large index

2000-10-15 Thread Geoff Hutchison
not be parsed at all by xpdf. http://www.htdig.org/FAQ.html#q4.9 -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this.

Re: [htdig] KICK THE SPAMMERS OFF ASAP PLEASE

2000-10-12 Thread Geoff Hutchison
I'm from a small enough place that I *hate* having to lock my doors. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm

Re: [htdig] Distributions (was Red Hat 7.0 RPMS?)

2000-10-11 Thread Geoff Hutchison
, this does not make it a high priority. If, for some reason, a significant security hole is discovered or other significant bug, then a 3.1.6 release will be made to fix that. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe

Re: [htdig] Indexing a whole site by parts

2000-10-10 Thread Geoff Hutchison
to make a whole index? Sure. This is one of the features of htmerge after version 3.1.0. If you're like to do this frequently, you might be interested in the multidig scripts that semi-automate the process. (I'm a bit biased here.) http://www.htdig.org/files/contrib/scripts/ -- -Geoff Hutchison

RE: [htdig] Distributions (was Red Hat 7.0 RPMS?)

2000-10-10 Thread Geoff Hutchison
d size, so even a small package like ht://Dig has to take the place of something else. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive

Re: [htdig] Re: 3.1.5 strange freeze problem (fwd)

2000-10-09 Thread Geoff Hutchison
. Did you notice much swapping or a drop in the amount of memory used? You said that cutting out large parts of the tree helped, so this would be my first guess at a culprit. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from

Re: AW: [htdig] One htdig for different searches

2000-10-09 Thread Geoff Hutchison
t type="hidden" name="restrict" value="~membername" This will do *exactly* what you describe. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAI

Re: [htdig] Red Hat 7.0 RPMS?

2000-10-08 Thread Geoff Hutchison
. Of course you can also recompile the SRPMS with: rpm --rebuild ... -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm

Re: [htdig] Question about url_part_aliases

2000-10-05 Thread Geoff Hutchison
: ... url_part_aliases: http:// *site search.conf: include: htdig.conf url_part_aliases: https:// *site This serves to keep the HTTP URLs when indexing (since the indexer cannot index HTTPS) and serve up HTTPS when searching. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu

RE: [htdig] puzzled by htdig

2000-10-05 Thread Geoff Hutchison
there. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this. List archives: http://www.htdig.org/mail/menu.html FAQ

Re: [htdig] ... but not changed

2000-10-04 Thread Geoff Hutchison
d the date of the file itself, but what if it includes a file that has changed, or an actual CGI? -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive

Re: [htdig] header.html

2000-10-04 Thread Geoff Hutchison
? -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this. List archives: http://www.htdig.org/mail/menu.html FAQ

Re: [htdig] puzzled by htdig

2000-10-04 Thread Geoff Hutchison
.htdig.org/attrs.html#limit_urls_to -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this. List archives: http://www.htdi

Re: [htdig] Problems with PDF files

2000-10-03 Thread Geoff Hutchison
, is that one or more of your PDF files is actually damaged. In that case, the best thing to do is to run htdig with more debugging turned on and send both STDOUT and STDERR to a file to peruse. Obviously the files reported just before this output would be ones to check. -- -Geoff Hutchison

Re: [htdig] Question about Htdig's database

2000-10-03 Thread Geoff Hutchison
At 10:40 AM +0100 10/3/00, Adam Rice wrote: Geoff Hutchison wrote: Second question -- can I use such a database from my own Perl script? Can you use a Berkeley DB? Sure, use the DBI interface--it should be part of your Perl 5 installation. No. Well, maybe there's a DBI driver

Re: [htdig] Last modified date revisited - Apache

2000-10-03 Thread Geoff Hutchison
request, and yet we can't find any answer in the apache docs or htdig. Since people seem to have this problem with increasing frequency, why don't you post some information about your Apache configuration? Include as much as you can. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu

Re: [htdig] ... but not changed

2000-10-03 Thread Geoff Hutchison
If they're the same, it ignores the file. An author is maintaining that she added a link to a page and that an update run of htdig failed to follow the new link(s) she had added. Are these static or dynamic pages? If the server is not returning Last-Modified headers, then this could be the problem.

Re: [htdig] from IIS searching to HtDig

2000-10-03 Thread Geoff Hutchison
the same way. Yes, you can do this (and more) using just the search form to ht://Dig. To mirror the IIS behaviour, you'd want the "restrict" field. http://www.htdig.org/hts_form.html -- -Geoff Hutchison Williams Students Online http://wso.wi

Re: [htdig] Question about Htdig's database

2000-10-02 Thread Geoff Hutchison
) that works reasonably well. But it doesn't work under 3.2, though there are other approaches to using 3.2's databases through the htdump/htload programs. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list

Re: [htdig] Sort : Too many open files

2000-10-01 Thread Geoff Hutchison
of saving keystrokes, the command is actually 'ulimit.' Cheers, -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this.

Re: [htdig] 3.2.0b2 and zlib

2000-09-28 Thread Geoff Hutchison
available. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this. List archives: http://www.htdig.org/mail/men

Re: [htdig] ht://Dig

2000-09-28 Thread Geoff Hutchison
to try out (or use) the package and ask for help if you have problems. But keep in mind that we're not a company--we're a group of volunteers. Cheers, -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing

[htdig] Re: htdig things

2000-09-28 Thread Geoff Hutchison
onyms" then htsearch won't use the database you created. Beyond that first step, it would be helpful to know what version of ht://Dig you're using, how it was installed (binary or did you compile it yourself), your OS, etc. Cheers, -- -Geoff Hutchison Williams Students Online http://wso.wi

Re: [htdig] 3.2.0b2 and zlib

2000-09-28 Thread Geoff Hutchison
nit here--this happened *after* 3.2.0b2 was released. You can certainly use htpurge in 3.2.0b2 in this fashion, but htmerge also still works the way it used to. But for 3.2.0b3 and later, you'll want to update your rundig script. (Obviosuly the one installed will be updated correctly.) -- -Geoff

Re: [htdig] 3.2.0b2 and zlib

2000-09-27 Thread Geoff Hutchison
On Wed, 27 Sep 2000, Erik Lyons wrote: Actually, I tried this. Configure and install completed, but then when I tried to run htdig it suffered an Arithmetic Exception and dumped core. I assumed this was because of --without-zlib, but perhaps not... (?) Sigh. You said you're running on

Re: [htdig] Searching in Subdirectories

2000-09-26 Thread Geoff Hutchison
will never find them. All it does is follow links (like any good web spider) from one page to the next, so if there aren't links, there won't be indexing. Granted, you can add as many URLs to the start_url as you want. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu

[htdig] Re: htdig: Modified Date

2000-09-21 Thread Geoff Hutchison
/time the document was generated. Just thinking aloud: Can htdig translate META tags into $(MODIFIED)? I can easily "seed" each file with its system date... Funny you mention that... See ftp://ftp.ccsf.org/htdig-patches/3.1.5/SortMetaDate.0 -- -Geoff Hutchison Williams Students O

Re: [htdig] configuration question

2000-09-21 Thread Geoff Hutchison
or 3.0.8b1.) -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this. List archives: http://www.htdig.org/mail/menu.html

Re: [htdig] Including Pull-Down Menu Pages

2000-09-21 Thread Geoff Hutchison
;Links." If you want a nice exact URL, try: http://web3.w3.org/TR/html4/struct/links.html#h-12.3 -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You wi

Re: [htdig] configuration question

2000-09-20 Thread Geoff Hutchison
ter versed in the configuration that they're offering and might be able to explain this. As it is, I wonder if they're using a wrapper around the actual htsearch binary. You mention that this only happens when you should get the "No Match" page. I assume this means the rest of your modifications t

Re: [htdig] CSS style

2000-09-20 Thread Geoff Hutchison
the FAQ: http://www.htdig.org/FAQ.html#q4.2 -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this. List archives: http

Re: [htdig] configuration question

2000-09-20 Thread Geoff Hutchison
in the common directory. What is in that directory? Can you send a directory listing? -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ And in the "htdig" file under the "www" folder, there aren't those pages either. So my first question is, in the

[htdig] Re: sunos 5.6 / htdig?

2000-09-14 Thread Geoff Hutchison
using a copy of gcc you compiled yourself, or is it a binary installation? -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ At 1:14 PM -0400 9/13/00, support wrote: I thought i read in the htdig search/postings that you were having trouble with solaris. the compiler

Re: [htdig] no server running during indexing after arbitrarypoint

2000-09-14 Thread Geoff Hutchison
from stats.) Obviously you will want to set server_wait_time as appropriate. Users indexing other people's machines often set this at 30 sec. though in a multi-user setting htdig will alternate servers by default. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu

Re: [htdig] Reindexing single document

2000-09-14 Thread Geoff Hutchison
into the positions w/o the .work extension before you run the htmerge command. Does that make sense? htmerge -m $CONFDIR/htsingle.conf -c real.conf I'm running 3.1.5, if that matters. That's obviously newer than 3.1.0. :-) Cheers, -- -Geoff Hutchison Williams Students Online http://wso.williams.edu

[htdig] Re: Problems with htdig 3.2b2

2000-09-13 Thread Geoff Hutchison
index just that page if I have problems and if so, I'd turn up the debugging by running htdig -. Since you mention using 3.2.0b2, I'd also see if it might be a bug that was fixed in a later snapshot. -- -Geoff Hutchison Williams Students Online http://wso.wi

Re: [htdig] configure under BSD not finding fstream.h

2000-09-13 Thread Geoff Hutchison
and are included in CXXFLAGS. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this. List archives: http://www.htdig.org/mail

Re: [htdig] configure under BSD not finding fstream.h

2000-09-13 Thread Geoff Hutchison
that I don't have root access to?G) I can understand your complaint. Any way to discuss things with the person who installed the compilers? -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send

Re: [htdig] configure under BSD not finding fstream.h

2000-09-13 Thread Geoff Hutchison
ix= but in the 3.1.x series the paths are all in CONFIG, not the Makefiles. (Yes, this is fairly non-standard.) -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECT

Re: [htdig] htdig 3.2 beta will not show any results

2000-09-12 Thread Geoff Hutchison
on the version you're using. My guess is that you have a server misconfiguration if you can get results from the command-line. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message

Re: [htdig] has anybody seen this before ?

2000-09-12 Thread Geoff Hutchison
ve checked for all of this and everything is fine. I just threw this in on the off chance this was somehow related to the errors I was seeing. No, but if you are having problems with this too, we can troubleshoot that, but let's take one problem per thread. :-) -- -Geoff Hutchison Williams Stude

Re: [htdig] a question

2000-09-12 Thread Geoff Hutchison
. This is the purpose of the restrict and exclude fields in the search form. They restrict searches to specific URL patterns (or exclude certain patterns from the search). See http://www.htdig.org/hts_form.html -- -Geoff Hutchison Williams Students Online http://wso.williams.edu

Re: [htdig] configure under BSD not finding fstream.h

2000-09-12 Thread Geoff Hutchison
as a consequence you won't be able to use the compression_level attribute to compress document excerpts. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You

Re: [htdig] Another indexing external_parsers issue

2000-09-11 Thread Geoff Hutchison
At 12:40 PM +0200 9/11/00, Martin Mielke wrote: The problem is that running rundig gives an error like: sh: /usr/local/bin/conv_doc.pl: No such file or directory This usually means that the first line of the script (which points to the location of Perl) is incorrect. So perhaps

RE: [htdig] Another indexing external_parsers issue

2000-09-11 Thread Geoff Hutchison
... Fine, but remember you can also look at the output of a run with debugging turned on to see reasons for rejecting links. Of course it's faster to see if there are links in the first place, which I'm suggesting above. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu

Re: [htdig] opening search site in a new window

2000-09-11 Thread Geoff Hutchison
in your common' directory. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this. List archives: http://www.htdig.org

Re: [htdig] HtDig searcher in perl ?

2000-09-11 Thread Geoff Hutchison
/ directory as well. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this. List archives: http://www.htdig.org/mail/menu.html

Re: [htdig] opening search site in a new window

2000-09-09 Thread Geoff Hutchison
to add "target" attributes to the links in the result templates. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm

Re: [htdig] Indexing

2000-09-09 Thread Geoff Hutchison
an explicit index.html with the appropriate links, or use the automatic directory listing features of most webservers. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL

[htdig] Re: Antwort: Re: [htdig] [htdig] DB2 problem ...: missing orempty key value specified

2000-09-08 Thread Geoff Hutchison
significantly to code bloat. For another, there still wouldn't be a way to work out exactly what links are supposed to be indexed--we'd have to anticipate user action, etc. Better to just ignore it. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu

Re: [htdig] -

2000-09-08 Thread Geoff Hutchison
On Fri, 8 Sep 2000, Jorge Becerra wrote: help Could you be a bit more specific? -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive

Re: [htdig] using ht://Dig with Apache Virtual hosts ?

2000-09-08 Thread Geoff Hutchison
exactly why the attribute exists.) See http://www.htdig.org/attrs.html#url_part_aliases url_part_aliases: http://www.xyz.com/ *1 v. (in the other config file) url_part_aliases: https://secure.xyz.com/ *1 -- -Geoff Hutchison Williams Students Online http://wso.williams.edu

Re: [htdig] index always scores 100

2000-09-07 Thread Geoff Hutchison
backlink_factor are probably the reason you're getting these "phantom" matches. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a messa

Re: [htdig] htdig 3.1.5 indexing

2000-09-07 Thread Geoff Hutchison
result in bugs that cannot be fixed easily. But you have partial matching with the prefix and substring algorithms already: http://www.htdig.org/attrs.html#search_algorithm -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from

Re: [htdig] external_parsers: ignored

2000-09-07 Thread Geoff Hutchison
On Thu, 7 Sep 2000, Klaus Gröger wrote: external_parsers: "application/pdf; charset=iso-8859-1" "/usr/share/htdig/parse_doc.pl" Hmm. Does it work if you just have "application/pdf;"? I'm assuming the ExternalParser code thought that the semicolon was par

Re: [htdig] the -l option

2000-09-07 Thread Geoff Hutchison
the log file if htdig is interrupted. So if you start the command and then kill the job/process, then a log file will be created and read in on the next go-round. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig

Re: [htdig] [htdig] DB2 problem ...: missing or empty key valuespecified

2000-09-07 Thread Geoff Hutchison
her things. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this. List archives: http://www.htdig.org/mail/men

RE: [htdig] Problems indexing our intranet

2000-09-07 Thread Geoff Hutchison
into your Apache server. In any case, my point is that if you're having problems doing it with a browser, it's a server issue, not an issue with htdig. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list

Re: [htdig] Maximum url character size

2000-09-07 Thread Geoff Hutchison
? If Sure. You can customize all of the output. See http://www.htdig.org/FAQ.html#q4.2 -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message

RE: [htdig] the -l option

2000-09-07 Thread Geoff Hutchison
On Thu, 7 Sep 2000, John Dispirito wrote: providing the kill is with a -15 and not a 9, correct? The code explicitly traps for INT, QUIT, TERM and HUP signals. So yes, a KILL signal would kill the process outright. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu

Re: [htdig] external_parsers: ignored

2000-09-06 Thread Geoff Hutchison
it doesn't have pdf_parser: set in it somewhere? -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this. List archives: http

Re: [htdig] index always scores 100

2000-09-06 Thread Geoff Hutchison
, if it doesn't go away, please send me some debugging output and I'll take a look at what's going on. Alternatively, I could try indexing your site myself to see if the problem is reproducible. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu

RE: [htdig] Problems indexing our intranet

2000-09-06 Thread Geoff Hutchison
a subdirectory link in your browser. Does it look like a listing of files with links to them? -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive

Re: [htdig] url_part_aliases

2000-09-06 Thread Geoff Hutchison
about his code because I couldn't see it and vice-versa. N.B. This shouldn't be an issue in 3.2 since the databases are keyed differently, but I'll make sure to test this. In any case, it would be very useful to get platform information to winnow down the possibilities. -- -Geoff Hutchison

Re: [htdig] Bad words

2000-09-06 Thread Geoff Hutchison
will be read and htsearch will allow all words to go through. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this. List

Re: [htdig] Bad words

2000-09-06 Thread Geoff Hutchison
On Wed, 6 Sep 2000, Vishal Shah wrote: oh ok. so what do I have to do to make htsearch read that bd words file ? I do not want htsearch to be searching on those words in the bad words file. See http://www.htdig.org/attrs.html#bad_words_list bad_words_list: /path/to/a/file.txt -- -Geoff

Re: [htdig] Using a different program for digging

2000-09-06 Thread Geoff Hutchison
in a result. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this. List archives: http://www.htdig.org/mail/menu.html FAQ

Re: [htdig] Digging problem, probably css-related?

2000-09-05 Thread Geoff Hutchison
omatically *ignores* script code, where 3.1.x needs to be told to ignore it with the noindex_start tags. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You wi

Re: [htdig] Re: [htdig3-dev] htdig 3.1.5 indexing

2000-09-05 Thread Geoff Hutchison
using? The messages are fairly minor, but I'm sure you'd prefer they didn't fill up your error log! -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You

Re: [htdig] htdig 3.1.5 indexing

2000-09-05 Thread Geoff Hutchison
. thanks ;). -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this. List archives: http://www.htdig.org/mail/menu.html

Re: [htdig] index always scores 100

2000-09-05 Thread Geoff Hutchison
see what I mean. I've tried deleting the databases and indexing again, but I still got the same results If you continue to have problems, could you send me the output from running htdig - off-list (to cut down on bandwidth). I'll see if I can find anything in that. -- -Geoff Hutchiso

Re: [htdig] htdig 3.1.5 indexing

2000-09-05 Thread Geoff Hutchison
? That would be a good idea--I wonder if there's a difference between the libdb that the code expects and the version on SuSE 6.4. The source version includes the db code to link statically. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu

Re: [htdig] Installation material -- 3.1.5

2000-09-05 Thread Geoff Hutchison
On Tue, 5 Sep 2000 [EMAIL PROTECTED] wrote: I've got 3.1.5 up and running. Due to disk-space considerations, am considering removal of the "installation" folder structure. Does not appear to me that this participates in routine execution of htdig . . . This is correct.

Re: [htdig] Digging problem, probably css-related?

2000-09-04 Thread Geoff Hutchison
probably don't want to index CSS files, so I'd add .css to bad_extensions in your config file: http://www.htdig.org/attrs.html#bad_extensions -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send

Re: [htdig] indexing files in directories

2000-09-04 Thread Geoff Hutchison
automatic directory indexes (e.g. Apache's FancyIndex in mod_autodir). The current 3.2 development code has support for file:// URLs, but as of now, these do not make their own directory lists either. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu

Re: [htdig] Problems indexing our intranet

2000-09-04 Thread Geoff Hutchison
directory indexes, then you don't need to have index.html files for each subdirectory. Otherwise, you'll need to make index.html files to link to the files you want. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from

Re: [htdig] Digging problem, probably css-related?

2000-09-04 Thread Geoff Hutchison
it. I don't understand at all ... Perhaps I don't understand what you're trying to do. Those messages certainly indicate that the .css file isn't indexing. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig

Re: [htdig] Problems indexing our intranet

2000-09-04 Thread Geoff Hutchison
rather than a directory, it will only index that one URL. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this. List

<    1   2   3   4   5   6   7   8   9   10   >