Re: [htdig] more problems with CONFIG

2001-01-22 Thread Geoff Hutchison
out what's going wrong. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this. List archives: http://www.htdig.org/mail

Re: [htdig] latest 2.95.2

2001-01-22 Thread Geoff Hutchison
On Mon, 22 Jan 2001, Ronald Edward Petty wrote: um... where is the latest 2.95.2 , the highest number i see is 2.91... The latest stable releases of libstc++ come with the compiler itself. Download versions of GCC from GNU mirrors or from: http://gcc.gnu.org/releases.html -- -Geoff Hutchison

Re: [htdig] How Long???

2001-01-22 Thread Geoff Hutchison
and 64MB of RAM is naturally going to take a while. (BTW, things scale more on the number of URLs, not really the amount of data.) Remember if you're hitting the network it also depends a lot on your bandwidth, network congestion, etc. -- -Geoff Hutchison Williams Students Online http

Re: [htdig] External site list line termination

2001-01-21 Thread Geoff Hutchison
. Are there any special things at all required in the external URL file? Nope. It just needs to be a whitespace-separated list. (One per line usually looks nice to us humans.) -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from

Re: [htdig] more problems with CONFIG

2001-01-21 Thread Geoff Hutchison
he full HTTP headers. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this. List archives: http://www.htdig.org/mail/menu.html

Re:[htdig] db.docdb db.wordlist

2001-01-21 Thread Geoff Hutchison
://www.htdig.org/RELEASE.html Did you know that there are RPMs available? http://www.htdig.org/files/binaries/ If you run htdig -v by itself, what does it say? -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig

Re: [htdig] multiple run instances?

2001-01-20 Thread Geoff Hutchison
copy isn't writing to a database at the same time. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this. List archives

Re: [htdig] Htdig 3.20b3 -- installation problems.

2001-01-20 Thread Geoff Hutchison
While this is fairly off-topic, I can say that I have never seen zlib miscompile. I'd make sure you get a copy from a reasonable source and make sure it transfers using binary. What version of gcc are you using? (I ask because you're using a pretty old kernel.) -- -Geoff Hutchison Williams Students

Re: [htdig] Strategy for a dynamic site

2001-01-20 Thread Geoff Hutchison
? This is pretty reasonable. If you don't want to index the "bogus pages," make sure they have META robots tags in them, e.g. META name="robots" content="follow,noindex" -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ --

Re: [htdig] Multiple kinds of links?

2001-01-20 Thread Geoff Hutchison
: in your indexing config: url_part_aliases: http://www.foo.com/bogus/ *1 in your searching config: url_part_aliases: http://www.foo.com/normal/ *1 -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send

Re: [htdig] Search words seem to become truncated

2001-01-20 Thread Geoff Hutchison
re is a maximum word length for indexing. See http://www.htdig.org/attrs.html#maximum_word_length -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive

Re: [htdig] Plural, singular

2001-01-20 Thread Geoff Hutchison
e .aff files for Spanish? (See, for example the notes in the FAQ http://www.htdig.org/FAQ.html#q4.10 The endings algorithm should generate root words from words with endings. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe fro

Re: [htdig] How to exclude particular files from indexing?

2001-01-19 Thread Geoff Hutchison
On Fri, 19 Jan 2001, Martin Mielke wrote: how do I tell ht:/Dig not to index particular filenames (indexint.html and userdata.dat) ? See exclude_urls http://www.htdig.org/attrs.html#exclude_urls -- -Geoff Hutchison Williams Students Online http://wso.williams.edu

Re: [htdig] append database problem

2001-01-19 Thread Geoff Hutchison
documents will be reindexed. However, once it has updated the URLs, it will go through any new URLs listed in the start_url. So I would be very much surprised if it "never" got to the new URLs. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ ---

Re: [htdig] Duplicate results with directories some questions/bugs?

2001-01-19 Thread Geoff Hutchison
and I do not need to write all the subdirectories in the conf file. If you do not supply a password to htdig, it will be rejected when it attempts to access password-protected areas. So if parts of your site are protected, you will need to supply the password for them to be indexed. -- -Geoff Hutchison

Re: [htdig] rundig error

2001-01-19 Thread Geoff Hutchison
tories set in the CONFIG file. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this. List archives: http://www.htdi

Re: [htdig] Problem compiling?

2001-01-19 Thread Geoff Hutchison
On Fri, 19 Jan 2001, htdighelp wrote: Is this error a light one while compiling? I doubt you'll ever see this while compiling the program itself, though you may see it when indexing. It is not a fatal error, but usually indicates some form of database corruption. -- -Geoff Hutchison Williams

[htdig] Re: Htdig ?

2001-01-18 Thread Geoff Hutchison
for your answer Yours. Kamel FRANCE If you can output a text-file list of URLs, it's quite simple to insert this list into the config file, e.g. start_url: `/path/to/url.file` -- -Geoff Hutchison Williams Students Online http://wso.williams.edu

Re: [htdig] Spelling Help

2001-01-18 Thread Geoff Hutchison
code that produces a list of suggestion words from an input, we can probably port it. 2)Can anybody recommend a _good_ (UK English) spell checker for IRXIX 6.5? Yes. Try ispell with the UK dictionaries. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu

Re: [htdig] Changing sites list midway

2001-01-18 Thread Geoff Hutchison
with -l. (This is now the default behavior in the 3.2 code.) But this doesn't mean that changes to the exclude_urls or other limits will change the current list of URLs. But if you just want to update the list of starting URLs, this will probably work OK for you. -- -Geoff Hutchison Williams

Re: [htdig] Merging two databases

2001-01-18 Thread Geoff Hutchison
two in one command-line call. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this. List archives: http://www.htdig.org/mail

Re: [htdig] Indexing a given list of file

2001-01-18 Thread Geoff Hutchison
. It's the new "-m" flag to htdig--it should work as advertised, though I haven't tested it recently. http://dev.htdig.org/htdig-3.2/htdig.html -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig ma

Re: [htdig] Other search tools?

2001-01-18 Thread Geoff Hutchison
to htsearch. But also remember that you can easily change the style of output to fit your needs. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You

Re: [htdig] HOWTO? setp-by-step?

2001-01-18 Thread Geoff Hutchison
but can they be read by the webserver user? Can the directories leading up to the databases be read by the webserver user? -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECT

Re: [htdig] Problems with the page dates.

2001-01-17 Thread Geoff Hutchison
not return a Last-Modified: header for the URL. This frequently happens with CGIs or other server-generated content (e.g. SSI, ASP, JSP...). Try using the modification_time_is_now attribute, which uses the current time for the date. http://www.htdig.org/attrs.html#modification_time_is_now

RE: [htdig] Unable to contact server-revisisted

2001-01-17 Thread Geoff Hutchison
what you want to do, though perhaps a config attribute is needed for this. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm

Re: [htdig] problem indexing a site

2001-01-17 Thread Geoff Hutchison
imeout set in the connection code and the 3.1.5 version should be good about killing connections if they timeout. How long is "a long while?" -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send

Re: [htdig] config-questions: special characters and search restriction

2001-01-17 Thread Geoff Hutchison
3.2 code keeps track of where words were found, at the moment there isn't any htsearch code to restrict searches in this manner. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [

Re: [htdig] solaris 2.6 and htdig 3.1.5

2001-01-17 Thread Geoff Hutchison
configuration/installation questions are better off asked on the gcc mailing list or newsgroups. See for example, http://gcc.gnu.org/ -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message

RE: [htdig] problem indexing a site

2001-01-17 Thread Geoff Hutchison
If you aren't using port 80, you will need to set this in the start_url, e.g.: start_url: http://www.foo.com:81/ Cheers, -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ On Wed, 17 Jan 2001, Elsa Chan wrote: It just hangs for 10 to 15 minutes. If port 80 is not what

Re: [htdig] List of search words

2001-01-17 Thread Geoff Hutchison
/attrs.html#logging which will log queries via syslog. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this. List

RE: [htdig] problem indexing a site

2001-01-17 Thread Geoff Hutchison
On Wed, 17 Jan 2001, Elsa Chan wrote: 1:0:http://www.site.com New Server www.site.com , 80 I think we need to see your config file--if you did change your htdig.conf, then you have done it in a manner that htdig does not recognize. -- -Geoff Hutchison Williams Students Online http

Re: [htdig] environment problems

2001-01-17 Thread Geoff Hutchison
libraries you linked in), you'll need to install them on this server too. You'll also probably need to set the environment variable LD_LIBRARY_PATH to include these libraries if they're in different locations than on the compiling machine. -- -Geoff Hutchison Williams Students Online http

Re: [htdig] basic script

2001-01-17 Thread Geoff Hutchison
_dir: http://www.htdig.org/confindex.html -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this. List archives: http://www.htdi

Re: [htdig] document contains no data

2001-01-16 Thread Geoff Hutchison
ontains no data" message? 3) Any ideas on resolution? Can you run the htsearch program from the command line? What do you see in the server error log after these requests? And of course, what version are you running? -- -Geoff Hutchison Williams Students Online htt

Re: [htdig] document contains no data

2001-01-16 Thread Geoff Hutchison
, you'll have a password in plaintext either in your config file or in a script or cron job. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive

Re: [htdig] htdig 3.2.0 under Solaris

2001-01-16 Thread Geoff Hutchison
a consequence of the automake-generated Makefiles. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this. List archives: http

Re: [htdig] Multiple same results

2001-01-16 Thread Geoff Hutchison
) and multiple *identical* URLs, which would be a bug. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this. List archives

Re: [htdig] htdig ignores *.doc file extension

2001-01-15 Thread Geoff Hutchison
linking to these .doc or .pdf files, it doesn't list the link? Or do you not have any documents linking to these files? -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL

Re: [htdig] static compilation of htdig

2001-01-15 Thread Geoff Hutchison
even link against libc statically, you'll have to edit the Makefiles. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm

Re: [htdig] htdig ignores *.doc file extension

2001-01-15 Thread Geoff Hutchison
it ignores the Windows 2000 server. Are these also password-protected? If so, are they using the "Basic" authentication scheme? -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send

Re: [htdig] NEED HELP with indexing

2001-01-15 Thread Geoff Hutchison
ions "alt" somewhere in it, try running "rundig -a" which will update the databases using alternate .work files. That should get you started in the right direction. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsub

Re: [htdig] how do you index local pages in 3.1.5?

2001-01-15 Thread Geoff Hutchison
directory listings if this is something very important to you. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this. List

Re: [htdig] more solaris problems

2001-01-15 Thread Geoff Hutchison
compile on Solaris, so I would normally think it's some sort of compiler error, though most of these are usually picked up by the configure script. I did as -version and it is gnu... so um what is wrong ... any ideas? No, but someone on the gcc mailing list might have some. -- -Geoff Hutchison

Re: [htdig] Numbers

2001-01-14 Thread Geoff Hutchison
At 11:05 AM -0600 1/12/01, htdighelp wrote: I've added the allow_numbers directive in the conf file yet searching for numbers is And have you reindexed? You may be allowing numbers in the search, but if there aren't numbers in the databases, you won't have much luck. -- -Geoff Hutchison

Re: [htdig] 3.2v?

2001-01-12 Thread Geoff Hutchison
At 9:47 AM +0100 1/12/01, Emilio Bueso wrote: When will the 3.2 version of ht://Dig be available? Betas are available now--the 3.2.0b3 release will be made fairly soon. When will 3.2.0 be ready? When it's finished. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu

Re: [htdig] prepare a search index for a different URL

2001-01-12 Thread Geoff Hutchison
to read: url_part_aliases: http://www.customersdomain.com/ *1 This will encode the URLs when indexing and make sure a different pattern is there for decoding on the customer's end. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu

RE: [htdig] 3.2v?

2001-01-12 Thread Geoff Hutchison
. But there will be ways to update indexes after this point. Reade the htdoc/RELEASE.html and htdoc/upgrade.html files for instructions. When will 3.2.0 be ready? When it's finished. A long way to go?:) I'd guess we need at least one more beta before aiming at 3.2.0. -- -Geoff Hutchison

Re: [htdig] how to set the $(PERCENT)? -it always show 1%

2001-01-12 Thread Geoff Hutchison
://www.htdig.org/files/snapshots/ -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this. List archives: http://www.htdig.org

Re: [htdig] htdig ignores *.doc file extension

2001-01-12 Thread Geoff Hutchison
output. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this. List archives: http://www.htdig.org/mail/menu.html FAQ

Re: [htdig] Grouping words

2001-01-12 Thread Geoff Hutchison
On Fri, 12 Jan 2001, Jason Meyering wrote: Is it possible to group words in a search via quotes or anything? For See the FAQ: http://www.htdig.org/FAQ.html#q1.9 -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from

RE: [htdig] htdig

2001-01-11 Thread Geoff Hutchison
suspect you have an infinite loop or close to that. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this. List archives

RE: [htdig] htdig

2001-01-11 Thread Geoff Hutchison
No regular expressions needed. You can limit URLs based on query patterns already. See the bad_querystr attribute: http://www.htdig.org/attrs.html#bad_querystr -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ On Thu, 11 Jan 2001, Richard Bethany wrote: Geoff, I'm

Re: [htdig] dig taking forever

2001-01-11 Thread Geoff Hutchison
ve you set the server_wait_time attribute or anything of that sort? -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this. List archi

Re: [htdig] htdig

2001-01-10 Thread Geoff Hutchison
big are the databases--did they fill the disks or get larger than 2GB each? You might try running htdig alone with the command-line flags "-v -i" which will create the databases from scratch (-i) and provide a progress report as it goes (-v). -- -Geoff Hutchison Williams Students O

Re: [htdig] installing as a user..... possible?

2001-01-10 Thread Geoff Hutchison
At 8:38 PM -0800 1/10/01, Carlos Ramirez wrote: Just as long as you have access to a c++ compiler you should be able build it. And you have some way of running the htsearch CGI as a "normal user." On many web servers, CGIs must be installed by root. YMMV. -- -Geoff Hutchison William

Re: [htdig] Unable to specify config-file ?

2001-01-09 Thread Geoff Hutchison
On Tue, 9 Jan 2001, Trond Arve Nordheim wrote: Unable to read configuration file '/usr/local/conf/testhtdig.conf' What's this? Check your form, you have more than one field named "config"--so htsearch is catenating the fields. -- -Geoff Hutchison Williams Students O

[htdig] Re: htdig.org outages

2001-01-09 Thread Geoff Hutchison
://www.htdig.org/mirrors.html Thanks for your understanding, -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this. List

Re: [htdig] Merging two databases

2001-01-09 Thread Geoff Hutchison
On Tue, 9 Jan 2001, Peterman, Timothy P wrote: I have a related question. Can I merge more that two databases at a time? Not at the moment. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list

Re: [htdig] PDFs, numbers, and percent signs

2001-01-09 Thread Geoff Hutchison
there? -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this. List archives: http://www.htdig.org/mail/menu.html FAQ

Re: [htdig] Document contains no data

2001-01-09 Thread Geoff Hutchison
assume it fixes the one you're referring to. (One bug in 3.2.0b2 is that it requires the .weakcmpr file to be writable even for htsearch.) -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send

Re: [htdig] uninstalling htdig

2001-01-09 Thread Geoff Hutchison
ust remove the files and it's gone. No "registry" to change.) -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm

Re: [htdig] keep temp files while running indexer? How to...

2001-01-09 Thread Geoff Hutchison
script in the contrib/ section (I'm pretty sure it's in the releases, but it's definitely on the FTP server.) -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED

Re: [htdig] How to put the brakes on htDig - it's crashing ourserver

2001-01-07 Thread Geoff Hutchison
At 11:30 AM +0100 1/7/01, SMantscheff wrote: Is there a way to put a brake on htDig so that it pauses between requests? Yes. See server_wait_time: http://www.htdig.org/attrs.html#server_wait_time -- -Geoff Hutchison Williams Students Online http://wso.williams.edu

Re: [htdig] How to put the brakes on htDig - it's crashing ourserver

2001-01-07 Thread Geoff Hutchison
At 11:11 AM -0500 1/7/01, Phil Barnett wrote: On 7 Jan 2001, at 8:43, Geoff Hutchison wrote: At 11:30 AM +0100 1/7/01, SMantscheff wrote: Is there a way to put a brake on htDig so that it pauses between requests? Yes. See server_wait_time: http://www.htdig.org/attrs.html

Re: [htdig] server_wait_time

2001-01-07 Thread Geoff Hutchison
conds even with the interleaving.) -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this. List archives: http://www.htdig.org/mail/men

Re: [htdig] robotstxt_name

2001-01-07 Thread Geoff Hutchison
this will affect how htdig considers robot-specific directives in the robots.txt file. This is useful, for example, when you want to make directives specific to your copy of htdig, etc. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu

Re: [htdig] restrict and exclude more than one pattern

2001-01-07 Thread Geoff Hutchison
tent="url1|url2|url3" (so results would have to match url1 OR url2 OR url3. The same works for exclude, but AND'ing is not possible in the 3.1 code. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdi

[htdig] Re: can ht://dig run with iPlanet web server

2001-01-07 Thread Geoff Hutchison
At 8:13 PM -0800 1/7/01, Edward Lu wrote: Thank you very much for your reply! one more question. Can ht://dig run with iPlanet web server? Looking forward your reply. Thanks! -Ed I don't know of anyone that's reported that, but I doubt it would be a problem. -- -Geoff Hutchison Williams

Re: [htdig] 3.1.5 engine on 3.1.3 db

2001-01-05 Thread Geoff Hutchison
. to create the databases, why don't you just replace htsearch at the same time? (Perhaps I miss what you're saying.) -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED

Re: [htdig] 3.1.5 engine on 3.1.3 db

2001-01-05 Thread Geoff Hutchison
. Granted I was never sure whether Solaris x86 was big or little-endian. But the root question is: Why are you having problems compiling 3.1.5 on IRIX? -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing

Re: [htdig] https links being rejected - help.

2001-01-04 Thread Geoff Hutchison
be support at some point in the 3.2 code, though this does not exist at the moment. A patch exists to the 3.1.x code to use the OpenSSL library to support HTTPS. This can be found at ftp://ftp.ccsf.org/htdig-patches/ -- -Geoff Hutchison Williams Students Online http://wso.williams.edu

Re: [htdig] multiple dbases from one search form?

2001-01-04 Thread Geoff Hutchison
in the current code. This is supported in the 3.2 code, though it has received minimal testing (at best) at the moment and not much documentation exists there either. (Partly since the htsearch code is badly in need of a rewrite.) -- -Geoff Hutchison Williams Students Online http://wso.williams.edu

Re: [htdig] questions

2001-01-04 Thread Geoff Hutchison
. That's what they called it and Apache started as a patched httpd, so it's stuck around. -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message

Re: [htdig] what does --host in configure do?

2001-01-04 Thread Geoff Hutchison
.2 would set (if it does anything), but I can't imagine it would do much that CFLAGS and CXXFLAGS shouldn't do anyway. Configure is complaining because it's definitely *not* a standard cpu-company-system -- -Geoff Hutchison Williams Students Online http://wso.williams.edu

Re: [htdig] A little automation

2001-01-04 Thread Geoff Hutchison
Perl stuff in the ConfigDig project can be used from different frontends. (I believe an X/Tk frontend was in the works.) -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL

Re: [htdig] multiple dbases from one search form?

2001-01-04 Thread Geoff Hutchison
do it with v. 3.1.5? Not really. Use the config field in the search form if you wish to pick a different config file--using flags through the CGI is insecure (at best). -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from th

Re: [htdig] Remote Request?

2001-01-02 Thread Geoff Hutchison
something else in mind? -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this. List archives: http://www.htdig.org/mail

Re: [htdig] time spent for specific search

2001-01-02 Thread Geoff Hutchison
htsearch. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this. List archives: http://www.htdig.org/mail/menu.html FAQ

Re: [htdig] Indexing a database

2001-01-02 Thread Geoff Hutchison
to suit as htDig input. Can I do this? Sure. Use an external parser: http://www.htdig.org/attrs.html#external_parser -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL

Re: [htdig] Remote Request?

2001-01-02 Thread Geoff Hutchison
e familiar with using rsh in this fashion, but it's insecure.) -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ At 10:46 AM -0600 1/2/01, htdighelp wrote: In this fashion, you'd probably want to index on the DB server itself and then transfer the ht://Dig databases if you wanted.

Re: [htdig] unix directory index duplication

2001-01-02 Thread Geoff Hutchison
FancyIndexing turned on. You can either turn off the column-heading links, or you can do something like this: exclude_urls: ?M=D ?D=A ?D=D [...] -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list

Re: [htdig] Indexing problems

2001-01-02 Thread Geoff Hutchison
-Since: header? (Or put another way, are many of these documents server-parsed or other dynamic content?) If so, try setting the modification_time_is_now attribute: http://www.htdig.org/attrs.html#modification_time_is_now -- -Geoff Hutchison Williams Students Online http://wso.williams.edu

Re: [htdig] Options to htdig

2000-12-30 Thread Geoff Hutchison
e lot of explanation that the files will only be renamed if $alt is set. Just my $0.02 -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a

Re: [htdig] Search error - Copying of text...

2000-12-29 Thread Geoff Hutchison
On Thu, 28 Dec 2000, Dudley Jane wrote: Error: Copying of text from this document is not allowed. Is this from trying to index a pdf file? That would be my initial guess. What program are you using to parse or convert PDF files? -- -Geoff Hutchison Williams Students Online http

Re: [htdig] Options to htdig

2000-12-27 Thread Geoff Hutchison
t.) For an example of how this might be useful, you can see my rundig.sh script in the contributed section of the website (along with several other scripts). http://www.htdig.org/contrib/ -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ ---

Re: [htdig] Re: Help deciphering error message

2000-12-27 Thread Geoff Hutchison
On Wed, 27 Dec 2000, Allan Trick wrote: ld.so.1: htsearch: fatal: libstdc++.so.2.8.1.1: open failed: No such file or directory See the FAQ: http://www.htdig.org/FAQ.html#q5.7 -- -Geoff Hutchison Williams Students Online http://wso.williams.edu

Re: [htdig] Duplicates

2000-12-27 Thread Geoff Hutchison
he 3.2.0b3 code and look at the RELEASE.html file in it. There is now code to compute an md5 checksum to eliminate this problem. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message

[htdig] Re: * Web Crawler Help!** (PR#982)

2000-12-23 Thread Geoff Hutchison
, it can take more than one URL). Index. Use htsearch and you'll have an index of all six sites--if you want to restrict the search, use the "restrict" and/or "exclude" fields in the search form. -- -Geoff Hutchison Williams Students Online htt

Re: [htdig] Directory Argument to htdig

2000-12-22 Thread Geoff Hutchison
). -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this. List archives: http://www.htdig.org/mail/menu.html FAQ:http

Re: [htdig] newbie question, sorry

2000-12-22 Thread Geoff Hutchison
On Thu, 21 Dec 2000, Max Trummer wrote: how can i get htdig to search the files under DocumentRoot rather than only searching what it finds via links? It can only discover URLs that you give in the start_url attribute (which can be more than one) or via links. -- -Geoff Hutchison Williams

Re: [htdig] Titles in search Results.

2000-12-22 Thread Geoff Hutchison
On Fri, 22 Dec 2000, Vladislav Klimov wrote: Can htdig use information in h2 tags as name of page in search results? Not by default. You can edit the HTML.cc file (specifically the do_tag procedure) to change this if you need to. -- -Geoff Hutchison Williams Students Online http

[htdig] Re: Htdig - hint

2000-12-21 Thread Geoff Hutchison
then make a new config file, say search.conf: url_part_aliases: www.foo.com *1 -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ At 4:47 PM +0100 12/20/00, CED Città di Arzignano (Giorgio Roncolato) wrote: I installed and test your powerful htdig search engine. But I have

Re: [htdig] query string

2000-12-20 Thread Geoff Hutchison
On Wed, 20 Dec 2000, Dave Salisbury wrote: Is this endemic of this version, or am i doing something wrong?? The former: http://www.htdig.org/FAQ.html#q5.15 http://www.htdig.org/RELEASE.html -- -Geoff Hutchison Williams Students Online http://wso.williams.edu

Re: [htdig] Indexing Restricted Pages

2000-12-20 Thread Geoff Hutchison
://www.htdig.org/attrs.html#authorization -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this. List archives: http

Re: [htdig] Indexing Restricted Pages

2000-12-20 Thread Geoff Hutchison
e in front of you.) -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this. List archives: http://www.htdig.org/mail/men

Re: [htdig] Going for the big dig

2000-12-19 Thread Geoff Hutchison
other search engine either. True, though other search engines usually also ignore certain patterns (e.g. cgi-bin). I also heavily use the META robots tag, though these are not as old a standard and sometimes are still ignored. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu

Re: [htdig] Going for the big dig

2000-12-19 Thread Geoff Hutchison
mentioned earlier. There are also simple "link checker" scripts which can give you a count of the number of URLs on a site. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send

Re: [htdig] Data mining on Htdig DB

2000-12-19 Thread Geoff Hutchison
needs? -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this. List archives: http://www.htdig.org/mail/menu.html FAQ

Re: [htdig] Going for the big dig

2000-12-18 Thread Geoff Hutchison
e hosts without some careful checks. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this. List archives: http://www.htdi

  1   2   3   4   5   6   7   8   9   10   >