RE: [htdig] locale:ru on Solaris

2000-12-13 Thread Eldar Imangulov

Hello!

Thanks for your will to help me.

Here it is the core.



Regards,
Eldar Imangulov
project manager (design  hosting)
[EMAIL PROTECTED]
phone/fax.: +7 095 777.09.10

Global Chance
Bld.1, 42 Bolshaya Yakimanka st.,
Moscow 117049 Russia

//  -Original Message-
//  From: Gilles Detillieux [mailto:[EMAIL PROTECTED]]
//  Sent: Tuesday, December 12, 2000 8:33 PM
//  To: Eldar Imangulov
//  Cc: [EMAIL PROTECTED]
//  Subject: Re: [htdig] locale:ru on Solaris
//  
//  
//  According to Eldar Imangulov:
//   I'm useing Solaris 7
//   
//   I made the htDig and now I try to make search my site in russian
//   (windows-1251).
//   
//   in htdig.conf I said the
//   locale : ru
//   
//   The website indexing is going well but the htsearch does not work
//   (coredump).
//   
//   But without russian language (indexing by default = without 
//  locale:ru)
//   indexing  htsearch works well togather.
//   
//   What is the problem???
//  
//  Hard to say, but from what you describe it sounds like a 
//  problem with the
//  locale tables for your locale, or a database corruption problem of some
//  sort, perhaps.  Could you give us a stack backtrace of htsearch's core
//  dump to narrow things down a bit?
//  
//  See the latter part of http://www.htdig.org/FAQ.html#q5.14
//  
//  -- 
//  Gilles R. Detillieux  E-mail: [EMAIL PROTECTED]
//  Spinal Cord Research Centre   WWW:
//  http://www.scrc.umanitoba.ca/~grdetil
//  Dept. Physiology, U. of Manitoba  Phone:  (204)789-3766
//  Winnipeg, MB  R3E 3J7  (Canada)   Fax:(204)789-3930
//  
 core.zip


To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.
List archives:  http://www.htdig.org/mail/menu.html
FAQ:http://www.htdig.org/FAQ.html


[htdig] Htdig as external Link Checker? (Maybe off-topic)

2000-12-13 Thread Reich, Stefan

Hi community,

I need to generate a List for my boss, which contains all external Links of
our Web-Site (which gets already indexed by htdig) including the status
(means if the target of this link exists or not)

Can HTDIG help me with this by:

1. Create a List of external URLs (all URLs, which HTDIG finds during
indexing, but doesn't follow because of the restrict URL config). I could
use this list by some other tools like wget to check the connection to this
links

or (the preferred way)

2. Can HTDIG provide me with a list of broken external links?

Any ideas?

tnx

  Stefan


To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.
List archives:  http://www.htdig.org/mail/menu.html
FAQ:http://www.htdig.org/FAQ.html




[htdig] Hi, I really need your help!

2000-12-13 Thread Sean Harris

Dear Sir or Madam,

I want to use htdig to index my personnal website.  i succeeded in
using htdig to gather the files and htmerge to establish the datebase.
Even i succeeded in searching by using English words,  such as "good".
BUT, if i use a chinese word to search for, IT FAILS!

I really need your help.  If you know how the htsearch search the database
, please tell me!


Thank you very much!


Sean






To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.
List archives:  http://www.htdig.org/mail/menu.html
FAQ:http://www.htdig.org/FAQ.html




[htdig] Hi, need help with searching database.

2000-12-13 Thread Akshay Guleria

Hi Everyone,

I'll need help with htdig.

I just installed Redhat7.0 on my machine. And then installed htdig rpm. I
can see the page
http://myhost/htdig/ which is the search page.
I make a search and for any search I make, it returns a page saying
"No matches found for ... "

Now, I ran rundig and it increased the file sizes in /var/lib/htdig. So, I
presume the database was created. And then I ran htmerge. But I still get
the
"No matches found .." page.

What am i missing?
Any help will be appreciated.

Thanx,
Akshay



To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.
List archives:  http://www.htdig.org/mail/menu.html
FAQ:http://www.htdig.org/FAQ.html




Re: [htdig] Installation Problem under HPUX 10.20

2000-12-13 Thread Intekhab Choudhury

I have installed gcc using swinstall utility that
comes with HPUX

% swinstall -s /tmp/gcc-2.95.2-sd-10.20.depot 

and swinstall analyzed the packaged and installed
it (no error).  I am not a C/C++ programmer, so did
not try to compile any other program, but I
encountered the same problem with another machine
running the same version of HP-UX 10.20.  

Pardon my ignorance, but I went to the include dir
under /opt/gcc and ran make stdio.h and some other C
library, and they do not respond with error.  

For example, for %make stdio.h (output is 'stdio.h' is
up to date) and same result for other *.h file 

No, I did not do "make boostrap" (do you mean make
bootstrap) .  Because I used swinstall which I use for
all kinds of installation and worksout nicely all the
time.

Regards,

Intekhab




--- Geoff Hutchison [EMAIL PROTECTED] wrote:
 I have gone to gnu's website and reinstalled
 gcc-2.95.2-sd-10.20.depot.gz, even rebooted :-p 
 but
 no luck yet.  Any pointer?
 
 Have you tried compiling other programs with this
 compiler? When you 
 installed gcc, did you do it as a "make boostrap?"
 
 The configure script basically tries to compile
 "hello world" and if 
 it didn't work, then there's something wrong with
 the compiler. You 
 can get more info from the config.log file.
 
 --
 -Geoff Hutchison
 Williams Students Online
 http://wso.williams.edu/


__
Do You Yahoo!?
Yahoo! Shopping - Thousands of Stores. Millions of Products.
http://shopping.yahoo.com/


To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.
List archives:  http://www.htdig.org/mail/menu.html
FAQ:http://www.htdig.org/FAQ.html




Re: [htdig] locale:ru on Solaris

2000-12-13 Thread Gilles Detillieux

Sorry, but there's absolutely nothing I can do with the core file itself,
as I don't have a Solaris system.  What I want is for you to get a stack
backtrace using your htsearch executable and your core dump file using
your debugger.  Either that or run htsearch directly under your debugger,
and when it fails get the stack backtrace directly from the in-memory copy
of the program.  If you use gdb, the procedure is described in the FAQ.
If you use another debugger, you'll need to figure out how to do it with
that debugger.

According to Eldar Imangulov:
 Hello!
 
 Thanks for your will to help me.
 
 Here it is the core.
 
 
 
 Regards,
 Eldar Imangulov
 project manager (design  hosting)
 [EMAIL PROTECTED]
 phone/fax.: +7 095 777.09.10
 
 Global Chance
 Bld.1, 42 Bolshaya Yakimanka st.,
 Moscow 117049 Russia
 
 //  -Original Message-
 //  From: Gilles Detillieux [mailto:[EMAIL PROTECTED]]
 //  Sent: Tuesday, December 12, 2000 8:33 PM
 //  To: Eldar Imangulov
 //  Cc: [EMAIL PROTECTED]
 //  Subject: Re: [htdig] locale:ru on Solaris
 //  
 //  
 //  According to Eldar Imangulov:
 //   I'm useing Solaris 7
 //   
 //   I made the htDig and now I try to make search my site in russian
 //   (windows-1251).
 //   
 //   in htdig.conf I said the
 //   locale : ru
 //   
 //   The website indexing is going well but the htsearch does not work
 //   (coredump).
 //   
 //   But without russian language (indexing by default = without 
 //  locale:ru)
 //   indexing  htsearch works well togather.
 //   
 //   What is the problem???
 //  
 //  Hard to say, but from what you describe it sounds like a 
 //  problem with the
 //  locale tables for your locale, or a database corruption problem of some
 //  sort, perhaps.  Could you give us a stack backtrace of htsearch's core
 //  dump to narrow things down a bit?
 //  
 //  See the latter part of http://www.htdig.org/FAQ.html#q5.14


-- 
Gilles R. Detillieux  E-mail: [EMAIL PROTECTED]
Spinal Cord Research Centre   WWW:http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba  Phone:  (204)789-3766
Winnipeg, MB  R3E 3J7  (Canada)   Fax:(204)789-3930


To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.
List archives:  http://www.htdig.org/mail/menu.html
FAQ:http://www.htdig.org/FAQ.html




Re: [htdig] Htdig as external Link Checker? (Maybe off-topic)

2000-12-13 Thread Gilles Detillieux

According to Reich, Stefan:
 I need to generate a List for my boss, which contains all external Links of
 our Web-Site (which gets already indexed by htdig) including the status
 (means if the target of this link exists or not)

You should have a look at Gabriele's ht://check program, which is partly
based on htdig.  It's on the sourceforge.org web site, I believe.

-- 
Gilles R. Detillieux  E-mail: [EMAIL PROTECTED]
Spinal Cord Research Centre   WWW:http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba  Phone:  (204)789-3766
Winnipeg, MB  R3E 3J7  (Canada)   Fax:(204)789-3930


To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.
List archives:  http://www.htdig.org/mail/menu.html
FAQ:http://www.htdig.org/FAQ.html




Re: [htdig] result count is too small ?

2000-12-13 Thread Gilles Detillieux

According to Dennis Director:
 I am running htdig-3.2.0b2, I recently moved from htdig-3.1.5.
 Sometimes, the result count that I get back from a search is too small.
 For instance, below it said I have ten matches but only gave me two.

It's hard to say for sure what's happening, but 3.2.0b2 has a number of
known bugs, which are fixed in the latest development snapshot for 3.2.0b3.
The infamous scoring bugs might account for the behaviour you see.

-- 
Gilles R. Detillieux  E-mail: [EMAIL PROTECTED]
Spinal Cord Research Centre   WWW:http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba  Phone:  (204)789-3766
Winnipeg, MB  R3E 3J7  (Canada)   Fax:(204)789-3930


To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.
List archives:  http://www.htdig.org/mail/menu.html
FAQ:http://www.htdig.org/FAQ.html




Re: [htdig] Hi, I really need your help!

2000-12-13 Thread christopher . murtagh

On Wed, 13 Dec 2000, Sean Harris wrote:
I want to use htdig to index my personnal website.  i succeeded in
using htdig to gather the files and htmerge to establish the datebase.
Even i succeeded in searching by using English words,  such as "good".
BUT, if i use a chinese word to search for, IT FAILS!

Check FAQ 4.10. How do I index documents in other languages?

http://www.htdig.org/FAQ.html#q4.10

Cheers,

Chris


-- 

Christopher Murtagh
Webmaster / Web Communications Group
McGill University
Montreal, Quebec
Canada




To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.
List archives:  http://www.htdig.org/mail/menu.html
FAQ:http://www.htdig.org/FAQ.html




[htdig] Re: I need your help [from ellenliu]

2000-12-13 Thread Gilles Detillieux

Hi, Ellen.  First of all, you should always send these questions to
the list, and not to me personally.  I don't have all the answers.
See http://www.htdig.org/FAQ.html#q1.16

According to ellenliu:
 Dear Gilles R. Detillieux:
   I'm very grateful for your kind help last time.
   
   All these problems happened before compilation,during the Configure process.
  
   Because I can't get the most recent development snapshot of 3.2.0b3

They're in http://www.htdig.org/files/snapshots/

However, if you don't need any of the new features in the 3.2 series, you're
probably better off with 3.1.5.

 I  run 3.1.5 instead,but there  still exit some problems.
 I entered :
 "sh ./configure" ,
 
 it prompts:
 ".
 checking host system type ... ./configure: ./config.guess: no such file or directory 
configure
 configure:error:can not guess host type ;you must specify one
 configure :error :./configure failed for db/dist"
 I think that it can't pass through the check of 'host system type',I
 have read through the ./config.guess file ,but I 'm not clear what
 should I do yet.I know the default value of $host is NONE,whether need
 I set a type according to my machine?
 
 as I said last time when I run 3.2.0b2
 the output  prompts:
 "
 ...
 checking whether make sets ${MAKE}(cached) yes
 configure :error: can not run ./config.sub"
 
 in ./configure file I find the line (933):"if ${CONFIG_SHELL-/bin/sh} $ac_config_sub 
sun4 dev/null 21;then "
 why set  the parameter sun4 ?
 
 would you tell me what I shoulddo next ?
 Thanks.
 configure:
 cpu :PIII 550M
 os: red hat linux 6.2 kernel 2.2.14-5.0

We've never seen anything like this before on Red Hat Linux systems of any
version.  Certainly not on 6.2.  As I said last time, you may very well be
missing some critical packages from your Red Hat distribution which are
needed to compile and install software.

The other thing I'm noticing is that there seems to be a problem with
execution of scripts on your system.  How did you extract the files from
the .tar.gz distributions of either 3.1.5 or 3.2.0b2?  Did you use chmod
on any of the files, and in doing so turn off execute permissions on them?
If you did, that's definitely going to be a problem!

-- 
Gilles R. Detillieux  E-mail: [EMAIL PROTECTED]
Spinal Cord Research Centre   WWW:http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba  Phone:  (204)789-3766
Winnipeg, MB  R3E 3J7  (Canada)   Fax:(204)789-3930


To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.
List archives:  http://www.htdig.org/mail/menu.html
FAQ:http://www.htdig.org/FAQ.html




Re: [htdig] Hi, need help with searching database.

2000-12-13 Thread Gilles Detillieux

According to Akshay Guleria:
 I just installed Redhat7.0 on my machine. And then installed htdig rpm. I
 can see the page
 http://myhost/htdig/ which is the search page.

Which htdig rpm did you install?  For Red Hat 7.0, you should use the RPM
for htdig-3.1.5-6 that comes with the 7.0 PowerTools.

 I make a search and for any search I make, it returns a page saying
 "No matches found for ... "
 
 Now, I ran rundig and it increased the file sizes in /var/lib/htdig. So, I
 presume the database was created. And then I ran htmerge. But I still get
 the
 "No matches found .." page.

If you run rundig, you don't need to run htmerge separately.  The rundig
script will run htdig followed by htmerge.  You should try running
your /var/www/cgi-bin/htsearch program right from the command line
first, to see if that works.  If it does, it may be an Apache server
configuration problem, or a problem with your search form.  Did you
make any changes to the /var/www/html/htdig/search.html search form?
If so, see http://www.htdig.org/FAQ.html#q5.17

-- 
Gilles R. Detillieux  E-mail: [EMAIL PROTECTED]
Spinal Cord Research Centre   WWW:http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba  Phone:  (204)789-3766
Winnipeg, MB  R3E 3J7  (Canada)   Fax:(204)789-3930


To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.
List archives:  http://www.htdig.org/mail/menu.html
FAQ:http://www.htdig.org/FAQ.html




Re: [htdig] htdig missing subdirectories (was: Incremental indexing)

2000-12-13 Thread Gilles Detillieux

Please direct your questions to the list, not to me personally.
See FAQ 1.16.  Also, you're off topic, as this has nothing to do with
last week's "Incremental indexing" thread, so you should pick a more
descriptive subject.

According to crosstar:
 I have copiously poured over the messages
 in the mailing list, as well as references in FAQ.
 I am not very technical, but my situation is that htdig is
 missing a lot of files, words and subdirectories, altogether.
 
 I'm wondering if there is a simpler adjustment in
 htdig.conf to remedy this?  I simply do not understand
 the instrtuctions, as given, unfortunately, and note that
 one reader says that he thinks tinkering with the
 server is not the answer.

Did you follow the recommendations in FAQ 5.25  5.27?  That's probably
where you should focus your attention.  Running htdig with the -vvv
option will give you tons of output, but if you trace your way through
there you might be able to see why it's missing parts of your site.

 I tried running htfuzzy but get the error:
 htfuzzy: No algorithms specified 

You need to tell htfuzzy which database to build.  This won't solve your
problem above, though.  It's just for building databases for fuzzy match
algorithms.

 I have changed one default up upping to: 
 max_head_length:5

That will make htdig keep more of each document for use in excerpts for
matched pages, but it won't get you more matches.  However, upping the
max_doc_size may get htdig to index more stuff if it was missing links from
really large pages.  See FAQ 5.1.

-- 
Gilles R. Detillieux  E-mail: [EMAIL PROTECTED]
Spinal Cord Research Centre   WWW:http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba  Phone:  (204)789-3766
Winnipeg, MB  R3E 3J7  (Canada)   Fax:(204)789-3930


To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.
List archives:  http://www.htdig.org/mail/menu.html
FAQ:http://www.htdig.org/FAQ.html




Re: [htdig] HTTPS Indexing

2000-12-13 Thread Jason Scharlach

Joshua

  Here are all of the active lines (non-comments) in my config file.

database_dir:   /wwwsys/src/htdig/db
start_url:  https://www.myurl.com/pub/en/index.html
limit_urls_to:  ${start_url}
bad_extensions: .wav .gz .z .sit .au .zip .tar .hqx .exe .com
.gif \
.jpg .jpeg .aiff .class .map .ram .tgz .bin .rpm .mpg
.mov .avi
maintainer: [EMAIL PROTECTED]
max_head_length:1
max_doc_size:   20
no_excerpt_show_top:true
search_algorithm:   exact:1 synonyms:0.5 endings:0.1
.
. (page layout stuff)
.

Also, I tried running the rundig script but I get the same "Unable to
build connection" error as before.  Let me know if there's anything else
I can do to help.

  Jason


Joshua Gerth wrote:
 
 Hi Jason,
 
I've done some more poking around and I've gotten openssl to work -
  atleast to the extent where I can successfully connect to my secure
  webserver using ./openssl s_client.
 
Once I got that working I figured I'd get a fresh start and
  recompile/reinstall htdig.  Now when I try to run ./htdig -i - I get
  the following output:
 
   ./htdig -i -v
  URL: https://www.myurl.com/ 1:0:https://www.myurl.com/
  New server: www.myurl.com, 443
  Unable to build connection with www.myurl.com:443
   pushed
  pick: www.myurl.com, # servers = 1
  
 
Any ideas?
 
 I just tried a fresh install and mine works so I don't think its the
 patch.  Can you include the first couple of line from your config file ...
 like everything down to 'maintainer'?
 
 Also, have you tried using the 'rundig' script?  I doubt this is the
 problem but I normally run:
 ./bin/rundig -vvv -c ./conf/myurl.conf  myurl.out 
 
 then I can run a
 tail -f myurl.out
 
 to watch the results.
 
 Joshua
 
 
 To unsubscribe from the htdig mailing list, send a message to
 [EMAIL PROTECTED]
 You will receive a message to confirm this.
 List archives:  http://www.htdig.org/mail/menu.html
 FAQ:http://www.htdig.org/FAQ.html


To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.
List archives:  http://www.htdig.org/mail/menu.html
FAQ:http://www.htdig.org/FAQ.html




[htdig] htmerge - tons of output

2000-12-13 Thread Dennis Director


Running htdig-3.2.0b3-112600, when I try to merge two database, I get
a huge amout of output to (stdout?), I stopped it before it completed with
CNTRL-C, and it left a core file.

I used the command:

htmerge -c network.conf -m gourmetspot.conf | tee mergelog

A sample of the output is:

BitStream::Show: ntags:0 size:6018 buffsize:   753 ::: 

[htdig] Words and files not being found or indexed

2000-12-13 Thread crosstar

I am not too technical, so I hope this sounds clear.

I have htdig installed.  But, although it works fine with no
errors, many files and words are being left out of the search and
indexing.

I have checked all of the relevant FAQ, but either do not
understand what I am to do or am falling short, in some other way.

In reply to my earlier message, I was told to check the output using
-vvv.  I did so and here is what I found.

For example, I have a subdirectory which contains 70 files,
in /news/archives/2000/.

7 of these files turn up listed in the output.  But, where are the other 63?
They are not there and there is no reference to them in the entire
output file.

So, I am stumped as to what to do now.

Any assistance appreciated.  

HQ
-
The Nationalist Movement
PO Box 2000
Learned MS 39154
(601) 885-2288
Clinic: http://www.nationalist.org/board/html/index.php
Crosstarlist: http://www.nationalist.org/docs/resources/list.html
E-mail: mailto:[EMAIL PROTECTED]
Forum: http://www.nationalist.org/forum/index.php
Home Page: http://www.nationalist.org
ICQ: 5429992
Newsgroup: alt.national
Views not necessarily those of The Nationalist Movement
© 2000 by The Nationalist Movement
-

END



To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.
List archives:  http://www.htdig.org/mail/menu.html
FAQ:http://www.htdig.org/FAQ.html