RE: [htdig] locale:ru on Solaris
Hello! Thanks for your will to help me. Here it is the core. Regards, Eldar Imangulov project manager (design hosting) [EMAIL PROTECTED] phone/fax.: +7 095 777.09.10 Global Chance Bld.1, 42 Bolshaya Yakimanka st., Moscow 117049 Russia // -Original Message- // From: Gilles Detillieux [mailto:[EMAIL PROTECTED]] // Sent: Tuesday, December 12, 2000 8:33 PM // To: Eldar Imangulov // Cc: [EMAIL PROTECTED] // Subject: Re: [htdig] locale:ru on Solaris // // // According to Eldar Imangulov: // I'm useing Solaris 7 // // I made the htDig and now I try to make search my site in russian // (windows-1251). // // in htdig.conf I said the // locale : ru // // The website indexing is going well but the htsearch does not work // (coredump). // // But without russian language (indexing by default = without // locale:ru) // indexing htsearch works well togather. // // What is the problem??? // // Hard to say, but from what you describe it sounds like a // problem with the // locale tables for your locale, or a database corruption problem of some // sort, perhaps. Could you give us a stack backtrace of htsearch's core // dump to narrow things down a bit? // // See the latter part of http://www.htdig.org/FAQ.html#q5.14 // // -- // Gilles R. Detillieux E-mail: [EMAIL PROTECTED] // Spinal Cord Research Centre WWW: // http://www.scrc.umanitoba.ca/~grdetil // Dept. Physiology, U. of Manitoba Phone: (204)789-3766 // Winnipeg, MB R3E 3J7 (Canada) Fax:(204)789-3930 // core.zip To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this. List archives: http://www.htdig.org/mail/menu.html FAQ:http://www.htdig.org/FAQ.html
[htdig] Htdig as external Link Checker? (Maybe off-topic)
Hi community, I need to generate a List for my boss, which contains all external Links of our Web-Site (which gets already indexed by htdig) including the status (means if the target of this link exists or not) Can HTDIG help me with this by: 1. Create a List of external URLs (all URLs, which HTDIG finds during indexing, but doesn't follow because of the restrict URL config). I could use this list by some other tools like wget to check the connection to this links or (the preferred way) 2. Can HTDIG provide me with a list of broken external links? Any ideas? tnx Stefan To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this. List archives: http://www.htdig.org/mail/menu.html FAQ:http://www.htdig.org/FAQ.html
[htdig] Hi, I really need your help!
Dear Sir or Madam, I want to use htdig to index my personnal website. i succeeded in using htdig to gather the files and htmerge to establish the datebase. Even i succeeded in searching by using English words, such as "good". BUT, if i use a chinese word to search for, IT FAILS! I really need your help. If you know how the htsearch search the database , please tell me! Thank you very much! Sean To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this. List archives: http://www.htdig.org/mail/menu.html FAQ:http://www.htdig.org/FAQ.html
[htdig] Hi, need help with searching database.
Hi Everyone, I'll need help with htdig. I just installed Redhat7.0 on my machine. And then installed htdig rpm. I can see the page http://myhost/htdig/ which is the search page. I make a search and for any search I make, it returns a page saying "No matches found for ... " Now, I ran rundig and it increased the file sizes in /var/lib/htdig. So, I presume the database was created. And then I ran htmerge. But I still get the "No matches found .." page. What am i missing? Any help will be appreciated. Thanx, Akshay To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this. List archives: http://www.htdig.org/mail/menu.html FAQ:http://www.htdig.org/FAQ.html
Re: [htdig] Installation Problem under HPUX 10.20
I have installed gcc using swinstall utility that comes with HPUX % swinstall -s /tmp/gcc-2.95.2-sd-10.20.depot and swinstall analyzed the packaged and installed it (no error). I am not a C/C++ programmer, so did not try to compile any other program, but I encountered the same problem with another machine running the same version of HP-UX 10.20. Pardon my ignorance, but I went to the include dir under /opt/gcc and ran make stdio.h and some other C library, and they do not respond with error. For example, for %make stdio.h (output is 'stdio.h' is up to date) and same result for other *.h file No, I did not do "make boostrap" (do you mean make bootstrap) . Because I used swinstall which I use for all kinds of installation and worksout nicely all the time. Regards, Intekhab --- Geoff Hutchison [EMAIL PROTECTED] wrote: I have gone to gnu's website and reinstalled gcc-2.95.2-sd-10.20.depot.gz, even rebooted :-p but no luck yet. Any pointer? Have you tried compiling other programs with this compiler? When you installed gcc, did you do it as a "make boostrap?" The configure script basically tries to compile "hello world" and if it didn't work, then there's something wrong with the compiler. You can get more info from the config.log file. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ __ Do You Yahoo!? Yahoo! Shopping - Thousands of Stores. Millions of Products. http://shopping.yahoo.com/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this. List archives: http://www.htdig.org/mail/menu.html FAQ:http://www.htdig.org/FAQ.html
Re: [htdig] locale:ru on Solaris
Sorry, but there's absolutely nothing I can do with the core file itself, as I don't have a Solaris system. What I want is for you to get a stack backtrace using your htsearch executable and your core dump file using your debugger. Either that or run htsearch directly under your debugger, and when it fails get the stack backtrace directly from the in-memory copy of the program. If you use gdb, the procedure is described in the FAQ. If you use another debugger, you'll need to figure out how to do it with that debugger. According to Eldar Imangulov: Hello! Thanks for your will to help me. Here it is the core. Regards, Eldar Imangulov project manager (design hosting) [EMAIL PROTECTED] phone/fax.: +7 095 777.09.10 Global Chance Bld.1, 42 Bolshaya Yakimanka st., Moscow 117049 Russia // -Original Message- // From: Gilles Detillieux [mailto:[EMAIL PROTECTED]] // Sent: Tuesday, December 12, 2000 8:33 PM // To: Eldar Imangulov // Cc: [EMAIL PROTECTED] // Subject: Re: [htdig] locale:ru on Solaris // // // According to Eldar Imangulov: // I'm useing Solaris 7 // // I made the htDig and now I try to make search my site in russian // (windows-1251). // // in htdig.conf I said the // locale : ru // // The website indexing is going well but the htsearch does not work // (coredump). // // But without russian language (indexing by default = without // locale:ru) // indexing htsearch works well togather. // // What is the problem??? // // Hard to say, but from what you describe it sounds like a // problem with the // locale tables for your locale, or a database corruption problem of some // sort, perhaps. Could you give us a stack backtrace of htsearch's core // dump to narrow things down a bit? // // See the latter part of http://www.htdig.org/FAQ.html#q5.14 -- Gilles R. Detillieux E-mail: [EMAIL PROTECTED] Spinal Cord Research Centre WWW:http://www.scrc.umanitoba.ca/~grdetil Dept. Physiology, U. of Manitoba Phone: (204)789-3766 Winnipeg, MB R3E 3J7 (Canada) Fax:(204)789-3930 To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this. List archives: http://www.htdig.org/mail/menu.html FAQ:http://www.htdig.org/FAQ.html
Re: [htdig] Htdig as external Link Checker? (Maybe off-topic)
According to Reich, Stefan: I need to generate a List for my boss, which contains all external Links of our Web-Site (which gets already indexed by htdig) including the status (means if the target of this link exists or not) You should have a look at Gabriele's ht://check program, which is partly based on htdig. It's on the sourceforge.org web site, I believe. -- Gilles R. Detillieux E-mail: [EMAIL PROTECTED] Spinal Cord Research Centre WWW:http://www.scrc.umanitoba.ca/~grdetil Dept. Physiology, U. of Manitoba Phone: (204)789-3766 Winnipeg, MB R3E 3J7 (Canada) Fax:(204)789-3930 To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this. List archives: http://www.htdig.org/mail/menu.html FAQ:http://www.htdig.org/FAQ.html
Re: [htdig] result count is too small ?
According to Dennis Director: I am running htdig-3.2.0b2, I recently moved from htdig-3.1.5. Sometimes, the result count that I get back from a search is too small. For instance, below it said I have ten matches but only gave me two. It's hard to say for sure what's happening, but 3.2.0b2 has a number of known bugs, which are fixed in the latest development snapshot for 3.2.0b3. The infamous scoring bugs might account for the behaviour you see. -- Gilles R. Detillieux E-mail: [EMAIL PROTECTED] Spinal Cord Research Centre WWW:http://www.scrc.umanitoba.ca/~grdetil Dept. Physiology, U. of Manitoba Phone: (204)789-3766 Winnipeg, MB R3E 3J7 (Canada) Fax:(204)789-3930 To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this. List archives: http://www.htdig.org/mail/menu.html FAQ:http://www.htdig.org/FAQ.html
Re: [htdig] Hi, I really need your help!
On Wed, 13 Dec 2000, Sean Harris wrote: I want to use htdig to index my personnal website. i succeeded in using htdig to gather the files and htmerge to establish the datebase. Even i succeeded in searching by using English words, such as "good". BUT, if i use a chinese word to search for, IT FAILS! Check FAQ 4.10. How do I index documents in other languages? http://www.htdig.org/FAQ.html#q4.10 Cheers, Chris -- Christopher Murtagh Webmaster / Web Communications Group McGill University Montreal, Quebec Canada To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this. List archives: http://www.htdig.org/mail/menu.html FAQ:http://www.htdig.org/FAQ.html
[htdig] Re: I need your help [from ellenliu]
Hi, Ellen. First of all, you should always send these questions to the list, and not to me personally. I don't have all the answers. See http://www.htdig.org/FAQ.html#q1.16 According to ellenliu: Dear Gilles R. Detillieux: I'm very grateful for your kind help last time. All these problems happened before compilation,during the Configure process. Because I can't get the most recent development snapshot of 3.2.0b3 They're in http://www.htdig.org/files/snapshots/ However, if you don't need any of the new features in the 3.2 series, you're probably better off with 3.1.5. I run 3.1.5 instead,but there still exit some problems. I entered : "sh ./configure" , it prompts: ". checking host system type ... ./configure: ./config.guess: no such file or directory configure configure:error:can not guess host type ;you must specify one configure :error :./configure failed for db/dist" I think that it can't pass through the check of 'host system type',I have read through the ./config.guess file ,but I 'm not clear what should I do yet.I know the default value of $host is NONE,whether need I set a type according to my machine? as I said last time when I run 3.2.0b2 the output prompts: " ... checking whether make sets ${MAKE}(cached) yes configure :error: can not run ./config.sub" in ./configure file I find the line (933):"if ${CONFIG_SHELL-/bin/sh} $ac_config_sub sun4 dev/null 21;then " why set the parameter sun4 ? would you tell me what I shoulddo next ? Thanks. configure: cpu :PIII 550M os: red hat linux 6.2 kernel 2.2.14-5.0 We've never seen anything like this before on Red Hat Linux systems of any version. Certainly not on 6.2. As I said last time, you may very well be missing some critical packages from your Red Hat distribution which are needed to compile and install software. The other thing I'm noticing is that there seems to be a problem with execution of scripts on your system. How did you extract the files from the .tar.gz distributions of either 3.1.5 or 3.2.0b2? Did you use chmod on any of the files, and in doing so turn off execute permissions on them? If you did, that's definitely going to be a problem! -- Gilles R. Detillieux E-mail: [EMAIL PROTECTED] Spinal Cord Research Centre WWW:http://www.scrc.umanitoba.ca/~grdetil Dept. Physiology, U. of Manitoba Phone: (204)789-3766 Winnipeg, MB R3E 3J7 (Canada) Fax:(204)789-3930 To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this. List archives: http://www.htdig.org/mail/menu.html FAQ:http://www.htdig.org/FAQ.html
Re: [htdig] Hi, need help with searching database.
According to Akshay Guleria: I just installed Redhat7.0 on my machine. And then installed htdig rpm. I can see the page http://myhost/htdig/ which is the search page. Which htdig rpm did you install? For Red Hat 7.0, you should use the RPM for htdig-3.1.5-6 that comes with the 7.0 PowerTools. I make a search and for any search I make, it returns a page saying "No matches found for ... " Now, I ran rundig and it increased the file sizes in /var/lib/htdig. So, I presume the database was created. And then I ran htmerge. But I still get the "No matches found .." page. If you run rundig, you don't need to run htmerge separately. The rundig script will run htdig followed by htmerge. You should try running your /var/www/cgi-bin/htsearch program right from the command line first, to see if that works. If it does, it may be an Apache server configuration problem, or a problem with your search form. Did you make any changes to the /var/www/html/htdig/search.html search form? If so, see http://www.htdig.org/FAQ.html#q5.17 -- Gilles R. Detillieux E-mail: [EMAIL PROTECTED] Spinal Cord Research Centre WWW:http://www.scrc.umanitoba.ca/~grdetil Dept. Physiology, U. of Manitoba Phone: (204)789-3766 Winnipeg, MB R3E 3J7 (Canada) Fax:(204)789-3930 To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this. List archives: http://www.htdig.org/mail/menu.html FAQ:http://www.htdig.org/FAQ.html
Re: [htdig] htdig missing subdirectories (was: Incremental indexing)
Please direct your questions to the list, not to me personally. See FAQ 1.16. Also, you're off topic, as this has nothing to do with last week's "Incremental indexing" thread, so you should pick a more descriptive subject. According to crosstar: I have copiously poured over the messages in the mailing list, as well as references in FAQ. I am not very technical, but my situation is that htdig is missing a lot of files, words and subdirectories, altogether. I'm wondering if there is a simpler adjustment in htdig.conf to remedy this? I simply do not understand the instrtuctions, as given, unfortunately, and note that one reader says that he thinks tinkering with the server is not the answer. Did you follow the recommendations in FAQ 5.25 5.27? That's probably where you should focus your attention. Running htdig with the -vvv option will give you tons of output, but if you trace your way through there you might be able to see why it's missing parts of your site. I tried running htfuzzy but get the error: htfuzzy: No algorithms specified You need to tell htfuzzy which database to build. This won't solve your problem above, though. It's just for building databases for fuzzy match algorithms. I have changed one default up upping to: max_head_length:5 That will make htdig keep more of each document for use in excerpts for matched pages, but it won't get you more matches. However, upping the max_doc_size may get htdig to index more stuff if it was missing links from really large pages. See FAQ 5.1. -- Gilles R. Detillieux E-mail: [EMAIL PROTECTED] Spinal Cord Research Centre WWW:http://www.scrc.umanitoba.ca/~grdetil Dept. Physiology, U. of Manitoba Phone: (204)789-3766 Winnipeg, MB R3E 3J7 (Canada) Fax:(204)789-3930 To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this. List archives: http://www.htdig.org/mail/menu.html FAQ:http://www.htdig.org/FAQ.html
Re: [htdig] HTTPS Indexing
Joshua Here are all of the active lines (non-comments) in my config file. database_dir: /wwwsys/src/htdig/db start_url: https://www.myurl.com/pub/en/index.html limit_urls_to: ${start_url} bad_extensions: .wav .gz .z .sit .au .zip .tar .hqx .exe .com .gif \ .jpg .jpeg .aiff .class .map .ram .tgz .bin .rpm .mpg .mov .avi maintainer: [EMAIL PROTECTED] max_head_length:1 max_doc_size: 20 no_excerpt_show_top:true search_algorithm: exact:1 synonyms:0.5 endings:0.1 . . (page layout stuff) . Also, I tried running the rundig script but I get the same "Unable to build connection" error as before. Let me know if there's anything else I can do to help. Jason Joshua Gerth wrote: Hi Jason, I've done some more poking around and I've gotten openssl to work - atleast to the extent where I can successfully connect to my secure webserver using ./openssl s_client. Once I got that working I figured I'd get a fresh start and recompile/reinstall htdig. Now when I try to run ./htdig -i - I get the following output: ./htdig -i -v URL: https://www.myurl.com/ 1:0:https://www.myurl.com/ New server: www.myurl.com, 443 Unable to build connection with www.myurl.com:443 pushed pick: www.myurl.com, # servers = 1 Any ideas? I just tried a fresh install and mine works so I don't think its the patch. Can you include the first couple of line from your config file ... like everything down to 'maintainer'? Also, have you tried using the 'rundig' script? I doubt this is the problem but I normally run: ./bin/rundig -vvv -c ./conf/myurl.conf myurl.out then I can run a tail -f myurl.out to watch the results. Joshua To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this. List archives: http://www.htdig.org/mail/menu.html FAQ:http://www.htdig.org/FAQ.html To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this. List archives: http://www.htdig.org/mail/menu.html FAQ:http://www.htdig.org/FAQ.html
[htdig] htmerge - tons of output
Running htdig-3.2.0b3-112600, when I try to merge two database, I get a huge amout of output to (stdout?), I stopped it before it completed with CNTRL-C, and it left a core file. I used the command: htmerge -c network.conf -m gourmetspot.conf | tee mergelog A sample of the output is: BitStream::Show: ntags:0 size:6018 buffsize: 753 :::
[htdig] Words and files not being found or indexed
I am not too technical, so I hope this sounds clear. I have htdig installed. But, although it works fine with no errors, many files and words are being left out of the search and indexing. I have checked all of the relevant FAQ, but either do not understand what I am to do or am falling short, in some other way. In reply to my earlier message, I was told to check the output using -vvv. I did so and here is what I found. For example, I have a subdirectory which contains 70 files, in /news/archives/2000/. 7 of these files turn up listed in the output. But, where are the other 63? They are not there and there is no reference to them in the entire output file. So, I am stumped as to what to do now. Any assistance appreciated. HQ - The Nationalist Movement PO Box 2000 Learned MS 39154 (601) 885-2288 Clinic: http://www.nationalist.org/board/html/index.php Crosstarlist: http://www.nationalist.org/docs/resources/list.html E-mail: mailto:[EMAIL PROTECTED] Forum: http://www.nationalist.org/forum/index.php Home Page: http://www.nationalist.org ICQ: 5429992 Newsgroup: alt.national Views not necessarily those of The Nationalist Movement © 2000 by The Nationalist Movement - END To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this. List archives: http://www.htdig.org/mail/menu.html FAQ:http://www.htdig.org/FAQ.html