Re: UdmSearch: Webboard: Segfault (grrr)

2001-02-16 Thread Alexander Barkov
It seems that search.cgi does not unescape URL limit. Check this URL with unescaped URL limit: http://search.freewinds.cx/cgi-bin/v3.cgi?ps=20o=0m=allwf=22210cat=ul=http://www.xs4all.nl/q=anything Zenon Panoussis wrote: Alexander Barkov skrev: [ul limit] search.cgi outputs some

Re: 2 questions

2001-02-16 Thread Alexander Barkov
Rick Roberts wrote: Can I run indexer and MySQL on a separate server from the web server? Yes. How do I get indexer to search through pdf files? Use exterbal parser. Check doc/parsers.txt ___ If you want to unsubscribe send "unsubscribe general"

Re: 1 question

2001-02-16 Thread Alexander Barkov
Rick Roberts wrote: How can I get search.cgi to look for the database (MySQL) at another host? If it requires source code mod then that's O.K.. (A point to the appropriate module would be helpful) DBAddr mysql://hostname/dbname/ Where hostname is DNS host name of ANOTHER host.

Re: search in pdf

2001-02-17 Thread Alexander Barkov
Hannes Kriegner wrote: Hi Have a possibility to search in PDF - Files Hannes Use external parser, for example pdftotext. Check doc/parsers.txt. ___ If you want to unsubscribe send "unsubscribe general" to [EMAIL PROTECTED]

Re: Direct Database Injection And Some probably stupid questions

2001-02-17 Thread Alexander Barkov
"Briggs, Gary" wrote: OK, so I've been reading the source code, and I'm having real trouble with what some parts of the url table are for... Why bother to compute a crc32 for the urls? do I need it? [I'm currently using crc-multi db mode on mySQL] It is url.crc32 of document content. It

Re: UdmSearch: Webboard: Segfault (grrr)

2001-02-17 Thread Alexander Barkov
Zenon Panoussis wrote: ./configure --with-mysql --prefix=/usr/local/mn3110-4 --localstatedir=/var/mn3110-4 --enable-fast-tag --enable-fast-cat --enable-fast-site [OK] make [...] creating libudmsearch.la (cd .libs rm -f libudmsearch.la ln -s ../libudmsearch.la libudmsearch.la) gcc

Re: Need Help/Search Engine

2001-02-18 Thread Alexander Barkov
[EMAIL PROTECTED] wrote: Hi I need a little more info on how to install the search engine Thank you George INSTALL file has all information required to install search engine. What is your problem? ___ If you want to unsubscribe send "unsubscribe

Re: 3.1.10: rating==1 bug almost found: UdmGroupByURL()

2001-02-19 Thread Alexander Barkov
Looking in CVS Web I found this sql.c comment for v1.4 (ram) very suspicios: Document rating store in pos field of wrd[] = n_uniq_words According to this, rating always will be 1 for one-word searches, since uniq word is always equal 1. Rating is totally meaningless in this

Re: UdmSearch: space in url cause error

2001-02-19 Thread Alexander Barkov
I can't reproduce this. I created test pages at http://gw.udmsearch.izhnet.ru/~bar/test/ and they are indexed without any problems. So I do guess that problem is not with spaces. Caffeinate The World wrote: --- Alexander Barkov [EMAIL PROTECTED] wrote: I think the problem

Webboard: Long URLS in httdb

2001-02-19 Thread Alexander Barkov
Author: Alexander Barkov Email: [EMAIL PROTECTED] Message: I want to index URLs generated from a Lotus Notes Database. The indexing works so far (after some tweaking: these URLs are longer than 127 bytes, so I had to increase it in the sources). There is a problem, however, with indexing

Re: Yahoo! Auto Response

2001-02-19 Thread Alexander Barkov
Zenon Panoussis wrote: [EMAIL PROTECTED] skrev: This mailbox is for sending only. Do not send me any messages to this address. Please use my other accounts instead. These messages are flooding the list. Is there anything that can be done about it? I've unsubscribed him already.

Webboard: FollowOutside problems.

2001-02-19 Thread Alexander Barkov
Author: Alexander Barkov Email: [EMAIL PROTECTED] Message: I use the following config file: DBAddr mysql://user:pass@localhost/dbname/ Alias http://domain/ http://mirror/ Server http://domain/startfile.php FollowOutside no Disallow \.gif$ \.jpg$ \.class$ \.doc$ \.css$ Disallow \.pdf

Webboard: Search on certain servers only

2001-02-20 Thread Alexander Barkov
Author: Alexander Barkov Email: [EMAIL PROTECTED] Message: Hi, how can I search only through certain servers in the flat? I could not figure it out yet. Can anyone give me a hint please??? Thanx Use URL limits. It is to be passed in ul form variable. Check default template. Reply: http

Re: Segmentation Fault core dump (gdb report include in this mail)

2001-02-26 Thread Alexander Barkov
Please compile with -g cc flags: export CFLAGS=-g ./configure make then run gdb backtrace again after crash. It's interesting what's happening in UdmAddURL() function. Use "frame 3" then "where" commands. "Briggs, Gary" wrote: I'm getting this too. Solaris 2.6 MySQL 3.23.33

Webboard: Different results for different languages.

2001-02-26 Thread Alexander Barkov
Author: Alexander Barkov Email: [EMAIL PROTECTED] Message: It is better to use tag command. Something like this: Tag ru Realm http://your.server.name/*.ru.* Tag de Realm http://your.server.name/*.de.* Tag en Realm http://your.server.name/*.en.* Reply: http://search.mnogo.ru/board

Crash in UdmAddURL() fix

2001-02-26 Thread Alexander Barkov
Hello! Please find a patch which fixes crash in UdmAddURL() function. Thanks everybody who helped to debug! Please give feedback. add_url.3.1.11.diff.gz

Webboard: SyBase on HP-UX is it possible?

2001-02-26 Thread Alexander Barkov
Author: Alexander Barkov Email: [EMAIL PROTECTED] Message: Hi, All! I've HP-UX with SyBase installed. Can I install and configure MnoGoSearch to be able to store indexing data in SyBase tables on my UNIX? I know suggestion about SyBase via ODBC, but my HP-UX doesn't support ODBC yet

Re: BOUNCE general@mnogosearch.org: Non-member submission from [Luis Bravo lebravo@emcali.net.co]

2001-02-27 Thread Alexander Barkov
What is your OS? Have you any defines MSG_xxx in sys/socket.h or in other includes? [EMAIL PROTECTED] wrote: From [EMAIL PROTECTED] Sat Feb 24 23:03:20 2001 Received: (from nobody@localhost) by tr155m.krystech.ru (8.9.3/8.9.3) id XAA08651; Sat, 24 Feb 2001 23:02:46 +0300

Webboard: Excluding a directory

2001-02-27 Thread Alexander Barkov
Author: Alexander Barkov Email: [EMAIL PROTECTED] Message: Hi everybody, How can I exclude a directory from de indexing process? Using allow/disallow commands, or using robots.txt file if you are indexing your own site. Reply: http://search.mnogo.ru/board/message.php?id=1541

Re: Bug report

2001-02-27 Thread Alexander Barkov
poelzi wrote: UdmSearch version: 3.1.11 Platform: OS:Linux fabala.intra.agi.de 2.4.1 #1 Don Feb 15 01:33:33 CET 2001 i686 unknown Database: mysql Ver 11.12 Distrib 3.23.33, for pc-linux-gnu (i686) Statistics: server over ServerTable only registered as urls but

Re: Webboard: indexer -C

2001-02-28 Thread Alexander Barkov
This is from doc/cachemode.txt: 1. "indexer -C" WITHOUT subsection control does not do anything with word index tree. You should delete /var/tree directory manually. 2. "indexer -C ... "

Re: Crosswords and servertables

2001-02-28 Thread Alexander Barkov
This feature works in single and multi modes only. Thanks for noticing this, we'll add this into documentation. "Briggs, Gary" wrote: having looked some more, the docs say that it is "not supported in built-in database and Cachemode". No matter what I do, I can't get it to work at all

Re: Crosswords and servertables

2001-02-28 Thread Alexander Barkov
Hi! It seems to be a bug. CrossWeight is not working when servers are loaded from SQL table using ServerTable command. Thanks for reporting! "Briggs, Gary" wrote: having looked some more, the docs say that it is "not supported in built-in database and Cachemode". No matter what I do,

Re: Crosswords and servertables

2001-02-28 Thread Alexander Barkov
Sorry for this letter, I was wrong. It should work in all SQL modes, the problem is in ServerTable. Alexander Barkov wrote: This feature works in single and multi modes only. Thanks for noticing this, we'll add this into documentation. "Briggs, Gary" wrote: having looked

Re: Crosswords and servertables

2001-02-28 Thread Alexander Barkov
You are right! "Briggs, Gary" wrote: I think that's because it doesn't read anything from the ServerTable about crossweight by default, and then sets the crossweight to "0". ___ If you want to unsubscribe send "unsubscribe general" to [EMAIL

Re: 3.1.11 patched dont launch parsers

2001-02-28 Thread Alexander Barkov
Laurent LEVIER wrote: Hi Guys, Is it me or the udmsearch 3.1.11 does not launch any parser ? I put a log on my parsing gateway, and it seems never indexer is using parsers. I also noticed that Disallow Regex are not working anymore. Notes: Same configuration file was used in 3.1.9 in

Re: BOUNCE general@mnogosearch.org: Non-member submission from [Igor Lyubimov mr_july@nm.ru]

2001-02-28 Thread Alexander Barkov
Hi! Please find a patch from Darko Koruga here: http://gw.udmsearch.izhnet.ru/~bar/patches/built-in.diff.gz Regards. From [EMAIL PROTECTED] Tue Feb 27 19:35:08 2001 Received: (from nobody@localhost) by tr155m.krystech.ru (8.9.3/8.9.3) id TAA12525; Tue, 27 Feb 2001

Re: Phrase Search (cache mode)

2001-02-28 Thread Alexander Barkov
I tested this document, everything is indexed as expected and full phrases are searched, doesn't matter two or three words in query. I guess that you have no "Phrase yes" command in your search template. Probably you added it only into indexer.conf. Frederic Tschannen wrote: Hi,

Webboard: Indexing subdirs(second attempt)

2001-02-28 Thread Alexander Barkov
Author: Alexander Barkov Email: [EMAIL PROTECTED] Message: Hi All! Does anybody know HOW TO indexing subdirs in my site without links to them? Or I must write URLs to all of them in server-access parameter? A lot of! Thanks. Either using indexer -i -u http://hostname/path/to/subdir

Re: Crosswords and servertables

2001-02-28 Thread Alexander Barkov
ut this [I like being able to use php...] Thanks, Gary (-; -Original Message- From: Alexander Barkov [SMTP:[EMAIL PROTECTED]] Sent: Wednesday, February 28, 2001 10:54 AM To: [EMAIL PROTECTED]; Briggs, Gary Subject: Re: Crosswords and servertables I'm sorry for that

Re: BOUNCE general@mnogosearch.org: Non-member submission from [Zenon Panoussis lrh@xs4all.nl]

2001-02-28 Thread Alexander Barkov
I just not sure what version are you using. Is it here: }else{ /* Unknown Content-Type */ if(Method!=UDM_HEAD){ crc32=UdmCRC32(Doc-content, (size_t)realsize); changed=!(crc32==Doc-crc32);

Webboard: Alias news://xyz news://123 ?

2001-02-28 Thread Alexander Barkov
Author: Alexander Barkov Email: [EMAIL PROTECTED] Message: What exactly didn't work? Alias? Does indexer connect to the origial server? Or something else? Alias news://news.external.com/ news://10.0.100.1 Server news://news.external.com/a_newsgroup It didn't work. Tried this instead

Re: Searching multiple tags

2001-02-28 Thread Alexander Barkov
"Briggs, Gary" wrote: Is there any way that I can search from multiple tags in one search? Gary (-; Just submit several t=XXX pairs from HTML form. At least search.cgi can do it. ___ If you want to unsubscribe send "unsubscribe general" to [EMAIL

Re: BOUNCE general@mnogosearch.org: Non-member submission from [Zenon Panoussis lrh@xs4all.nl]

2001-02-28 Thread Alexander Barkov
OK. Please check also this: print realsize print *Doc Zenon Panoussis wrote: please run the following commands in gdb: frame 1 print content_type print Method print Doc print Doc-content print Doc-url #0 0x80600ca in UdmCRC32 (buf=0x4021c03e "", size=4294967295) at

Re: BOUNCE general@mnogosearch.org: Non-member submission from [Zenon Panoussis lrh@xs4all.nl]

2001-03-01 Thread Alexander Barkov
) is returned only in two places: 1. in open_host() function, when port is 0. 2. in UdmHTTPGet() function, when select() returns an error. I have no idea what is happening. Zenon Panoussis wrote: Alexander Barkov skrev: OK. Please check also this: print realsize print *Doc (gdb) frame

Re: Searching multiple tags

2001-03-01 Thread Alexander Barkov
Sergey, any comments? "Briggs, Gary" wrote: Hmmm. The php one can't . Gary (-; -Original Message- From: Alexander Barkov [SMTP:[EMAIL PROTECTED]] Sent: Thursday, March 01, 2001 5:13 AM To: [EMAIL PROTECTED]; Briggs, Gary Subject: Re: Searching mul

Re: 3.1.11: stopwords substring search

2001-03-01 Thread Alexander Barkov
"Andrey A. Chernov" wrote: There is apparent conflict between stopwords and substring search, f.e. try to search Russian 'ÎÁÄÏ' (which is stopword by itself) in {beg,end,sub}string mode - search stopped with no result because stopword immediately found, but it prevents

Webboard: 3.1.11: stopwords substring search

2001-03-01 Thread Alexander Barkov
Author: Alexander Barkov Email: [EMAIL PROTECTED] Message: There is apparent conflict between stopwords and substring search, f.e. try to search Russian 'íàäî' (which is stopword by itself) in {beg,end,sub}string mode - search stopped with no result because stopword immediately

Webboard: bugs or what?

2001-03-01 Thread Alexander Barkov
Author: Alexander Barkov Email: [EMAIL PROTECTED] Message: First, I've tried to set up mnogosearch to my home directory, since I'm not a root on a system. The first íåñîîòâåòñòâèå was that for bult-in database we need to specify --with-biult-in keyword in configure - that's not described

Webboard: bugs or what?

2001-03-01 Thread Alexander Barkov
Author: Alexander Barkov Email: [EMAIL PROTECTED] Message: I found that problem source - it was that mnogosearch tried to use SQL in stopword section, but I used a built-in database and when I commented that string indexer was working... But I want to ask you, how to index a ftp - I gave quot

Webboard: Indexer gets killed for not apparent reason - ongoing topic

2001-03-01 Thread Alexander Barkov
Author: Alexander Barkov Email: [EMAIL PROTECTED] Message: Is core file produced? I know I have posted this message before but I got no replies, the problem is that indexer gets killed after the var/dict.txt files reaches ~7MB, I have more then enough space on the device (36GB) and it's

Webboard: indexer inserts unwanted url into database

2001-03-01 Thread Alexander Barkov
Author: Alexander Barkov Email: [EMAIL PROTECTED] Message: Use either robots.txt file or allow/disallow commands. I don't want to index full server,only some directories ,so i am giving commands like: Server http://www.raaga.ch/de/dienst/ Server http://www.raaga.ch/de/kontakt/ Server

Re: php-mnogo-1.01

2001-03-01 Thread Alexander Barkov
Old PHP front-end uses SQL functions to access mnoGoSearch data. New front-end is native module for PHP: http://www.php.net/manual/en/ref.mnogo.php "Briggs, Gary" wrote: This is probably a REALLY stupid question, but what's the difference between this and the PHP frontend that appears

Webboard: INDEXER die

2001-03-02 Thread Alexander Barkov
Author: Alexander Barkov Email: [EMAIL PROTECTED] Message: We've found a bug which may be a reason of such behaviour. Please find it here: http://gw.udmsearch.izhnet.ru/~bar/server.c.diff.gz I know I had posted this message before but I get no answer. My indexer process stops after

server.c bug

2001-03-02 Thread Alexander Barkov
Hello! We've found a bug in server.c It was a reason of some crashes. Patch is in attachement. server.c.diff.gz

Re: server.c bug

2001-03-05 Thread Alexander Barkov
Use "patch" program PNTCD wrote: Thank you, Should I replace the old server.c with server.c.diff and recompile mnoGoSearch? -Original Message- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED]]On Behalf Of Alexander Barkov Sent: Friday, March 02, 20

Webboard: Indexer gets killed for not apparent reason - ongoing topic

2001-03-05 Thread Alexander Barkov
Author: Alexander Barkov Email: [EMAIL PROTECTED] Message: I have no ideas what's happening. Something or somebody kills the process. Probably, you have quotes. btw. How much memory does indexer eat before killing? Reply: http://search.mnogo.ru/board/message.php?id=1614

Webboard: crc-multi and cache mode

2001-03-05 Thread Alexander Barkov
Author: Alexander Barkov Email: [EMAIL PROTECTED] Message: My database is in crc-multi mode, and now we are going to run the last version of mnogosearch in cache mode. First, we want to index in cache mode the same database, and we do not delete the word tables, because we want the people

Webboard: Windows Version Availibility

2001-03-05 Thread Alexander Barkov
Author: Alexander Barkov Email: [EMAIL PROTECTED] Message: is there windows source available for this like the linux version has? Windows version is built from UNIX sources. The only difference is that it has graphical interface. Reply: http://search.mnogo.ru/board/message.php?id=1620

Re: Big bug in indexer confirmed

2001-03-05 Thread Alexander Barkov
Try to apply a patch http://gw.udmsearch.izhnet.ru/~bar/patches/server.c.diff.gz This fixes strange bug. sprintf(NULL,"bla bla bla") doesn't crash on some operating systems, at least on FreeBSD 4.2 (both RELEASE and STABLE) and Linux with 2.2.14 kernel. It looks like memory leak, program eats

Re: Big bug in indexer confirmed

2001-03-05 Thread Alexander Barkov
Try to apply the patch http://gw.udmsearch.izhnet.ru/~bar/patches/server.c.diff.gz We checked indexer for memory leaks using memprof (btw nice program). It does not find any memory leaks. Probably, the reason is our bug together with sprintf() strange behaviour I mentioned earlier. Tõnu

Re: link validation config help!

2001-03-05 Thread Alexander Barkov
Run indexer -a -s0 - I've been doing a bit of poking around, and have noticed a couple of odd behaviours. I'm using the most recent version from CVS but I've experienced very similar behaviour from 3.1.9. First, if I do this: # indexer foo (after a few URLS, ^C to

Re: Is this, by any chance, a Bad Thing (TM)?

2001-03-05 Thread Alexander Barkov
"Briggs, Gary" wrote: 010305 9:28:18 Aborted connection 394 to db: 'udmsearch' user: 'udm' host: `localhost' (Got a packet bigger than 'max_allowed_packet') From the mySQL error log. Change max_allowed_packet mysqld variable. btw, what was the last sql query? Can you check this in

Webboard: Windows Spyware?

2001-03-06 Thread Alexander Barkov
Author: Alexander Barkov Email: [EMAIL PROTECTED] Message: UdmSearch for Windows is on a number of spyware lists. Is this a rumor? If not, will you be removing this quot;featurequot; from the next release? Thanks, Denny It is not true. There is no one line which send something to internet

Webboard: Windows Spyware?

2001-03-06 Thread Alexander Barkov
Author: Alexander Barkov Email: [EMAIL PROTECTED] Message: UdmSearch for Windows is on a number of spyware lists. Is this a rumor? If not, will you be removing this quot;featurequot; from the next release? By the way, who did you tell about that? Reply: http://search.mnogo.ru/board

Webboard: Windows Spyware?

2001-03-07 Thread Alexander Barkov
Author: Alexander Barkov Email: [EMAIL PROTECTED] Message: The only thing I can tell is that mnoGoSearch is not spyware. I dnon't know who and why decided to post in on grc.com. This is not true. Win version is build using unix sources. You can check them agains spy code. The only difference

Webboard: multiple dictionaries for the same language?

2001-03-07 Thread Alexander Barkov
Author: Alexander Barkov Email: [EMAIL PROTECTED] Message: Hi! There are American and British dictionaries (using the same affix rules). And there is old and new typography of German (different word lists and affix files). Is it possible to use multiple ispell wordlists and/or affix

Webboard: Cache mode: File permissions

2001-03-07 Thread Alexander Barkov
Author: Alexander Barkov Email: [EMAIL PROTECTED] Message: I would like to know, since cache mode is not working properly, at least in my box, if the following permissions are properly set by the program: doc, etc... var: drwxrwsr-x var/cache: drwxrwxrwx var/raw, splitter amp; tree

Re: indexer sped up with Posgresql

2001-03-07 Thread Alexander Barkov
Caffeinate The World wrote: mnogosearch-cvs, postgresql-cvs i just wanted to let everyone who uses postgresql with mnogosearch how i got this incredible speed increase. I had about 500,000 URLs and when i started indexing, it took for ever between each URL. per alex's suggestion, i

Re: indexer sped up with Posgresql

2001-03-12 Thread Alexander Barkov
Thanks! I've added this into pgsql create script. Caffeinate The World wrote: mnogosearch-cvs, postgresql-cvs i just wanted to let everyone who uses postgresql with mnogosearch how i got this incredible speed increase. I had about 500,000 URLs and when i started indexing, it took for

Re: redirect url too long error

2001-03-13 Thread Alexander Barkov
Current version does not support URLs longer that 128 bytes. We'll fix this in 3.2 branch. Caffeinate The World wrote: [1] http://minneapolis.about.com/citiestowns/midwestus/minneapolis/local/midwestus/minneapolis/li brary/weekly/aa092199.htm [1] Redirect URL too long:

Webboard: SESSION/COOKIE

2001-03-13 Thread Alexander Barkov
Author: Alexander Barkov Email: [EMAIL PROTECTED] Message: It is not implemented. We have it on our TODO. I have a little problem, I have a website which use SESSION for print the right page (like language ) and the parser of mnogosearch don't accept cookie therefore php use TANS_ID

Re: cache mode

2001-03-13 Thread Alexander Barkov
Hello! Teofilis Martisius wrote: MnoGoSearch v3.1.11/mysql Solaris/sparc 7 I had some questions about cache mode. First, I have this problem- in my search engine a lot of queries return results with URLs that have no title, take up 0 bytes and so on. I think it is because these URLs

Webboard: Windows Version Licensing Fees Availability

2001-03-14 Thread Alexander Barkov
Author: Alexander Barkov Email: [EMAIL PROTECTED] Message: Well, iam hanging around here daily just to know the answer for your question! Binaries seems to work fine. However pricing and Documentation are not ready yet. So, we can't make a release. Reply: http://search.mnogo.ru/board

Webboard: can udmsearch

2001-03-22 Thread Alexander Barkov
Author: Alexander Barkov Email: [EMAIL PROTECTED] Message: No, it can't do it. i have clasification of web site, just like directory. There is education, region, job and so on. I have many directory in my server and it is always added file by the user. Could udmsearch clasify them by it self

Webboard: Searching is terribly slow - any ideas why?

2001-03-22 Thread Alexander Barkov
Author: Alexander Barkov Email: [EMAIL PROTECTED] Message: Take a look into sql.c, add #define DEBUG_SQL 1 then recompile. Run search.cgi from command line: ./search.cgi word /dev/null You'll see every SQL query sent to backend with time spent on their execution. What is the most long query

Webboard: Ranking of hits

2001-03-22 Thread Alexander Barkov
Author: Alexander Barkov Email: [EMAIL PROTECTED] Message: Hey folks. Just wondering, how do I tell the search to rank URLs with long pathnames lower than direct hits on let's say a domain name with no directory path? Unfortunately there is no way to do it. Reply: http://search.mnogo.ru

Webboard: How to config windows version to put full content into TXT field?

2001-03-22 Thread Alexander Barkov
Author: Alexander Barkov Email: [EMAIL PROTECTED] Message: i asked the same question before , with no real answer ..! We need to know how to index the full page , rather then part of it .. It does index whole document. So, word will be found if it is in the bottom. However indexer does

Webboard: Last-Modified META tag

2001-03-23 Thread Alexander Barkov
Author: Alexander Barkov Email: [EMAIL PROTECTED] Message: It's taken from Last-Modified HTTP response header. META http-equiv=Last-Modified is not processed. What is the source for last-modified info that is stored in indexed database? I am trying to use lt;META http-equiv=quot;Last

Webboard: How long will the Windows trial version be available?

2001-03-23 Thread Alexander Barkov
Author: Alexander Barkov Email: [EMAIL PROTECTED] Message: I noticed the message on the Windows Trial Beta said quot;Available only till March 15 2001quot;. Yet it still works. Is this crippled in any way? Will it become crippled at some time? (when?) Thanks! Bob Don't worry, it's

Webboard: Different indexes

2001-03-23 Thread Alexander Barkov
Author: Alexander Barkov Email: [EMAIL PROTECTED] Message: Hallo, does mnogosearch work as a crawler or a local indexer, that indexes the file system. If it works as a crawler, is it possible to use different indexes, e.g. whole site, protocols It supports HTTP, FTP, NEWs and local

Webboard: Missing table: mnoGoSearch.ndict2

2001-03-23 Thread Alexander Barkov
Author: Alexander Barkov Email: [EMAIL PROTECTED] Message: Table definition is in the create/mysql/crc-multi.txt I've configured MnoGoSearch to use mysql. I can't find any reference to a table called ndict2 in the table creation script. When I start the indexer I see the following

Re: Denying a URL

2001-03-25 Thread Alexander Barkov
If squid works fine with browser, it will work with indexer. Just use "Proxy ..." indexer.conf command. André Luiz dos Santos wrote: I've made a redirector program for squid that blocks out ad banners and a big list of websites stored in a postgresql database, and I'd like to use it to

Webboard: 15-20% 302 - Moved Temporarily?

2001-03-25 Thread Alexander Barkov
Author: Alexander Barkov Email: [EMAIL PROTECTED] Message: I've got 315.000 URLs indexed now, and about 50.000 of them have status 302 - Moved Temporarily. I have no experience with this, but I think it's a rather high number. Any thoughts? Best regards, Ørjan Sandland

Webboard: ukrainization

2001-03-26 Thread Alexander Barkov
Author: Alexander Barkov Email: [EMAIL PROTECTED] Message: Dear Sirs, Does somebody have an experience of an ukrainization of UDMSEARCH have faced a problem here. Such as: It doesn't want to take some(!) ukrainian letters. Server: Apache 1.3.12 (ENG) Locale: uk_UA Server's Encoding cp1251

Webboard: The value of intag in the dict tables

2001-03-26 Thread Alexander Barkov
Author: Alexander Barkov Email: [EMAIL PROTECTED] Message: This field consist of two values. The lowest two bytes store weight value. Other two bytes are either word position or word count, depenging of whether phrases are used. How is the value of the intag calculated? For example

Webboard: Premature end of indexing?

2001-03-26 Thread Alexander Barkov
Author: Alexander Barkov Email: [EMAIL PROTECTED] Message: Try indexer -a -s504 I'm running latest mnogosearch and redhat 7 with mysql rpm installed. I compiled mnogosearch with support for pthreads btw. Mode is crc-multi, and several stop word lists is in use. The server is on a 1Mbit

Re: *Major* drawback in using Mnogosearch

2001-03-26 Thread Alexander Barkov
News extensions require another tables structure and search front-end. Take them from our download page. Emre Bastuz wrote: Hi, after spending the whole day with experimenting with Mnogosearch (latest) I came to the following conclusion: indexer ist not able to adapt it´s Mysql query

Webboard: Different indexes

2001-03-26 Thread Alexander Barkov
Author: Alexander Barkov Email: [EMAIL PROTECTED] Message: So i can use it as a crawler and a local file indexer ?! yes, you can If i use it as a crawler, can i then use different indizes?? What do you mean? Reply: http://search.mnogo.ru/board/message.php?id=1824

Webboard: realm

2001-03-27 Thread Alexander Barkov
Author: Alexander Barkov Email: [EMAIL PROTECTED] Message: hi all, i commented out the #server and wanted to index all the pages in the .nl domain. so i added Realm http://*.nl/* then i ran indexer and it gave me nothing, whats the right way to do that. thanks Realm command does

Webboard: more info

2001-03-29 Thread Alexander Barkov
Author: Alexander Barkov Email: [EMAIL PROTECTED] Message: Hi! Spider must send Host header even in HTTP/1.0 This allows to index virtual hosts located on the same machine. This is from RFC 2068: 14.23 Host The Host request-header field specifies the Internet host and port

Webboard: real (again)

2001-03-29 Thread Alexander Barkov
Author: Alexander Barkov Email: [EMAIL PROTECTED] Message: You are on the right track, it will index whole .nl domain, not only www.zonnet.nl hi all, i want to index the sites in the .nl domain. i cleared my database and typed in: indexer -i -u http://www.zonnet.nl/ then i added

Re: Webboard: real (again)

2001-03-29 Thread Alexander Barkov
In this case indexer will follow any link, not *.nl only, "Briggs, Gary" wrote: You probably want a "follow world" in there, too. -Original Message- From: Doos [SMTP:[EMAIL PROTECTED]] Sent: Wednesday, March 28, 2001 11:28 AM To: [EMAIL PROTECTED] Subject:

Re: Bug report

2001-03-29 Thread Alexander Barkov
Oops. % signs are to be used in SQL version only, it is SQL LIKE wildcards. Built-in database uses substring match. So, try to use .am without %'s. Artashes Kalantarian wrote: UdmSearch version: 3.1.12 Platform: Dual Proccessor PIII 600, 389M ram OS:Solaris 5.8

Re: Bug report

2001-03-30 Thread Alexander Barkov
Please find a patch here: http://gw.udmsearch.izhnet.ru/~bar/patches/conf.c.diff Regards Vadim Spichak wrote: UdmSearch version: 3.1.12 Platform: Linux. etc/// OS: Database: mysql Statistics:many 3.1.10 works fine for same config

Re: Search engine

2001-03-31 Thread Alexander Barkov
Laurent is right. Those file formats are really supported. But we didn't test our software on MacOS. Laurent LEVIER wrote: Philippe, UDM/Mnogo Search is reading the stdout that should contain only ASCII (ISO 8859-X) documents. You define a parser and this parser reads the file

Webboard: One URL per domain

2001-04-02 Thread Alexander Barkov
Author: Alexander Barkov Email: [EMAIL PROTECTED] Message: I wrote your suggestion to our TODO. Thanks. At times a search term happens to be repeated lots of times on different pages of the same site, so that the results get clogged. Imagine looking for shops in your area that offer quot

Webboard: AM_INIT_AUTOMAKE (solved)

2001-04-03 Thread Alexander Barkov
Author: Alexander Barkov Email: [EMAIL PROTECTED] Message: Thanks for Suggestion! Dmitry, please consider to add this instruction into README.CVS. The solution (on Linux RH 7.0): Run aclocal (just like that) after autoconf and before running ./configure. That's another step

Webboard: Underscore symbol.

2001-04-03 Thread Alexander Barkov
Author: Alexander Barkov Email: [EMAIL PROTECTED] Message: Hello. Does anyone know is there possibility to interpret underscore symbol quot;_quot; as part of word, not as delimeter. So, the indexer would interpret quot;aaa_bbbquot; as single word, not 2 words aaa and bbb. Many thanks

Re: Webboard: Underscore symbol.

2001-04-03 Thread Alexander Barkov
It should work with dollar sign too. But I didn't test. - Original Message - From: Zenon Panoussis [EMAIL PROTECTED] To: [EMAIL PROTECTED] Sent: Tuesday, April 03, 2001 10:15 PM Subject: Re: Webboard: Underscore symbol. Alexander Barkov skrev: Open charset.c and add

Webboard: Compliments

2001-04-04 Thread Alexander Barkov
Author: Alexander Barkov Email: [EMAIL PROTECTED] Message: Thanks! In the past couple of days I've been looking at urls scrolling up the screen, ending in things like Indexer[12895]: [1] Done (100122 seconds) and 200 0 190277 OK In the meanwhile, no core dumps

Webboard: SSI

2001-04-05 Thread Alexander Barkov
Author: Alexander Barkov Email: [EMAIL PROTECTED] Message: Use $iurl(http://some/include.html) template syntax. It includes given URL. You may also use $if(/usr/local/httpd/include.html). This comman includes given file from local system. Does anybody know of any way to put server side

Re: indexer does not work.

2001-04-06 Thread Alexander Barkov
./indexer -am Follow documentation for futher information. - Original Message - From: Hellinger Balazs [EMAIL PROTECTED] To: [EMAIL PROTECTED] Sent: Wednesday, April 04, 2001 5:14 PM Subject: indexer does not work. I have installed udmsearch, then i indexed my site. For for first

Re: CRC32 in URL table

2001-04-06 Thread Alexander Barkov
CRC32 field of URL table is CRC32 of whole document. And yes, it's used for clone detection. - Original Message - From: Briggs, Gary [EMAIL PROTECTED] To: [EMAIL PROTECTED] Sent: Wednesday, April 04, 2001 5:13 PM Subject: CRC32 in URL table What is this? I'm unable to find what it

Re: Https with mnoGoSearch

2001-04-06 Thread Alexander Barkov
This is explained in documentation supplied together with the sources. Download *.tar.gz, unpack it and take a look into /doc directory. - Original Message - From: Erwin LEUBAZ [EMAIL PROTECTED] To: [EMAIL PROTECTED] Sent: Thursday, April 05, 2001 8:20 PM Subject: Https with mnoGoSearch

Re: Problems with indexing files on local hard drive.

2001-04-09 Thread Alexander Barkov
Cliff Olle wrote: This is what indexer reports Indexer[28709]: indexer from mnogosearch-3.1.12/MySQL started with '/var/local/mnogosearch-3.1.12/etc/indexer.conf' Indexer[28716]: [1] Done (0 seconds) This is my indexer file DBAddr mysql://myuser:mypass@localhost/mnogosearch/ robots

Re: $iurl and template vars

2001-04-09 Thread Alexander Barkov
Dmitry Illarionov wrote: Hello, Could you, please, tell, is it possible to use template variables in the files included with $iurl command? It is not possible to use them directly. However, you can write in template: $iurl(http://servername/include.cgi?q=$q) Query text will be

Webboard: Search Form - 'wf' Select

2001-04-09 Thread Alexander Barkov
Author: Alexander Barkov Email: [EMAIL PROTECTED] Message: ok, i had to remove the Select named 'wf' to get results from the indexer. Whats this supposed to do and why isnt returning any results with it being used? (all wf options were tested and non returned results) until further notice

Webboard: web bug in search.htm ?

2001-04-09 Thread Alexander Barkov
Author: Alexander Barkov Email: [EMAIL PROTECTED] Message: You can change your template yourself. It is absolutely configurable. The distributed search.htm contains this: lt;a href=quot;http://search.mnogo.ru/quot;gt;lt;IMG BORDER=0 SRC=quot;http://search.mnogo.ru/img/udm.gifquot;gt;lt;/Agt

Webboard: TAG question

2001-04-09 Thread Alexander Barkov
Author: Alexander Barkov Email: [EMAIL PROTECTED] Message: You have to reindex with indexer -am after changing Tag commands. How to give to the local_host1 serveur the tag value 0 and the tag value 1 to the local_host2 serveur, in the same indexer.conf file !? I tried this, but when I look

  1   2   3   4   5   6   7   >