UdmSearch: Webboard: search.cgi and method=post
Author: gluke Email: [EMAIL PROTECTED] Message: Use method 'GET' instead. Reply: http://search.mnogo.ru/board/message.php?id=776 __ If you want to unsubscribe send "unsubscribe udmsearch" to [EMAIL PROTECTED]
UdmSearch: Webboard: Strange SQL error
Author: gluke Email: [EMAIL PROTECTED] Message: Increase tmp_table_size mysqld variable. Reply: http://search.mnogo.ru/board/message.php?id=777 __ If you want to unsubscribe send "unsubscribe udmsearch" to [EMAIL PROTECTED]
UdmSearch: Crashes
I'm using mnoGoSearch 3.1.8 + mysql 3.22.32 at FreeBSD 4.1.1, and my indexer often makes server hang until reboo. During this, /var/log/messages if full of messages like: Nov 20 15:06:34 degon /kernel: rl0: no memory for tx list Nov 20 15:06:49 degon last message repeated 31 times i've ran indexer with and withour threads support. Have anyone problem like this? -- WBR Dmitry Guyvoronsky [EMAIL PROTECTED] [Webstore System Administrator] __ If you want to unsubscribe send "unsubscribe udmsearch" to [EMAIL PROTECTED]
Re: UdmSearch: How do I keep this thing under control? :D
Roy Nasser wrote: about the Realm with .br, will itdeletetheinapropriatelinks from the DB? is there a way todo thus? itwilldoit. If no more "Realm" and "Server" commands are given. -Mensagem original- De: Alexander Barkov Para: Roy Nasser Cc: [EMAIL PROTECTED] Enviada em: 20/11/2000 09:38 Assunto: Re: UdmSearch: How do I keep this thing under control? :D Hi! Roy Nasser wrote: Hi! I have finally been able to install mnoGoSearch 3.1.8, with crc-multi, and, MySQL 3.23.27... We run a Meta-search engine, so capturing links is very easy (We have over 10,000 searches /day, and have about 20,000 unique addresses being added by a "redirector" which we have adding all URLs that are lciked to our database...) I would, therefore, like to know how we can restrict some things such as: Dont spider at all, Use "Follow no" in your indexer.conf spider only up to 2 subpages, or something Subpage number is not implemented, you may find "MaxHops" to be usefull though. I would also like to know what tag would be put in indexer to index only sites that end in .br, basically XXX.X.br/X Use "Realm http://.*\.br/.*" (this appeared in 3.1.x and do not work in 3.0.x) Finally, is there any work to make mnoGoSearch a high-precision engine? I have heard some people comment that it would be of no use to have a "I'm feeling lucky" button (that takes ytou straight to the 1st link), so I ask myself, is it possible to make it high precision, kind of like Google, with an algorithm that ranks by 1: clicks and 2: referrals (Basically the more pages that link to one page, give it more points, and pages which have more points, grant extra points to their referring pages- basically what google uses) We consider how to implement this now. Propably something will appear in next release. __ If you want to unsubscribe send "unsubscribe udmsearch" to [EMAIL PROTECTED]
Re: UdmSearch: Mnogosearch-perl: $search-work_time in mod_perl ...
Hi, The Hermit Hacker wrote: what does it represent? I've set my restop to be: !--restop-- Search results: small$W/smallHR Displaying documents $f-$l of total B$t/B found. ($SearchTime secs) !--/restop-- and the resutls are coming out as: Displaying documents 1-10 of total 458 found. (1705334.945 secs) Have you the same results without modperl or with search.cgi ? What is your mnogosearch / mnogosearch-perl version ? -- Dubun Guillaume [EMAIL PROTECTED] __ If you want to unsubscribe send "unsubscribe udmsearch" to [EMAIL PROTECTED]
Re: UdmSearch: Mnogosearch-perl: $search-work_time in mod_perl ...
On Tue, 21 Nov 2000, Dubun Guillaume wrote: Hi, The Hermit Hacker wrote: what does it represent? I've set my restop to be: !--restop-- Search results: small$W/smallHR Displaying documents $f-$l of total B$t/B found. ($SearchTime secs) !--/restop-- and the resutls are coming out as: Displaying documents 1-10 of total 458 found. (1705334.945 secs) Have you the same results without modperl or with search.cgi ? What is your mnogosearch / mnogosearch-perl version ? latest version ... and the number of secs keeps increasing, as if its time since the modperl script was loaded: Displaying documents 1-10 of total 512 found. (1733825.195 secs) ~7hrs since the last time I tested, which is about right: 1733825 - 1705334 28491 28491 / 60 474 474 / 60 7 __ If you want to unsubscribe send "unsubscribe udmsearch" to [EMAIL PROTECTED]
UdmSearch: Limitations?
Are there any inherent limitations on how long the Server path list can get? Would the indexer work with, say, a 2 MB list of URLs to index, or would it choke? Z __ If you want to unsubscribe send "unsubscribe udmsearch" to [EMAIL PROTECTED]
Re: UdmSearch: Limitations?
Hi! Tuesday, November 21, 2000, 5:24:56 PM, you wrote: ZP Are there any inherent limitations on how long the Server path ZP list can get? Would the indexer work with, say, a 2 MB list of ZP URLs to index, or would it choke? It should work. If it coredumps or smth else - please report to us. -- Regards, Sergey aka gluke. __ If you want to unsubscribe send "unsubscribe udmsearch" to [EMAIL PROTECTED]
UdmSearch: Adapting UDM Search results into other PHP Pages?
Hi, I would like to iuse the PHP-interface to mbed results from MNOGOSearch results into a php page, is this possible? Has anyone done this easily? Is there a way to Class-it or somthing? Has anyone though about making it a class? Thanks! Roy __ If you want to unsubscribe send "unsubscribe udmsearch" to [EMAIL PROTECTED]
UdmSearch: One more question ...
Hello Gary ! Thanks for your answer. Now it raises a new question. Suppose that I have installed pdftotext, deroff and catdoc. If I want to index a PDF file, for instance, I must first transform it using 'pdftotext' and then indexing this new file or directly 'indexer' carry on this goal ? ... I mean, call automatically the 'pdftotext' parser and index the file ?. If so, it does the samething with deroff and catdoc ? If this task is not currently done by 'indexer' it would be possible to automate this process ? Thanks for your help ... Arturo Of course, for that to work ,you need gs in your path, and pdftotext and deroff and catdoc all available... gs, you work it out [took me a whole bloody day to compile it, and it's so skanky I can't even copy it onto another of my systems...] deroff is avbailable in /usr/bin on solaris by default as far as I know catdoc I got from http://www.htdig.org/contrib/, along with xpdf [which contains pdftotext] Although I've updated to a more recent version of xpdf that is on that page, IIRC Gary (-; -Original Message- From: Arturo [SMTP:[EMAIL PROTECTED]] Sent: Friday, November 17, 2000 12:53 AM To: [EMAIL PROTECTED] Subject:UdmSearch: a little doubt ... Hello there, It says in the features of mnoGoSearch that Udmsearch has external parsers to support other type file indexing. If I want to index a PDF,PS, or DOC file how can I do it ? Thanks for your help. Regards Ing. Arturo Pulido __ If you want to unsubscribe send "unsubscribe udmsearch" to [EMAIL PROTECTED] __ If you want to unsubscribe send "unsubscribe udmsearch" to [EMAIL PROTECTED]
Re: UdmSearch: One more question ...
On Wed, 22 Nov 2000, Arturo wrote: Hello Gary ! Thanks for your answer. Now it raises a new question. Suppose that I have installed pdftotext, deroff and catdoc. If I want to index a PDF file, for instance, I must first transform it using 'pdftotext' and then indexing this new file or directly 'indexer' carry on this goal ? ... I mean, call automatically the 'pdftotext' parser and index the file ?. If so, it does the samething with deroff and catdoc ? If this task is not currently done by 'indexer' it would be possible to automate this process ? Thanks for your help ... Arturo You need to add the correct information to the configuration file in the Mime section; the sample config file has instructions. Then the text will be automagically extracted and fed to indexer. -- David Robley| WEBMASTER Mail List Admin RESEARCH CENTRE FOR INJURY STUDIES | http://www.nisu.flinders.edu.au/ AusEinet| http://auseinet.flinders.edu.au/ Flinders University, ADELAIDE, SOUTH AUSTRALIA __ If you want to unsubscribe send "unsubscribe udmsearch" to [EMAIL PROTECTED]
UdmSearch: Webboard: No 'Server' command for url... deleted.
Author: ken Email: [EMAIL PROTECTED] Message: What does "No 'Server' command for url... deleted." means when I run Indexer. Reply: http://search.mnogo.ru/board/message.php?id=778 __ If you want to unsubscribe send "unsubscribe udmsearch" to [EMAIL PROTECTED]
UdmSearch: Webboard: Installation question
Author: Eugene Chiu Email: [EMAIL PROTECTED] Message: I installed mnogosearch-3.1.8 with mysql 3.22.30 when I try to run the indexer I get the following error. Indexer[13097]: [1] Error: '#1054: Unknown column 'msg_id' in 'field list'' Any idea?? I have run all create scripts for mysql... Am i missing something? Reply: http://search.mnogo.ru/board/message.php?id=779 __ If you want to unsubscribe send "unsubscribe udmsearch" to [EMAIL PROTECTED]
UdmSearch: Webboard: Installation question
Author: Matthew Sullivan Email: [EMAIL PROTECTED] Message: I installed mnogosearch-3.1.8 with mysql 3.22.30 when I try to run the indexer I get the following error. Indexer[13097]: [1] Error: '#1054: Unknown column 'msg_id' in 'field list'' Any idea?? I have run all create scripts for mysql... Am i missing something? To use news-extensions you need to use the news-extensions create script. You could recompile without the news-extensions if they are not nessesary. Reply: http://search.mnogo.ru/board/message.php?id=781 __ If you want to unsubscribe send "unsubscribe udmsearch" to [EMAIL PROTECTED]
Re: UdmSearch: Adapting UDM Search results into other PHP Pages?
Hi! Tuesday, November 21, 2000, 10:49:39 PM, you wrote: RN I would like to iuse the PHP-interface to mbed results from MNOGOSearch RN results into a php page, is this possible? Has anyone done this easily? Yes. This can be done easily by using ? include('search.php'); ? in your php page. -- Regards, Sergey aka gluke. __ If you want to unsubscribe send "unsubscribe udmsearch" to [EMAIL PROTECTED]
UdmSearch: Webboard: Setting tags or categories (based on URL?)
Author: gluke Email: [EMAIL PROTECTED] Message: Please read categories.txt at the doc directory. Shortly you should use Catgory indexer.conf command. Reply: http://search.mnogo.ru/board/message.php?id=783 __ If you want to unsubscribe send "unsubscribe udmsearch" to [EMAIL PROTECTED]