Hello!
it should be "ntemplates" there, not "ntempl".
I recently found that we have two variables with the
same name. The first one is static and the second is
LoadTemplate() function argument. To make code cleaner,
I renamed static variable to "ntemplates", but did not
try to compile with enabl
Hi!
No, cache mode requires low-level file operations,
bit operations, a lot of sorts. It is impossible to
implement in PHP. But we have an idea together with PHP
team to include native UdmSearch support into PHP. If
we have success, it will be possible to use any mode.
Sergey Stukov wrote:
>
Hi!
Also it may help to compile both MySQL and UdmSearch itself with
-O6 flags - high optimization level.
Shane Wegner wrote:
>
> On Tue, Oct 10, 2000 at 05:12:34PM -0700, Thomas Yengst wrote:
> >
> > I'm using mysql 3.22.32 on a FreeBSD 4.1 P-II 400 Mhz with SCSI disk to
> > index 10,000 H
"Artyom A. Tolstobrov" wrote:
>
> How can I estimate the size required for all indexer databases knowing
> the total size of documents to index?
It depends on storage mode being used. But for HTML text is usually
from 1/5 to 1/3 of original size.
--
Alexander Barkov
IZ
e the 3.1.6 splitter on data indexed by the indexer
> from 3.1.5 or 3.1.2-pre1?
>
> -- Dan Hanks
>
> On Wed, 11 Oct 2000, Alexander Barkov wrote:
>
> > BTW. 3.1.6 we've fixed two major bugs in cache mode.
> > So I affraid that evebody have to drop da
I'm expericening a fata morgana or not :-)
>
> Thanks
> Fredy
>
> BTW. There is another minor issue in search.cgi: The 2nd page is
> numbered from 1 to 20 instead 21 to 40 and so on (default
> settings).
--
Alexander Barkov
IZHCOM, Izhevsk
email:[EMAIL PROTECTED]
TW. There is another minor issue in search.cgi: The 2nd page is
> numbered from 1 to 20 instead 21 to 40 and so on (default
> settings).
I can't see this on given URL. Second pages writes "21 to 40" to me.
--
Alexander Barkov
IZHCOM, Izhevsk
email:[EMAIL PROTECTED]
n that $if() was not work has been fixed in search.cgi
- A bug in splitter for cache mode has been fixed.
--
Alexander Barkov
IZHCOM, Izhevsk
email:[EMAIL PROTECTED] | http://www.izhcom.ru
Phone:+7 (3412) 51-32-11 | Fax: +7 (3412) 51-20-80
ICQ: 7748759
__
If you want
> 1) Dumb caches that don't follow standards
> 2) Dumb versions of browsers that don't follow standards
> 3) Dumb users fiddling with language settings and putting wierd stuff
> in.
>
> The apache team is working on some workarounds for most of these, if
> possible.
s
> 2) Dumb versions of browsers that don't follow standards
> 3) Dumb users fiddling with language settings and putting wierd stuff
> in.
>
> The apache team is working on some workarounds for most of these, if
> possible. For a reasonably heavily used website you can expect an email
> about it once a week or so.
--
Alexander Barkov
IZHCOM, Izhevsk
email:[EMAIL PROTECTED] | http://www.izhcom.ru
Phone:+7 (3412) 51-32-11 | Fax: +7 (3412) 51-20-80
ICQ: 7748759
__
If you want to unsubscribe send "unsubscribe udmsearch"
to [EMAIL PROTECTED]
t... the one that goes in the
> bin/ directory of the compiled source)
>
> -Original Message-
> From: Alexander Barkov
> To: Roy Nasser
> Cc: 'Shane Wegner '; ''[EMAIL PROTECTED] ' '
> Sent: 10/10/2000 03:23
> Subject: Re: UdmSearch: Ques
should i pass a special
> parameter
> > to the indexer for it to fetch like, 5 or 10 URLs simultaneously?
>
> Well there may be gcc options which could improve your
> performance. The various -m options come to mind but from
> my work with UDMSearch, it's pretty fast and
in
> indexer.conf, however this did not work:
>
> Allow http://*.ch/
> Disallow http://*.*/
>
> or
> Disallow http://*.de/
--
Alexander Barkov
IZHCOM, Izhevsk
email:[EMAIL PROTECTED] | http://www.izhcom.ru
Phone:+7 (3412) 51-32-11 | Fax: +7 (3412) 51-20-80
ICQ:
x27;
> collect2: ld returned 1 exit status
> make[1]: *** [search.cgi] Error 1
> make[1]: Leaving directory `/usr/local/udmsearch-3.0.23/src'
> make: *** [all-recursive] Error 1
> [root@pc20 udmsearch-3.0.23]#
--
Alexander Barkov
IZHCOM, Izhevsk
email:[EMAIL PROTECTED]
sd/ssi/index_left_top.html
> Indexer[11108]: [1] http://websd/ssi/main.shtml
> Indexer[11108]: [1] http://websd/ssi/
>
> Configuration of udmsearch (indexer.conf):
>
> <>
>
> --------
>Name: indexer.conf
>indexer.confType: unspecified
> On Sun, 08 Oct 2000 22:37:01 +0500, Alexander Barkov wrote:
>
> >Frederic Tschannen wrote:
> >>
> >> Hi,
> >>
> >> our 'indexer' does not follow cl
Frederic Tschannen wrote:
>
> Hi,
>
> our 'indexer' does not follow client side image map.
> Is it implemented?
>
> Our HTML code lokks like:
>
>
>
>
>
>
>
>
>
>
>
>
> Does it work? How?
Hello!
I thought that it works. Is y
Hello!
This is already fixed in 3.1.6 sources.
We'll try to make it available on Monday.
thanks for reporting!
Nefer wrote:
>
> UdmSearch version: 3.1.5
> Platform: 4x XEON 550
> OS: RedHat 6.2
> Database: MySQL
> Statistics: ~33
>
> ./splitter output:
> -
7; keyword for example ?
> Thanks for your ideas.
> I work on this for days...and still love udmsearch. I hope to be soon
> clever enough to help you on the list...
Thanks.
--
Alexander Barkov
IZHCOM, Izhevsk
email:[EMAIL PROTECTED] | http://www.izhcom.ru
Phone:+7 (3412)
collect2: ld returned 1 exit status
> make[1]: *** [search.cgi] Error 1
> make[1]: Leaving directory `/usr/local/udmsearch-3.0.23/src'
> make: *** [all-recursive] Error 1
> [root@pc20 udmsearch-3.0.23]#
--
Alexander Barkov
IZHCOM, Izhevsk
email:[EMAIL PROTECTED] | http://www.izhcom.ru
Phone:+7 (3412) 51-32-11 | Fax: +7 (3412) 51-20-80
ICQ: 7748759
__
If you want to unsubscribe send "unsubscribe udmsearch"
to [EMAIL PROTECTED]
Khlutchin Sergey Yurievich wrote:
>
> Is it possibe to fetch server list from database (preferably with tags and other
>parameters as well)?
>
No. It is not supported. But we have a plan to implement this
in near future.
--
Alexander Barkov
IZHCOM, Izhevsk
email:[EMA
first 4 urls then dies
> with:
>
> Error:#1100 Table "dict2" was not locked with LOCK TABLES'
>
> Is this a bonafide bug, or have I done something wrong?
>
> On an aside, which would be faster? CRC mode or MULTI mode? (since I
> cannot get CRC-MULTI work
literal.
> %s%s%s",
> ---^
> make: *** [id3.lo] Error 1
>
--
Alexander Barkov
IZHCOM, Izhevsk
email:[EMAIL PROTECTED] | http://www.izhcom.ru
Phone:+7 (3412) 51-32-11 | Fax: +7 (3412) 51-20-80
ICQ: 7748759
__
If you want to unsubscribe send "unsubscribe udmsearch"
to [EMAIL PROTECTED]
ww.cis.tu-graz.ac.at/zid/lang/
> Phone: +43 (0) 316 / 873 - 8508 ICQ: 69372257
>
> 1.79 x 10^12 furlongs per fortnight -- it's not just a good idea, it's
> the law!
--
Alexander Barkov
IZHCOM, Izhevsk
email:[EMAIL PROTECTED] | http://www.izhcom.ru
Mario Lang wrote:
>
> Hi.
>
> I am moving over the backend things to an SGI IRIX 6.5 MIPS machine.
> When I try to compile UdmSearch, I get the following (after
> some successful files).
> cc -DHAVE_CONFIG_H -I. -I. -I../include -I../include -I/search/mysql/include -g
>-DUDM_CONF_DIR=\"/search/
Hi!
This is fixed in 3.1.6 sources.
Thanks for reporting.
Nagy Erno wrote:
>
> Has somebody any idea what may be wrong?
>
> Thanks,
>Ned
>
> ned:~$ /usr/local/udmsearch/sbin/indexer
> indexer from UdmSearch v.3.1.5-pre4/MySQL started with
> '/usr/local/udmsearch-3.1.5-pre4//etc/indexer.
rom the cmd line...
>
> It just doesnt give any results (It says 0 results found... no matter what)
>
>
> -----Mensagem original-
> De: Alexander Barkov
> Para: Roy Nasser
> Cc: '[EMAIL PROTECTED]'
> Enviada em: 3/10/2000 05:28
> Assunto: Re: Ud
SET rec_id=YYY WHERE rec_id=TTT;
where TTT is any unused rec_id, for example 0x.
This scheme has a problem of simultanious or
multi-threaded indexers thougth :-(
But it is possible to do this in external script after indexing on
small databases.
Alexander Barkov wrote:
>
> Mari
e mode, but may be implemented
easier in any of SQL modes by using this query:
UPDATE dict SET url_id=XXX WHERE url_id=YYY with some additional
code.
--
Alexander Barkov
IZHCOM, Izhevsk
email:[EMAIL PROTECTED] | http://www.izhcom.ru
Phone:+7 (3412) 51-32-11 | Fax: +7 (3412) 51-20-80
IC
ssible ?
>
> --
> Kaj-Michael Lang
> [EMAIL PROTECTED]
>
> __
> If you want to unsubscribe send "unsubscribe udmsearch"
> to [EMAIL PROTECTED]
--
Alexander Barkov
IZHCOM, Izhevsk
email:[EMAIL PROTECTED] | http://www.izhcom.ru
Phone:+7 (
0 1 Moved Permanently
>404 0 2 Not found
>-----
> Total 0118
>
> Alexander Barkov a écrit :
>
> > Try to run with "-v6". High verbose level usually gives
> > an idea of why something is going wrong.
h had this error, but remember that
> 3.1.4 did also not really index external parser mimetypes...)
--
Alexander Barkov
IZHCOM, Izhevsk
email:[EMAIL PROTECTED] | http://www.izhcom.ru
Phone:+7 (3412) 51-32-11 | Fax: +7 (3412) 51-20-80
ICQ: 7748759
__
If you want to unsubscribe send "unsubscribe udmsearch"
to [EMAIL PROTECTED]
uot;.
> Any references to existing installations would be great... I can add more
> RAM etc. right away if needed also, I have a couple of other machines
> that could serve as http servers and just dedicate the 233 to MySQL only if
> that helps...
--
Alexander Barkov
IZHCOM,
newver yields any results...
>
> How can i fix this?
>
> http://200.194.112.66/bj/ (php interface - debug on)
> http://200.194.112.66/cgi-search/search.cgi (c interface)
>
> Thanks
> Roy
--
Alexander Barkov
IZHCOM, Izhevsk
email:[EMAIL PROTECTED] | http://www.izhcom.ru
IL PROTECTED]/udmsearch/
> DBMode single
> LocalCharset iso-8859-1
> MaxDocSize 10048576
> Server http://s-li-1.cg13.fr/
> Allow \.htm$ \.html$ \.shtml$ \.txt$ \/$
> Disallow .*
--
Alexander Barkov
IZHCOM, Izhevsk
email:[EMAIL PROTECTED] | http://www.izhcom.ru
Phone
Hi!
Thanks for reporting this and sorry for late reply.
We'll fix this soon.
Nagy Erno wrote:
>
> Has somebody any idea what may be wrong?
>
> Thanks,
>Ned
>
> ned:~$ /usr/local/udmsearch/sbin/indexer
> indexer from UdmSearch v.3.1.5-pre4/MySQL started with
> '/usr/local/udmsearch-3.1.5-p
ucture is created properly.
Use MySQL structure as example.
The problem is that there is no search front-end. Authors of extended
news mode Perl front-end used MySQL and I haven't an idea of how it is
dufficalt to port to other databases.
--
Alexander Barkov
IZHCOM, Izhevsk
email:[EM
returning search results
> primarily, but not at the sacrifice of being unable to actually spider
> the sites I want to index inside of a month or so of running indexer.
> As it stands now I am ready to return to crc mode since it is
> performing better with over 100,000 records than crc/mul
t; char last_index_time[64]="";
>
> #ifdef HEIKODEBUG
>
> Tin Le
> ----
> http://tin.le.org
> Tin Le - [EMAIL PROTECTED]
> Firewall and Security Consulting
--
Alexander Barkov
IZHCOM, Izhevsk
email:[EMAIL PROTECTED] | http://www.izhcom.ru
P
s listed.
> With next pages (np=1,2..) all O.K.
> When tried version 3.1.3 search.cgi
> all O.K.
> I need 3.1.4 indexer, because it handles IIS
> pages better.
> So no I have hybrid version with 3.1.4 indexer and
> 3.1.3 search.
> regards Kustaa Luoma
> YLE - Finland
-
08:26 GMT
> Indexer[29810]: [1] Server: Apache/1.3.12 (Unix) PHP/4.0.1pl2
> Indexer[29810]: [1] Location: /pubs/monitor15/monitor1503.html
> Indexer[29810]: [1] Connection: close
>
> There is more if you want it :-) And the outcome is the same - only 302
> entries in the URL tabl
t people. What does not work for you?
--
Alexander Barkov
IZHCOM, Izhevsk
email:[EMAIL PROTECTED] | http://www.izhcom.ru
Phone:+7 (3412) 51-32-11 | Fax: +7 (3412) 51-20-80
ICQ: 7748759
__
If you want to unsubscribe send "unsubscribe udmsearch"
to [EMAIL PROTECTED]
the strcmp near the top of
>
> __INDLIB__ int UdmURLFile(UDM_AGENT *Indexer, int action) in indexer.c.
>
> Perhaps in the line:
>
> if(!strcmp(Indexer->Conf->url_file_name,"-"))
>
> I'm downloading kdbg right now (not being too familiar with
> gdb...ala
Try this command:
indexer -am -s302 -v6
David Robley wrote:
>
> On Thu, 28 Sep 2000, Alexander Barkov wrote:
> > Note that page which cause redirect is stored too.
> > For example if http://www/index.htm redirects to
> > http://www/index.html you have both of them inde
ot; field of HTTP-header instead.
>
> But it seems there may be no location field issued? Or should this not show
> up with a GET?
>
> I've temporarily solved the problem for some directories by changing .htm
> to .html in the index page, but I am sure my boss is going to d
enother server. We
want to test 20-50 million documents.
--
Alexander Barkov
IZHCOM, Izhevsk
email:[EMAIL PROTECTED] | http://www.izhcom.ru
Phone:+7 (3412) 51-32-11 | Fax: +7 (3412) 51-20-80
ICQ: 7748759
__
If you want to unsubscribe send "unsubscribe udmsearch"
to [EMAIL PROTECTED]
s after running splitter.
Or gzip them for backup purposes.
Total size of tree after distributing logs is 8.4 Gb. That means
8,4/42=0.2 of original volume.
SQL version is not so effitient. For example 223 Gb of original
documents on http://search.udm.net takes 78 Gb word indexes. So
the ratio is
6 unknown
--
Alexander Barkov
IZHCOM, Izhevsk
email:[EMAIL PROTECTED] | http://www.izhcom.ru
Phone:+7 (3412) 51-32-11 | Fax: +7 (3412) 51-20-80
ICQ: 7748759
__
If you want to unsubscribe send "unsubscribe udmsearch"
to [EMAIL PROTECTED]
has anyone run across trouble doing this?
--
Alexander Barkov
IZHCOM, Izhevsk
email:[EMAIL PROTECTED] | http://www.izhcom.ru
Phone:+7 (3412) 51-32-11 | Fax: +7 (3412) 51-20-80
ICQ: 7748759
__
If you want to unsubscribe send "unsubscribe udmsearch"
to [EMAIL PROTECTED]
aid HD.
Please note that this machine is overloaded by other running
tasks. So one may see delays sometime.
Search itself does not require much resources. Splitter does.
--
Alexander Barkov
IZHCOM, Izhevsk
email:[EMAIL PROTECTED] | http://www.izhcom.ru
Phone:+7 (3412) 51-32-11 | Fax:
will be implemented in near future:
---
1. Parallel indexing, both threaded version and several simultanious
indexers even on different machines.
2. Automatize database management: remove all manual operation such
as log moving etc.
3. Make it possible
You have to add nont linked pages manualy.
Using either Server command or -i key for indexer.
"Alain Tésio" wrote:
> > There\'s also other pages placed in /Archive,
> > which are NOT linked from 1st, main site in
> > any way (so can\'t be reached by following links).
>
> > - What I need --
Hello!
There is no such plans.
Roy Nasser wrote:
> Hi Everyone...
> I am new to UDMSearch, and to this mailing list so excuse me if this is not
> the usual format for e-mails to be sent...
> I would like to know if there is an addon, or at least a plan for an addon
> to make UDM Search function
re?
Hello!
This is correct. files.c is a built-in database driver which
is compiled when no SQL database selected in configure.
If you use one of supported SQL database, file sql.c is compiled
instead.
All non-static functions in both files have the same interface.
--
Alexander Barkov
IZHCOM, Izh
Hello!
UdmSearch-3.0.23 is available from our site http://search.mnogo.ru
Major memory leak in PostgreSQL driver has been fixed.
Regards.
--
Alexander Barkov
IZHCOM, Izhevsk
email:[EMAIL PROTECTED] | http://www.izhcom.ru
Phone:+7 (3412) 51-32-11 | Fax: +7 (3412) 51-20-80
7 on a Sun box clone of some sort - not my system. Not sure of the RAM).
You may create new database and respider into it while old one
may work for search.
--
Alexander Barkov
IZHCOM, Izhevsk
email:[EMAIL PROTECTED] | http://www.izhcom.ru
Phone:+7 (3412) 51-32-11 | Fax: +7 (3412) 51-20-80
ICQ: 7748759
__
If you want to unsubscribe send "unsubscribe udmsearch"
to [EMAIL PROTECTED]
Hello!
Nagy, thanks for reporting this!
I've fixed bugs in 3.1.5 sources.
Nagy Erno wrote:
> Hello!
>
> If I want to compile the udmsearch version 3.1.4 with a DEC C
> compiler on a TRU64 UNIX, it made the following errors:
>
>
Did you run configure with --enable-news-extensions?
If so, it is useless without perl front-end for extended news mode.
You have to create specialized database structure which is supplied
with perl front-end.
You can find it here:
http://www.izhcom.ru/~bar/news-3.1.tar.gz
- Original Mess
The patch itself.
Alexander Barkov wrote:
>
> Hello!
>
> The attached patch againts sql.c should fix memory
> leak problem in PostgreSQL driver.
>
> I tested it with documentation supplied with Apache.
>
> Old indexer take 2.4M at startup and 6.8M before indexin
always takes 2.5M
PostgreSQL users, please give feedback!
P.S.
Patch is suitable for both 3.0.22 and 3.1.4 versions.
--
Alexander Barkov
IZHCOM, Izhevsk
email:[EMAIL PROTECTED] | http://www.izhcom.ru
Phone:+7 (3412) 51-32-11 | Fax: +7 (3412) 51-20-80
ICQ: 7748759
12:25:51PM +0500, Alexander Barkov wrote:
> > > "S.S.Chang" wrote:
> > >Can someone tell me that does udmsearch can be used to index/search
> > > web pages of double bytes character, e.g., Chinese Big-5?
> > No, this is not supported.
> This is a pain,
8 in free () from /lib/libc.so.6
> #1 0x400a5aed in free () from /lib/libc.so.6
> #2 0x8050059 in UdmFreeParsers (Env=0x806c468) at parser.c:84
> #3 0x805f047 in UdmFreeEnv (Env=0x806c468) at conf.c:935
> #4 0x804a5fa in main (argc=3, argv=0xbd70) at main.c:551
--
Alexander Bark
at indexer.c:721
> 721 if(!strcmp(Indexer->Conf->url_file_name,"-"))
> (gdb) backtrace
> #0 0x804c129 in UdmURLFile (Indexer=0x0, action=4) at indexer.c:721
> #1 0x804a561 in main (argc=4, argv=0xb9b4) at main.c:425
> (gdb)
> --
> DIV
> [E
> ---
>
> This looks mad to me. Here is how I invoked configure:
>
> fzidpc73:/usr/src/udmsearch-3.1.4# ./configure --prefix=/usr/local/udm31
>--enable-linux-pthreads --disable-syslog --enable-mp3 --disable-file --disable-htdb
>--with-mysql
>
> Any ideas how c
in here people think one might actually wish to search on?
--
Alexander Barkov
IZHCOM, Izhevsk
email:[EMAIL PROTECTED] | http://www.izhcom.ru
Phone:+7 (3412) 51-32-11 | Fax: +7 (3412) 51-20-80
ICQ: 7748759
__
If you want to unsubscribe send "unsubscribe udmsearch"
to [EMAIL PROTECTED]
this a memory leak? Or is that UdmSearch needs a very large footprint
> to run with this URL quantity?
This is memory leak. We'll try to fix it as soon as possible in 3.1.x
development thread.
> And... how can I use the multithreading capabilities of UdmSearch?
indexer -N XXX , where
English stop-list has been added.
* Danish stop-list has been added.
* Slovak stop-list has been added.
--
Alexander Barkov
IZHCOM, Izhevsk
email:[EMAIL PROTECTED] | http://www.izhcom.ru
Phone:+7 (3412) 51-32-11 | Fax: +7 (3412) 51-20-80
ICQ: 7748759
__
If yo
> "S.S.Chang" wrote:
>
> Hello!
>
>Can someone tell me that does udmsearch can be used to index/search
> web pages of double bytes character, e.g., Chinese Big-5?
>
> Thank you!
Hello!
No, this is not supported.
--
Alexander Barkov
IZHCOM,
> I have several udmsearch indexes (several mysql dbs) that I would like to
search using just one interface. I would like the user to be able to do one
search and have the system scan through all of the indexes.
>
> Is there a method in place for doing this? Or do I need to go in and
heavily modif
1]
>http://www.ananova.com/feedback/feedback.html?destination=budgetphone
> Indexer[10508]: [1] http://www.ananova.com/
> Indexer[10508]: [1] http://www.ananova.com/search/search_tips.html
> Indexer[10508]: [1] http://www.ananova.com/sport/
> Indexer[10508]: [1] http://www.ana
Hi!
This is implemented in the latest 3.1.x
- Original Message -
From: Truman Leung <[EMAIL PROTECTED]>
To: <[EMAIL PROTECTED]>
Sent: Thursday, September 07, 2000 12:31 AM
Subject: UdmSearch: Categories?
> Hi,
>
> Is the development team planning to include support for categorizing li
- Original Message -
From: Shane Wegner <[EMAIL PROTECTED]>
To: Alexander Barkov <[EMAIL PROTECTED]>
Cc: <[EMAIL PROTECTED]>
Sent: Thursday, September 07, 2000 5:51 AM
Subject: Re: UdmSearch: Stopwords
> Words with both numbers and digits are indexed. People
hecks in the code. Can anyone see
> any use in indexing single characters and digit pairs?
Hello!
Sorry for late reply.
About digist and alpha/digis combinations:
You can use this in your indexer.conf:
NumberFactor 0
AlNumFactor 0
to disallow indexing of such words.
--
Alexander Bark
ble.net/~dipclans/clans.html
> Indexer[4665]: [1] http://www.concentric.net/robots.txt
> Indexer[4665]: [1] http://www.concentric.net/~Bethtim/tordex.htm
> Indexer[4665]: [1] http://www.dcoc.homepage.com/robots.txt
> Indexer[4665]: [1] http://www.dcoc.homepage.com/
> Indexer[466
- A bug that "last_index_time" field was not modified has been fixed.
- Danish stop-list has been added.
--
Alexander Barkov
IZHCOM, Izhevsk
email:[EMAIL PROTECTED] | http://www.izhcom.ru
Phone:+7 (3412) 51-32-11 | Fax: +7 (3412) 51-20-80
ICQ: 7748759
__
I
y. (for example 2825120600.html, 2825130245.html ... )
> then, I can't have the name of all of new files before indexing and then I
> can't do command : indexer -i -u http://my.domain/x.html x is not
> known)
>
> thanks, christophe deltheil
>
--
Alexan
r[32237]: [1] 'CheckOnly\.doc$'
> Indexer[32237]: [1] Alias: 'file:/mnt/nts1/PROJECT/1997/020bf/fax
> matser.doc'
> Indexer[32237]: [1] HTTP/1.0 404 Not found
> Indexer[32237]: [1] HTTP/1.0 404 Not found ? 0
> Indexer[32237]: [1] Deleting URL
--
Alexander Bar
abel wrote:
>
> Is udmsearch search engine able to support double-byte(Chinese) keyword ?
No.
> Is udmsearch able to index and search the local disks?
Yes.
--
Alexander Barkov
IZHCOM, Izhevsk
email:[EMAIL PROTECTED] | http://www.izhcom.ru
Phone:+7 (3412) 51-32-11
command to reject such
bad formed URLs until this is not fixed in the sources.
--
Alexander Barkov
IZHCOM, Izhevsk
email:[EMAIL PROTECTED] | http://www.izhcom.ru
Phone:+7 (3412) 51-32-11 | Fax: +7 (3412) 78-70-10
ICQ: 7748759
__
If you want to unsubscribe send "unsubscribe udmsearch"
to [EMAIL PROTECTED]
ou find in 'all' european languages.
>
Select "iso-8859-1" LocalCharset.
--
Alexander Barkov
IZHCOM, Izhevsk
email:[EMAIL PROTECTED] | http://www.izhcom.ru
Phone:+7 (3412) 51-32-11 | Fax: +7 (3412) 78-70-10
ICQ: 7748759
__
If you want to unsub
gt; better idea.
> >
> > Yes, that's right.
> >
>
> in your case rather than adding the '-' character you will need to add
> the swedish specific characters.
That is wrong. You should use "LocalCharset" command.
--
Alexander Barkov
IZHCOM, Iz
hracters (in
> Swedish: åäöé) do not work.
> Example:
> Search word: brännskada : No results found
> Search word: nnskada : Docs 1-20 from 20 found
>
> So, why is it that the serch word is truncated at the ä ?
--
Alexander Barkov
IZHCOM, Izhevsk
email:[EMAIL PROTECTED]
udmsearch for this condition??
>
You can ask somebody with the same OS and platform to compile
it for you.
--
Alexander Barkov
IZHCOM, Izhevsk
email:[EMAIL PROTECTED] | http://www.izhcom.ru
Phone:+7 (3412) 51-32-11 | Fax: +7 (3412) 78-70-10
ICQ: 7748759
__
> On Mon, Aug 28, 2000 at 04:23:38PM -0700, Steve Quezadas wrote:
> > I am using udmsearch to crawl my website and I noticed that the indexer
program separates words with a dash ("-"). In my case, I have many web pages
with model numbers for products (MVC-FD88 for example) so udmsearch is
indexi
- Original Message -
From: xiao shibin <[EMAIL PROTECTED]>
To: <[EMAIL PROTECTED]>
Sent: Tuesday, August 29, 2000 4:39 AM
Subject: UdmSearch: pages become expired
> How does UDMSearch defines a expired page?
>
> or, in what situation, pages become expired.
After "Period" seconds.
ve but there doesn't seem to be anyone else who has
> questioned this message. Any help about why I'm getting this message
> or how to fix it would be greatly appreciated.
--
Alexander Barkov
IZHCOM, Izhevsk
email:[EMAIL PROTECTED] | http://www.izhcom.ru
Phone:+7 (
L table? It has 1.5 million rows. Or
> do I need to dump the contents of the table out to a text file?
--
Alexander Barkov
IZHCOM, Izhevsk
email:[EMAIL PROTECTED] | http://www.izhcom.ru
Phone:+7 (3412) 51-32-11 | Fax: +7 (3412) 78-70-10
ICQ: 7748759
__
If you wa
>
> msg6.html
> msg7.html
> msg8.html
>
> New messages are always added to the current year. So msg00012.html will go
> in the www.shadow-lands.com/2000 directory. I want to be able to get
> indexer to recognize that I added msg00012.html and
t? :) )
>
> OS: Linux.
> DBI: Oracle.
> Platform: Intel Pentiun II
--
Alexander Barkov
IZHCOM, Izhevsk
email:[EMAIL PROTECTED] | http://www.izhcom.ru
Phone:+7 (3412) 51-32-11 | Fax: +7 (3412) 78-70-10
ICQ: 7748759
__
If you want to unsubscribe send "unsubscribe udmsearch"
to [EMAIL PROTECTED]
exing using Disallow commands.
2) Write a shell or perl script which will execute antiword and kill him
for example after a minute. Then use this script as a parser instead of
direct antiword call.
--
Alexander Barkov
IZHCOM, Izhevsk
email:[EMAIL PROTECTED] | http://www.izhcom.ru
Pho
hint,text_escaped,title_escaped,content_type,size,
> keywords_escaped,descript_escaped,digest,lang,url_id);
> sql_query(((DB*)(Indexer->db)),qbuf);
> if(UdmDBErrorCode(Indexer->db))return(IND_ERROR);
> return(IND_OK);
>
> -
t; Ispell import options:
> -L language Two letters Language code (en, ru, de etc.)
> -A filename ispell Affix file
> -D filename ispell Dictionary file
> -ddump to stdout instead of storing to database
>
> Misc. options:
> -h,-? print this hel
adow-lands.com/sml/1999/
> Server http://www.shadow-lands.com/sml/1998/
> Server http://www.shadow-lands.com/sml/1997/
--
Alexander Barkov
IZHCOM, Izhevsk
email:[EMAIL PROTECTED] | http://www.izhcom.ru
Phone:+7 (3412) 51-32-11 | Fax: +7 (3412) 78-70-10
ICQ: 7748759
__
If you want to unsubscribe send "unsubscribe udmsearch"
to [EMAIL PROTECTED]
f.so.1...done.
> #0 0x805bc61 in crc32_sz (
> buf=0x80b41fd "The Web server cannot find the file or script you asked for.
>Please check the URL to ensure that the path is correct.\r\n\r\nPlease contact
>the server's
> administrator if this problem persists. 103
t;CFLAGS" environment variable with value "-DDEBUG_SQL".
2) run "configure"
3) make
4) make install
All SQL queries will be printed to STDERR.
Or you may just open sql.c and add one line in the top of this file:
#define DEBUG_SQL 1
Regards
--
Alexander Barkov
IZHCOM, Izhevsk
;s nice :-)
> One strange thing I noticed is that mysql (version 3.22.19b) choked on the
> /*!32302 TEMPORARY */ bit in parse.inc of the php front end (it gave an
> SQL syntax error). It seems to work fine after removing it.
MySQL-3.22.19 does not support this syntax yet.
So removin
ell
2) Use search.cgi if you want ispell support.
--
Alexander Barkov
IZHCOM, Izhevsk
email:[EMAIL PROTECTED] | http://www.izhcom.ru
Phone:+7 (3412) 51-32-11 | Fax: +7 (3412) 78-70-10
ICQ: 7748759
__
If you want to unsubscribe send "unsubscribe udmsearch"
to [EMAIL PROTECTED]
e type
> contents too, like gzip archives, or postscript files.
>
--
Alexander Barkov
IZHCOM, Izhevsk
email:[EMAIL PROTECTED] | http://www.izhcom.ru
Phone:+7 (3412) 51-32-11 | Fax: +7 (3412) 78-70-10
ICQ: 7748759
__
If you want to unsubscribe send "unsubscr
ork.
Bugs in robots.txt and gethostbyname() mutexes locking have been fixed.
Regards.
--
Alexander Barkov
IZHCOM, Izhevsk
email:[EMAIL PROTECTED] | http://www.izhcom.ru
Phone:+7 (3412) 51-32-11 | Fax: +7 (3412) 78-70-10
ICQ: 7748759
__
If you want to unsubscr
ru64. Threaded SQL version is is tested
on Linux and FreeBSD only.
--
Alexander Barkov
IZHCOM, Izhevsk
email:[EMAIL PROTECTED] | http://www.izhcom.ru
Phone:+7 (3412) 51-32-11 | Fax: +7 (3412) 78-70-10
ICQ: 7748759
__
If you want to unsubscribe send "unsubscri
501 - 600 of 857 matches
Mail list logo