It tries to process the first server, fails and stops
with no error message :

21:16:51 search ~/etc $echo YES | indexer -C ; indexer -v5
You are going to delete database 'search' content
Are you sure?(YES/no)Deleting...Done
Indexer[728]: indexer from mnogosearch-3.1.12/MySQL started with
'/home/search/etc/indexer.conf'
Indexer[730]: [1] Load stopword table 'stopword'
Indexer[730]: [1] http://onesite.org/robots.txt
Indexer[730]: [1] Server 'http://onesite.org/'
Indexer[730]: [1] Allow by default
Indexer[730]: [1] HTTP/1.1 404 Not Found
Indexer[730]: [1] Date: Fri, 27 Apr 2001 20:11:50 GMT
Indexer[730]: [1] Server: Apache/1.3.4 (Unix) FrontPage/4.0.4.3
Indexer[730]: [1] Connection: close
Indexer[730]: [1] text/html
Indexer[730]: [1] HTTP/1.1 404 Not Found text/html 319
Indexer[730]: [1] Deleting URL
Indexer[730]: [1] Done (0 seconds)

This fails whatever the first line is.
Compiling with or without linux-pthreads doesn't help, robots yes/no
neither.

Linux Debian stable
Kernel 2.2.17
gcc 2.95.2
glibc 2.1
mnoGoSearch 3.1.12
MySQL 3.22.32

Any idea ?

Alain


============= file indexer.conf

DBAddr          mysql://search:xxxxxxxxxxxxx@localhost/search/
DBMode crc-multi

Affix en /home/search/src/dict/en/english.aff
Spell en /home/search/src/dict/en/american.med+

MaxDocSize 5000000

StopwordTable stopword

HTTPHeader User-Agent: Dip Search engine at
http://onesite.org/dipsearch/search.php3
HTTPHeader Accept-Language: en
HTTPHeader From: [EMAIL PROTECTED]

Period 604800
MaxNetErrors 4
ReadTimeOut 30

Robots yes

Follow path

Disallow /cgi-bin/ \.cgi /nph \?
Disallow \.b$    \.sh$   \.md5$  \.rpm$
Disallow \.arj$  \.tar$  \.zip$  \.tgz$  \.gz$ \.z$  \.bz2$ \.r[0-9][0-9]$
\.a[0-9][0-9]$
Disallow \.lha$  \.lzh$  \.tar\.Z$  \.rar$  \.zoo$ \.ha$
Disallow \.gif$  \.jpg$  \.jpeg$ \.bmp$  \.tiff$ \.tif$ \.xpm$ \.xbm$ \.pcx$
Disallow \.vdo$  \.mpeg$ \.mpe$  \.mpg$  \.avi$  \.movie$ \.mov$  \.dat$
Disallow \.mid$  \.mp3$  \.rm$   \.ram$  \.wav$  \.aiff$ \.ra$
Disallow \.vrml$ \.wrl$  \.png$
Disallow \.exe$  \.com$  \.cab$  \.dll$  \.bin$  \.class$ \.ex_$
Disallow \.tex$  \.texi$ \.xls$  \.doc$  \.texinfo$
Disallow \.rtf$  \.pdf$  \.cdf$  \.ps$
Disallow \.ai$   \.eps$  \.ppt$  \.hqx$
Disallow \.cpt$  \.bms$  \.oda$  \.tcl$
Disallow \.o$ \.a$ \.la$ \.so$ \.so\.[0-9]$
Disallow \.pat$ \.pm$ \.m4$ \.am$
Disallow \?D=A$ \?D=A$ \?D=D$ \?M=A$ \?M=D$ \?N=A$ \?N=D$ \?S=A$ \?S=D$
Disallow /[.]{1,2} /\%2e /\%2f
Disallow [^:]//

AddType text/plain \.pl$ \.js$ \.txt$ \.h$ \.c$ \.pm$ \.e$
AddType text/html \.html$ \.htm$
AddType image/x-xpixmap \.xpm$
AddType image/x-xbitmap \.xbm$
AddType image/gif \.gif$
AddType application/unknown .*

Mime application/msword      text/plain;cp1251      "catdoc $1"
Mime text/x-postscript   text/plain  "ps2ascii"
Mime "application/pdf; charset=iso-8859-1"  "text/plain"
"pdftotext $1"

Include /home/search/etc/indexer_servers.conf

============= end of file indexer.conf

============= file indexer_servers.conf

Server  http://onesite.org/
Server  http://darkwing.uoregon.edu/~epederso/gd/gdhome.html
Server  http://dog.tcr.com/~prem/dip.html
Server  http://frontpage.lightspeed.net/mharvath/
Server  http://home.earthlink.net/~hensley/dip.htm
Server  http://home.pacbell.net/andyhre/main.html
Server  http://members.tripod.com/~masseyd/vermont.html
Server  http://members.xoom.com/stab_at_idg/
Server
http://msnhomepages.talkcity.com/RedmondAve/paul_martinsen/WarRoom.html
Server  http://ogham.ucc.ie/~pascal/diplomacy.html
Server  http://ourworld.compuserve.com/homepages/DiplomacyWorld/
Server  http://ourworld.compuserve.com/homepages/emeric/diplomac.htm
Server  http://pages.prodigy.net/sdk/diplod.htm
Server  http://reges.net/index.htm
Server  http://starship.python.net/crew/manus/dpjudge/
Server  http://starship.python.net/crew/manus/dpjudge/payola/
Server  http://starship.python.net/crew/manus/dpjudge/xtalball/
Server  http://thingy.apana.org.au/~ozdip/
Server  http://village.fortunecity.com/priscilla/853/postmo.htm
Server  http://www.bfree.on.ca/cdo/home.htm
Server  http://www.cgocable.net/~dipclans/clans.html
Server  http://www.concentric.net/~Bethtim/tordex.htm
Server  http://www.dcoc.homepage.com/
Server  http://www.diplomacy-archive.com/
Server  http://www.diplomacy-online.com/
Server  http://www.diplomacy.org.uk/
Server  http://www.diplomacyworld.org/
Server  http://www.ellought.demon.co.uk/
Server  http://www.fiendishgames.demon.co.uk/index.htm
Server  http://www.fiendishgames.demon.co.uk/words/mfg/mfgtoc.htm
Server  http://www.fortunecity.com/boozers/brewerytap/1/
Server  http://www.ftech.net/~monark/dip/index.hts
Server  http://www.gate.net/~lancast/diplomacy/
Server  http://www.geocities.com/Pipeline/Ramp/6217/clan.html
Server  http://www.geocities.com/TimesSquare/9580/
Server  http://www.geocities.com/TimesSquare/Adventure/1217/clan.html
Server  http://www.geocities.com/TimesSquare/Battlefield/8761/
Server  http://www.geocities.com/TimesSquare/Corner/1436/
Server  http://www.geocities.com/TimesSquare/Dome/8542/
Server  http://www.geocities.com/bpdiplo/
Server  http://www.geocities.com/dipchief44/
Server  http://www.geocities.com/diplomacy_world/
Server  http://www.geocities.com/~diplomacyden/
Server  http://www.gooner.redhotant.com/
Server  http://www.iguazu.nl/
Server  http://www.lancedal.demon.co.uk/dip2000/
Server  http://www.mewlinghill.com/war_room.html
Server  http://www.modernhof.webprovider.com/
Server  http://www.neocrypto.com/armada/
Server  http://www.net-gate.com/~pdkenny/
Server  http://www.oxford.net/~ravgames/dipmaps/dipmaps.htm
Server  http://www.phoenixat.com/~magicfan/dindex.htm
Server  http://www.redscape.com/
Server  http://www.spoonland.com/other/dip/
Server  http://www.stud.uni-bayreuth.de/~a0011/dip/ai/index.html
Server  http://www.svart.com/isaksson-dip/
Server  http://www.thekleimans.com/diplomacy/
Server  http://www.tkblack.com/Diplomacy/
Server  http://www.users.bigpond.com/kennedy4/
Server  http://www.users.globalnet.co.uk/~gduke/Diplomacy/
Server  http://www.users.waitrose.com/~kelletts/fes/
Server  http://www.wam.umd.edu/~hturner/Wu/SCD.html

Server  http://devel.diplom.org/DipPouch/
Disallow        http://devel.diplom.org/DipPouch/openings/
Server http://devel.diplom.org/DipPouch/Online/Openings/textversion/

Server  ftp://devel.diplom.org/pub/diplomacy
Server  ftp://ftp.sentex.net/usr/nick/
Server  ftp://ftp.ugcs.caltech.edu/pub/diplomacy/

MaxHops 0
Server  http://www.diplomacy.no/welcome.html
Server  http://www.diplomacy.no/edc4.html

============= end of file indexer_servers.conf


___________________________________________
If you want to unsubscribe send "unsubscribe general"
to [EMAIL PROTECTED]

Reply via email to