WinXP

Franz Ruebe Mon, 06 Jun 2005 01:48:18 -0700

Hi solprovider,

> >log4j.properties must be on the CLASSPATH.  The ANT Batch file has:
> >SET CLASSPATH=.
> >so having log4j.properties in the tools/bin directory works.

> I have my log4j.properties in the CLASSPATH set by the ant.bat (In mycase

> d:\apache-lenya-1.2.3\tools\bin).

I set my CLASSPATH in Windows and la voil�, there's a lucene.log. Setting myclasspath was not necessary before, because I had an ant 1.5.1 in myPATH-Variable, which I deleted after the new installation. Does this soundsreasonable?

If it is in the correct directory, the errors go away and "lucene.log"
is created.  (Like the filename?  I invented it myself!)

The login errors went away, and I like the name very muchacho (mirrors yourgenius ;-)) .But is the location sensful? Shouldn't it be a log dir (There are still notenough dirs in lenya).

But the log is empty (creation date is crawl-try-date) and I got still abunch of Java errors. So just the old problem...


So once again some questions:

What to do when just want to use the 'normal way of searching' (I agreeafter understanding your way indexing the xml is much smarter, but one afterthe other...)?I made a crawler-live.xconf (Inhttp://lenya.apache.org/1_2_x/components/search/lucene.html is a mistake, Ithink)

"Note that there is a search.properties file in build/lenya/webapp/lenya/binthat you may have to change. crawler.xconf needs to have the followingelements:"

Shouldn't that be crawler-live.xconf as written in the ant command?

Here is my crawler-live.xconf
<?xml version="1.0"?>
<crawler>
 <user-agent>lenya</user-agent>
 <base-url href="http://localhost:8888/mypub/live/index.html"/>
 <scope-url href="http://localhost:8888/mypub/live/"/>
 <uri-list src="work/search/lucene/uris.txt"/>

<htdocs-dump-dirsrc="work/search/lucene/htdocs_dump/live/lenya.apache.org"/>

</crawler>

"Note that there is a search.properties..."
Ok, in there's just the webapp.dir=../../ ; seems to be ok...

org.apache.avalon.excalibur.io.FileUtil.catPath(FileUtil.java:509)
forgot to check the bounds before using substring.  Might work if you
wrap the code:

if(lowerbound >= 0){ newString = oldString.substring(lowerbound [,upperbound]);

}
but it would be better to read the code and figure out why the
lowerbound is -1.  Usually the -1 comes from searching with indexOf()
or lastIndexOf() for a substring that is not there .

Wouldn't that mean, that everybody would have the same probs in using thecrawler?

Does anybody else uses the crawler?
I'm not (yet??) the right guy to change java-classes.

> Is there a way to get an index without crawling?

Yes.  That was the point of the How-To.  I wanted to filter the
results based on language and  filepath, and that is easier working
from Lenya's XML than from HTML.  It seems silly to crawl the website
and put a copy on the hard drive, when all the contents are already on
the hard drive in a much better format.

How embarassing. I read the stuff several times, but obviously didn'tunderstand it. Have the red lines on your sitehttp://solprovider.com/lenya/search have already been yesterday there? Or isthis a tribute to my reading-over-without-understanding? This line in theapache how-to would spare dudes like me a lot of time.


Thanx for your time!

"Programming is like war between engineers trying to make better softwareand the universe producing idiots. So far, universe wins"


Sorry for BSE
Franz

_________________________________________________________________

Machen Sie l�stigen E-Mails ein Ende. MSN Hotmail mit Junk-Mail-Filter.http://www.msn.de/antispam/prevention/junkmailfilter Jetzt kostenlosanmelden!



---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: lucene problems in lenya 1.2.2 / Servlet/ WinXP

Reply via email to