Hi solprovider,

> >log4j.properties must be on the CLASSPATH.  The ANT Batch file has:
> >SET CLASSPATH=.
> >so having log4j.properties in the tools/bin directory works.
> I have my log4j.properties in the CLASSPATH set by the ant.bat (In my case
> d:\apache-lenya-1.2.3\tools\bin).
I set my CLASSPATH in Windows and la voil�, there's a lucene.log. Setting my classpath was not necessary before, because I had an ant 1.5.1 in my PATH-Variable, which I deleted after the new installation. Does this sounds reasonable?

If it is in the correct directory, the errors go away and "lucene.log"
is created.  (Like the filename?  I invented it myself!)
The login errors went away, and I like the name very muchacho (mirrors your genius ;-)) . But is the location sensful? Shouldn't it be a log dir (There are still not enough dirs in lenya).

But the log is empty (creation date is crawl-try-date) and I got still a bunch of Java errors. So just the old problem...

So once again some questions:
What to do when just want to use the 'normal way of searching' (I agree after understanding your way indexing the xml is much smarter, but one after the other...)? I made a crawler-live.xconf (In http://lenya.apache.org/1_2_x/components/search/lucene.html is a mistake, I think)

"Note that there is a search.properties file in build/lenya/webapp/lenya/bin that you may have to change. crawler.xconf needs to have the following elements:"
Shouldn't that be crawler-live.xconf as written in the ant command?

Here is my crawler-live.xconf
<?xml version="1.0"?>
<crawler>
 <user-agent>lenya</user-agent>
 <base-url href="http://localhost:8888/mypub/live/index.html"/>
 <scope-url href="http://localhost:8888/mypub/live/"/>
 <uri-list src="work/search/lucene/uris.txt"/>
<htdocs-dump-dir src="work/search/lucene/htdocs_dump/live/lenya.apache.org"/>
</crawler>

"Note that there is a search.properties..."
Ok, in there's just the webapp.dir=../../ ; seems to be ok...

org.apache.avalon.excalibur.io.FileUtil.catPath(FileUtil.java:509)
forgot to check the bounds before using substring.  Might work if you
wrap the code:
if(lowerbound >= 0){ newString = oldString.substring(lowerbound [, upperbound]);
}
but it would be better to read the code and figure out why the
lowerbound is -1.  Usually the -1 comes from searching with indexOf()
or lastIndexOf() for a substring that is not there .

Wouldn't that mean, that everybody would have the same probs in using the crawler?
Does anybody else uses the crawler?
I'm not (yet??) the right guy to change java-classes.

> Is there a way to get an index without crawling?

Yes.  That was the point of the How-To.  I wanted to filter the
results based on language and  filepath, and that is easier working
from Lenya's XML than from HTML.  It seems silly to crawl the website
and put a copy on the hard drive, when all the contents are already on
the hard drive in a much better format.

How embarassing. I read the stuff several times, but obviously didn't understand it. Have the red lines on your site http://solprovider.com/lenya/search have already been yesterday there? Or is this a tribute to my reading-over-without-understanding? This line in the apache how-to would spare dudes like me a lot of time.

Thanx for your time!

"Programming is like war between engineers trying to make better software and the universe producing idiots. So far, universe wins"

Sorry for BSE
Franz

_________________________________________________________________
Machen Sie l�stigen E-Mails ein Ende. MSN Hotmail mit Junk-Mail-Filter. http://www.msn.de/antispam/prevention/junkmailfilter Jetzt kostenlos anmelden!


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to