Re: Long URL's in results

2007-04-15 Thread Paul Liddelow
Hi Neal Thanks for that it worked a treat. Cheers Paul On 4/15/07, Neal Whitley [EMAIL PROTECTED] wrote: I use the following code right before I display the url in search.jsp: From above... % String url2 = detail.getValue(url); % % if (url2.length() 70) url2 =

Re: Long URL's in results

2007-04-15 Thread Paul Liddelow
. Don't display URL in the JSP pages - modify it the jsp pages.. i think you can just comment it out.. i.e. displaying url. Regards raj On 4/14/07, Paul Liddelow [EMAIL PROTECTED] wrote: Hi In my results there are a few that have really long URL's that go right off the page. Here is an example

Index compression

2007-04-15 Thread Paul Liddelow
Hi I'm just wondering about the index that Nutch creates and whether it is compressed in any way. I have checked through all the mailing list entries and can't find anything about compression. I found something on Sami Siren's blog that mentioned it but it didn't really answer my question. My

Long URL's in results

2007-04-14 Thread Paul Liddelow
Hi In my results there are a few that have really long URL's that go right off the page. Here is an example: Search Results ... of 2006) 3. Interpretation Anti-Discrimination Act 1998 (No. 46 of ...

Nutch changes 0.9.txt

2007-04-06 Thread Paul Liddelow
Hi Does anybody know what this means exactly: 8. NUTCH-338 - Remove the text parser as an option for parsing PDF files in parse-plugins.xml (Chris A. Mattmann via siren) In my crawl log file it says: Error parsing:

Re: Nutch changes 0.9.txt

2007-04-06 Thread Paul Liddelow
be .. 1. parse-pdf plugin is not enabled plugin in nutch-site.xml .. you need to enable it.. 2. The pdf file is over the content limit .. you need to increase the content limit value in nutch-site.xml. 3. Something else that i don't know.. Regards On 4/6/07, Paul Liddelow [EMAIL PROTECTED] wrote

Problems crawling a URL

2007-03-19 Thread Paul Liddelow
Hi I have set Nutch up and the crawler (following the intranet tutorial) and can fetch results OK for the few URL's I have tested, but for some reason I cannot get any results returned when I try to crawl this URL:

Re: Newbie questions about followed links

2007-03-08 Thread Paul Liddelow
exactly what I was going to say! Cheers Paul On 3/8/07, Hasan Diwan [EMAIL PROTECTED] wrote: Sir: On 08/03/07, Jeroen Verhagen [EMAIL PROTECTED] wrote: Surely these links look ordinary enough to be seen and followed by nutch? Could someone please tell me what could be causing these links