Hi:

You have two option

1. Don't crawl/index URL's having more then X char. You can edit this
value in nutch-site.xml.
2. Don't display URL in the JSP pages - modify it the jsp pages.. i
think you can just comment it out.. i.e. displaying url.

Regards
raj

On 4/14/07, Paul Liddelow <[EMAIL PROTECTED]> wrote:
> Hi
> In my results there are a few that have really long URL's that go
> right off the page. Here is an example:
>
>
>
> Search Results
> ... of 2006) 3. Interpretation Anti-Discrimination Act 1998 (No. 46 of ...
> http://www.thelaw.tas.gov.au/results/index.w3p;actT=;amActT=;amsrT=;docno=;docyear=;domain=;eIndex=10;lastSearch=;pointInTime=;rta=;rti=44%2B%2B2003%2BAT%40EN%2B20070407000000;sIndex=1;sc1=;sessional=;sortBy=;srT=;ss=;sub=;title=Relationships%20Act%202003;tx1=;type=;wh1=
>
>
> Does anybody know why this might occur and how to fix it?
>
> Cheers
> Paul
>

-------------------------------------------------------------------------
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT & business topics through brief surveys-and earn cash
http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV
_______________________________________________
Nutch-general mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to