Vinci, Please use the mailing list to ask questions and discuss first, not JIRA. Also, please include an example of what you are describing, if you can.
Thanks, Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch ----- Original Message ---- From: Vinci (JIRA) <[EMAIL PROTECTED]> To: nutch-dev@lucene.apache.org Sent: Sunday, March 30, 2008 7:55:24 AM Subject: [jira] Created: (NUTCH-624) Better parsed text Better parsed text ------------------ Key: NUTCH-624 URL: https://issues.apache.org/jira/browse/NUTCH-624 Project: Nutch Issue Type: Improvement Reporter: Vinci I found the parsed text by default parser Neko is not easy to process - it just add a space to the end of the tag. Can neko (or other parser) change the behaviour to 1.adding tab (for inline element) 2.add a tab+newline for block level element end instead of space, so we can have a better parsed text? -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.