[
https://issues.apache.org/jira/browse/SOLR-1003?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12670093#action_12670093
]
Fergus McMenemie commented on SOLR-1003:
----------------------------------------
What is the difference between the HTMLStripTransformer and what is proposed
here? Surely both would return:-
"This text is bold and this text is underlined!"
> XPathEntityprocessor must allow slurping all text from a given xml node and
> its children
> ----------------------------------------------------------------------------------------
>
> Key: SOLR-1003
> URL: https://issues.apache.org/jira/browse/SOLR-1003
> Project: Solr
> Issue Type: New Feature
> Components: contrib - DataImportHandler
> Affects Versions: 1.4
> Reporter: Noble Paul
> Priority: Minor
> Fix For: 1.4
>
> Attachments: SOLR-1003.patch
>
>
> take an example:
> {code:xml}
> <xhtml:p>This text is
> <xhtml:b>bold</xhtml:b> and this text is
> <xhtml:u>underlined</xhtml:u>!
> </xhtml:p>
> {code}
> It may be useful to get all the text from all the tags in <xhtml: p> ignoring
> the tag names .
> the configuration of the field may look like
> {code:xml}
> <field column="para" xpath="/p" flatten="true"/>
> {code}
>
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.