[ 
https://issues.apache.org/jira/browse/SOLR-1973?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Koji Sekiguchi updated SOLR-1973:
---------------------------------

             Priority: Minor  (was: Major)
    Affects Version/s:     (was: 3.1)
                       1.4
                       1.4.1
        Fix Version/s: 4.0
                       3.1

> Empty fields in update messages confuse DataImportHandler
> ---------------------------------------------------------
>
>                 Key: SOLR-1973
>                 URL: https://issues.apache.org/jira/browse/SOLR-1973
>             Project: Solr
>          Issue Type: Bug
>          Components: contrib - DataImportHandler
>    Affects Versions: 1.4, 1.4.1
>         Environment: CentOS 5, Java 1.6, Tomcat 6
>            Reporter: Sixten Otto
>            Assignee: Koji Sekiguchi
>            Priority: Minor
>             Fix For: 3.1, 4.0
>
>         Attachments: SOLR-1973-test.patch, SOLR-1973.patch, SOLR-1973.patch
>
>
> I seem to be running into an issue with Solr (maybe just the 
> DataImportHandler?) not liking empty field elements in the docs, and getting 
> the wrong values into the fields of the index. Here's the entity declaration 
> from data-config.xml for my isolated example:
>  <document>
>   <entity name="contentAsSolrAdd"
>     dataSource="xml"
>     processor="XPathEntityProcessor"
>     stream="true"
>     url="http://example.com/Content.xml";
>     useSolrAddSchema="true">
>   </entity>
>  </document>
> And here's the Content.xml being pulled in by the DIH:
>  <add>
>   <doc>
>     <field name="empty"></field>
>     <field name="full">Lorem Ipsum Dolor</field>
>     <field name="other">Some content is me!</field>
>   </doc>
>  </add>
> And here's the relevant portion of the output from the DIH in debug mode:
>  <lst name="document#1">
>   <str name="query">
>     http://example.com/Content.xml
>   </str>
>   <str name="time-taken">0:0:0.6</str>
>   <str>----------- row #1-------------</str>
>   <str name="full">Some content is me!</str>
>   <str name="empty">Lorem Ipsum Dolor</str>
>   <str>---------------------------------------------</str>
>  </lst>
> Notice that the field "full" doesn't appear here, but the following field 
> "empty" has the content that was there for "full". The "other" field, which 
> was non-empty, and preceded by a non-empty field, shows up correctly.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to