[ 
https://issues.apache.org/jira/browse/SOLR-7383?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15936499#comment-15936499
 ] 

Varun Thacker commented on SOLR-7383:
-------------------------------------

bq. What would you expect in that file? The core name is by default the same as 
directory name. 

Yeah i wasn't sure if that logic still holds true . Like when we start 
{{bin/solr -e techproducts}} the core.properties files writes out the core 
name. I used to think that today that's the only way solr discovers core names 
but I am probably wrong. Anyways not an issue here.

bq. If we switch to eDisMax to search the original fields, then the string 
fields such as author will not be easily searchable and/or will require a 
secondary copy into a text field to be searched properly.

Fair point. Let's stick to the current approach only then. The edismax idea 
will bring more complexity than good here.

Can we also update the README under example-DIH ? More specifically this section

{code}
See also README.txt in the solr subdirectory, and check
http://wiki.apache.org/solr/DataImportHandler for detailed
usage guide and tutorial.
{code}

I'm not sure what README it's referring to ? And instead of the wiki maybe 
point users to this : 
https://cwiki.apache.org/confluence/display/solr/Uploading+Structured+Data+Store+Data+with+the+Data+Import+Handler
 ?



> DIH: rewrite XPathEntityProcessor/RSS example as the smallest good demo 
> possible
> --------------------------------------------------------------------------------
>
>                 Key: SOLR-7383
>                 URL: https://issues.apache.org/jira/browse/SOLR-7383
>             Project: Solr
>          Issue Type: Bug
>          Components: contrib - DataImportHandler
>    Affects Versions: 5.0, 6.0
>            Reporter: Upayavira
>            Assignee: Alexandre Rafalovitch
>            Priority: Minor
>         Attachments: atom_20170315.tgz, rss-data-config.xml
>
>
> The DIH example (solr/example/example-DIH/solr/rss/conf/rss-data-config.xml) 
> is broken again. See associated issues.
> Below is a config that should work.
> This is caused by Slashdot seemingly oscillating between RDF/RSS and pure 
> RSS. Perhaps we should depend upon something more static, rather than an 
> external service that is free to change as it desires.
> <dataConfig>
>     <dataSource type="URLDataSource" />
>     <document>
>         <entity name="slashdot"
>                 pk="link"
>                 url="http://rss.slashdot.org/Slashdot/slashdot";
>                 processor="XPathEntityProcessor"
>                 forEach="/RDF/item"
>                 transformer="DateFormatTransformer">
>                               
>             <field column="source" xpath="/RDF/channel/title" 
> commonField="true" />
>             <field column="source-link" xpath="/RDF/channel/link" 
> commonField="true" />
>             <field column="subject" xpath="/RDF/channel/subject" 
> commonField="true" />
>                       
>             <field column="title" xpath="/RDF/item/title" />
>             <field column="link" xpath="/RDF/item/link" />
>             <field column="description" xpath="/RDF/item/description" />
>             <field column="creator" xpath="/RDF/item/creator" />
>             <field column="item-subject" xpath="/RDF/item/subject" />
>             <field column="date" xpath="/RDF/item/date" 
> dateTimeFormat="yyyy-MM-dd'T'HH:mm:ss" />
>             <field column="slash-department" xpath="/RDF/item/department" />
>             <field column="slash-section" xpath="/RDF/item/section" />
>             <field column="slash-comments" xpath="/RDF/item/comments" />
>         </entity>
>     </document>
> </dataConfig>



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to