[ 
https://issues.apache.org/jira/browse/SOLR-2301?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12975639#action_12975639
 ] 

Karl Wright commented on SOLR-2301:
-----------------------------------

bq. I heard somewhere that Manifold or the "Connector Framework" were all going 
to be integrated in to Lucene/Solr. Any thoughts on that?

ManifoldCF has always been integrated with Solr.  The only thing missing is a 
Solr SearchComponent that enforces document security.  There is a Solr ticket 
and a patch that provides that SearchComponent for people willing to build it 
themselves.  But you don't need any of that to do a simple RSS crawl, indexing 
into Solr.

Download the current ManifoldCF 0.1 release candidate from 
http://people.apache.org/~kwright, and unpack.  Follow the readme directions 
for using the quick-start example.  Set up a Solr output connection, and an RSS 
repository connection, and create a job using these connections.  If you need 
help in doing that, the online documentation should assist you.


> RSS Feed URL Breaking
> ---------------------
>
>                 Key: SOLR-2301
>                 URL: https://issues.apache.org/jira/browse/SOLR-2301
>             Project: Solr
>          Issue Type: Bug
>          Components: clients - C#
>    Affects Versions: 1.4.1, 4.0
>         Environment: Windows 7
>            Reporter: Adam Estrada
>
> This is an odd one....I am trying to index RSS feeds and have come across 
> several issues. Some are more pressing than others. Referring to SOLR-2286 ;-)
> Anyway, the CDC has a list of RSS feeds that the Solr dataimporter can't work 
> with
> Home page:
> http://emergency.cdc.gov/rss/
> Page to Index:
> http://www2a.cdc.gov/podcasts/createrss.asp?t=r&c=19
> The console reports the following and as you can see it's because it does not 
> like the param "c". Any ideas on how to fix this?
> INFO: Processing configuration from solrconfig.xml: 
> {config=./solr/conf/dataimpo
> rthandler/rss.xml}
> [Fatal Error] :18:63: The reference to entity "c" must end with the ';' 
> delimite
> r.
> Dec 28, 2010 2:39:46 PM org.apache.solr.handler.dataimport.DataImportHandler 
> inf
> orm
> SEVERE: Exception while loading DataImporter
> org.apache.solr.handler.dataimport.DataImportHandlerException: Exception 
> occurre
> d while initializing context
>         at 
> org.apache.solr.handler.dataimport.DataImporter.loadDataConfig(DataIm
> porter.java:193)
>         at 
> org.apache.solr.handler.dataimport.DataImporter.<init>(DataImporter.j
> ava:100)
>         at 
> org.apache.solr.handler.dataimport.DataImportHandler.inform(DataImpor
> tHandler.java:112)
>         at 
> org.apache.solr.core.SolrResourceLoader.inform(SolrResourceLoader.jav
> a:539)
>         at org.apache.solr.core.SolrCore.<init>(SolrCore.java:596)
>         at org.apache.solr.core.CoreContainer.create(CoreContainer.java:660)
>         at org.apache.solr.core.CoreContainer.load(CoreContainer.java:412)
>         at org.apache.solr.core.CoreContainer.load(CoreContainer.java:294)
>         at 
> org.apache.solr.core.CoreContainer$Initializer.initialize(CoreContain
> er.java:243)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to