That sounds like a perfect match for 
http://wiki.apache.org/solr/DataImportHandler#CachedSqlEntityProcessor :)

On Thursday, May 16, 2013 at 6:01 PM, O. Olson wrote:

> I am using the DataImportHandler to Query a SQL Server and populate Solr.
> Unfortunately, SQL does not have an understanding of hierarchical
> relationships, and hence I use Table Joins. The following is an outline of
> my table structure:  
>  
>  
> PROD_TABLE
> -> SKU (Primary Key)
> -> Title (varchar)
> -> Descr (varchar)
>  
> CAT_TABLE
> -> SKU (Foreign Key)
> -> CategoryLevel (int i.e. 1, 2, 3 …)
> -> CategoryName (varchar)
>  
> I specify the SQL Query in the db-data-config.xml file – a snippet of which
> looks like:  
>  
> <dataConfig>
> <dataSource driver="com.microsoft.sqlserver.jdbc.SQLServerDriver"
> url="jdbc:sqlserver://localhost\...."/>
> <document>
> <entity name="Product"  
> query="SELECT SKU, Title, Descr FROM PROD_TABLE">
> <field column="SKU" name="SKU" />
> <field column="Title" name="Title" />
> <field column="Descr" name="Descr" />
>  
> <entity name="Cat1"  
> query="SELECT CategoryName from CAT_TABLE where
> SKU='${Product.SKU}' AND CategoryLevel=1">
> <field column="CategoryName" name="Category1" />  
> </entity>
> <entity name="Cat2"  
> query="SELECT CategoryName from CAT_TABLE where
> SKU='${Product.SKU}' AND CategoryLevel=2">
> <field column="CategoryName" name="Category2" />  
> </entity>
> <entity name="Cat3"  
> query="SELECT CategoryName from CAT_TABLE where
> SKU='${Product.SKU}' AND CategoryLevel=3">
> <field column="CategoryName" name="Category3" />  
> </entity>
>  
> </entity>
> </document>
> </dataConfig>
>  
> It seems like the DataImportHandler handler sends out three or four queries
> for each Product. This results in a very slow import. Is there any way to
> speed this up? I would not mind an intermediate step of first extracting SQL
> and then putting it into Solr.
>  
> Thank you for all your help.  
> O. O.
>  
>  
>  
>  
> --
> View this message in context: 
> http://lucene.472066.n3.nabble.com/Speed-up-import-of-Hierarchical-Data-tp4063924.html
> Sent from the Solr - User mailing list archive at Nabble.com 
> (http://Nabble.com).
>  
>  


Reply via email to