Amit,

You might want to take a look at LuSql[1] and see if it may be
appropriate for the issues you have.

thanks,

Glen

[1]http://lab.cisti-icist.nrc-cnrc.gc.ca/cistilabswiki/index.php/LuSql

2009/4/27 Amit Nithian <anith...@gmail.com>:
> All,
> I have a few questions regarding the data import handler. We have some
> pretty gnarly SQL queries to load our indices and our current loader
> implementation is extremely fragile. I am looking to migrate over to the
> DIH; however, I am looking to use SolrJ + EmbeddedSolr + some custom stuff
> to remotely load the indices so that my index loader and main search engine
> are separated.
> Currently, unless I am missing something, the data gathering from the entity
> and the data processing (i.e. conversion to a Solr Document) is done
> sequentially and I was looking to make this execute in parallel so that I
> can have multiple threads processing different parts of the resultset and
> loading documents into Solr. Secondly, I need to create temporary tables to
> store results of a few queries and use them later for inner joins was
> wondering how to best go about this?
>
> I am thinking to add support in DIH for the following:
> 1) Temporary tables (maybe call it temporary entities)? --Specific only to
> SQL though unless it can be generalized to other sources.
> 2) Parallel support
>  - Including some mechanism to get the number of records (whether it be
> count or the MAX(custom_id)-MIN(custom_id))
> 3) Support in DIH or Solr to post documents to a remote index (i.e. create a
> new UpdateHandler instead of DirectUpdateHandler2).
>
> If any of these exist or anyone else is working on this (OR you have better
> suggestions), please let me know.
>
> Thanks!
> Amit
>



-- 

-

Reply via email to