Re: Basic Solr Configurations and best practice

2010-11-30 Thread Lance Norskog
Solr 4- You mean the Solr 'trunk' source or the Solr 1.4.1 release? The 1.4.1 release does not have the TikaEntityProcessor, only the /extract code. The Solr 3.x branch and the trunk have the TikaEP. I use the 3.x branch and, well, the TikaEP has a few problems but can be hacked around.

Re: Basic Solr Configurations and best practice

2010-11-28 Thread Darx Oman
thanx Alexey I downloaded Solr 4 and implemented the TikaEntityProcessor, it worked fine with Tika 0.6. didn't work with Tika 0.7 nor Tika 0.8 SNAPSHOT On Sat, Nov 27, 2010 at 4:05 AM, Alexey Serba ase...@gmail.com wrote: 1- How to combine data from DIH and content extracted from file

Re: Basic Solr Configurations and best practice

2010-11-26 Thread Alexey Serba
1-      How to combine data from DIH and content extracted from file system document into one document in the index? http://wiki.apache.org/solr/TikaEntityProcessor You can have one sql entity that retrieves metadata from database and another nested entity that parses binary file into additional

Basic Solr Configurations and best practice

2010-11-23 Thread Darx Oman
Hi guys I'm kind of new to solr and I'm wondering how to configure solr to best fulfills my requirements. Requirements are as follow: I have 2 data sources: database and file system documents. Every document in the file system has related information stored in the database. Both the file