Re: TikaEntityProcessor Not Finding My Files

2015-06-16 Thread Paden
I thought it might be useful to list the logging errors as well. Here they are. There are just three. WARN FileDataSourceFileDataSource.basePath is empty. Resolving to: /home/paden/Downloads/solr-5.1.0/server/. ERRORDocBuilder Exception while processing: file document : Solr

Re: TikaEntityProcessor + multivalue field as url source

2014-01-29 Thread Bustaa
Thanks for you suggestions Ahmet. We are using the Typo3 CMS (with custom extensions / db-schemas). We are using Solarium to connect to the Solr instance. The schema is pretty simple:

Re: TikaEntityProcessor + multivalue field as url source

2014-01-29 Thread Ahmet Arslan
Hi Bustaa, Can you paste your data-config.xml?  Also, did you consider using ManifoldCF [1] to crawl/index your CMS? What CMS are you using? [1] http://manifoldcf.apache.org/release/trunk/en_US/end-user-documentation.html#repositoryconnectiontypes On Wednesday, January 29, 2014 1:03 PM,

Re: TikaEntityProcessor not working?

2011-11-21 Thread kumar8anuj
Thanks for the reply Gora, I tried Googling but didn't find anything on this. I didn't try this on Tika mailing list ,I will post this to tika mailing list now. Thanks for the suggestion On Mon, Nov 21, 2011 at 9:10 PM, Gora Mohanty-3 [via Lucene] < ml-node+s472066n3525046...@n3.nabble.com>

Re: TikaEntityProcessor not working?

2011-11-21 Thread Gora Mohanty
On Mon, Nov 21, 2011 at 8:45 PM, kumar8anuj wrote: > So where can i get some information on this issue, Can you please help ? Have you tried simple things like searching Google, using the Tika site, and, failing these, asking on a Tika-specific mailing list? No offence, but you might do some basi

Re: TikaEntityProcessor not working?

2011-11-21 Thread kumar8anuj
So where can i get some information on this issue, Can you please help ? On Mon, Nov 21, 2011 at 8:17 PM, Erick Erickson [via Lucene] < ml-node+s472066n3524905...@n3.nabble.com> wrote: > Sorry, but I don't really have that info. > > Erick > > On Mon, Nov 21, 2011 at 9:37 AM, kumar8anuj <[hidden

Re: TikaEntityProcessor not working?

2011-11-21 Thread Erick Erickson
Sorry, but I don't really have that info. Erick On Mon, Nov 21, 2011 at 9:37 AM, kumar8anuj wrote: > Erick, >          Need your help on this. Waiting for resolution. Please help ... > > -- > View this message in context: > http://lucene.472066.n3.nabble.com/TikaEntityProcessor-not-working-tp85

Re: TikaEntityProcessor not working?

2011-11-21 Thread kumar8anuj
Erick, Need your help on this. Waiting for resolution. Please help ... -- View this message in context: http://lucene.472066.n3.nabble.com/TikaEntityProcessor-not-working-tp856965p3524881.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: TikaEntityProcessor not working?

2011-11-14 Thread kumar8anuj
Earlier issue has been resolved but stuck up on something else. Can you tell me which poi jar version would work with tika.0.6. Currently I have poi-3.7.jar. Error which i am getting is this SEVERE: Exception while processing: js_logins document : SolrInputDocument[{id=id(1.0)={100984}, comp

Re: TikaEntityProcessor not working?

2011-11-08 Thread Erick Erickson
What's not clear is what you are doing to insure that the file names pulled from your database are being read (from disk? from a shared filesystem somewhere?), analyzed and sent to Solr. So, somewhere you need to actually use the file name to pass on to one of the processors that'll actually send

Re: TikaEntityProcessor not working?

2011-11-08 Thread kumar8anuj
Erick, As Brad has configured the system, I configured it in the same way and then no document indexing was happening and i was not even getting any errors in the log. I then changed my Tika to 0.6 and tried it but no success. So table columns are getting indexed but document is not. Let me know if

Re: TikaEntityProcessor not working?

2011-11-07 Thread Erick Erickson
You have to provide a lot more information about what you're doing. Are you trying to use DIH? the extracting update request handler? What do your config files look like? Please review: http://wiki.apache.org/solr/UsingMailingLists Best Erick On Mon, Nov 7, 2011 at 8:18 AM, kumar8anuj wrote: >

Re: TikaEntityProcessor not working?

2011-11-07 Thread kumar8anuj
I tried to do the same but problem still persist and my document is not getting indexed. I am using solr 3.4.0 and it was having tika 0.8 i replaced core and parser jar with the 0.6 but document is not getting indexed. Please help and nothing is coming in my logs related to that. -- View this mes

Re: TikaEntityProcessor

2011-04-20 Thread firdous_kind86
after reading this post i hoped that i could achieve.. but couldnt find any success in almost a week http://lucene.472066.n3.nabble.com/TikaEntityProcessor-not-working-td856965.html#a867572 -- View this message in context: http://lucene.472066.n3.nabble.com/TikaEntityProcessor-tp2839188p2843084.

Re: TikaEntityProcessor

2011-04-20 Thread Andreas Kemkes
ch already uses Tika 0.8 - still re-compiled from code (no changes as far as I remember) - never tried the library replacement - don't think it's possible. Andreas From: firdous_kind86 To: solr-user@lucene.apache.org Sent: Wed, April 20, 201

Re: TikaEntityProcessor

2011-04-20 Thread firdous_kind86
hi, i asked that :) didnt get that.. what dependencies? i am using solr 1.4 and tika 0.9 i replaced tika-core 0.9 and tika-parsers 0.9 at /contrib/extraction/lib also replaced old version of dataimporthandler-extras by apache-solr-dataimporthandler-extras-3.1.0.jar but still same problem.. som

Re: TikaEntityProcessor

2011-04-19 Thread Oleg Tikhonov
I don't know, will ask him. On Tue, Apr 19, 2011 at 7:02 PM, Li wrote: > Looks like dependencies. Did you or him included the dependencies in the > solrconfig? > > Sent from my iPhone > > On Apr 19, 2011, at 8:35 AM, Oleg Tikhonov wrote: > > >> Hello everybody, > >> > >> Recently, I got a mess

Re: TikaEntityProcessor

2011-04-19 Thread Li
Looks like dependencies. Did you or him included the dependencies in the solrconfig? Sent from my iPhone On Apr 19, 2011, at 8:35 AM, Oleg Tikhonov wrote: >> Hello everybody, >> >> Recently, I got a message from a guy who was asking about >> TikaEntityProcessor. >> He uses Solr 1.4 and Tika 0

Re: TikaEntityProcessor

2011-04-19 Thread Oleg Tikhonov
> Hello everybody, > > Recently, I got a message from a guy who was asking about > TikaEntityProcessor. > He uses Solr 1.4 and Tika 0.8. > Here is a stack: > SEVERE: Full Import failed > org.apache.solr.handler. > dataimport.DataImportHandlerException: Unable to load En > tityProcessor implementati

RE: TikaEntityProcessor on Solr 1.4?

2010-06-08 Thread Tim Gilbert
[mailto:six...@sfko.com] Sent: Tuesday, June 08, 2010 3:53 PM To: solr-user@lucene.apache.org Subject: Re: TikaEntityProcessor on Solr 1.4? 2010/5/22 Noble Paul നോബിള്‍ नोब्ळ् : > just copy the dih-extras jar file from the nightly should be fine Now that I've finally got a server on

Re: TikaEntityProcessor on Solr 1.4?

2010-06-08 Thread Sixten Otto
2010/5/22 Noble Paul നോബിള്‍ नोब्ळ् : > just copy the dih-extras jar file from the nightly should be fine Now that I've finally got a server on which to attempt to set these things up... this turns out not to be a viable solution. The extras jar does contain the TikaEntityProcessor class, but NOT

Re: TikaEntityProcessor not working?

2010-06-04 Thread Brad Greenlee
You are my hero. I replaced the Tika 0.8 snapshots that were included with Solr with 0.6 and it works now. Thank you! Brad On Jun 3, 2010, at 6:22 AM, David George wrote: > > Which version of Tika do you have? There was a problem introduced somewhere > between Tika 0.6 and Tika 0.7 whereby the

Re: TikaEntityProcessor not working?

2010-06-03 Thread David George
Which version of Tika do you have? There was a problem introduced somewhere between Tika 0.6 and Tika 0.7 whereby the TikaConfig method config.getParsers() was returns an empty parser list due to class loader scope issues with Solr running under an application server. There is a fix in the Tika 0

Re: TikaEntityProcessor not working?

2010-05-31 Thread Brad Greenlee
It is a file. Only the filename is stored in the database. Brad On May 31, 2010, at 2:59 AM, Noble Paul നോബിള്‍ नो ब्ळ् wrote: BinFileDataSource will only work with file, Try FieldStreamDataSource On Mon, May 31, 2010 at 3:30 AM, Brad Greenlee wrote: Hi. I'm trying to get Solr to i

Re: TikaEntityProcessor not working?

2010-05-31 Thread Noble Paul നോബിള്‍ नोब्ळ्
BinFileDataSource will only work with file, Try FieldStreamDataSource On Mon, May 31, 2010 at 3:30 AM, Brad Greenlee wrote: > Hi. I'm trying to get Solr to index a database in which one column is a > filename of a PDF document I'd like to index. My configuration looks like > this: > > > url=

Re: TikaEntityProcessor on Solr 1.4?

2010-05-22 Thread Noble Paul നോബിള്‍ नोब्ळ्
just copy the dih-extras jar file from the nightly should be fine On Sat, May 22, 2010 at 3:12 AM, Sixten Otto wrote: > On Fri, May 21, 2010 at 5:30 PM, Chris Harris wrote: >> Actually, rather than cherry-pick just the changes from SOLR-1358 and >> SOLR-1583 what I did was to merge in all DataIm

Re: TikaEntityProcessor on Solr 1.4?

2010-05-21 Thread Sixten Otto
On Fri, May 21, 2010 at 5:30 PM, Chris Harris wrote: > Actually, rather than cherry-pick just the changes from SOLR-1358 and > SOLR-1583 what I did was to merge in all DataImportHandler-related > changes from between the 1.4 release up through Solr trunk r890679 > (inclusive). I'm not sure if that

Re: TikaEntityProcessor on Solr 1.4?

2010-05-21 Thread Chris Harris
You are right that TikaEntityProcessor has a couple of other prereqs beyond stock Solr 1.4. I think the main point is that they're relatively minor. I've merged TikaEntityProcessor (and some prereqs) and its dependencies into my Solr 1.4 tree and it compiles fine, though I haven't yet tested that T

Re: TikaEntityProcessor on Solr 1.4?

2010-05-21 Thread Sixten Otto
2010/5/19 Noble Paul നോബിള്‍ नोब्ळ् : > I guess it should work because Tika Entityprocessor does not use any > new 1.4 APIs > > On Wed, May 19, 2010 at 1:17 AM, Sixten Otto wrote: >> The TikaEntityProcessor class that enables DataImportHandler to >> process business documents was added after the

Re: TikaEntityProcessor on Solr 1.4?

2010-05-19 Thread Noble Paul നോബിള്‍ नोब्ळ्
I guess it should work because Tika Entityprocessor does not use any new 1.4 APIs On Wed, May 19, 2010 at 1:17 AM, Sixten Otto wrote: > Sorry to repeat this question, but I realized that it probably > belonged in its own thread: > > The TikaEntityProcessor class that enables DataImportHandler to

Re: TikaEntityProcessor in Solr1.4

2010-04-27 Thread Monmohan Singh
typo: Also, is there a timeframe on Solr1. release? should be Also, is there a timeframe on Solr1.5 release? On Tue, Apr 27, 2010 at 8:10 AM, monmohan wrote: > > Hi, > I would like to use TikaEntityProcessor with Solr1.4. > https://issues.apache.org/jira/browse/SOLR-1358 shows that this is added