Re: Need help on Solr Cell usage with specific Tika parser

2010-06-15 Thread olivier sallou
Thanks,
moving it to direcxt child worked.

Olivier

2010/6/14 Chris Hostetter 

>
> : In solrconfig, in update/extract requesthandler I specified  : name="tika.config">./tika-config.xml , where tika-config.xml is in
> : conf directory (same as solrconfig).
>
> can you show us the full requestHandler decalration? ... tika.config needs
> to be a direct child of the requestHandler (not in the defaults)
>
> I also don't know if using a "local" path like that will work -- depends
> on how that file is loaded (if solr loads it, then you might want to
> remove the "./";  if solr just gives the path to tika, then you probably
> need an absolute path.
>
>
> -Hoss
>
>


Re: Need help on Solr Cell usage with specific Tika parser

2010-06-14 Thread Chris Hostetter

: In solrconfig, in update/extract requesthandler I specified ./tika-config.xml , where tika-config.xml is in
: conf directory (same as solrconfig).

can you show us the full requestHandler decalration? ... tika.config needs 
to be a direct child of the requestHandler (not in the defaults)

I also don't know if using a "local" path like that will work -- depends 
on how that file is loaded (if solr loads it, then you might want to 
remove the "./";  if solr just gives the path to tika, then you probably 
need an absolute path.


-Hoss



Re: Need help on Solr Cell usage with specific Tika parser

2010-06-14 Thread olivier sallou
Yeap, I do.
As magic is not set, this is the reason why it looks for this specific
mime-type. Unfortunatly, It seems it either do not read my specific
tika-config file or the mime-type file. But there is no error log concerning
those files... (not trying to load them?)


2010/6/14 Ken Krugler 

> Hi Olivier,
>
> Are you setting the mime type explicitly via the stream.type parameter?
>
> -- Ken
>
>
> On Jun 14, 2010, at 9:14am, olivier sallou wrote:
>
>  Hi,
>> I use Solr Cell to send specific content files. I developped a dedicated
>> Parser for specific mime types.
>> However I cannot get Solr accepting my new mime types.
>>
>> In solrconfig, in update/extract requesthandler I specified > name="tika.config">./tika-config.xml , where tika-config.xml is in
>> conf directory (same as solrconfig).
>>
>> In tika-config I added my mimetypes:
>>
>> > class="org.irisa.genouest.tools.readseq.ReadSeqParser">
>>   biosequence/document
>>   biosequence/embl
>>   biosequence/genbank
>>   
>>
>> I do not know for:
>>  
>>
>> whereas path to tika mimetypes should be absolute or relative... and even
>> if
>> this file needs to be redefined if "magic" is not used.
>>
>>
>> When I run my update/extract, I have an error that "biosequence/document"
>> does not match any known parser.
>>
>> Thanks
>>
>> Olivier
>>
>
> 
> Ken Krugler
> +1 530-210-6378
> http://bixolabs.com
> e l a s t i c   w e b   m i n i n g
>
>
>
>
>


Re: Need help on Solr Cell usage with specific Tika parser

2010-06-14 Thread Ken Krugler

Hi Olivier,

Are you setting the mime type explicitly via the stream.type parameter?

-- Ken

On Jun 14, 2010, at 9:14am, olivier sallou wrote:


Hi,
I use Solr Cell to send specific content files. I developped a  
dedicated

Parser for specific mime types.
However I cannot get Solr accepting my new mime types.

In solrconfig, in update/extract requesthandler I specified name="tika.config">./tika-config.xml , where tika-config.xml  
is in

conf directory (same as solrconfig).

In tika-config I added my mimetypes:


   biosequence/document
   biosequence/embl
   biosequence/genbank
   

I do not know for:
 

whereas path to tika mimetypes should be absolute or relative... and  
even if

this file needs to be redefined if "magic" is not used.


When I run my update/extract, I have an error that "biosequence/ 
document"

does not match any known parser.

Thanks

Olivier



Ken Krugler
+1 530-210-6378
http://bixolabs.com
e l a s t i c   w e b   m i n i n g






Need help on Solr Cell usage with specific Tika parser

2010-06-14 Thread olivier sallou
Hi,
I use Solr Cell to send specific content files. I developped a dedicated
Parser for specific mime types.
However I cannot get Solr accepting my new mime types.

In solrconfig, in update/extract requesthandler I specified ./tika-config.xml , where tika-config.xml is in
conf directory (same as solrconfig).

In tika-config I added my mimetypes:


biosequence/document
biosequence/embl
biosequence/genbank


I do not know for:
  

whereas path to tika mimetypes should be absolute or relative... and even if
this file needs to be redefined if "magic" is not used.


When I run my update/extract, I have an error that "biosequence/document"
does not match any known parser.

Thanks

Olivier