Re: Need help on Solr Cell usage with specific Tika parser
Thanks, moving it to direcxt child worked. Olivier 2010/6/14 Chris Hostetter > > : In solrconfig, in update/extract requesthandler I specified : name="tika.config">./tika-config.xml , where tika-config.xml is in > : conf directory (same as solrconfig). > > can you show us the full requestHandler decalration? ... tika.config needs > to be a direct child of the requestHandler (not in the defaults) > > I also don't know if using a "local" path like that will work -- depends > on how that file is loaded (if solr loads it, then you might want to > remove the "./"; if solr just gives the path to tika, then you probably > need an absolute path. > > > -Hoss > >
Re: Need help on Solr Cell usage with specific Tika parser
: In solrconfig, in update/extract requesthandler I specified ./tika-config.xml , where tika-config.xml is in : conf directory (same as solrconfig). can you show us the full requestHandler decalration? ... tika.config needs to be a direct child of the requestHandler (not in the defaults) I also don't know if using a "local" path like that will work -- depends on how that file is loaded (if solr loads it, then you might want to remove the "./"; if solr just gives the path to tika, then you probably need an absolute path. -Hoss
Re: Need help on Solr Cell usage with specific Tika parser
Yeap, I do. As magic is not set, this is the reason why it looks for this specific mime-type. Unfortunatly, It seems it either do not read my specific tika-config file or the mime-type file. But there is no error log concerning those files... (not trying to load them?) 2010/6/14 Ken Krugler > Hi Olivier, > > Are you setting the mime type explicitly via the stream.type parameter? > > -- Ken > > > On Jun 14, 2010, at 9:14am, olivier sallou wrote: > > Hi, >> I use Solr Cell to send specific content files. I developped a dedicated >> Parser for specific mime types. >> However I cannot get Solr accepting my new mime types. >> >> In solrconfig, in update/extract requesthandler I specified > name="tika.config">./tika-config.xml , where tika-config.xml is in >> conf directory (same as solrconfig). >> >> In tika-config I added my mimetypes: >> >> > class="org.irisa.genouest.tools.readseq.ReadSeqParser"> >> biosequence/document >> biosequence/embl >> biosequence/genbank >> >> >> I do not know for: >> >> >> whereas path to tika mimetypes should be absolute or relative... and even >> if >> this file needs to be redefined if "magic" is not used. >> >> >> When I run my update/extract, I have an error that "biosequence/document" >> does not match any known parser. >> >> Thanks >> >> Olivier >> > > > Ken Krugler > +1 530-210-6378 > http://bixolabs.com > e l a s t i c w e b m i n i n g > > > > >
Re: Need help on Solr Cell usage with specific Tika parser
Hi Olivier, Are you setting the mime type explicitly via the stream.type parameter? -- Ken On Jun 14, 2010, at 9:14am, olivier sallou wrote: Hi, I use Solr Cell to send specific content files. I developped a dedicated Parser for specific mime types. However I cannot get Solr accepting my new mime types. In solrconfig, in update/extract requesthandler I specified name="tika.config">./tika-config.xml , where tika-config.xml is in conf directory (same as solrconfig). In tika-config I added my mimetypes: biosequence/document biosequence/embl biosequence/genbank I do not know for: whereas path to tika mimetypes should be absolute or relative... and even if this file needs to be redefined if "magic" is not used. When I run my update/extract, I have an error that "biosequence/ document" does not match any known parser. Thanks Olivier Ken Krugler +1 530-210-6378 http://bixolabs.com e l a s t i c w e b m i n i n g
Need help on Solr Cell usage with specific Tika parser
Hi, I use Solr Cell to send specific content files. I developped a dedicated Parser for specific mime types. However I cannot get Solr accepting my new mime types. In solrconfig, in update/extract requesthandler I specified ./tika-config.xml , where tika-config.xml is in conf directory (same as solrconfig). In tika-config I added my mimetypes: biosequence/document biosequence/embl biosequence/genbank I do not know for: whereas path to tika mimetypes should be absolute or relative... and even if this file needs to be redefined if "magic" is not used. When I run my update/extract, I have an error that "biosequence/document" does not match any known parser. Thanks Olivier