Re: Solr Indexing Rich Documents

2013-04-27 Thread Ahmet Arslan
Indexing Rich Documents Yes, file system 2013/4/27 Ahmet Arslan > hi, > > Where do you store your rich documents? File system? > > > > > - Original Message - > From: Furkan KAMACI > To: solr-user@lucene.apache.org > Cc: > Sent: Friday, April 26, 2013 6:19

Re: uniqueKey required false for multivalued id when indexing rich documents

2013-04-27 Thread Furkan KAMACI
Ok, thanks for the answer. 2013/4/26 Gora Mohanty > On 26 April 2013 18:38, Furkan KAMACI wrote: > > I am new to Solr and try to index rich files. I have defined that at my > > schema: > [...] > > > > This will not work: Please see http://wiki.apache.org/solr/UniqueKey > for different use case

Re: Solr Indexing Rich Documents

2013-04-27 Thread Furkan KAMACI
Yes, file system 2013/4/27 Ahmet Arslan > hi, > > Where do you store your rich documents? File system? > > > > > - Original Message - > From: Furkan KAMACI > To: solr-user@lucene.apache.org > Cc: > Sent: Friday, April 26, 2013 6:19 PM > Subject:

Re: Solr Indexing Rich Documents

2013-04-27 Thread Ahmet Arslan
hi, Where do you store your rich documents? File system? - Original Message - From: Furkan KAMACI To: solr-user@lucene.apache.org Cc: Sent: Friday, April 26, 2013 6:19 PM Subject: Re: Solr Indexing Rich Documents Is there any example at wiki for Manifold? 2013/4/26 Ahmet Arslan

Re: Solr Indexing Rich Documents

2013-04-26 Thread Furkan KAMACI
Is there any example at wiki for Manifold? 2013/4/26 Ahmet Arslan > Hi Furkan, > > post.jar meant to be used as example, quick start etc. For production > (incremental updates, deletes) consider using http://manifoldcf.apache.orgfor > indexing rich documen

Re: Solr Indexing Rich Documents

2013-04-26 Thread Ahmet Arslan
Hi Furkan, post.jar meant to be used as example, quick start etc. For production (incremental updates, deletes) consider using http://manifoldcf.apache.org for indexing rich documents. It utilises ExtractingRequestHandler feature of solr. --- On Fri, 4/26/13, Furkan KAMACI wrote: > F

Re: uniqueKey required false for multivalued id when indexing rich documents

2013-04-26 Thread Gora Mohanty
On 26 April 2013 18:38, Furkan KAMACI wrote: > I am new to Solr and try to index rich files. I have defined that at my > schema: [...] > This will not work: Please see http://wiki.apache.org/solr/UniqueKey for different use cases for the uniqueKey. For documents, I usually use the document name

uniqueKey required false for multivalued id when indexing rich documents

2013-04-26 Thread Furkan KAMACI
I am new to Solr and try to index rich files. I have defined that at my schema: and there is a line at my schema: id should I make it like that: for my purpose?

Re: Solr Indexing Rich Documents

2013-04-26 Thread Furkan KAMACI
le types: > http://wiki.apache.org/solr/ExtractingRequestHandler > > -- Jack Krupansky > > -Original Message- From: Furkan KAMACI > Sent: Friday, April 26, 2013 4:48 AM > To: solr-user@lucene.apache.org > Subject: Solr Indexing Rich Documents > > > I have a larg

Re: Solr Indexing Rich Documents

2013-04-26 Thread Jack Krupansky
AM To: solr-user@lucene.apache.org Subject: Solr Indexing Rich Documents I have a large corpus of rich documents i.e. pdf and doc files. I think that I can use directly the example jar of Solr. However for a real time environment what should I care? Also how do you send such kind of documents into So

Solr Indexing Rich Documents

2013-04-26 Thread Furkan KAMACI
I have a large corpus of rich documents i.e. pdf and doc files. I think that I can use directly the example jar of Solr. However for a real time environment what should I care? Also how do you send such kind of documents into Solr to index, I think post.jar does not handle that file type? I should

Re: indexing rich documents

2010-07-16 Thread Lance Norskog
The libraries are searched in the solr/llib directory, not solr home. If using multicore, solr/core/lib. These are searched automatically. You can also tell Solr to search in other directories with the directive in solrconfig.xml. On Tue, Jul 13, 2010 at 11:48 PM, satya swaroop wrote: > > here

Re: indexing rich documents

2010-07-13 Thread satya swaroop
ya i checked the extraction request handler but couldnt get the info... i installed tika-0.7 and copied the jar files into the solr home library.. i started sending the pdf/html files then i get a lazy error. i am using tomcat and solr 1.4

Re: indexing rich documents

2010-07-13 Thread satya swaroop
hi, yes i followed the wiki and can now tell me the procedure for it regards, swaroop

Re: indexing rich documents

2010-07-13 Thread Markus Jelsma
Hi, Are you sure you followed the wiki [1] on this subject? There is an example there but you need Solr 1.4.0 or higher. I unsure if just patching 1.3.0 will really do the trick. The patch must then also include Apache Tika, which sits under the hood, extracting content and meta data from vario

Re: indexing rich documents

2010-07-13 Thread Nikola Garafolic
On 07/13/2010 02:11 PM, satya swaroop wrote: Hi all, i am new to solr and followed with the wiki and got the solr admin run sucessfully. It is good going for xml files. But to index the rich documents i am unable to get it. I followed wiki to make the richer documents also, but i didnt

indexing rich documents

2010-07-13 Thread satya swaroop
Hi all, i am new to solr and followed with the wiki and got the solr admin run sucessfully. It is good going for xml files. But to index the rich documents i am unable to get it. I followed wiki to make the richer documents also, but i didnt get it.The error comes when i send an pdf/html

indexing rich documents

2010-07-13 Thread satya swaroop
Hi all, i am new to solr and followed with the wiki and got the solr admin run sucessfully. It is good going for xml files. But to index the rich documents i am unable to get it. I followed wiki to make the richer documents also, but i didnt get it.The error comes when i send an pdf/html

Re: Indexing rich documents from websites using ExtractingRequestHandler

2009-07-08 Thread Jay Hill
tribute doesn't > support web addresses. Is there another attribute that does? > -- > View this message in context: > http://www.nabble.com/Indexing--rich-documents-from-websites-using-ExtractingRequestHandler-tp24392809p24392809.html > Sent from the Solr - User mailing list archive at Nabble.com. > >

Re: Indexing rich documents from websites using ExtractingRequestHandler

2009-07-08 Thread Glen Newton
etwork path > was not found) > > I sort of understand why I get those errors. What are the alternative > methods of doing this? I am guessing that the stream.file attribute doesn't > support web addresses. Is there another attribute that does? > -- > View this message in c

Indexing rich documents from websites using ExtractingRequestHandler

2009-07-08 Thread ahammad
http://www.nabble.com/Indexing--rich-documents-from-websites-using-ExtractingRequestHandler-tp24392809p24392809.html Sent from the Solr - User mailing list archive at Nabble.com.