Indexing Rich Documents
Yes, file system
2013/4/27 Ahmet Arslan
> hi,
>
> Where do you store your rich documents? File system?
>
>
>
>
> - Original Message -
> From: Furkan KAMACI
> To: solr-user@lucene.apache.org
> Cc:
> Sent: Friday, April 26, 2013 6:19
Ok, thanks for the answer.
2013/4/26 Gora Mohanty
> On 26 April 2013 18:38, Furkan KAMACI wrote:
> > I am new to Solr and try to index rich files. I have defined that at my
> > schema:
> [...]
> >
>
> This will not work: Please see http://wiki.apache.org/solr/UniqueKey
> for different use case
Yes, file system
2013/4/27 Ahmet Arslan
> hi,
>
> Where do you store your rich documents? File system?
>
>
>
>
> - Original Message -
> From: Furkan KAMACI
> To: solr-user@lucene.apache.org
> Cc:
> Sent: Friday, April 26, 2013 6:19 PM
> Subject:
hi,
Where do you store your rich documents? File system?
- Original Message -
From: Furkan KAMACI
To: solr-user@lucene.apache.org
Cc:
Sent: Friday, April 26, 2013 6:19 PM
Subject: Re: Solr Indexing Rich Documents
Is there any example at wiki for Manifold?
2013/4/26 Ahmet Arslan
Is there any example at wiki for Manifold?
2013/4/26 Ahmet Arslan
> Hi Furkan,
>
> post.jar meant to be used as example, quick start etc. For production
> (incremental updates, deletes) consider using http://manifoldcf.apache.orgfor
> indexing rich documen
Hi Furkan,
post.jar meant to be used as example, quick start etc. For production
(incremental updates, deletes) consider using http://manifoldcf.apache.org for
indexing rich documents. It utilises ExtractingRequestHandler feature of solr.
--- On Fri, 4/26/13, Furkan KAMACI wrote:
> F
On 26 April 2013 18:38, Furkan KAMACI wrote:
> I am new to Solr and try to index rich files. I have defined that at my
> schema:
[...]
>
This will not work: Please see http://wiki.apache.org/solr/UniqueKey
for different use cases for the uniqueKey.
For documents, I usually use the document name
I am new to Solr and try to index rich files. I have defined that at my
schema:
and there is a line at my schema:
id
should I make it like that:
for my purpose?
le types:
> http://wiki.apache.org/solr/ExtractingRequestHandler
>
> -- Jack Krupansky
>
> -Original Message- From: Furkan KAMACI
> Sent: Friday, April 26, 2013 4:48 AM
> To: solr-user@lucene.apache.org
> Subject: Solr Indexing Rich Documents
>
>
> I have a larg
AM
To: solr-user@lucene.apache.org
Subject: Solr Indexing Rich Documents
I have a large corpus of rich documents i.e. pdf and doc files. I think
that I can use directly the example jar of Solr. However for a real time
environment what should I care? Also how do you send such kind of documents
into So
I have a large corpus of rich documents i.e. pdf and doc files. I think
that I can use directly the example jar of Solr. However for a real time
environment what should I care? Also how do you send such kind of documents
into Solr to index, I think post.jar does not handle that file type? I
should
The libraries are searched in the solr/llib directory, not solr home.
If using multicore, solr/core/lib.
These are searched automatically. You can also tell Solr to search in
other directories with the directive in solrconfig.xml.
On Tue, Jul 13, 2010 at 11:48 PM, satya swaroop wrote:
>
> here
ya i checked the extraction request handler but couldnt get the
info... i installed tika-0.7 and copied the jar files into the solr
home library.. i started sending the pdf/html files then i get a lazy
error. i am using tomcat and solr 1.4
hi,
yes i followed the wiki and can now tell me the procedure for it
regards,
swaroop
Hi,
Are you sure you followed the wiki [1] on this subject? There is an example
there but you need Solr 1.4.0 or higher. I unsure if just patching 1.3.0 will
really do the trick. The patch must then also include Apache Tika, which sits
under the hood, extracting content and meta data from vario
On 07/13/2010 02:11 PM, satya swaroop wrote:
Hi all,
i am new to solr and followed with the wiki and got the solr admin
run sucessfully. It is good going for xml files. But to index the rich
documents i am unable to get it. I followed wiki to make the richer
documents also, but i didnt
Hi all,
i am new to solr and followed with the wiki and got the solr admin
run sucessfully. It is good going for xml files. But to index the rich
documents i am unable to get it. I followed wiki to make the richer
documents also, but i didnt get it.The error comes when i send an pdf/html
Hi all,
i am new to solr and followed with the wiki and got the solr admin
run sucessfully. It is good going for xml files. But to index the rich
documents i am unable to get it. I followed wiki to make the richer
documents also, but i didnt get it.The error comes when i send an pdf/html
tribute doesn't
> support web addresses. Is there another attribute that does?
> --
> View this message in context:
> http://www.nabble.com/Indexing--rich-documents-from-websites-using-ExtractingRequestHandler-tp24392809p24392809.html
> Sent from the Solr - User mailing list archive at Nabble.com.
>
>
etwork path
> was not found)
>
> I sort of understand why I get those errors. What are the alternative
> methods of doing this? I am guessing that the stream.file attribute doesn't
> support web addresses. Is there another attribute that does?
> --
> View this message in c
http://www.nabble.com/Indexing--rich-documents-from-websites-using-ExtractingRequestHandler-tp24392809p24392809.html
Sent from the Solr - User mailing list archive at Nabble.com.
21 matches
Mail list logo