Re: How to create a new index file automatically

2009-09-16 Thread busbus



 It can import documents in certain other formats using the 
 http://wiki.apache.org/solr/ExtractingRequestHandler
 

1) According to my inference.Solr uses Apache Tikka to convert other rich
document format files to Text Files, so that the Class ExtractRequestHandler
use the output text file to create the Index files.

2. If Point 1 is correct,then I think this could suit my requirements since
I need to index rich documents files especially .xls format.
But i cant find the class ExtractRequestHandler which has to be configured
in SOLRCONFIG.xml file, so that i can import XLS documents through the
servlet

ttp://localhost:8983/solr/update/extract?=
-- 
View this message in context: 
http://www.nabble.com/How-to-create-a-new-index-file-automatically-tp25455045p25466714.html
Sent from the Solr - User mailing list archive at Nabble.com.



New to Solr : How to create solr index for rich documents especially .xls

2009-09-15 Thread busbus

Hi

I am a newbie to Solr. Right now I have to do a task of converting rich
documents to Solr readable index format so that I can use the index for
searching.

I learnt about Solr and got a rough idea of what has to be done.

Requirement 1: 

1)  I have to index the rich document format files like .xls,.pdf,doc,ppt 

Information that I know:

For this as far as I searched in Internet I came to know that we can use
Data Import Handler, Apache Tika. (  but how to do that with this ).Should I
code with the Data Import Handler ?

So far I have downloaded a sample document from net and tried running that.
The application runs on a Jetty Web Server and when I query in I get an xml
file as output.

Problems faced:

Since I am very new to java I am not able to get a clear picture of what has
to be done and what is this Ant tool used for.

Requirement 2:

I need to change the Web server from Jetty to Jboss Application server. What
has to be done for this?



Solution tried:

I tried copying the solr.war in to the web app directory and tried running
the application. Since I am very new to java I might have made some basic
mistake too. Please guide me.

Thanks in advance.


-- 
View this message in context: 
http://www.nabble.com/New-to-Solr-%3A-How-to-create-solr-index-for-rich-documents-especially-.xls-tp25451164p25451164.html
Sent from the Solr - User mailing list archive at Nabble.com.



How to create a new index file automatically

2009-09-15 Thread busbus

Hi all,

I am newbie to Solr.

I have downloaded and used the solr  example and I have a basic doubt.

There are some xml documents present in
apache-solr-1.3.0\example\exampledocs.
These are the input files to solr index and I found that by giving this
command 

java –jar post.jar *.xml 

. All these xml documents have basic structure schema.

Say for example

add
doc
  field name= “ “ abc /field
…
….

/doc
/add

I want to index some more files. Then in that case should I have to create a
new xml file manually or what should I do to create it automatically. 

Please give me a solution. I am very new to Solr and so please make it as
simple as possible.

Thanks a lot...

-- 
View this message in context: 
http://www.nabble.com/How-to-create-a-new-index-file-automatically-tp25455045p25455045.html
Sent from the Solr - User mailing list archive at Nabble.com.



How to Convert Lucene index files to XML Format

2009-09-10 Thread busbus

Hello All,
I have a set of Files indexed by Lucene. Now i want to use the indexed files
in SOLR. The file .cfx an .cfs are not readable by Solr, as it supports only
.fds and .fdx.

So i decided to Add/update the index by just loading a XML File using the
post.jar funtion.

java -jar post.jar newFile.XML - Loads the XML and Updates the index.

Now i want to Convert all the cfx files to XML so that i can Use them in
SOLR.

Advice Needed.

Any other suggestions are most welcomed.

- Balaji
-- 
View this message in context: 
http://www.nabble.com/How-to-Convert-Lucene-index-files-to-XML-Format-tp25381017p25381017.html
Sent from the Solr - User mailing list archive at Nabble.com.



Re: How to Convert Lucene index files to XML Format

2009-09-10 Thread busbus

Thanks for your reply





 On Sep 10, 2009, at 6:41 AM, busbus wrote:
 Solr defers to Lucene on reading the index.  You just need to tell  
 Solr whether the index is a compound file or not and make sure the 
 versions are compatible.
 

This part seems to be the point.
How to make solr to read lucene index files.
There is a tag in Solrconfig.xml
useCompundFile false useCompundFile

Enable it to true does not seem to be working.

What else need to be done.

Should i change the config file or add new tag.

Also how to check the compatibility of Lucen and solr

Thanks in advance

-- 
View this message in context: 
http://www.nabble.com/How-to-Convert-Lucene-index-files-to-XML-Format-tp25381017p25382367.html
Sent from the Solr - User mailing list archive at Nabble.com.