Re: How to create a new index file automatically
It can import documents in certain other formats using the http://wiki.apache.org/solr/ExtractingRequestHandler 1) According to my inference.Solr uses Apache Tikka to convert other rich document format files to Text Files, so that the Class ExtractRequestHandler use the output text file to create the Index files. 2. If Point 1 is correct,then I think this could suit my requirements since I need to index rich documents files especially .xls format. But i cant find the class ExtractRequestHandler which has to be configured in SOLRCONFIG.xml file, so that i can import XLS documents through the servlet ttp://localhost:8983/solr/update/extract?= -- View this message in context: http://www.nabble.com/How-to-create-a-new-index-file-automatically-tp25455045p25466714.html Sent from the Solr - User mailing list archive at Nabble.com.
New to Solr : How to create solr index for rich documents especially .xls
Hi I am a newbie to Solr. Right now I have to do a task of converting rich documents to Solr readable index format so that I can use the index for searching. I learnt about Solr and got a rough idea of what has to be done. Requirement 1: 1) I have to index the rich document format files like .xls,.pdf,doc,ppt Information that I know: For this as far as I searched in Internet I came to know that we can use Data Import Handler, Apache Tika. ( but how to do that with this ).Should I code with the Data Import Handler ? So far I have downloaded a sample document from net and tried running that. The application runs on a Jetty Web Server and when I query in I get an xml file as output. Problems faced: Since I am very new to java I am not able to get a clear picture of what has to be done and what is this Ant tool used for. Requirement 2: I need to change the Web server from Jetty to Jboss Application server. What has to be done for this? Solution tried: I tried copying the solr.war in to the web app directory and tried running the application. Since I am very new to java I might have made some basic mistake too. Please guide me. Thanks in advance. -- View this message in context: http://www.nabble.com/New-to-Solr-%3A-How-to-create-solr-index-for-rich-documents-especially-.xls-tp25451164p25451164.html Sent from the Solr - User mailing list archive at Nabble.com.
How to create a new index file automatically
Hi all, I am newbie to Solr. I have downloaded and used the solr example and I have a basic doubt. There are some xml documents present in apache-solr-1.3.0\example\exampledocs. These are the input files to solr index and I found that by giving this command java –jar post.jar *.xml . All these xml documents have basic structure schema. Say for example add doc field name= “ “ abc /field … …. /doc /add I want to index some more files. Then in that case should I have to create a new xml file manually or what should I do to create it automatically. Please give me a solution. I am very new to Solr and so please make it as simple as possible. Thanks a lot... -- View this message in context: http://www.nabble.com/How-to-create-a-new-index-file-automatically-tp25455045p25455045.html Sent from the Solr - User mailing list archive at Nabble.com.
How to Convert Lucene index files to XML Format
Hello All, I have a set of Files indexed by Lucene. Now i want to use the indexed files in SOLR. The file .cfx an .cfs are not readable by Solr, as it supports only .fds and .fdx. So i decided to Add/update the index by just loading a XML File using the post.jar funtion. java -jar post.jar newFile.XML - Loads the XML and Updates the index. Now i want to Convert all the cfx files to XML so that i can Use them in SOLR. Advice Needed. Any other suggestions are most welcomed. - Balaji -- View this message in context: http://www.nabble.com/How-to-Convert-Lucene-index-files-to-XML-Format-tp25381017p25381017.html Sent from the Solr - User mailing list archive at Nabble.com.
Re: How to Convert Lucene index files to XML Format
Thanks for your reply On Sep 10, 2009, at 6:41 AM, busbus wrote: Solr defers to Lucene on reading the index. You just need to tell Solr whether the index is a compound file or not and make sure the versions are compatible. This part seems to be the point. How to make solr to read lucene index files. There is a tag in Solrconfig.xml useCompundFile false useCompundFile Enable it to true does not seem to be working. What else need to be done. Should i change the config file or add new tag. Also how to check the compatibility of Lucen and solr Thanks in advance -- View this message in context: http://www.nabble.com/How-to-Convert-Lucene-index-files-to-XML-Format-tp25381017p25382367.html Sent from the Solr - User mailing list archive at Nabble.com.