Indexing data from multiple datasources

2011-06-09 Thread Greg Georges
Hello all, I have checked the forums to see if it is possible to create and index from multiple datasources. I have found references to SOLR-1358, but I don't think this fits my scenario. In all, we have an application where we upload files. On the file upload, I use the Tika extract handler

RE: Indexing data from multiple datasources

2011-06-09 Thread Greg Georges
implementing a Transformer to do the database querying and modify the document as necessary This is pretty simple to do, we can chat a bit more depending on whether either approach makes sense. Best Erick On Thu, Jun 9, 2011 at 10:43 AM, Greg Georges greg.geor...@biztree.com wrote: Hello all

RE: Indexing data from multiple datasources

2011-06-09 Thread Greg Georges
On Thu, Jun 9, 2011 at 1:20 PM, Greg Georges greg.geor...@biztree.com wrote: Hello Erick, Thanks for the response. No, I am using the extract handler to extract the data from my text files. In your second approach, you say I could use a DIH to update the index which would have been

Limit data stored from fmap.content with Solr cell

2011-06-01 Thread Greg Georges
Hello everyone, I have just gotten extracting information from files with Solr Cell. Some of the files we are indexing are large, and have much content. I would like to limit the amount of data I index to a specified limit of characters (example 300 chars) which I will use as a document

Indexing files Solr cell and Amazon S3

2011-05-30 Thread Greg Georges
Hello everyone, We have our infrastructure on Amazon cloud servers, and we use the S3 file system. We need to index files using Solr Cell. From what I have read, we need to stream files to Solr in order for it to extract the metadata into the index. If we stream data through a public url there

Question concerning the updating of my solr index

2011-05-02 Thread Greg Georges
Hello all, I have integrated Solr into my project with success. I use a dataimporthandler to first import the data mapping the fields to my schema.xml. I use Solrj to query the data and also use faceting. Works great. The question I have now is a general one on updating the index and how it

RE: Question concerning the updating of my solr index

2011-05-02 Thread Greg Georges
search :: http://search-lucene.com/ - Original Message From: Greg Georges greg.geor...@biztree.com To: solr-user@lucene.apache.org solr-user@lucene.apache.org Sent: Mon, May 2, 2011 1:33:30 PM Subject: Question concerning the updating of my solr index Hello all, I have

RE: Question concerning the updating of my solr index

2011-05-02 Thread Greg Georges
); server.commit(); server.optimize(); Greg -Original Message- From: Greg Georges [mailto:greg.geor...@biztree.com] Sent: 2 mai 2011 14:44 To: solr-user@lucene.apache.org Subject: RE: Question concerning the updating of my solr index Ok I had seen this in the wiki

RE: Question concerning the updating of my solr index

2011-05-02 Thread Greg Georges
Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch Lucene ecosystem search :: http://search-lucene.com/ - Original Message From: Greg Georges greg.geor...@biztree.com To: solr-user@lucene.apache.org solr-user@lucene.apache.org Sent: Mon, May 2, 2011 2:45:40 PM Subject: RE: Question

DataImportHandler on 2 tables

2011-05-02 Thread Greg Georges
Hello all, I have a system where I have a dataimporthandler defined for one table in my database. I need to also index data from another table, so therefore I will need another index to search on. Does this mean I must configure another solr instance (another schema.xml file, dataimporthandler

RE: DataImportHandler on 2 tables

2011-05-02 Thread Greg Georges
Do you want to search on the datas from the tables together or seperately ? Is there a join between the two tables ? Ludovic. 2011/5/2 Greg Georges [via Lucene] ml-node+2891256-222073995-383...@n3.nabble.com Hello all, I have a system where I have a dataimporthandler defined for one table

RE: DataImportHandler on 2 tables

2011-05-02 Thread Greg Georges
: DataImportHandler on 2 tables ok, so It seems you should create a new index and core as you said. see here for the management : http://wiki.apache.org/solr/CoreAdmin But it seems that is a problem for you. Is it ? Ludovic. 2011/5/2 Greg Georges [via Lucene] ml-node+2891277-472183207-383...@n3

Embedded Solr

2011-03-21 Thread Greg Georges
Hello all, I am using Solr in a Java architecture right now, and the results are great. The app development team has asked me if it is possible to embed Solr, but the request is to embed it into a C++ app and mac app using objective C. I do not have much knowledge on embedded Solr. Does it

Indexing a text string for faceting

2011-03-09 Thread Greg Georges
Hello all, I have a small problem with my faceting fields. In all I create a new faceting field which is indexed and not stored, and use copyField. The problem is I facet on category names which have examples like this Policies Documentation

Indexing languages, dataimporthandler

2011-02-22 Thread Greg Georges
Hello all, I have just gone through the mailing list and have set up my different field type analysers for my 6 different languages in my shema.xml. Here is my question. I am using the dataimporthandler to import data from my database into my index. In my table, the documentname column's data

Question regarding indexing multiple languages, stopwords, etc.

2011-02-21 Thread Greg Georges
Hello all, I have gotten my DataImporthandler to index my data from my MySQL database. I was looking at the schema tool and noticing that stopwords in different languages are being indexed as terms. The 6 languages we have are English, French, Spanish, Chinese, German and Italian. Right now I

Question regarding inner entity in dataimporthandler

2011-02-15 Thread Greg Georges
Hello all, I have searched the forums for the question I am about to ask, never found any concrete results. This is my case. I am defining the data config file with the document and entity tags. I define with success a basic entity mapped to my mysql database, and I then add some inner

RE: Question regarding inner entity in dataimporthandler

2011-02-15 Thread Greg Georges
OK, I think I found some information, supposedly TemplateTransformer will return an empty string if the value of a variable is null. Some people say to use the regex transformer instead, can anyone clarify this? Thanks -Original Message- From: Greg Georges [mailto:greg.geor

Solr design decisions

2011-02-11 Thread Greg Georges
Hello all, I have just finished to book Solr 1.4 Enterprise Search Server. I now understand most of the basics of Solr and also how we can scale the solution. Our goal is to have a centralized search service for a multitude of apps. Our first application which we want to index, is a system in

Difference between Solr and Lucidworks distribution

2011-02-11 Thread Greg Georges
Hello all, I just started watching the webinars from Lucidworks, and they mention their distribution which has an installer, etc.. Is there any other differences? Is it a good idea to use this free distribution? Greg

Architecture decisions with Solr

2011-02-09 Thread Greg Georges
Hello all, I am looking into an enterprise search solution for our architecture and I am very pleased to see all the features Solr provides. In our case, we will have a need for a highly scalable application for multiple clients. This application will be built to serve many users who each will

RE: Architecture decisions with Solr

2011-02-09 Thread Greg Georges
client? If there's no data sharing across clients, then using the same solr server/index doesn't seem necessary. Solr will easily meet your needs though, its the best there is. On Wed, 2011-02-09 at 14:23 -0500, Greg Georges wrote: Hello all, I am looking into an enterprise search solution