Tagging and searching on tagged indexes.

2009-07-07 Thread Rakhi Khatwani
Hi, How do we tag solr indexes and search on those indexes, there is not much information on wiki. all i could find is this: http://wiki.apache.org/solr/UserTagDesign has anyone tried it? (using solr API) One more question, can we change the schema dynamically at runtime? (while solr

Re: Is there any other way to load the index beside using http connection?

2009-07-07 Thread Marcus Herou
Out of my head... but are you not supposed to active the stream-handler in SOLR ? Think it is documented... Cheers //Marcus On Mon, Jul 6, 2009 at 8:55 PM, Francis Yakin fya...@liquid.com wrote: Yes, I uploaded the CSV file that I get it from Database then I ran that cmd and I have the

Re: Tagging and searching on tagged indexes.

2009-07-07 Thread Shalin Shekhar Mangar
On Tue, Jul 7, 2009 at 11:37 AM, Rakhi Khatwani rkhatw...@gmail.com wrote: Hi, How do we tag solr indexes and search on those indexes, there is not much information on wiki. all i could find is this: http://wiki.apache.org/solr/UserTagDesign has anyone tried it? (using solr API) That

Can´t use wildcard * on alphanumeric values?

2009-07-07 Thread gateway0
Hi, I indexed my data and defined a defaultsearchfield named text: (field name=text type=text indexed=true stored=false multiValued=true/). I copied all my other field values into that field. Now my problem: Lets say I have 2 values indexed 1.value ABCD 2.value ABCD3456 Now when I do a

Re: Filtering MoreLikeThis results

2009-07-07 Thread Marc Sturlese
Using MoreLikeThisHandler you can use fq to filter your results. As far as I know bq are not allowed. Bill Au wrote: I have been trying to restrict MoreLikeThis results without any luck also. In additional to restricting the results, I am also looking to influence the scores similar to

Re: Can´t use wildcard * on alphanumeric values?

2009-07-07 Thread Shalin Shekhar Mangar
On Tue, Jul 7, 2009 at 2:10 PM, gateway0 reiterwo...@yahoo.de wrote: I indexed my data and defined a defaultsearchfield named text: (field name=text type=text indexed=true stored=false multiValued=true/). Lets say I have 2 values indexed 1.value ABCD 2.value ABCD3456 Now when I do a

spell checker's collate values

2009-07-07 Thread Licinio Fernández Maurelo
Hi all, i'm still trying to tune my spellchecker to get the results i expect I've created a dictionary and currently i want to get an special behaviour from the spellchecker. The fact is that when i introduce the query 'Fernandox Alonso' i get what i expect : bool

Re: reindexed data on master not replicated to slave

2009-07-07 Thread Noble Paul നോബിള്‍ नोब्ळ्
Jay , I am opening an issue SOLR-1264 https://issues.apache.org/jira/browse/SOLR-1264 I have attached a patch as well . I guess that is the fix. could you please confirm that. On Tue, Jul 7, 2009 at 12:59 AM, solr jaysolr...@gmail.com wrote: It looks that the problem is here or before that in

Can't limit return fields in custom request handler

2009-07-07 Thread Osman İZBAT
Hi. I'm writing my custom faceted request handler. But I have a problem like this; when i call http://localhost:8983/solr/select/?qt=cfacetq=%2BitemTitle:nokia%20%2BcategoryId:130start=0limit=3fl=id, itemTitle i'm getiing all fields instead of only id and itemTitle. Also i'm gettting no result

Re: Is there any other way to load the index beside using http connection?

2009-07-07 Thread Yonik Seeley
Look at the error - it's bash (your command line shell) complaining. The '' terminates one command and puts it in the background. Surrounding the command with quotes will get you one step closer: curl

Re: Loading Data into Solr without HTTP

2009-07-07 Thread Yonik Seeley
On Tue, Jul 7, 2009 at 8:41 AM, Anand Kumar Prabhakaranand2...@gmail.com wrote: Is there any way so that we can read the data from the CSV file and load it into the Solr database without using /update/csv That *is* the right way to load a CSV file into Solr. How many records are in the CSV

Re: Loading Data into Solr without HTTP

2009-07-07 Thread Anand Kumar Prabhakar
Thank you for the Reply Yonik, I have already tried with smaller CSV files, currently we are trying to load a CSV file of 400 MB but this is taking too much time(more than half an hour). I want to know is there any method to do it much faster, we have overcome the OutOfMemoryException by

Re: Can´t use wildcard * on alphanumeric values?

2009-07-07 Thread gateway0
Thank you, that was it. Why is the preserveOriginal=1 option nowhere documented? Shalin Shekhar Mangar wrote: On Tue, Jul 7, 2009 at 2:10 PM, gateway0 reiterwo...@yahoo.de wrote: I indexed my data and defined a defaultsearchfield named text: (field name=text type=text indexed=true

Re: Loading Data into Solr without HTTP

2009-07-07 Thread Yonik Seeley
On Tue, Jul 7, 2009 at 9:14 AM, Anand Kumar Prabhakaranand2...@gmail.com wrote: I want to know is there any method to do it much faster, we have overcome the OutOfMemoryException by increasing heap space. Optimize your schema - eliminate all unnecessary copyFields and default values. The

Re: Loading Data into Solr without HTTP

2009-07-07 Thread Yonik Seeley
Also make sure you don't have any autocommit rules enabled in solrconfig.xml How many documents are in the 400MB CSV file, and how long does it take to index now? -Yonik http://www.lucidimagination.com On Tue, Jul 7, 2009 at 10:03 AM, Anand Kumar Prabhakaranand2...@gmail.com wrote: Hi

Re: Loading Data into Solr without HTTP

2009-07-07 Thread Anand Kumar Prabhakar
Hi Yonik, Currently our Schema has very few fields and we don't have any copy fields also. Please find the below Schema.xml we are using: ?xml version=1.0 encoding=UTF-8 ? schema name=cmps version=1.1 !-- attribute name is the name of this schema and is only used for display purposes.

Re: Indexing XML

2009-07-07 Thread Matt Mitchell
Saeli, Solr expects a certain XML structure when adding documents. You'll need to come up with a mapping, that translates the original structure to one that solr understands. You can then search solr and get those solr documents back. If you want to keep the original XML, you can store it in a

Re: Filtering MoreLikeThis results

2009-07-07 Thread Bill Au
I think fq only works on the main response, not the mlt matches. I found a couple of releated jira: http://issues.apache.org/jira/browse/SOLR-295 http://issues.apache.org/jira/browse/SOLR-281 If I am reading them correctly, I should be able to use DIsMax and MoreLikeThis together. I will give

Indexing XML

2009-07-07 Thread Saeli Mathieu
Hello. I'm a new user of Solr, I already used Lucene to index files and search. But my programme was too slow, it's why I was looking for another solution, and I thought I found it. I said I thought because I don't know if it's possible to use solar with this kind of XML files. lom

Question regarding ExtractingRequestHandler

2009-07-07 Thread ahammad
Hello, I've recently started using this handler to index MS Word and PDF files. When I set ext.extract.only=true, I get back all the metadata that is associated with that file. If I want to index, I need to set ext.extract.only=false. If I want to index all that metadata along with the

Re: SynonymFilterFactory usage

2009-07-07 Thread Mani Kumar
anyone? ps: my apologies if you guys think its spamming. but i really need some help here. thanks! mani On Sun, Jul 5, 2009 at 12:49 PM, Mani Kumar manikumarchau...@gmail.comwrote: hi all, i am confused a bit about how to use synonym filter configs. i am using solr 1.4. default config is

Browse indexed terms in a field

2009-07-07 Thread Pierre-Yves LANDRON
Hello, Here is what I would like to achieve : in an indexed document there's a fulltext indexed field ; I'd like to browse the terms in this field, ie. get all the terms that match the begining of a given word, for example. I can get all the field's facets for this document, but that's a lot

Re: Filtering MoreLikeThis results

2009-07-07 Thread Marc Sturlese
At least in trunk, if you request for: http://localhost:8084/solr/core_A/mlt?q=id:7468365fq=price[100 TO 200] It will filter the MoreLikeThis results Bill Au wrote: I think fq only works on the main response, not the mlt matches. I found a couple of releated jira:

Re: Browse indexed terms in a field

2009-07-07 Thread Bill Au
You can use facet.perfix to match the beginning of a given word: http://wiki.apache.org/solr/SimpleFacetParameters#head-579914ef3a14d775a5ac64d2c17a53f3364e3cf6 Bill On Tue, Jul 7, 2009 at 11:02 AM, Pierre-Yves LANDRON pland...@hotmail.comwrote: Hello, Here is what I would like to achieve

Re: Filtering MoreLikeThis results

2009-07-07 Thread Bill Au
I have been using the StandardRequestHandler (ie /solr/select). fq does work with the MoreLikeThisHandler. I will switch to use that. Thanks. Bill On Tue, Jul 7, 2009 at 11:02 AM, Marc Sturlese marc.sturl...@gmail.comwrote: At least in trunk, if you request for:

Solr set up

2009-07-07 Thread G T
Hi, I was interested in creating a test environment where i can make use of solr/ lucene .My objective is to be able to test various features of solr .(replication , performance, indexing , searching and so on) I wanted someone to give me a start on above.I am well versed with lucene/solr

Re: solr health check

2009-07-07 Thread Koji Sekiguchi
solr jay wrote: Hi, I am looking at this piece of configuration in solrconfig.xml admin defaultQuerysolr/defaultQuery gettableFiles solrconfig.xml schema.xml /gettableFiles pingQueryq=solramp;version=2.0amp;start=0amp;rows=0/pingQuery !-- configure a

posting binary file and metadata in two separate documents

2009-07-07 Thread rossputin
Hi. I am currently using Solr Cell to extract content from binary files, and I am passing along some additional metadata with ext.literal params. Sample below: curl

KStem download

2009-07-07 Thread Pascal Dimassimo
Hi, I want to try KStem. I'm following the instructions on this page: http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters/Kstem ... but the download link doesn't work. Is anyone know the new location to download KStem? -- View this message in context:

Re: Query on the updation of synonym and stopword file.

2009-07-07 Thread Koji Sekiguchi
Sagar, I am facing a problem here that even after the core reload and re-indexing the documents the new updated synonym or stop words are not loaded. Seems so the filters are not aware that these files are updated so the solution to me is to restart the whole container in which I have

Re: Multiple values for custom fields provided in SOLR query

2009-07-07 Thread Suryasnat Das
Hi Otis, Thanks for replying to my query. My query is, if multiple values are provided for a custom field then how can it be represented in a SOLR query. So if my field is fileID and its values are 111, 222 and 333 and my search string is ‘product’ then how can this be represented in a SOLR

Re: Filtering MoreLikeThis results

2009-07-07 Thread Yao Ge
I am not sure about the parameters for MLT the requestHandler plugin. Can one of you share the solrconfig.xml entry for MLT? Thanks in advance. -Yao Bill Au wrote: I have been using the StandardRequestHandler (ie /solr/select). fq does work with the MoreLikeThisHandler. I will switch to

Re: Indexing XML

2009-07-07 Thread Jay Hill
Mathieu, have a look at Solr's DataImportHandler. It provides a configuration-based approach to index different types of datasources including relational databases and XML files. In particular have a look at the XpathEntityProcessor (

Solr Set Up

2009-07-07 Thread G T
Hi, I was interested in creating a test environment where i can make use of solr/ lucene .My objective is to be able to test various features of solr .(replication , performance, indexing , searching and so on) I wanted someone to give me a start on above.I am well versed with lucene/solr

How to get various records in the result set

2009-07-07 Thread fei dong
Hi buddy, I am working on a music search project and I have a special requirement about the ranking when querying the artist name. Ex: When I query the artist ne yo, there are 500results and maybe 100 song names are repeated. So the ideal thing is to let users get more different songs in on page

RE: Is there any other way to load the index beside using http connection?

2009-07-07 Thread Francis Yakin
I did try: curl 'http://localhost:8983/solr/update/csv?stream.file=/opt/apache-1.2.0/example/exampledocs/test.csvstream.contentType=text/plain;charset=utf-8' It doesn't work Francis -Original Message- From: ysee...@gmail.com [mailto:ysee...@gmail.com] On Behalf Of Yonik Seeley Sent:

RE: Is there any other way to load the index beside using http connection?

2009-07-07 Thread Francis Yakin
With curl 'http://localhost:8983/solr/update/csv?stream.file=/opt/apache-1.2.0/example/exampledocs/test.csvstream.contentType=text/plain;charset=utf-8' No errors now. But , how can I verify if the update happening? Thanks Francis -Original Message- From: Francis Yakin

Re: Is there any other way to load the index beside using http connection?

2009-07-07 Thread Yonik Seeley
The double quotes around the ampersand don't belong there. I think that UTF8 should also be the default, so the following should also work: curl 'http://localhost:8983/solr/update/csv?stream.file=/opt/apache-1.2.0/example/exampledocs/test.csv' -Yonik http://www.lucidimagination.com On Tue, Jul

RE: Is there any other way to load the index beside using http connection?

2009-07-07 Thread Francis Yakin
yeah, It works now. How can I verify if the new CSV file get uploaded? Thanks Francis -Original Message- From: ysee...@gmail.com [mailto:ysee...@gmail.com] On Behalf Of Yonik Seeley Sent: Tuesday, July 07, 2009 10:49 AM To: solr-user@lucene.apache.org Cc: Norberto Meijome Subject: Re:

Re: Is there any other way to load the index beside using http connection?

2009-07-07 Thread Yonik Seeley
On Tue, Jul 7, 2009 at 1:50 PM, Francis Yakinfya...@liquid.com wrote:  yeah, It works now. How can I verify if the new CSV file get uploaded? point your browser at http://localhost:8983/solr/admin/stats.jsp Check out the UPDATE HANDLERS section -Yonik http://www.lucidimagination.com

Re: reindexed data on master not replicated to slave

2009-07-07 Thread solr jay
It seemed that the patch fixed the symptom, but not the problem itself. Now the log messages looks good. After one download and installed the index, it printed out *Jul 7, 2009 10:35:10 AM org.apache.solr.handler.SnapPuller fetchLatestIndex INFO: Slave in sync with master.* but the files inside

Re: Solr slave Heap space error and index size issue

2009-07-07 Thread Chris Hostetter
: 5-6 days after fresh index index size suddenly increased (no optimization in : between) by 150GB and then query takes long time and java heap error comes. : I run optimize in this index Its takes long time and result it increase : index size more more then 200GB and it didn't show about

Re: Can´t use wildcard * on alphanumeric values?

2009-07-07 Thread Shalin Shekhar Mangar
On Tue, Jul 7, 2009 at 6:45 PM, gateway0 reiterwo...@yahoo.de wrote: Thank you, that was it. Why is the preserveOriginal=1 option nowhere documented? A simple case of oversight :) I've added a note on preserveOriginal and splitOnNumerics (another omission) to the wiki page

Re: how to shuffle the result while follow some priority rules at the same time

2009-07-07 Thread Chris Hostetter
: I want to implement that effect that the results had better differ from each : other in one page, but I want to show some results first like those contains : more attributes. there is a RandomSortField that you can use as a tie breaker when all other fields are equal. info baout using that

Re: reindexed data on master not replicated to slave

2009-07-07 Thread Shalin Shekhar Mangar
On Tue, Jul 7, 2009 at 11:50 PM, solr jay solr...@gmail.com wrote: It seemed that the patch fixed the symptom, but not the problem itself. Now the log messages looks good. After one download and installed the index, it printed out *Jul 7, 2009 10:35:10 AM org.apache.solr.handler.SnapPuller

Re: reindexed data on master not replicated to slave

2009-07-07 Thread solr jay
I see. So I tried it again. Now index.properties has #index properties #Tue Jul 07 12:13:49 PDT 2009 index=index.20090707121349 but there is no such directory index.20090707121349 under the data directory. Thanks, J On Tue, Jul 7, 2009 at 11:50 AM, Shalin Shekhar Mangar

Re: facets and stopwords

2009-07-07 Thread Chris Hostetter
: http://projecte01.development.barcelonamedia.org/fonetic/ : you will see a Top Words list (in Spanish and stemmed) in the list there : is the word si which is in 20649 documents. : If you click at this word, the system will perform the query : (x) content:si, with no answers at all :

Re: Indexing XML

2009-07-07 Thread Saeli Mathieu
I'm sorry I almost finish my script to format my xml in Solr's xml. I'll give it to you later, I think that can help some people like me in the future :) I just need to formate my output text and everything will be fine :) Cheers for your help guys ;) On Tue, Jul 7, 2009 at 7:06 PM, Jay Hill

Re: How to get various records in the result set

2009-07-07 Thread Chris Hostetter
duplicate post? http://www.nabble.com/how-to-shuffle-the-result-while-follow-some-priority-rules-at-the--same-time-to24282025.html#a24282025 FYI: reposting the same question twice doesn't tend to get responses faster, it just increases the total volume of mail and slows down everyones ability

Re: reindexed data on master not replicated to slave

2009-07-07 Thread solr jay
Ok, Here is the problem. In the function, the two directories tmpIndexDir and indexDir are the same (in this case only?), and then at the end of the function, the directory tmpIndexDir is deleted, which deletes the new index directory. } finally { delTree(tmpIndexDir); } On

Re: reindexed data on master not replicated to slave

2009-07-07 Thread solr jay
In fact, I saw the directory was created and then deleted. On Tue, Jul 7, 2009 at 12:29 PM, solr jay solr...@gmail.com wrote: Ok, Here is the problem. In the function, the two directories tmpIndexDir and indexDir are the same (in this case only?), and then at the end of the function, the

Re: Filtering MoreLikeThis results

2009-07-07 Thread Yao Ge
The answer to my owner question: ... requestHandler name=mlt class=solr.MoreLikeThisHandler lst name=defaults/ /requestHandler ... would work. -Yao Yao Ge wrote: I am not sure about the parameters for MLT the requestHandler plugin. Can one of you share the solrconfig.xml entry

Re: Faceting with MoreLikeThis

2009-07-07 Thread Yao Ge
Faceting on MLT request the use of MoreLikeThisHandler. The standard request handler, while provide support to MLT via a search component, does not return facets on MLT results. To enable MLT handler, add an entry like below to your solrconfig.xml requestHandler name=mlt

Re: Indexing XML

2009-07-07 Thread Saeli Mathieu
And here it's my code :) If you need some explanation feel free to ask :) You can test it on the first test file I gave you when I open the thread. At the moment that works only on one file, I have to change it a bit to make it works on repertory with lots of xml files, See you later guys :-)

Re: Can't limit return fields in custom request handler

2009-07-07 Thread Chris Hostetter
: But I have a problem like this; when i call : http://localhost:8983/solr/select/?qt=cfacetq=%2BitemTitle:nokia%20%2BcategoryId:130start=0limit=3fl=id, : itemTitle : i'm getiing all fields instead of only id and itemTitle. Your custom handler is responsible for checking the fl and setting

RE: Is there any other way to load the index beside using http connection?

2009-07-07 Thread Francis Yakin
Norberto, You said last week: why not generate your SQL output directly into your oracle server as a file, upload the file to your SOLR server? Then the data file is local to your SOLR server , you will bypass any WAN and firewall you may be having. (or some variation of it, sql - SOLR server

about defaultSearchField

2009-07-07 Thread Yang Lin
Hi, I have some problems. For my solr progame, I want to type only the Query String and get all field result that includ the Query String. But now I can't get any result without specified field. For example, query with tina get nothing, but Sentence:tina could. I hava adjusted the *schema.xml*

Re: reindexed data on master not replicated to slave

2009-07-07 Thread solr jay
I guess in this case it doesn't matter whether the two directories tmpIndexDir and indexDir are the same or not. It looks that the index directory is switched to tmpIndexDir and then it is deleted inside finally. On Tue, Jul 7, 2009 at 12:31 PM, solr jay solr...@gmail.com wrote: In fact, I saw

Re: Stopwords when facetting

2009-07-07 Thread Chris Hostetter
: When indexing or querying text, i'm using the solr.StopFilterFactory ; it seems to works just fine... : : But I want to use the text field as a facet, and get all the commonly : used words in a set of results, without the stopwords. As far as I : tried, I always get stopwords, and numerical

Re: Preparing the ground for a real multilang index

2009-07-07 Thread Jan Høydahl
When using stemming, you have to know the query language. For your project, perhaps you should look into switching to a lemmatizer instead. I believe Lucid can provide integration with a commercial lemmatizer. This way you can expand the document field itself and do not need to know the

Re: Preparing the ground for a real multilang index

2009-07-07 Thread Benson Margulies
There is an alternative to knowing the language at query: multiply-process for stems or lemmas of all the possible languages. This may well be a cure much worse than the disease. Yes, LI can sell you our lemma-production capability. --benson margulies basis technology On Tue, Jul 7, 2009 at

A big question about Solr and SolrJ range query ?

2009-07-07 Thread huenzhao
Hi all: Suppose that my index have 3 fields: title, x and y. I know one range(10 x 100) can query liks this: http://localhost:8983/solr/select?q=x:[10 TO 100]fl=title If I want to two range(10 x 100 AND 20 y 300) query like SQL(select title where x10 and x 100 and y 20 and y 300)

Re: A big question about Solr and SolrJ range query ?

2009-07-07 Thread Yao Ge
use Solr's Filter Query parameter fq: fq=x:[10 TO 100]fq=y:[20 TO 300]fl=title -Yao huenzhao wrote: Hi all: Suppose that my index have 3 fields: title, x and y. I know one range(10 x 100) can query liks this: http://localhost:8983/solr/select?q=x:[10 TO 100]fl=title If I want

Re: about defaultSearchField

2009-07-07 Thread Yao Ge
Try with fl=* or fl=*,score added to your request string. -Yao Yang Lin-2 wrote: Hi, I have some problems. For my solr progame, I want to type only the Query String and get all field result that includ the Query String. But now I can't get any result without specified field. For

RE: Query on the updation of synonym and stopword file.

2009-07-07 Thread Sagar Khetkade
I am using Solr1.3 version.. Date: Wed, 8 Jul 2009 01:12:02 +0900 From: k...@r.email.ne.jp To: solr-user@lucene.apache.org Subject: Re: Query on the updation of synonym and stopword file. Sagar, I am facing a problem here that even after the core reload and re-indexing the

Updating Solr index from XML files

2009-07-07 Thread Francis Yakin
I have the following curl cmd to update and doing commit to Solr ( I have 10 xml files just for testing) curl http://solr00:7001/solr/update --data-binary @xml_Artist-100170.txt -H 'Content-type:text/plain; charset=utf-8' curl http://solr00:7001/solr/update --data-binary

Re: reindexed data on master not replicated to slave

2009-07-07 Thread Noble Paul നോബിള്‍ नोब्ळ्
jay, Thanks. The testcase was not enough. I have given a new patch . I guess that should solve this On Wed, Jul 8, 2009 at 3:48 AM, solr jaysolr...@gmail.com wrote: I guess in this case it doesn't matter whether the two directories tmpIndexDir and indexDir are the same or not. It looks that the

Re: Updating Solr index from XML files

2009-07-07 Thread Otis Gospodnetic
If Perl is you choice: http://search.cpan.org/~bricas/WebService-Solr-0.07/lib/WebService/Solr.pm Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message From: Francis Yakin fya...@liquid.com To: solr-user@lucene.apache.org solr-user@lucene.apache.org