Replication failed without an error =(

2012-04-24 Thread stockii
hello.. anyone a idea how i can figure out why my replication failed ? i got no errors =( my configuratio is. 2 server! both are master and slave at the same time. only one server makes updates and is so the master. on slave is started via cron a replication. is one server crashed, i can

Re: Multi-words synonyms matching

2012-04-24 Thread elisabeth benoit
Hello, I'd like to resume this post. The only way I found to do not split synonyms in words in synonyms.txt it to use the line filter class=solr.SynonymFilterFactory synonyms=synonyms.txt ignoreCase=true expand=true tokenizerFactory=solr.KeywordTokenizerFactory/ in schema.xml where

Re: Replication failed without an error =(

2012-04-24 Thread stockii
bevore this problem i got this problem https://issues.apache.org/jira/browse/SOLR-1781 - --- System One Server, 12 GB RAM, 2 Solr Instances, 8 Cores, 1 Core with 45 Million Documents other Cores 200.000 - Solr1 for

Re: Group by distance

2012-04-24 Thread ravicv
Use group=true and group.field in your query. And your solr version should be solr 3.4 and above. Thanks, Ravi -- View this message in context: http://lucene.472066.n3.nabble.com/Group-by-distance-tp3934876p3934886.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Group by distance

2012-04-24 Thread ViruS
I think this only can works when I have many records in same position. My problem is to group witch short distance... like I say in last mail... about 10km. I need put markers on Poland country and display this. Now I have 100k records, but in future I will have about 2mln records so I must send

Auto suggest on indexed file content filtered based on user

2012-04-24 Thread prakash_ajp
I am trying to implement an auto-suggest feature. The search feature already exists and searches on file content in user's allotted workspace. The following is from my schema that will be used for search indexing: field name=Text type=text indexed=true stored=false multiValued=false/ field

Re: Deciding whether to stem at query time

2012-04-24 Thread Andrew Wagner
Ah, this is a really good point. Still seems like it has the downsides of #2, though, much bigger space requirements and possibly some time lost on queries. On Mon, Apr 23, 2012 at 3:35 PM, Walter Underwood wun...@wunderwood.orgwrote: There is a third approach. Create two fields and always

Searching on fields with White Spaces

2012-04-24 Thread Shubham Srivastava
I have a custom fieldtype with the below config fieldType name=text_ngram class=solr.TextField positionIncrementGap=100 autoGeneratePhraseQueries=true analyzer type=index tokenizer class=solr.WhitespaceTokenizerFactory/ filter class=solr.LowerCaseFilterFactory/

Re: Multi-words synonyms matching

2012-04-24 Thread Jeevanandam
usage of q and fq q = is typically the main query for the search request fq = is Filter Query; generally used to restrict the super set of documents without influencing score (more info. http://wiki.apache.org/solr/CommonQueryParameters#q) For example: q=hotel de ville ===

Re: Auto suggest on indexed file content filtered based on user

2012-04-24 Thread Jeevanandam
can you please share a sample query? -Jeevanandam On 24-04-2012 1:49 pm, prakash_ajp wrote: I am trying to implement an auto-suggest feature. The search feature already exists and searches on file content in user's allotted workspace. The following is from my schema that will be used for

Recovery - too many updates received since start

2012-04-24 Thread Trym R. Møller
Hi I experience that a Solr looses its connection with Zookeeper and re-establish it. After Solr is reconnection to Zookeeper it begins to recover. It has been missing the connection approximately 10 seconds and meanwhile the leader slice has received some documents (maybe about 1000

JDBC import yields no data

2012-04-24 Thread Hasan Diwan
I'm trying to migrate from RDBMS to the Lucene ecosystem. To do this, I'm trying to use the JDBC importer[1]. My configuration is given below: dataConfig dataSource driver=net.sf.log4jdbc.DriverSpy user=sa url=jdbc:log4jdbc:h2:tcp://192.168.1.6/finance/ !-- dataSource driver=org.h2.Driver

Recover - Read timed out

2012-04-24 Thread Trym R. Møller
Hi I experience that a Solr looses its connection with Zookeeper and re-establish it. After Solr is reconnection to Zookeeper it begins to recover its replicas. It has been missing the connection approximately 10 seconds and meanwhile the leader slice has received some documents (maybe about

Re: Multi-words synonyms matching

2012-04-24 Thread elisabeth benoit
yes, thanks, but this is NOT my question. I was wondering why I have multiple matches with q=hotel de ville and no match with fq=CATEGORY_ANALYZED:hotel de ville, since in both case I'm searching in the same solr fieldType. Why is q parameter behaving differently in that case? Why do the quotes

debugging junit test with eclipse

2012-04-24 Thread Bernd Fehling
I have tried all hints from internet for debugging a junit test of solr 3.6 under eclipse but didn't succeed. eclipse and everything is running, compiling, debugging with runjettyrun. Tests have no errors. Ant from command line ist also running with ivy, e.g. ant -Dtestmethod=testUserFields

Re: Deciding whether to stem at query time

2012-04-24 Thread Otis Gospodnetic
Hi Andrew, This would not necessarily increase the size of your index that much - you don't to store both fields, just 1 of them if you really need it for highlighting or displaying.  If not, just index. Otis  Performance Monitoring for Solr -

Query parsing VS marshalling/unmarshalling

2012-04-24 Thread Mindaugas Žakšauskas
Hi, I maintain a distributed system which Solr is part of. The data which is kept is Solr is permissioned and permissions are currently implemented by taking the original user query, adding certain bits to it which would make it return less data in the search results. Now I am at the point where

Re: Query parsing VS marshalling/unmarshalling

2012-04-24 Thread Benson Margulies
2012/4/24 Mindaugas Žakšauskas min...@gmail.com: Hi, I maintain a distributed system which Solr is part of. The data which is kept is Solr is permissioned and permissions are currently implemented by taking the original user query, adding certain bits to it which would make it return less

Re: Deciding whether to stem at query time

2012-04-24 Thread Paul Libbrecht
Le 24 avr. 2012 à 17:16, Otis Gospodnetic a écrit : This would not necessarily increase the size of your index that much - you don't to store both fields, just 1 of them if you really need it for highlighting or displaying. If not, just index. I second this. The query expansion process is

Re: Deciding whether to stem at query time

2012-04-24 Thread Andrew Wagner
I'm sorry, I'm missing something. What's the difference between storing and indexing a field? On Tue, Apr 24, 2012 at 10:28 AM, Paul Libbrecht p...@hoplahup.net wrote: Le 24 avr. 2012 à 17:16, Otis Gospodnetic a écrit : This would not necessarily increase the size of your index that much -

Re: Query parsing VS marshalling/unmarshalling

2012-04-24 Thread Mindaugas Žakšauskas
On Tue, Apr 24, 2012 at 3:27 PM, Benson Margulies bimargul...@gmail.com wrote: I'm about to try out a contribution for serializing queries in Javascript using Jackson. I've previously done this by serializing my own data structure and putting the JSON into a custom query parameter. Thanks for

RE: JDBC import yields no data

2012-04-24 Thread Dyer, James
You might also want to show us your dataimport handler configuration from solrconfig.xml and also the url you're using to start the data import. When its complete, browsing to http://192.168.1.6:8995/solr/db/dataimport; (or whatever the DIH handler name is in your config) should say indexing

Re: Group by distance

2012-04-24 Thread Erick Erickson
What do you mean by grouped? It's relatively easy to return only documents within a certain radius, and it's also easy to return the results ordered by distance. Here's a good place to start: http://wiki.apache.org/solr/SpatialSearch#geofilt_-_The_distance_filter Best Erick On Tue, Apr 24, 2012

Stats Component and solrj

2012-04-24 Thread Erik Fäßler
Hey all, I'd like to know how many terms I have in a particular field in a search. In other words, I want to know how many facets I have in that field. I use string fields, there are no numbers. I wanted to use the Stats Component and use its count value. When trying this out in the browser,

correct location in chain for EdgeNGramFilterFactory ?

2012-04-24 Thread geeky2
hello all, i want to experiment with the EdgeNGramFilterFactory at index time. i believe this needs to go in post tokenization - but i am doing a pattern replace as well as other things. should the EdgeNGramFilterFactory go in right after the pattern replace? fieldType

Re: Multi-words synonyms matching

2012-04-24 Thread Erick Erickson
Elisabeth: What shows up in the debug section of the response when you add debugQuery=on? There should be some bit of that section like: parsed_filter_queries My other question is are you absolutely sure that your CATEGORY_ANALYZED field has the correct content?. How does it get populated?

Re: Deciding whether to stem at query time

2012-04-24 Thread Erick Erickson
When you set store=true in your schema, a verbatim copy of the raw input is placed in the *.fdt file. That is the information returned when you specify the fl parameter for instance. When you set index=true, the input is analyzed and the resulting terms are placed in the inverted index and are

Re: Query parsing VS marshalling/unmarshalling

2012-04-24 Thread Erick Erickson
In general, query parsing is such a small fraction of the total time that, almost no matter how complex, it's not worth worrying about. To see this, attach debugQuery=on to your query and look at the timings in the pepare and process portions of the response. I'd be very sure that it was a

Re: solr replication failing with error: Master at: is not available. Index fetch failed

2012-04-24 Thread geeky2
hello, thank you for the reply, yes - master has been indexed. ok - makes sense - the polling interval needs to change i did check the solr war file on both boxes (master and slave). they are identical. actually - if they were not indentical - this would point to a different issue altogether

Re: Query parsing VS marshalling/unmarshalling

2012-04-24 Thread Mindaugas Žakšauskas
Hi Erick, Thanks for looking into this and for the tips you've sent. I am leaning towards custom query component at the moment, the primary reason for it would be to be able to squeeze the amount of data that is sent over to Solr. A single round trip within the same datacenter is worth around

Re: Auto suggest on indexed file content filtered based on user

2012-04-24 Thread prakash_ajp
Right now, the query is a very simple one, something like q=text. Basically, it would return ['textview', 'textviewer', ..] But the issue is, the 'textviewer' could be from a file that is out of bounds for this user. So, ultimately I would like to include the userName in the query. As mentioned

Re: solr replication failing with error: Master at: is not available. Index fetch failed

2012-04-24 Thread Rahul Warawdekar
Hi, In Solr wiki, for replication, the master url is defined as follows str name=masterUrlhttp://master_host:port /solr/corename/replication/str This url does not contain admin in its path where as in the master url provided by you, you have an additional admin in the url. Not very sure if this

Re: Auto suggest on indexed file content filtered based on user

2012-04-24 Thread Jeevanandam Madanagopal
On Apr 24, 2012, at 9:37 PM, prakash_ajp wrote: Right now, the query is a very simple one, something like q=text. Basically, it would return ['textview', 'textviewer', ..] hmm, so you're using default query field But the issue is, the 'textviewer' could be from a file that is out of

Re: JDBC import yields no data

2012-04-24 Thread Hasan Diwan
On 24 April 2012 07:49, Dyer, James james.d...@ingrambook.com wrote: You might also want to show us your dataimport handler configuration from solrconfig.xml and also the url you're using to start the data import. When its complete, browsing to http://192.168.1.6:8995/solr/db/dataimport;

Re: JDBC import yields no data

2012-04-24 Thread Gora Mohanty
On 24 April 2012 22:22, Hasan Diwan hasan.di...@gmail.com wrote: [...] The dataimport url I'm using is http://192.168.1.6:8995/solr/db/dataimport?command=full-import And, does it show you any output? As James mentions, it should say busy while the data import is running, and indexing completed

RE: JDBC import yields no data

2012-04-24 Thread Dyer, James
After you issue the full-import command with the url you gave: http://192.168.1.6:8995/solr/db/dataimport?command=full-import Paste the url in a web browser without the command http://192.168.1.6:8995/solr/db/dataimport It should be giving you status as to how many database calls its made, how

Re: Query parsing VS marshalling/unmarshalling

2012-04-24 Thread Erick Erickson
If you're assembling an fq clause, this is all done or you, although you need to take some care to form the fq clause _exactly_ the same way each time. Think of the filterCache as a key/value map where the key is the raw fq text and the value is the docs satisfying that query. So fq=acl:(a OR a)

Re: solr replication failing with error: Master at: is not available. Index fetch failed

2012-04-24 Thread geeky2
that was it! thank you. i did notice something else in the logs now ... what is the meaning or implication of the message, Connection reset.? 2012-04-24 12:59:19,996 INFO [org.apache.solr.handler.SnapPuller] (pool-12-thread-1) Slave in sync with master. 2012-04-24 12:59:39,998 INFO

RE: Auto suggest on indexed file content filtered based on user

2012-04-24 Thread Klostermeyer, Michael
I'm new to Solr, but I would think the fq=[username] would work here. http://wiki.apache.org/solr/CommonQueryParameters#fq Mike -Original Message- From: prakash_ajp [mailto:prakash_...@yahoo.com] Sent: Tuesday, April 24, 2012 11:07 AM To: solr-user@lucene.apache.org Subject: Re: Auto

Re: Auto suggest on indexed file content filtered based on user

2012-04-24 Thread prakash_ajp
I read on a couple of other web pages that fq is not supported for suggester. I even tried the query and it doesn't help. My understanding was, when the suggest (spellcheck) index is built, only the field chosen is considered for queries and the other fields from the main index are not available

Re: Auto suggest on indexed file content filtered based on user

2012-04-24 Thread Jeevanandam Madanagopal
yes only spellcheck indexed build field is for suggest query I believe, filtering a documents on search handler using fq parameter and spell suggest are two part we are discussing here. lets say you have field for spellcheck - used to build spell dictionary field name=spell type=textSpell …. …

Field names w/ leading digits cause strange behavior

2012-04-24 Thread bleakley
When specifying a field name that starts with a digit (or digits) in the fl parameter solr returns both the field name and field value as the those digits. For example, using nightly build apache-solr-4.0-2012-04-24_08-27-47 I run: java -jar start.jar and java -jar post.jar solr.xml monitor.xml

embedded solr populating field of type LatLonType

2012-04-24 Thread Jason Cunning
Hi, I have a question concerning the spatial field type LatLonType and populating it via an embedded solr server in java. So far I've only ever had to index simple types like boolean, float, and string. This is the first complex type. So I'd like to use the following field definition for

Re: correct location in chain for EdgeNGramFilterFactory ?

2012-04-24 Thread Erick Erickson
Well, what effect do you _want_? I'd probably put it after the PorterStemFilterFactory. As it is, it'll form a bunch of ngrams, then WordDelimiterFilterFactory will try to break them up according to _its_ rules and eventually you'll be sending absolute gibberish to the stemmer. I mean what is the

faceted searches - design question - facet field not part of qf search fields

2012-04-24 Thread geeky2
hello all, this is more of a design / newbie question on how others combine faceted search fields in to their requestHandlers. say you have a request handler set up like below. does it make sense (from a design perspective) to add a faceted search field that is NOT part of the main search

Re: Auto suggest on indexed file content filtered based on user

2012-04-24 Thread Erick Erickson
I don't know if there is a really good solution here. The problem is that suggester (and the trunk FST version) simply traverse the terms in the index. there's not even a real concept of those terms belonging to any document. Since your security level is on a document basis, that makes things

Re: faceted searches - design question - facet field not part of qf search fields

2012-04-24 Thread Erick Erickson
No problem here at all, it's done all the time. Consider a popular facet series in the last day, in the last week, in the last month... There's no reason you have to facet on the fields that are searched on. The user as search terms like my dog has fleas and your query looks like q=my dog has

Re: Field names w/ leading digits cause strange behavior

2012-04-24 Thread Erick Erickson
Hmmm, this does NOT happen on 3.6, and it DOES happen on trunk. Sure sounds like a JIRA to me, would you mind raising one? I can't imagine this is desired behavior, it's just weird. Thanks for pointing this out! Erick On Tue, Apr 24, 2012 at 3:38 PM, bleakley bleak...@factual.com wrote: When

Re: faceted searches - design question - facet field not part of qf search fields

2012-04-24 Thread Chris Hostetter
: : The user as search terms like my dog has fleas and your query : looks like : q=my dog has fleasfq=timestamp:[NOW/DAY TO NOW/DAY+1DAY] : and the user sees all documents with those terms added since midnight : last night. No confusion at all... right ... wether the facets are useful or

Re: Field names w/ leading digits cause strange behavior

2012-04-24 Thread bleakley
Thank you for verifying the issue. I've created a ticket at https://issues.apache.org/jira/browse/SOLR-3407 -- View this message in context: http://lucene.472066.n3.nabble.com/Field-names-w-leading-digits-cause-strange-behavior-tp3936354p3936599.html Sent from the Solr - User mailing list

Title Boosting and IDF

2012-04-24 Thread Tavi Nathanson
Hey everyone, I field documents by title and body. The title field often has far fewer terms than the body field. IDF, as a result, will have a profound effect in the title field compared to the body field. I currently have the title field boosted by 4x relative to the body field. While I want

Re: Auto suggest on indexed file content filtered based on user

2012-04-24 Thread Doug Mittendorf
Another option is to use faceting (via the facet.prefix param) for your auto-suggest. It's not as fast and scalable as using one of the Suggester implementations, but it does allow arbitrary fq parameters to be included in the request to limit the results.

QueryElevationComponent and distributed search

2012-04-24 Thread srinir
Hi, I am using solr 3.6. I saw in Solr wiki that QueryElevationComponent is not supported for distributed search. https://issues.apache.org/jira/browse/SOLR-2949 When I checked the above ticket, it looks like its fixed in Solr 4.0. Does anyone have any idea when a stable version of solr 4.0

Re: Auto suggest on indexed file content filtered based on user

2012-04-24 Thread prakash_ajp
The first one may not work because the number of users can be big. Besides, the users can simply register themselves and start using it. It won't work if an admin has to intervene in the registration process. The second could work I guess. But the problem would be data duplication as users might

Re: Auto suggest on indexed file content filtered based on user

2012-04-24 Thread prakash_ajp
Is it true that faceting is case sensitive? That would be disastrous for our requirement :( -- View this message in context: http://lucene.472066.n3.nabble.com/Auto-suggest-on-indexed-file-content-filtered-based-on-user-tp3934565p3937370.html Sent from the Solr - User mailing list archive at