Multiple schemas in the same SolrCloud ?

2013-10-09 Thread xinwu
Hi all, I want to use the multiple schemas in the same solrCloud, is it allowed? If it is allowed,how? These schemas may have no relation. Thank You. Dai. -- View this message in context: http://lucene.472066.n3.nabble.com/Multiple-schemas-in-the-same-SolrCloud-tp4094279.html Sent from

synonyms and term position

2013-10-09 Thread Alvaro Cabrerizo
Hi: I'm involved in a process o upgrade solr from 1.4 to 4.4 and I'm having a problem using SynonymFilterFactory within the process chain SynonymFilterFactory, StopFilterFactory . I have configured synonyms.txt to expand the word AIO as: all-in-one. Well, when using solr 1.4 I get the following

Re: Multiple schemas in the same SolrCloud ?

2013-10-09 Thread Anshum Gupta
You can simply have multiple collections, each independent of each other on the schema but could run on the same instance/jvm if you want. On Wed, Oct 9, 2013 at 12:36 PM, xinwu xinwu0...@gmail.com wrote: Hi all, I want to use the multiple schemas in the same solrCloud, is it allowed? If

Re: dynamic field question

2013-10-09 Thread Aloke Ghoshal
Hi David, A separate Solr document for each section is a good option if you also need to handle phrases, case, special characters, etc. within the title field. How do you map them to dynamic fields? E.g.: Appendix for cities, APPENDIX 1: Cities Regards, Aloke On Wed, Oct 9, 2013 at 9:45 AM,

Re: Multiple schemas in the same SolrCloud ?

2013-10-09 Thread xinwu
I remember I must put the -Dbootstrap_confdir=/opt/Solr_home/collection1/conf -Dcollection.configName=solrConfig in the catalina.sh . Is it means that solrCloud must have one ,and only one, schema? -- View this message in context:

Re: SolrJ best pratices

2013-10-09 Thread Furkan KAMACI
I suggest you to look at here: http://wiki.apache.org/solr/Solrj?action=fullsearchcontext=180value=cloudsolrservertitlesearch=Titles#Using_with_SolrCloud 2013/10/9 Shawn Heisey s...@elyograg.org On 10/7/2013 3:08 PM, Mark wrote: Some specific questions: - When working with HttpSolrServer

Re: SolrCloud High Availability during indexing operation

2013-10-09 Thread Furkan KAMACI
Hi Saurabh, Your link does not work (it is broken). 2013/10/9 Saurabh Saxena ssax...@gopivotal.com Pastbin link http://pastebin.com/cnkXhz7A I am doing a bulk request. I am uploading 100 files, each file having 100 docs. -Saurabh On Tue, Oct 8, 2013 at 7:39 PM, Mark Miller

Re: synonyms and term position

2013-10-09 Thread Furkan KAMACI
Could you send screenshot of admin Analysis page when trying to analyze that words? 2013/10/9 Alvaro Cabrerizo topor...@gmail.com Hi: I'm involved in a process o upgrade solr from 1.4 to 4.4 and I'm having a problem using SynonymFilterFactory within the process chain SynonymFilterFactory,

Re: Multiple schemas in the same SolrCloud ?

2013-10-09 Thread Furkan KAMACI
You can have more information from here: https://cwiki.apache.org/confluence/display/solr/Using+ZooKeeper+to+Manage+Configuration+Files 2013/10/9 xinwu xinwu0...@gmail.com I remember I must put the -Dbootstrap_confdir=/opt/Solr_home/collection1/conf -Dcollection.configName=solrConfig in the

Re: synonyms and term position

2013-10-09 Thread Alvaro Cabrerizo
Sure, Find attached the screenshots with almost all the analysis, (dont worry about the lowercase and the porter stemmer) Regards. On Wed, Oct 9, 2013 at 10:17 AM, Furkan KAMACI furkankam...@gmail.comwrote: Could you send screenshot of admin Analysis page when trying to analyze that

Re: no such field error:smaller big block size details while indexing doc files

2013-10-09 Thread sweety
I will try using solrj.Thanks. but I tried to index .docx file I am getting  some different error: SEVERE: null:java.lang.RuntimeException: java.lang.VerifyError: (class: org/apache/poi/extractor/ExtractorFactory, method: createExtractor signature:

Re: no such field error:smaller big block size details while indexing doc files

2013-10-09 Thread sweety
I will try using solrJ. Now I tried indexing .docx files and I get some different error,logs are: SEVERE: null:java.lang.RuntimeException: java.lang.VerifyError: (class: org/apache/poi/extractor/ExtractorFactory, method: createExtractor signature:

Re: synonyms and term position

2013-10-09 Thread Furkan KAMACI
Does two has a synonym of in and one? 2013/10/9 Furkan KAMACI furkankam...@gmail.com Does two has a synonym of in and one? 2013/10/9 Alvaro Cabrerizo topor...@gmail.com Sure, Find attached the screenshots with almost all the analysis, (dont worry about the lowercase and the porter

Re: synonyms and term position

2013-10-09 Thread Alvaro Cabrerizo
No, it has no synonyms. On Wed, Oct 9, 2013 at 10:48 AM, Furkan KAMACI furkankam...@gmail.comwrote: Does two has a synonym of in and one? 2013/10/9 Furkan KAMACI furkankam...@gmail.com Does two has a synonym of in and one? 2013/10/9 Alvaro Cabrerizo topor...@gmail.com Sure, Find

Re: synonyms and term position

2013-10-09 Thread Alvaro Cabrerizo
The synonyms.txt has defined the next associations defined. AIO=All in one aio=all-in-one Regards. On Wed, Oct 9, 2013 at 11:05 AM, Alvaro Cabrerizo topor...@gmail.comwrote: No, it has no synonyms. On Wed, Oct 9, 2013 at 10:48 AM, Furkan KAMACI furkankam...@gmail.comwrote: Does two has

Collection API wrong configuration

2013-10-09 Thread maephisto
I'm experimenting with SolrCloud using Solr 4.5.0 and the Collection API What i did was: 1. upload configuration to ZK zkcli.sh -cmd upconfig -zkhost 127.0.0.1:8993 -d solr/my_custom_collection/conf/ -n my_custom_collection 2. create a collection using the api:

Re: dynamic field question

2013-10-09 Thread Twomey, David
OK. Then the JSON returned would contain a lot of documents that are really sections. This would work fine for the use-case I mentioned but I also use the index for full-text search of the whole document. Therefore, I would need to parse the result JSON in a way that combines the solr docs

Re: Collection API wrong configuration

2013-10-09 Thread maephisto
Using Solr 4.4.0 the same scenarion behaves as expected. Can anyone else try this, to check if it this only happens with 4.5.0 and if so, is this a desired behaviour or a bug? -- View this message in context:

Re: Collection API wrong configuration

2013-10-09 Thread Shalin Shekhar Mangar
This may be a bug in 4.5 Another user has also reported this bug: https://issues.apache.org/jira/browse/SOLR-5307 On Wed, Oct 9, 2013 at 3:51 PM, maephisto my_sky...@yahoo.com wrote: Using Solr 4.4.0 the same scenarion behaves as expected. Can anyone else try this, to check if it this only

Re: Collection API wrong configuration

2013-10-09 Thread primoz . skale
Works fine at my end. I use Solr 4.5.0 on Windows 7. I tried: zkcli.bat -cmd upconfig -zkhost localhost:9000 -d ..\solr\collection2\conf -n my_custom_collection java -Djetty.port=8001 -DzkHost=localhost:9000 -jar start.jar and finally

Find documents that are composed of % words

2013-10-09 Thread shahzad73
Is there a way that in Solr Query i find documents that is composed of n number of words. for example here is the list of words - Love - Ice - Cream - Sunny - I - To - A - On - Elephant - Balloon And a percentage such as: 80% Let’s assume you’re analyzing the text of the following sentence.

Re: Collection API wrong configuration

2013-10-09 Thread maephisto
Yes, the problem described in the ticket is what I'm also confronting with. -- View this message in context: http://lucene.472066.n3.nabble.com/Collection-API-wrong-configuration-in-4-5-0-tp4094319p4094335.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: dynamic field question

2013-10-09 Thread Otis Gospodnetic
David, Yes. Document grouping (aka field collapsing) will help you here. It should also allow you to create a better search experience on the front end - it's often better to narrow down where in a large document a match is than give users a large doc and say: we know the match is in here

Re: Find documents that are composed of % words

2013-10-09 Thread Otis Gospodnetic
Hi, You can take your words, combine some % of them with AND. Then take another set of them OR it with the previous set, and so on. Otis Solr ElasticSearch Support http://sematext.com/ On Oct 9, 2013 6:54 AM, shahzad73 shahzad...@yahoo.com wrote: Is there a way that in Solr Query i find

Re: dynamically adding core with auto-discovery in Solr 4.5

2013-10-09 Thread Erick Erickson
Jan: This worked for me if I do NOT have a core.properties at all in my new core. Personally I think the behavior in 4.4 was dangerous, what happens if you mis-type the command for instance? You could do Bad Things to the old core you were inadvertently re-creating. The core.properties file

Re: {soft}Commit and cache flusing

2013-10-09 Thread Erick Erickson
Tim: I think you're mis-interpreting. By replying to a post with the subject: {soft}Commit and cache flushing but going in a different direction, it's easy for people to think I'm not interested in that thread, I'll ignore it, thereby missing the fact that you're asking a somewhat different

Re: How to share Schema between multicore on Solr 4.4

2013-10-09 Thread Erick Erickson
Shawn: Hmmm, I hadn't thought about that before. The shareSchema stuff is keyed off the absolute directory (and timestamp) of the schema.xml file associated with a core and is about sharing the internal object that holds the parsed schema. Do you know for sure if the fact that this is coming

Re: dynamically adding core with auto-discovery in Solr 4.5

2013-10-09 Thread Jan Van Besien
On Wed, Oct 9, 2013 at 2:15 PM, Erick Erickson erickerick...@gmail.com wrote: This worked for me if I do NOT have a core.properties at all in my new core. Personally I think the behavior in 4.4 was dangerous, what happens if you mis-type the command for instance? You could do Bad Things to

Re: Find documents that are composed of % words

2013-10-09 Thread Aloke Ghoshal
Hi Shahzad, Have you tried with the Minimum Should Match feature: http://wiki.apache.org/solr/ExtendedDisMax#mm_.28Minimum_.27Should.27_Match.29 Regards, Aloke On Wed, Oct 9, 2013 at 4:55 PM, Otis Gospodnetic otis.gospodne...@gmail.com wrote: Hi, You can take your words, combine some % of

Permisions didn't check when call discoverUnder

2013-10-09 Thread Said Chavkin
Hello. When in solr/home directory exists directory to which solr do not have rights, then solr failed to start with exception: 2108 [main] INFO org.apache.solr.core.CoresLocator - Looking for core definitions underneath /var/lib/solr 2109 [main] ERROR org.apache.solr.servlet.SolrDispatchFilter

Re: dynamically adding core with auto-discovery in Solr 4.5

2013-10-09 Thread Erick Erickson
If you create a Wiki login, I'll be happy to add you to the contributors list. It's always valuable to have fresh eyes update docs while the ambiguities are still fresh! Erick On Wed, Oct 9, 2013 at 8:37 AM, Jan Van Besien ja...@ngdata.com wrote: On Wed, Oct 9, 2013 at 2:15 PM, Erick Erickson

Re: Permisions didn't check when call discoverUnder

2013-10-09 Thread Erick Erickson
What do you think Solr should do in this case? If the process doesn't have permission to the dir, it can't write to it. You need to set the permissions, or the authority of the process that Solr is running as appropriately. Best, Erick On Wed, Oct 9, 2013 at 8:54 AM, Said Chavkin

RE: Searching on (hyphenated/capitalized) word issue

2013-10-09 Thread Van Tassell, Kristian
Thank you Upayavira. I'm trying to figure out what will make Solr stem on multi in the word multicad so that any attempt to search on multicad, Multi-CAD or multiCAD will return results. The WordDelimiterFilterFactory helps with the case of multi followed by a dash or a capital letter, but I'm

Re: Permisions didn't check when call discoverUnder

2013-10-09 Thread Said Chavkin
I'm not sure, may be solr should skip inaccessible directory. Because it is standard rule to place service on separate filesystem. On the other hand it is possible to place solr/home not on the top of mounted fs. Anyway it would be better if error message was more clearly. 2013/10/9 Erick

Re: Shard split issue

2013-10-09 Thread Shalin Shekhar Mangar
I opened https://issues.apache.org/jira/browse/SOLR-5324 On Mon, Oct 7, 2013 at 2:20 PM, Yago Riveiro yago.rive...@gmail.com wrote: If the replica has 20G must probably the recovery will take more than 120 seconds. In my case I have ssd's and 120 it's not enough. -- Yago Riveiro Sent

Re: Searching on (hyphenated/capitalized) word issue

2013-10-09 Thread Furkan KAMACI
If you have that word to index: multicad and if you want to get result when you search that: multi you can use ngram filter. However you should consider pros and cons of using Ngram Filter. If you use ngrams you may find multicad from multi but your index size will be much more bigger. I suggest

Update existing documents when using ExtractingRequestHandler?

2013-10-09 Thread Jeroen Steggink
Hi, In a content management system I have a document and an attachment. The document contains the meta data and the attachment the actual data. I would like to combine data of both in one Solr document. I have thought of several options: 1. Using ExtractingRequestHandler I would extract the

Re: run filter queries after post filter

2013-10-09 Thread Rohit Harchandani
Hey, so the post filter logs the number of ids that it receives. With the above filter having cost=200, the post filter should have received the same number of ids as before ( when the filter was not present ). But that does not seem to be the case...with the filter query on the index, the number

Re: Find documents that are composed of % words

2013-10-09 Thread shahzad73
Please help me formulate the query that will be easy or do i have to build a custom filter for this ? Shahzad -- View this message in context: http://lucene.472066.n3.nabble.com/Find-documents-that-are-composed-of-words-tp4094264p4094372.html Sent from the Solr - User mailing list archive at

Re: Find documents that are composed of % words

2013-10-09 Thread shahzad73
my client has a strange requirement, he will give a list of 500 words and then set a percentage like 80% now he want to find those pages or documents which consist of the only those 80% of 500 and only 20% unknown. like we have this document word1 word2

Re: How to share Schema between multicore on Solr 4.4

2013-10-09 Thread Shawn Heisey
On 10/9/2013 6:24 AM, Erick Erickson wrote: Hmmm, I hadn't thought about that before. The shareSchema stuff is keyed off the absolute directory (and timestamp) of the schema.xml file associated with a core and is about sharing the internal object that holds the parsed schema. Do you know for

Solr's Filtering approaches

2013-10-09 Thread David Philip
Hi All, I have an issue in handling filters for one of our requirements and liked to get suggestion for the best approaches. *Use Case:* 1. We have List of groups and the number of groups can increase upto 1 million. Currently we have almost 90 thousand groups in the solr search system.

Re: Multiple schemas in the same SolrCloud ?

2013-10-09 Thread Shawn Heisey
On 10/9/2013 1:17 AM, xinwu wrote: I remember I must put the -Dbootstrap_confdir=/opt/Solr_home/collection1/conf -Dcollection.configName=solrConfig in the catalina.sh . Is it means that solrCloud must have one ,and only one, schema? Those bootstrap options are intended to be used ONCE, and

Re: SolrJ best pratices

2013-10-09 Thread Mark
Thanks for the clarification. In Solr Cloud just use 1 connection. In non-cloud environments you will need one per core. On Oct 8, 2013, at 5:58 PM, Shawn Heisey s...@elyograg.org wrote: On 10/7/2013 3:08 PM, Mark wrote: Some specific questions: - When working with HttpSolrServer should

Re: Find documents that are composed of % words

2013-10-09 Thread Furkan KAMACI
Are you asking something like that: http://wiki.apache.org/solr/TextProfileSignature 9 Ekim 2013 Çarşamba tarihinde shahzad73 shahzad...@yahoo.com adlı kullanıcı şöyle yazdı: Please help me formulate the query that will be easy or do i have to build a custom filter for this ? Shahzad --

Re: limiting deep pagination

2013-10-09 Thread Michael Sokolov
On 10/8/13 6:51 PM, Peter Keegan wrote: Is there a way to configure Solr 'defaults/appends/invariants' such that the product of the 'start' and 'rows' parameters doesn't exceed a given value? This would be to prevent deep pagination. Or would this require a custom requestHandler? Peter Just

Re: SolrCloud High Availability during indexing operation

2013-10-09 Thread Saurabh Saxena
@Furkan Pastebin link is working for me. Can you try again ? On Wed, Oct 9, 2013 at 1:15 AM, Furkan KAMACI furkankam...@gmail.comwrote: Hi Saurabh, Your link does not work (it is broken). 2013/10/9 Saurabh Saxena ssax...@gopivotal.com Pastbin link http://pastebin.com/cnkXhz7A I am

Re: {soft}Commit and cache flusing

2013-10-09 Thread Tim Vaillancourt
Apologies all. I think the suggestion that I was replying to get noticed is what erked me, otherwise I would have moved on. I'll follow this advice. Cheers, Tim On 9 October 2013 05:20, Erick Erickson erickerick...@gmail.com wrote: Tim: I think you're mis-interpreting. By replying to a

matching starts with only

2013-10-09 Thread adm1n
My index contains documents which could be a single word or a short sentence which contains up to 4-5 words. I need to return documents, which starts with only from the searched pattern. in regex it would be [^my_query]. for example, for a docs: black beautiful black cat cat cat is black black

Re: matching starts with only

2013-10-09 Thread Shawn Heisey
On 10/9/2013 12:57 PM, adm1n wrote: My index contains documents which could be a single word or a short sentence which contains up to 4-5 words. I need to return documents, which starts with only from the searched pattern. in regex it would be [^my_query]. for example, for a docs: black

Re: run filter queries after post filter

2013-10-09 Thread Erick Erickson
Ah, I think you're misunderstanding the nature of post-filters. Or I'm confused, which happens a lot! The whole point of post filters is that they're assumed to be expensive (think ACL calculation). So you want them to run on the fewest documents possible. So only docs that make it through the

Re: matching starts with only

2013-10-09 Thread adm1n
Shawn Heisey-4: thanks for the quick response. Why this field have to be copyField? Couldn't it be a single field, for example: fieldType name=text_general_long class=solr.TextField positionIncrementGap=100 analyzer type=index tokenizer class=solr.KeywordTokenizerFactory/

Re: How to share Schema between multicore on Solr 4.4

2013-10-09 Thread Erick Erickson
bq: ...in the sense that there's only one canonical copy. Agreed, and as you say that copy is kept in ZooKeeper. And I pretty much guarantee that the internal solrconfig object is NOT shared. I doubt the schema object is shared, but it seems like it could be with some work. But the savings

Re: matching starts with only

2013-10-09 Thread Shawn Heisey
On 10/9/2013 2:16 PM, adm1n wrote: Why this field have to be copyField? Couldn't it be a single field, for I always assume that people already are using the existing field and type for other purposes. Offering advice without making that assumption will usually result in people making a

Re: Searching on (hyphenated/capitalized) word issue

2013-10-09 Thread Erick Erickson
The admin/analysis page is definitely your friend. On the surface, [catenateWords=1] in WDFF should mash the split up bits of multiCAD into multicad and you should be. I suspect that StandardTokenizerFactory is somehow getting into the mix here. Under any circumstance, the admin/analysis page

Re: matching starts with only

2013-10-09 Thread adm1n
search by starts with is something new I have to add, as well as the data I have to index for this purpose, so it's ok to create a new field. But once I added the following field type: fieldType name=text_general_long class=solr.TextField positionIncrementGap=100 analyzer type=index

Re: run filter queries after post filter

2013-10-09 Thread Rohit Harchandani
yes i get that. actually i should have explained in more detail. - i have a query which gets certain documents. - the post filter gets these matched documents and does some processing on them and filters the results. - but after this is done i need to apply another filter - which is why i gave a

Re: Searching on (hyphenated/capitalized) word issue

2013-10-09 Thread Upayavira
It depends whether multicad is a special case, or whether you want micr to match the term microsoft. If it is a special case, you can use synonyms, so that multi and multicad are considered the same term. If it isn't a special case, then ngrams could work - your document would be indexed with:

Dynamically loading synonym dictionary for solr SynonymFilter

2013-10-09 Thread ALEX PKB
Hi, All of our synonyms are maintained in DB, we would like to fetch those synonym dynamically for query expansion (Not indexing time). Are there any code contribution? I saw some discussion years ago but without conclusion. Thanks a lot!

Re: run filter queries after post filter

2013-10-09 Thread jim ferenczi
Hi Rohit, The main problem is that if the query inside the filter does not have a PostFilter implementation then your post filter is silently transformed into a simple filter. The query field:value is based on the inverted lists and does not have a postfilter support. If your field is a numeric

Re: Solr 4.4 - Master/Slave configuration - Replication Issue with Commits after deleting documents using Delete by ID

2013-10-09 Thread Otis Gospodnetic
Bharat, Can you look at the logs on the Master when you issue the delete and the subsequent commits and share that? Otis -- Solr ElasticSearch Support -- http://sematext.com/ Performance Monitoring -- http://sematext.com/spm On Tue, Oct 8, 2013 at 3:57 PM, Akkinepalli, Bharat (ELS-CON)

Re: Dynamically loading synonym dictionary for solr SynonymFilter

2013-10-09 Thread Jan Høydahl
Hi, Not as I know of. You'd probably want to subclass SynonymFilter* with your own DB aware implementation, and of course contribute this back :) -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com 9. okt. 2013 kl. 23:31 skrev ALEX PKB alex...@gmail.com: Hi, All of

Field with default value and stored=false, will be reset back to the default value in case of updating other fields

2013-10-09 Thread deniz
hi all, I have encountered some problems and post it on stackoverflow here: http://stackoverflow.com/questions/19285251/solr-field-with-default-value-resets-itself-if-it-is-stored-false as you can see from the response, does it make sense to open a bug ticket for this? because, although i can

Re: Field with default value and stored=false, will be reset back to the default value in case of updating other fields

2013-10-09 Thread Bill Bell
You have to update the whole record including all fields... Bill Bell Sent from mobile On Oct 9, 2013, at 7:50 PM, deniz denizdurmu...@gmail.com wrote: hi all, I have encountered some problems and post it on stackoverflow here:

Re: Field with default value and stored=false, will be reset back to the default value in case of updating other fields

2013-10-09 Thread deniz
Billnbell wrote You have to update the whole record including all fields... so what is the point of having atomic updates if i need to update everything? - Zeki ama calismiyor... Calissa yapar... -- View this message in context:

Re: Field with default value and stored=false, will be reset back to the default value in case of updating other fields

2013-10-09 Thread Shawn Heisey
On 10/9/2013 8:39 PM, deniz wrote: Billnbell wrote You have to update the whole record including all fields... so what is the point of having atomic updates if i need to update everything? If you have any regular fields that are not stored, atomic updates will not work -- unstored field