date:20141211

Re: Join in SOLR

2014-12-11 Thread Tomoko Uchida

Hi, I cannot guess what is 'entities' in your context, but do you want some kind of join functionality like RDBs on Solr? Basically, Solr is not "relational". So at first, you should consider denormalize your RDB tables to one table/view (or issue SQL JOIN query in DIH) to import data to Solr. If

[Hep] tab delimited gz file indexing steps

2014-12-11 Thread Sithik

Team, I have a compressed text file (gz) which holds tab delimited data. Is it possible for me to index this file directly without doing any pre processing of uncompressing the file on my own? if so, can you please tell me the steps/config changes I am supposed to follow. BTW, I am using solr-4.10

Join in SOLR

2014-12-11 Thread Rajesh

I'm using Solr 4.10. While importing through DIH, I've configured 3 separate entities. I'm facing some problems for indexing and retrieval. 1) How can I give the unique key, as the 3 entities will have different fields. 2) Is there a join query, from which I can join all the 3 tables. -- View t

Documents with SOLR function "sort" are NOT sorted by score

2014-12-11 Thread eakarsu

I am having difficulty with my sort function. With the following sort, documents are not sorted by score if you can see. Why sort function is not able to sort it properly? I appreciate your prompt answer This is my sort function. sort=map(and(termfreq(CustomersFavourite,852708),exists($exactqq)

Re: To understand SolrCloud configurations

2014-12-11 Thread Shawn Heisey

On 12/11/2014 8:09 PM, E S J wrote: > Thanks, I thought only option is default or schemaless because , When we > run bin/solr -e cloud you will get prompt like , > *Please choose a configuration for the gettingstarted collection, available > options are: default or schemaless [default]* I have

Re: To understand SolrCloud configurations

2014-12-11 Thread E S J

Thanks, I thought only option is default or schemaless because , When we run bin/solr -e cloud you will get prompt like , To begin, how many Solr nodes would you like to run in your local cluster? (specify 1-4 nodes) [2] 3 Ok, let's start up 3 Solr nodes for your example SolrCloud cluster. Please

Re: To understand SolrCloud configurations

2014-12-11 Thread Shawn Heisey

On 12/11/2014 6:31 PM, E S J wrote: > Thanks Eric, I understand your explanation. > Quick question, Are configurations sits under /configs/defaults because > -configname specified as default when I execute the following command? Can > I specify -configname as /c-ins/ > > zkcli.sh -zkhost zoo1.inte

Re: Details on why ConccurentUpdateSolrServer is reccommended for maximum index performance

2014-12-11 Thread Shawn Heisey

On 12/11/2014 9:19 AM, Michael Della Bitta wrote: > Only thing you have to worry about (in both the CUSS and the home grown > case) is a single bad document in a batch fails the whole batch. It's up > to you to fall back to writing them individually so the rest of the > batch makes it in. With CUS

Re: To understand SolrCloud configurations

2014-12-11 Thread E S J

Thanks Eric, I understand your explanation. Quick question, Are configurations sits under /configs/defaults because -configname specified as default when I execute the following command? Can I specify -configname as /c-ins/ zkcli.sh -zkhost zoo1.internal:2183,zoo2.internal:2183,zoo3.internal:2183

Re: To understand SolrCloud configurations

2014-12-11 Thread Erick Erickson

bq: 1. Like to know what I have done is correct ? Looks fine to me. bq: 2. Where to place the schema.xml's and other configurations. Because for the moment it's are under collection1/conf folder and collection1 is not an active collection for me. ( i'm using only c-ins core) I think you're a bit

Browse interface

2014-12-11 Thread tharpa

Is it possible to boost a query using the browse interface? How would one do this? -- View this message in context: http://lucene.472066.n3.nabble.com/Browse-interface-tp4173897.html Sent from the Solr - User mailing list archive at Nabble.com.

To understand SolrCloud configurations

2014-12-11 Thread E S J

Hello Team, I would like to get clarified where to place schema.xml on SolrCloud set-up. My Solr cloud set-up , 3 nodes, 3 shards and 3 replications, 3 ZooKeeper What I have done is, 1. Taken a solr.war from solr default download ( solr-4.10.2/example/webapps/solr.war - 4.10.2) and placed /we

To understand SolrCloud configurations

2014-12-11 Thread E S J

Hello Team, I would like to get clarified where to place schema.xml on SolrCloud set-up. My Solr cloud set-up , 3 nodes, 3 shards and 3 replications, 3 ZooKeeper What I have done is, 1. Taken a solr.war from solr default download ( solr-4.10.2/example/webapps/solr.war - 4.10.2) and placed /we

Re: Highlighting integer field

2014-12-11 Thread Michael Sokolov

So the short answer to your original question is "no." Highlighting is designed to find matches *within* a tokenized (text) field only. That is difficult because text gets processed and there are all sorts of complications, but for integers it should be pretty easy to match the values in the d

Re: Highlighting integer field

2014-12-11 Thread Pawel

Hi, Thanks for response. It is quite important to me for example to highlight multivalued field with many int or long tokens. -- Paweł On Thu, Dec 11, 2014 at 3:08 PM, Tomoko Uchida wrote: > > Hi Pawel, > > Essentially, highlighting is a feature to show "fragments of documents" > that matche use

Re: different fields for user-supplied phrases in edismax

2014-12-11 Thread alaa.abuzaghleh

explicit edismax 10 full_name,job_tree, company, city, state, country, first_name, last_name, id full_name_suggest^60 full_name_ngram^100.0 job_suggest^30 job_

Re: different fields for user-supplied phrases in edismax

2014-12-11 Thread Ahmet Arslan

Hi Mike, If I am not wrong, you are trying to simulate google behaviour. If you use quotes, google return exact matches. I think that makes perfectly sense and will be a valuable addition. I remember some folks asked/requested this behaviour in the list. Ahmet On Thursday, December 11, 2014

Mutli Lengual Suggester Solr 4.8

2014-12-11 Thread alaa.abuzaghleh

I am trying create suggester handler using solr 4.8, everything work fine but when I try to get suggestion using different language Arabic, or Japanese for example I got result in mixed language, but I am trying to search only using Japanese, I got Arabic with that too. the following is my Schema.x

Re: Solr 4.10.2 "Found core" but I get "No cores available" in dashboard page

2014-12-11 Thread Michael Sokolov

Have you rebooted the machine? (last refuge of the clueless, but often works) ... On 12/11/14 2:50 PM, solr-user wrote: yes, have triple checked the schema and solrconfig XML; various tools have indicated the XML is valid no missing types or dupes, and have not disabled the admin handler as m

different fields for user-supplied phrases in edismax

2014-12-11 Thread Michael Sokolov

I'd like to supply a different set of fields for phrases than for bare terms. Specifically, we'd like to treat phrases as more "exact" - probably turning off stemming and generally having a tighter analysis chain. Note: this is *not* what's done by configuring "pf" which controls fields for t

Re: Solr Error when making GeoPrefixTree polygon filter search

2014-12-11 Thread david.w.smi...@gmail.com

As in the layout shipped with Solr? Try putting the JTS ‘jar’ in lib/ext and let us know if that worked. I think it will but I forget. ~ David Smiley Freelance Apache Lucene/Solr Search Consultant/Developer http://www.linkedin.com/in/davidwsmiley On Thu, Dec 11, 2014 at 12:40 PM, mathaix wrote

Re: Inconsistent doc value across two nodes - very simple test - what's the expected behavior?

2014-12-11 Thread Shalin Shekhar Mangar

I opened https://issues.apache.org/jira/browse/SOLR-6837 Probably best to have further conversations on the Jira issue. On Thu, Dec 11, 2014 at 6:46 PM, Shalin Shekhar Mangar < shalinman...@gmail.com> wrote: > Hi Gili, > > Great question! > > A write in Solr, by default, is only guaranteed to ex

Re: Solr 4.10.2 "Found core" but I get "No cores available" in dashboard page

2014-12-11 Thread solr-user

Chris, will get the schema and solrconfig ready for uploading. -- View this message in context: http://lucene.472066.n3.nabble.com/Solr-4-10-2-Found-core-but-I-get-No-cores-available-in-dashboard-page-tp4173602p4173840.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Solr 4.10.2 "Found core" but I get "No cores available" in dashboard page

2014-12-11 Thread solr-user

yes, have triple checked the schema and solrconfig XML; various tools have indicated the XML is valid no missing types or dupes, and have not disabled the admin handler as mentioned in my most recent response, I can see the coreX core (the renamed and unmodified collection1 core from the download

Re: Solr 4.10.2 "Found core" but I get "No cores available" in dashboard page

2014-12-11 Thread Chris Hostetter

: coreA (which has our modified schema and solrconfig) is found by solr but is : not shown in the solr admin page: can you please include the *exact* solrconfig.xml & schema.xml you are using for coreA ... you've given us an overview of what you changed, but that's not enough for anyone to acta

Re: Solr 4.10.2 "Found core" but I get "No cores available" in dashboard page

2014-12-11 Thread Chris Hostetter

: can you please include the *exact* solrconfig.xml & schema.xml you are : using for coreA ... you've given us an overview of what you changed, but : that's not enough for anyone to actally try and reproduce your problem. if it helps (since hte list doesn't allow attachments) feel free to open a

Re: Solr 4.10.2 "Found core" but I get "No cores available" in dashboard page

2014-12-11 Thread Alexandre Rafalovitch

And the XML is valid, lib references in solrconfig.xml point to the right libraries (if any), you don't have duplicate definitions of types, you don't have missing definitions of types? And you didn't disable the admin handler? And it's not just admin that's failing to find the core, right? If you

RE: Help with a Join Query

2014-12-11 Thread EXTERNAL Taminidi Ravi (ETI, Automotive-Service-Solutions)

May be you can try using AND condition in the single join something like q={!join from=parent to=id}(Id:xxx AND (Color:red OR Color:Blue)), I don't think this will give bigger performance issue. Thanks Ravi -Original Message- From: Darin Amos [mailto:dari...@gmail.com] Sent: Thursday

Re: Solr 4.10.2 "Found core" but I get "No cores available" in dashboard page

2014-12-11 Thread solr-user

small correction; coreX (the one with the unmodified schema.xml and solrconfig.xml) IS seen by solr and appears on the solr admin page, but coreA (which has our modified schema and solrconfig) is found by solr but is not shown in the solr admin page: 1494 [main] INFO org.apache.solr.core.CoresLo

Re: Solr 4.10.2 "Found core" but I get "No cores available" in dashboard page

2014-12-11 Thread solr-user

my apologies for the lack of clarity our internal name for the project to upgrade solr from 4.0 to 4.10.2 is "helios" and so we named our test folder "heliosearch". I was not even aware of the github project Heliosearch, and nothing we are doing is related to it. to simplify things for this post

Re: Inconsistent doc value across two nodes - very simple test - what's the expected behavior?

2014-12-11 Thread Shalin Shekhar Mangar

Hi Gili, Great question! A write in Solr, by default, is only guaranteed to exist in 1 place i.e. the leader and the safety valves that we have to preserve these writes are: 1. The leaderVoteWait time for which leader election is suspended until enough live replicas are available 2. The two-way

Re: Help with a Join Query

2014-12-11 Thread Kydryavtsev Andrey

According to may experience, "query time join" has relatively poor performance. If you can cache this joins effectively (not so many unique color values in requests, cache doesn't invalidate) - it's ok. If not, it may be interesting to try "block join" instead - http://blog.griddynamics.com/201

Re: Help with a Join Query

2014-12-11 Thread Kydryavtsev Andrey

11.12.2014, 21:24, "Darin Amos" : > Thanks, > > That looks like a viable option, I could do something like the following: > > q={!join from=parent to=id} > &fq={!join from=parent to=id}color:red > &fq={!join from=parent to=id}color:blue > > With all these joins happening like this, what kind of p

Re: Help with a Join Query

2014-12-11 Thread Darin Amos

Thanks, That looks like a viable option, I could do something like the following: q={!join from=parent to=id} &fq={!join from=parent to=id}color:red &fq={!join from=parent to=id}color:blue With all these joins happening like this, what kind of performance concern is this? I would guess this wou

RE: WordBreakSolrSpellChecker Usage

2014-12-11 Thread Dyer, James

Matt, There is no exact number here, but I would think most people would want "count" to be maybe 10-20. Increasing this incurs a very small performance penalty for each term it generates suggestions for, but you probably won't notice a difference. For "maxCollationTries", 5 is a reasonable n

Re: Help with a Join Query

2014-12-11 Thread Kydryavtsev Andrey

How about something like ({!join from=parent to=id}color:red) AND ({!join from=parent to=id}color:blue) ? 11.12.2014, 19:48, "Darin Amos" : > Hello, > > I am trying to execute a join query that I am not 100% sure how to execute. > Lets say I have a bunch of parent and child documents and every

Have anyone used Automatic Phrase Tokenization (AutoPhrasingTokenFilterFactory) ?

2014-12-11 Thread shamik

Hi, I'm trying to use AutoPhrasingTokenFilterFactory which seems to be a great solution to our phrase query issues. But doesn't seem to work as mentioned in the blog : https://lucidworks.com/blog/automatic-phrase-tokenization-improving-lucene-search-precision-by-more-precise-linguistic-anal

Re: Solr Error when making GeoPrefixTree polygon filter search

2014-12-11 Thread mathaix

Thank you. That was the issue. Is am running solr with Jetty. Is there are recommended way for including those jars in the jetty configuration? -- View this message in context: http://lucene.472066.n3.nabble.com/Solr-Error-when-making-GeoPrefixTree-polygon-filter-search-tp4173629p4173807.html

Re: WordBreakSolrSpellChecker Usage

2014-12-11 Thread Matt Mongeau

Is there a suggested value for this. I bumped them up to 20 and still nothing has seemed to change. On Thu, Dec 11, 2014 at 9:42 AM, Dyer, James wrote: > My first guess here, is seeing it works some of the time but not others, > is that these values are too low: > > 5 > 5 > > You know spellcheck

Re: Details on why ConccurentUpdateSolrServer is reccommended for maximum index performance

2014-12-11 Thread Yonik Seeley

On Thu, Dec 11, 2014 at 11:52 AM, Alexandre Rafalovitch wrote: > On 11 December 2014 at 11:40, Yonik Seeley wrote: >> So to Solr (server side), it looks like a single update request >> (assuming 1 thread) with a batch of multiple documents... but it was >> never actually "batched" on the client s

Re: Details on why ConccurentUpdateSolrServer is reccommended for maximum index performance

2014-12-11 Thread Alexandre Rafalovitch

On 11 December 2014 at 11:40, Yonik Seeley wrote: > So to Solr (server side), it looks like a single update request > (assuming 1 thread) with a batch of multiple documents... but it was > never actually "batched" on the client side. Does Solr also indexes them one-by-one as it parses them off th

Help with a Join Query

2014-12-11 Thread Darin Amos

Hello, I am trying to execute a join query that I am not 100% sure how to execute. Lets say I have a bunch of parent and child documents and every one of my child documents has a single value field “color”. If I want to search all parents that have a “red” child, tis is very easy: {!join from

Re: Details on why ConccurentUpdateSolrServer is reccommended for maximum index performance

2014-12-11 Thread Yonik Seeley

On Wed, Dec 10, 2014 at 6:09 PM, Erick Erickson wrote: > So CUSS will do something like this: > 1> assemble a packet for Solr > 2> pass off the actual transmission > to Solr to a thread and immediately > go back to <1>. > > Basically, CUSS is doing async processing. The more important p

Re: Details on why ConccurentUpdateSolrServer is reccommended for maximum index performance

2014-12-11 Thread Mikhail Khludnev

Agree with Erick. However, I suppose you can try to provide your own RequestWriter, and let it stream XML. btw, what's in them? How Solr handles them right now? Why don't you want to start from the test? On Thu, Dec 11, 2014 at 7:04 PM, Erick Erickson wrote: > I don't think so, it uses SolrInpu

Inconsistent doc value across two nodes - very simple test - what's the expected behavior?

2014-12-11 Thread Gili Nachum

I know Solr CAP properties are CP, but I don't see it happening over a very basic test - doing something wrong? With two Solr nodes, I index doc1 to both, stop node2, update doc1, stop node1, start node2, start node1, and I get two different versions of the doc depending on which replica I query.

Re: Details on why ConccurentUpdateSolrServer is reccommended for maximum index performance

2014-12-11 Thread Michael Della Bitta

Tom: ConcurrentUpdateSolrServer isn't magic or anything. You could pretty trivially write something that takes batches of your XML documents and combines them into a single document (multiple tags in the section) and sends them up to Solr and achieve some of the same speed benefits. If yo

Re: Details on why ConccurentUpdateSolrServer is reccommended for maximum index performance

2014-12-11 Thread Erick Erickson

I don't think so, it uses SolrInputDocuments and lists thereof. So if you parse the xml and then put things in SolrInputDocuments.. Or something like that. Erick On Thu, Dec 11, 2014 at 9:43 AM, Tom Burton-West wrote: > Thanks Eric, > > That is helpful. We already have a process that works

Re: Details on why ConccurentUpdateSolrServer is reccommended for maximum index performance

2014-12-11 Thread Tom Burton-West

Thanks Eric, That is helpful. We already have a process that works similarly. Each thread/process that sends a document to Solr waits until it gets a response in order to make sure that the document was indexed successfully (we log errors and retry docs that don't get indexed successfully), howe

RE: WordBreakSolrSpellChecker Usage

2014-12-11 Thread Dyer, James

My first guess here, is seeing it works some of the time but not others, is that these values are too low: 5 5 You know spellcheck.count is too low if the suggestion you want is not in the "suggestions" part of the response, but increasing it makes it get included. You know that spellcheck.ma

Re: Is it possible in Solr to have document field value, based on context during query time, by request parameter ?

2014-12-11 Thread Alexandre Rafalovitch

So, what did not work for you with the External File Field approach? What is the next gap you are trying to close? You seem to be aware of the possible extension points for Solr, so you are not looking for just a pointer to custom search components or whatever. Regards, Alex. Personal: http://

Re: Design optimal Solr Schema

2014-12-11 Thread Alexandre Rafalovitch

Ok. Make sure to post in the right topics. People get super confused when the conversation thread changes. Maybe ignore this last couple of messages and post the new one as appropriate (separate or in another thread). That way the right people will see it. Regards, Alex. Personal: http://www.o

Re: Length norm not functioning in solr queries.

2014-12-11 Thread S.L

Yes, I understand that reindexing is neccesary , however for some reason I was not able to invoke the js script from the updateprocessor, so I ended up using Java only solution at index time. Thanks. On Thu, Dec 11, 2014 at 7:18 AM, Ahmet Arslan wrote: > > Hi, > > No special steps to be taken fo

Is it possible in Solr to have document field value, based on context during query time, by request parameter ?

2014-12-11 Thread Nenko Ivanov

The Use Case: Very large and sharded index with articles with different categorization fields, pre populated with algorithmic estimated values (simple type, mostly Integer values). The index is accessed from multiple “clients” and each client can override article property based on his context,

Re: Design optimal Solr Schema

2014-12-11 Thread tomas.kalas

Oh no, i want to answered to this topic, where you help me with the synonym filter: http://lucene.472066.n3.nabble.com/Alternative-searching-td4172339.html but i was opened this topic too and i checking my answer in google translator and copy it here. Now, i have a edit task, i do not have to se

Re: Highlighting integer field

2014-12-11 Thread Tomoko Uchida

Hi Pawel, Essentially, highlighting is a feature to show "fragments of documents" that matche user queries. With that, he/she can find occurrence of their query in long documents and can understand their results well. For tint or tlong fields (or other non-text field types), "fragments" usually h

Re: Design optimal Solr Schema

2014-12-11 Thread Alexandre Rafalovitch

Tomas, You have a difficult use case. You seem to have a speech recognition domain and you want to be able to search that transcribed text with reference back to timing. It's an interesting problem, but not an easy one. Certainly not something one can give you the answer all at once. The issue he

Suspicious message with attachment

2014-12-11 Thread help

The following message addressed to you was quarantined because it likely contains a virus: Subject: Inconsistent doc value across two nodes - very simple test - what's the expected behavior? From: Gili Nachum However, if you know the sender and are expecting an attachment, please reply to thi

Re: Length norm not functioning in solr queries.

2014-12-11 Thread Ahmet Arslan

Hi, No special steps to be taken for cloud setup. Please note that for both solutions, re-index is mandatory. Ahmet On Thursday, December 11, 2014 12:15 PM, S.L wrote: Ahmet, Thank you , as the configurations in SolrCloud are uploaded to zookeeper , are there any special steps that need to

Re: Priority in search an synonyms

2014-12-11 Thread Ahmet Arslan

Hi Antoine, By saying "The problem I have now is that ebc_libelle synonyms reported for the field are not show", you mean you have synonym entry for the word Castaroma, and documents containing those synonym entries do not show up in fist 100 documents? If yes, play with boost values (5 versus

Histogram Facet and Aggregation Solr

2014-12-11 Thread Ankit Jain

Hi All, We have an usecase where we want to perform histogram on 10 minutes time period and then each 10 mins time frame we have to perform facet on some field. We are currently using 4.7.2 version of Solr. Please suggest how we nested facet with histogram. -- Thanks, Ankit Jain

Re: Priority in search an synonyms

2014-12-11 Thread Antoine REBOUL

Hello, First of all thank you for your answers ! In my schema.xml file: - I created this field : - the type of this field is a "copyfiled" : I wonder if the following statement is required : ebc_libelle I test my results with the following settings : http:

Re: Length norm not functioning in solr queries.

2014-12-11 Thread S.L

Ahmet, Thank you , as the configurations in SolrCloud are uploaded to zookeeper , are there any special steps that need to be taken to make this work in SolrCloud ? On Wed, Dec 10, 2014 at 4:32 AM, Ahmet Arslan wrote: > > Hi, > > Or even better, you can use your new field for tie break purposes.

Highlighting integer field

2014-12-11 Thread Pawel Rog

Hi, Is it possible to highlight int (TrieLongField) or long (TrieLongField) field in Solr? -- Paweł

Re: Length norm not functioning in solr queries.

2014-12-11 Thread S.L

Mikhail, Thank you for confirming this , however Ahmet's proposal seems more simpler to implement to me . On Wed, Dec 10, 2014 at 5:07 AM, Mikhail Khludnev < mkhlud...@griddynamics.com> wrote: > > S.L, > > I briefly skimmed Lucene50NormsConsumer.writeNormsField(), my conclusion > is: if you suppl

Alternative synonymum

2014-12-11 Thread tomas.kalas

Hello, i want to searching in between transcripts of phone conversations. And the machine which is make transcript the conversation to text is making some alternatives. For example If we have sentence. Hello how are you. 1. Segment Hello Halo Hollow 2.Segment How Bow When i want

Re: Design optimal Solr Schema

2014-12-11 Thread tomas.kalas

Thanks for help, but how wrote Alex, I used synonm filter and it is what i want. When i wrote to synonym for example Hello, Hi. And sentence is Hello how are you and my query is Hi how are you, so that find it too. -- View this message in context: http://lucene.472066.n3.nabble.com/Design-optim

66 matches

Mail list logo