Hi,
I cannot guess what is 'entities' in your context, but do you want some
kind of join functionality like RDBs on Solr?
Basically, Solr is not "relational". So at first, you should consider
denormalize your RDB tables to one table/view (or issue SQL JOIN query in
DIH) to import data to Solr.
If
Team,
I have a compressed text file (gz) which holds tab delimited data. Is it
possible for me to index this file directly without doing any pre
processing of uncompressing the file on my own? if so, can you please tell
me the steps/config changes I am supposed to follow.
BTW, I am using solr-4.10
I'm using Solr 4.10. While importing through DIH, I've configured 3 separate
entities. I'm facing some problems for indexing and retrieval.
1) How can I give the unique key, as the 3 entities will have different
fields.
2) Is there a join query, from which I can join all the 3 tables.
--
View t
I am having difficulty with my sort function. With the following sort,
documents are not sorted by score if you can see. Why sort function is not
able to sort it properly?
I appreciate your prompt answer
This is my sort function.
sort=map(and(termfreq(CustomersFavourite,852708),exists($exactqq)
On 12/11/2014 8:09 PM, E S J wrote:
> Thanks, I thought only option is default or schemaless because , When we
> run bin/solr -e cloud you will get prompt like ,
> *Please choose a configuration for the gettingstarted collection, available
> options are: default or schemaless [default]*
I have
Thanks, I thought only option is default or schemaless because , When we
run bin/solr -e cloud you will get prompt like ,
To begin, how many Solr nodes would you like to run in your local cluster?
(specify 1-4 nodes) [2] 3
Ok, let's start up 3 Solr nodes for your example SolrCloud cluster.
Please
On 12/11/2014 6:31 PM, E S J wrote:
> Thanks Eric, I understand your explanation.
> Quick question, Are configurations sits under /configs/defaults because
> -configname specified as default when I execute the following command? Can
> I specify -configname as /c-ins/
>
> zkcli.sh -zkhost zoo1.inte
On 12/11/2014 9:19 AM, Michael Della Bitta wrote:
> Only thing you have to worry about (in both the CUSS and the home grown
> case) is a single bad document in a batch fails the whole batch. It's up
> to you to fall back to writing them individually so the rest of the
> batch makes it in.
With CUS
Thanks Eric, I understand your explanation.
Quick question, Are configurations sits under /configs/defaults because
-configname specified as default when I execute the following command? Can
I specify -configname as /c-ins/
zkcli.sh -zkhost zoo1.internal:2183,zoo2.internal:2183,zoo3.internal:2183
bq: 1. Like to know what I have done is correct ?
Looks fine to me.
bq: 2. Where to place the schema.xml's and other configurations. Because for
the moment it's are under collection1/conf folder and collection1 is not an
active collection for me. ( i'm using only c-ins core)
I think you're a bit
Is it possible to boost a query using the browse interface? How would one do
this?
--
View this message in context:
http://lucene.472066.n3.nabble.com/Browse-interface-tp4173897.html
Sent from the Solr - User mailing list archive at Nabble.com.
Hello Team,
I would like to get clarified where to place schema.xml on SolrCloud set-up.
My Solr cloud set-up , 3 nodes, 3 shards and 3 replications, 3 ZooKeeper
What I have done is,
1. Taken a solr.war from solr default download (
solr-4.10.2/example/webapps/solr.war - 4.10.2) and placed
/we
Hello Team,
I would like to get clarified where to place schema.xml on SolrCloud set-up.
My Solr cloud set-up , 3 nodes, 3 shards and 3 replications, 3 ZooKeeper
What I have done is,
1. Taken a solr.war from solr default download (
solr-4.10.2/example/webapps/solr.war - 4.10.2) and placed
/we
So the short answer to your original question is "no." Highlighting is
designed to find matches *within* a tokenized (text) field only. That
is difficult because text gets processed and there are all sorts of
complications, but for integers it should be pretty easy to match the
values in the d
Hi,
Thanks for response. It is quite important to me for example to highlight
multivalued field with many int or long tokens.
--
Paweł
On Thu, Dec 11, 2014 at 3:08 PM, Tomoko Uchida wrote:
>
> Hi Pawel,
>
> Essentially, highlighting is a feature to show "fragments of documents"
> that matche use
explicit
edismax
10
full_name,job_tree, company, city,
state, country,
first_name, last_name, id
full_name_suggest^60
full_name_ngram^100.0 job_suggest^30
job_
Hi Mike,
If I am not wrong, you are trying to simulate google behaviour.
If you use quotes, google return exact matches. I think that makes perfectly
sense and will be a valuable addition. I remember some folks asked/requested
this behaviour in the list.
Ahmet
On Thursday, December 11, 2014
I am trying create suggester handler using solr 4.8, everything work fine but
when I try to get suggestion using different language Arabic, or Japanese
for example I got result in mixed language, but I am trying to search only
using Japanese, I got Arabic with that too. the following is my Schema.x
Have you rebooted the machine? (last refuge of the clueless, but often
works) ...
On 12/11/14 2:50 PM, solr-user wrote:
yes, have triple checked the schema and solrconfig XML; various tools have
indicated the XML is valid
no missing types or dupes, and have not disabled the admin handler
as m
I'd like to supply a different set of fields for phrases than for bare
terms. Specifically, we'd like to treat phrases as more "exact" -
probably turning off stemming and generally having a tighter analysis
chain. Note: this is *not* what's done by configuring "pf" which
controls fields for t
As in the layout shipped with Solr? Try putting the JTS ‘jar’ in lib/ext
and let us know if that worked. I think it will but I forget.
~ David Smiley
Freelance Apache Lucene/Solr Search Consultant/Developer
http://www.linkedin.com/in/davidwsmiley
On Thu, Dec 11, 2014 at 12:40 PM, mathaix wrote
I opened https://issues.apache.org/jira/browse/SOLR-6837
Probably best to have further conversations on the Jira issue.
On Thu, Dec 11, 2014 at 6:46 PM, Shalin Shekhar Mangar <
shalinman...@gmail.com> wrote:
> Hi Gili,
>
> Great question!
>
> A write in Solr, by default, is only guaranteed to ex
Chris, will get the schema and solrconfig ready for uploading.
--
View this message in context:
http://lucene.472066.n3.nabble.com/Solr-4-10-2-Found-core-but-I-get-No-cores-available-in-dashboard-page-tp4173602p4173840.html
Sent from the Solr - User mailing list archive at Nabble.com.
yes, have triple checked the schema and solrconfig XML; various tools have
indicated the XML is valid
no missing types or dupes, and have not disabled the admin handler
as mentioned in my most recent response, I can see the coreX core (the
renamed and unmodified collection1 core from the download
: coreA (which has our modified schema and solrconfig) is found by solr but is
: not shown in the solr admin page:
can you please include the *exact* solrconfig.xml & schema.xml you are
using for coreA ... you've given us an overview of what you changed, but
that's not enough for anyone to acta
: can you please include the *exact* solrconfig.xml & schema.xml you are
: using for coreA ... you've given us an overview of what you changed, but
: that's not enough for anyone to actally try and reproduce your problem.
if it helps (since hte list doesn't allow attachments) feel free to open a
And the XML is valid, lib references in solrconfig.xml point to the
right libraries (if any), you don't have duplicate definitions of
types, you don't have missing definitions of types? And you didn't
disable the admin handler?
And it's not just admin that's failing to find the core, right? If you
May be you can try using AND condition in the single join something like
q={!join from=parent to=id}(Id:xxx AND (Color:red OR Color:Blue)), I don't
think this will give bigger performance issue.
Thanks
Ravi
-Original Message-
From: Darin Amos [mailto:dari...@gmail.com]
Sent: Thursday
small correction; coreX (the one with the unmodified schema.xml and
solrconfig.xml) IS seen by solr and appears on the solr admin page, but
coreA (which has our modified schema and solrconfig) is found by solr but is
not shown in the solr admin page:
1494 [main] INFO org.apache.solr.core.CoresLo
my apologies for the lack of clarity
our internal name for the project to upgrade solr from 4.0 to 4.10.2 is
"helios" and so we named our test folder "heliosearch". I was not even
aware of the github project Heliosearch, and nothing we are doing is related
to it.
to simplify things for this post
Hi Gili,
Great question!
A write in Solr, by default, is only guaranteed to exist in 1 place i.e.
the leader and the safety valves that we have to preserve these writes are:
1. The leaderVoteWait time for which leader election is suspended until
enough live replicas are available
2. The two-way
According to may experience, "query time join" has relatively poor performance.
If you can cache this joins effectively (not so many unique color values in
requests, cache doesn't invalidate) - it's ok. If not, it may be interesting to
try "block join" instead -
http://blog.griddynamics.com/201
11.12.2014, 21:24, "Darin Amos" :
> Thanks,
>
> That looks like a viable option, I could do something like the following:
>
> q={!join from=parent to=id}
> &fq={!join from=parent to=id}color:red
> &fq={!join from=parent to=id}color:blue
>
> With all these joins happening like this, what kind of p
Thanks,
That looks like a viable option, I could do something like the following:
q={!join from=parent to=id}
&fq={!join from=parent to=id}color:red
&fq={!join from=parent to=id}color:blue
With all these joins happening like this, what kind of performance concern is
this? I would guess this wou
Matt,
There is no exact number here, but I would think most people would want "count"
to be maybe 10-20. Increasing this incurs a very small performance penalty for
each term it generates suggestions for, but you probably won't notice a
difference. For "maxCollationTries", 5 is a reasonable n
How about something like
({!join from=parent to=id}color:red) AND ({!join from=parent to=id}color:blue) ?
11.12.2014, 19:48, "Darin Amos" :
> Hello,
>
> I am trying to execute a join query that I am not 100% sure how to execute.
> Lets say I have a bunch of parent and child documents and every
Hi,
I'm trying to use AutoPhrasingTokenFilterFactory which seems to be a
great solution to our phrase query issues. But doesn't seem to work as
mentioned in the blog :
https://lucidworks.com/blog/automatic-phrase-tokenization-improving-lucene-search-precision-by-more-precise-linguistic-anal
Thank you. That was the issue.
Is am running solr with Jetty. Is there are recommended way for including
those jars in the jetty configuration?
--
View this message in context:
http://lucene.472066.n3.nabble.com/Solr-Error-when-making-GeoPrefixTree-polygon-filter-search-tp4173629p4173807.html
Is there a suggested value for this. I bumped them up to 20 and still
nothing has seemed to change.
On Thu, Dec 11, 2014 at 9:42 AM, Dyer, James
wrote:
> My first guess here, is seeing it works some of the time but not others,
> is that these values are too low:
>
> 5
> 5
>
> You know spellcheck
On Thu, Dec 11, 2014 at 11:52 AM, Alexandre Rafalovitch
wrote:
> On 11 December 2014 at 11:40, Yonik Seeley wrote:
>> So to Solr (server side), it looks like a single update request
>> (assuming 1 thread) with a batch of multiple documents... but it was
>> never actually "batched" on the client s
On 11 December 2014 at 11:40, Yonik Seeley wrote:
> So to Solr (server side), it looks like a single update request
> (assuming 1 thread) with a batch of multiple documents... but it was
> never actually "batched" on the client side.
Does Solr also indexes them one-by-one as it parses them off th
Hello,
I am trying to execute a join query that I am not 100% sure how to execute.
Lets say I have a bunch of parent and child documents and every one of my child
documents has a single value field “color”.
If I want to search all parents that have a “red” child, tis is very easy:
{!join from
On Wed, Dec 10, 2014 at 6:09 PM, Erick Erickson wrote:
> So CUSS will do something like this:
> 1> assemble a packet for Solr
> 2> pass off the actual transmission
> to Solr to a thread and immediately
> go back to <1>.
>
> Basically, CUSS is doing async processing.
The more important p
Agree with Erick.
However, I suppose you can try to provide your own RequestWriter, and let
it stream XML. btw, what's in them? How Solr handles them right now? Why
don't you want to start from the test?
On Thu, Dec 11, 2014 at 7:04 PM, Erick Erickson
wrote:
> I don't think so, it uses SolrInpu
I know Solr CAP properties are CP, but I don't see it happening over a very
basic test - doing something wrong?
With two Solr nodes, I index doc1 to both, stop node2, update doc1, stop
node1, start node2, start node1, and I get two different versions of the
doc depending on which replica I query.
Tom:
ConcurrentUpdateSolrServer isn't magic or anything. You could pretty
trivially write something that takes batches of your XML documents and
combines them into a single document (multiple tags in the
section) and sends them up to Solr and achieve some of the same speed
benefits.
If yo
I don't think so, it uses SolrInputDocuments and
lists thereof. So if you parse the xml and then
put things in SolrInputDocuments..
Or something like that.
Erick
On Thu, Dec 11, 2014 at 9:43 AM, Tom Burton-West wrote:
> Thanks Eric,
>
> That is helpful. We already have a process that works
Thanks Eric,
That is helpful. We already have a process that works similarly. Each
thread/process that sends a document to Solr waits until it gets a response
in order to make sure that the document was indexed successfully (we log
errors and retry docs that don't get indexed successfully), howe
My first guess here, is seeing it works some of the time but not others, is
that these values are too low:
5
5
You know spellcheck.count is too low if the suggestion you want is not in the
"suggestions" part of the response, but increasing it makes it get included.
You know that spellcheck.ma
So, what did not work for you with the External File Field approach?
What is the next gap you are trying to close?
You seem to be aware of the possible extension points for Solr, so you
are not looking for just a pointer to custom search components or
whatever.
Regards,
Alex.
Personal: http://
Ok. Make sure to post in the right topics. People get super confused
when the conversation thread changes.
Maybe ignore this last couple of messages and post the new one as
appropriate (separate or in another thread). That way the right people
will see it.
Regards,
Alex.
Personal: http://www.o
Yes, I understand that reindexing is neccesary , however for some reason I
was not able to invoke the js script from the updateprocessor, so I ended
up using Java only solution at index time.
Thanks.
On Thu, Dec 11, 2014 at 7:18 AM, Ahmet Arslan
wrote:
>
> Hi,
>
> No special steps to be taken fo
The Use Case:
Very large and sharded index with articles with different categorization
fields, pre populated with algorithmic estimated values (simple type,
mostly Integer values). The index is accessed from multiple “clients”
and each client can override article property based on his context,
Oh no, i want to answered to this topic, where you help me with the synonym
filter:
http://lucene.472066.n3.nabble.com/Alternative-searching-td4172339.html
but i was opened this topic too and i checking my answer in google
translator and copy it here.
Now, i have a edit task, i do not have to se
Hi Pawel,
Essentially, highlighting is a feature to show "fragments of documents"
that matche user queries.
With that, he/she can find occurrence of their query in long documents and
can understand their results well.
For tint or tlong fields (or other non-text field types), "fragments"
usually h
Tomas,
You have a difficult use case. You seem to have a speech recognition
domain and you want to be able to search that transcribed text with
reference back to timing. It's an interesting problem, but not an easy
one. Certainly not something one can give you the answer all at once.
The issue he
The following message addressed to you was quarantined because it likely
contains a virus:
Subject: Inconsistent doc value across two nodes - very simple test - what's
the expected behavior?
From: Gili Nachum
However, if you know the sender and are expecting an attachment, please reply
to thi
Hi,
No special steps to be taken for cloud setup. Please note that for both
solutions, re-index is mandatory.
Ahmet
On Thursday, December 11, 2014 12:15 PM, S.L wrote:
Ahmet,
Thank you , as the configurations in SolrCloud are uploaded to zookeeper ,
are there any special steps that need to
Hi Antoine,
By saying "The problem I have now is that ebc_libelle synonyms reported for the
field are not show", you mean you have synonym entry for the word Castaroma,
and documents containing those synonym entries do not show up in fist 100
documents?
If yes, play with boost values (5 versus
Hi All,
We have an usecase where we want to perform histogram on 10 minutes time
period and then each 10 mins time frame we have to perform facet on some
field.
We are currently using 4.7.2 version of Solr.
Please suggest how we nested facet with histogram.
--
Thanks,
Ankit Jain
Hello,
First of all thank you for your answers !
In my schema.xml file:
- I created this field :
- the type of this field is a "copyfiled" :
I wonder if the following statement is required :
ebc_libelle
I test my results with the following settings :
http:
Ahmet,
Thank you , as the configurations in SolrCloud are uploaded to zookeeper ,
are there any special steps that need to be taken to make this work in
SolrCloud ?
On Wed, Dec 10, 2014 at 4:32 AM, Ahmet Arslan
wrote:
>
> Hi,
>
> Or even better, you can use your new field for tie break purposes.
Hi,
Is it possible to highlight int (TrieLongField) or long (TrieLongField)
field in Solr?
--
Paweł
Mikhail,
Thank you for confirming this , however Ahmet's proposal seems more simpler
to implement to me .
On Wed, Dec 10, 2014 at 5:07 AM, Mikhail Khludnev <
mkhlud...@griddynamics.com> wrote:
>
> S.L,
>
> I briefly skimmed Lucene50NormsConsumer.writeNormsField(), my conclusion
> is: if you suppl
Hello, i want to searching in between transcripts of phone conversations. And
the machine which is make transcript the conversation to text is making some
alternatives. For example If we have sentence.
Hello how are you.
1. Segment
Hello
Halo
Hollow
2.Segment
How
Bow
When i want
Thanks for help, but how wrote Alex, I used synonm filter and it is what i
want. When i wrote to synonym for example Hello, Hi. And sentence is Hello
how are you and my query is Hi how are you, so that find it too.
--
View this message in context:
http://lucene.472066.n3.nabble.com/Design-optim
66 matches
Mail list logo