Question on SOLR join query

2017-09-25 Thread Jaimin Patel
I am facing a performance problem and could narrow it down to a join query
that we are using. The join is on a unique field.

We have a person profile stored in RDB in a relational way. Like person
name table , address table etc. SOLR indexes are build using this RDB
data,Each children is stored as separate document with parent's unique id.
At query time , unique id of parent is joined with same in child
documents({!join
to=par_id from=par_id }) to allow search with AND condition for search
terms involving children data

I am reading similar issues and it says "initial join implementation is
O(nterms)".. What does this mean ? I could not find any reference
explaining meaning of 0 num_terms_in_field.

 Regards,
Jai


Not getting appropriate spell suggestions

2018-02-14 Thread Jaimin Patel
Here is link to solrconfig.xml
 and
managed-schema
.

I have document with following value in suggestions (field used for spell
check), same information as information available in text field.

"suggestion":"Amylase Production, Part 1\nIn humans, salivary α-amylase is
produced by the AMY1 gene on chromosome 1. \n\n;;br\nHumans are diploid
organisms, meaning that they generally have two copies of genes that are
not present on the X and Y chromosomes -- one copy inherited from each
parent. \n\n;;br\nHowever, genetic studies show that people can have
anywhere from two to 15 copies of the AMY1 gene on each chromosome 1,
suggesting that the gene has been duplicated during human
evolution.\n\n;;br\nWhy would humans evolve multiple copies of a gene?"

*Search term and result*
"amylase" - works and provides appropriate result
"amylas" - No result, it identify as correct spelling
"amylos" - No result, spell suggestion is "amylas" (without e)


Similar situation with another term -
"declaratoin" - Correct spell for it is "declaration", I have document with
suggestions field value "declaration of independence", but the spell
suggestion returns "declar", I would imagine that it would return the close
match one instead of something like "declar" (which is also without e, so I
am suspecting it has something to do with PorterStemFilterFactory, but even
after I commented it, it shows same result.

Can someone help what I am not understanding correctly?How can I improve
when there is one character difference (missing or swap postion)

Thanks.


-- 
Jaimin

ᐧ