Re: question regarding wildcard-searches

2018-03-16 Thread jagdish vasani
Hi paesen,

Value - EO.1954.53.1 is indexed as below
Eo
1954
53
1
Dot is removed.try with wildcard -?
Like EO.1954.53.?? If you have 2 digits only in last..

I have not tried but you just check it.
Hope it will solve your problem.

Thanks,
Jagdish
On 16-Mar-2018 3:51 pm, "Paesen Roel"  wrote:

> Hi everybody,
>
> We are experimenting with solr, and I have a (I think) basic-level
> question:
> we have a multiple fields, all copied into a generic field so we can
> search everything at once.
> However we have a (for us) strange situation doing wildcard searches for
> the contents of one specific field.
>
> Given in the schema:
>
>  multiValued="true"/>
>
>  stored="true"/>
> 
> and lot of other fields exactly like 'genormaliseerdInventarisnummer'.
>
>
> Now, we are certain that the field 'genormaliseerdInventarisnummer'
> contains entries like 'EO.1954.53.1', 'EO.1954.53.2', EO.1954.53.3', all
> the way up to '.19', we can query these directly by passing these exact
> texts to the query on field '_text_' (our default search field).
> Problem is: wildcard searches for these don't work, like 'EO.1954.53.*'
> for example returns zero results.
>
> Why is that?
> What needs to be adjusted? (and how?)
>
> Thanks already,
> Roel
>
>


Re: Preserve order during indexing

2018-01-18 Thread jagdish vasani
Hi Ashish,
I think it's not possible,solr creates  inverted index.. but you can get
documents by sorting orders, give sort= asc/desc.

Thanks,
JagdishVasani
On 19-Jan-2018 9:22 am, "Aashish Agarwal"  wrote:

> Hi,
>
> I need to index documents in solr so that they are stored in same order as
> present in database. i.e *:* gives result in db order. Is it possible.
>
> Thanks,
> Aashish
>


RE: Search on field returns documents that should not match

2017-09-14 Thread Jagdish Vasani
Hi Chang Wang,

Searching is worng here, q=title_lemma: fit you handgun
It will search  title_lemma: fit or  _text:you  or _text:handgun (if stop words 
are not excluded)
Here _text is default search field , you might have different default search 
field.

Try search like q=title_lemma:(fit you handgun)
That will search any words in field title_lemma

Fire below query, you will get proper result,
http://localhost:8983/solr/bkb4/select?fq=doc_type:document=on=title_lemma:(fit
 you handgun)=json

Thanks,
Jagdish
From: Chang Wang [mailto:changwan...@gmail.com]
Sent: Friday, September 15, 2017 8:12 AM
To: solr-user@lucene.apache.org
Subject: Search on field returns documents that should not match

Hello All,


I am confused by the field search behavior of solr (6.6), and hope someone
can help me understand the results.

For example,
I search "fit you handgun" on the field of "title_lemma".
http://localhost:8983/solr/bkb4/select?fq=doc_type:document=on=title_lemma:%20fit%20you%20handgun=json

The first returned result is a good one which contains "fit you handgun" in 
"title_lemma" field, the 2nd and 3rd results do not contain any of those query 
words at all. Why are these documents still returned?

I attach the screen shot.


There is a related question. When I do not index the field "title_lemma", solr 
gui
still allows me to search on that field and returns the result. Why does it 
happen?
My understanding is that if a field is not indexed, it should not be searchable 
at all.

Thank you,
Chang


NOTICE TO RECIPIENT(s):This e-mail message may contain confidential or legally 
privileged information and is intended only for the use of the intended 
recipient(s). Any unauthorized disclosure, dissemination, distribution, copying 
or the taking of any action in reliance on the information herein is 
prohibited. E-mails are not secure and cannot be guaranteed to be error free as 
they can be intercepted, amended, or contain viruses. Although The Digital 
Group has taken reasonable precautions to ensure no viruses are present in this 
email, the company cannot accept responsibility for any loss or damage arising 
from the use of this email or attachments. Any opinion defamatory or deemed to 
be defamatory or any material which could be reasonably branded to be a species 
of plagiarism and other statements contained in this message and any attachment 
are solely those of the author and do not necessarily represent those of the 
company.


RE: FastVector does not highlight for phrase query when it contains stop word/s

2017-08-22 Thread Jagdish Vasani
Hi Rick,

Thanks for response.
I understood that If I  do not use StopFilter factory or do not exclude stop 
words , than it will solve problem.
But here stop words excluded and search is working well with stop word in 
phrase query.. but fast vector highlighter does not highlighting.

I debug the code and found that fast vector strictly check positions of phrase 
terms. It does not skipping stop word positions.
It should skip positions when stop words are there in phrase queries.

Thanks,
Jagdish
-Original Message-
From: Rick Leir [mailto:rl...@leirtech.com]
Sent: Tuesday, August 22, 2017 3:05 AM
To: solr-user@lucene.apache.org
Subject: Re: FastVector does not highlight for phrase query when it contains 
stop word/s

Recent discussions have recommended that you not use stop words in any case. 
Cheers -- Rick

On August 21, 2017 11:47:11 AM EDT, Jagdish Vasani 
<jagdish.vas...@thedigitalgroup.com> wrote:
>Hi  Solr Users,
>
>I come across issue that fast Vector highlighter does not highlight
>field values when search for phrase query contains stop word.
>For example , Query is "blue is the sky" , it will return result but
>highlighting will not available for this field.
>
>I have applied,
>hl.usePhraseHighlighter=true
>hl.preserveMulti=true
>hl.highlightMultiTerm=true
>hl.fragsize=1500
>hl.snippets=5
>hl=on
>hl.fl=
>hl.tag.pre=
>hl.tag.post==
>hl.method = fastVector
>
>schema.xml, fields having
>indexed="true" termOffsets="true" stored="true" termPositions="true"
>termVectors="true" multiValued="true"
>
>I have used solr 6.4.2
>
>Does that correct or I am missing some thing ?
>
>Thanks,
>Jagdish
>
>
>
>
>NOTICE TO RECIPIENT(s):This e-mail message may contain confidential or
>legally privileged information and is intended only for the use of the
>intended recipient(s). Any unauthorized disclosure, dissemination,
>distribution, copying or the taking of any action in reliance on the
>information herein is prohibited. E-mails are not secure and cannot be
>guaranteed to be error free as they can be intercepted, amended, or
>contain viruses. Although The Digital Group has taken reasonable
>precautions to ensure no viruses are present in this email, the company
>cannot accept responsibility for any loss or damage arising from the
>use of this email or attachments. Any opinion defamatory or deemed to
>be defamatory or any material which could be reasonably branded to be a
>species of plagiarism and other statements contained in this message
>and any attachment are solely those of the author and do not
>necessarily represent those of the company.

--
Sorry for being brief. Alternate email is rickleir at yahoo dot com



NOTICE TO RECIPIENT(s):This e-mail message may contain confidential or legally 
privileged information and is intended only for the use of the intended 
recipient(s). Any unauthorized disclosure, dissemination, distribution, copying 
or the taking of any action in reliance on the information herein is 
prohibited. E-mails are not secure and cannot be guaranteed to be error free as 
they can be intercepted, amended, or contain viruses. Although The Digital 
Group has taken reasonable precautions to ensure no viruses are present in this 
email, the company cannot accept responsibility for any loss or damage arising 
from the use of this email or attachments. Any opinion defamatory or deemed to 
be defamatory or any material which could be reasonably branded to be a species 
of plagiarism and other statements contained in this message and any attachment 
are solely those of the author and do not necessarily represent those of the 
company.


FastVector does not highlight for phrase query when it contains stop word/s

2017-08-21 Thread Jagdish Vasani
Hi  Solr Users,

I come across issue that fast Vector highlighter does not highlight field 
values when search for phrase query contains stop word.
For example , Query is "blue is the sky" , it will return result but 
highlighting will not available for this field.

I have applied,
hl.usePhraseHighlighter=true
hl.preserveMulti=true
hl.highlightMultiTerm=true
hl.fragsize=1500
hl.snippets=5
hl=on
hl.fl=
hl.tag.pre=
hl.tag.post==
hl.method = fastVector

schema.xml, fields having
indexed="true" termOffsets="true" stored="true" termPositions="true" 
termVectors="true" multiValued="true"

I have used solr 6.4.2

Does that correct or I am missing some thing ?

Thanks,
Jagdish




NOTICE TO RECIPIENT(s):This e-mail message may contain confidential or legally 
privileged information and is intended only for the use of the intended 
recipient(s). Any unauthorized disclosure, dissemination, distribution, copying 
or the taking of any action in reliance on the information herein is 
prohibited. E-mails are not secure and cannot be guaranteed to be error free as 
they can be intercepted, amended, or contain viruses. Although The Digital 
Group has taken reasonable precautions to ensure no viruses are present in this 
email, the company cannot accept responsibility for any loss or damage arising 
from the use of this email or attachments. Any opinion defamatory or deemed to 
be defamatory or any material which could be reasonably branded to be a species 
of plagiarism and other statements contained in this message and any attachment 
are solely those of the author and do not necessarily represent those of the 
company.


RE: LogTransformer

2015-07-09 Thread Jagdish Vasani
One thing I noted that you need to give full package detail while mentioning 
transformer.
Like, I have added bellow
entity transformer=org.apache.solr.handler.dataimport .LogTransformer...

Hope this will help you.

Thanks,
Jagdish
-Original Message-
From: Midas A [mailto:test.mi...@gmail.com] 
Sent: Friday, July 10, 2015 11:08 AM
To: solr-user@lucene.apache.org
Subject: LogTransformer

I want to log query running through DIH  should i use LogTransformer to do that


entity transformer= LogTransformer logTemplate=Query: ${products.query}
logLevel=info name=products pk=product_id query = SELECT p.product_id

in log i am getting text Query: but not query variable

my Solr version : 4.2


Please correct me what is wrong .or other ways to do this .

Regards,
Abhishek


How to stop long running query ?

2014-04-17 Thread Jagdish Vasani
Hi ,

I have long  query (full text search with 1K line each having Boolean 
query),that's running for long time, sometime utilizing 99% cpu.
Except Timelimiting Collector option, do we have any other functionality to 
stop that query ?

Thanks,
Jagdish


RE: Need Suggestion

2011-07-19 Thread Jagdish Vasani
Look  this link : http://wiki.apache.org/solr/DistributedSearch

This will help you when you have large index. 

-Original Message-
From: Rohit Gupta [mailto:ro...@in-rev.com] 
Sent: Friday, July 15, 2011 11:37 PM
To: solr-user@lucene.apache.org
Subject: Re: Need Suggestion

I am using -Xms2g and -Xmx6g 

What would be the ideal JVM size?

Regards,
Rohit




From: Mohammad Shariq shariqn...@gmail.com
To: solr-user@lucene.apache.org
Sent: Fri, 15 July, 2011 7:27:38 PM
Subject: Re: Need Suggestion

below are  certain things to do for search latency.
1) Do bulk insert.
2) Commit after every ~5000 docs.
3) Do optimization once in a day.
4) in search query  use fq parameter.

What is the size of JVM you are using ???





On 15 July 2011 17:44, Rohit ro...@in-rev.com wrote:

 I am facing some performance issues on my Solr Installation (3core server).
 I am indexing live twitter data based on certain keywords, as you can
 imagine, the rate at which documents are received is very high and so the
 updates to the core is very high and regular. Given below are the document
 size on my three core.



 Twitter  - 26874747

 Core2-  3027800

 Core3-  6074253



 My Server configuration has 8GB RAM, but now we are experiencing server
 performance drop. What can be done to improve this?  Also, I have a few
 questions.



 1.  Does the number of commit takes high memory? Will reducing the
 number of commits per hour help?
 2.  Most of my queries are field or date faceting based? how to improve
 those?



 Regards,

 Rohit





 Regards,

 Rohit

 Mobile: +91-9901768202

 About Me:  http://about.me/rohitg http://about.me/rohitg






-- 
Thanks and Regards
Mohammad Shariq


RE: ' invisible ' words

2011-07-15 Thread Jagdish Vasani
Hi deniz 

You can use luke ( http://www.getopt.org/luke/) and see how that field is 
indexed..which words are there in that field. That may help you figure out how 
you indexed you field.

Thanks.
Jagdish



-Original Message-
From: deniz [mailto:denizdurmu...@gmail.com] 
Sent: Thursday, July 14, 2011 2:57 PM
To: solr-user@lucene.apache.org
Subject: Re: ' invisible ' words

well i know it is totally weird... i have tried many things , including the
ones in this forum, but the result is the same... somehow some words are
just invisible... 



-
Zeki ama calismiyor... Calissa yapar...
--
View this message in context: 
http://lucene.472066.n3.nabble.com/invisible-words-tp3158060p3168598.html
Sent from the Solr - User mailing list archive at Nabble.com.


RE: Is it possible to extract all the tokens from solr?

2011-07-15 Thread Jagdish Vasani
Check the LukeRequestHandler at - http://wiki.apache.org/solr/LukeRequestHandler

This will give you all you need.

Thanks,
Jagdish



-Original Message-
From: pravesh [mailto:suyalprav...@yahoo.com] 
Sent: Thursday, July 14, 2011 2:50 PM
To: solr-user@lucene.apache.org
Subject: Re: Is it possible to extract all the tokens from solr?

You can use lucene for doing this. It provides TermEnum API to enumerate all
terms of field(s).
 SOLR-1.4.+ also provides a special request handler for this purpose. Check
it if that helps

Thanx
Pravesh

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Is-it-possible-to-extract-all-the-tokens-from-solr-tp3168362p3168589.html
Sent from the Solr - User mailing list archive at Nabble.com.


RE: Turn off caching

2011-02-11 Thread Jagdish Vasani IN
I don't think there is option to disable cache in solrconfig.xml in Solr
1.4..You need to modify/change code at time of creating
SolrIndexSearcher instance in  class SorlCore.

Thanks,
Jagdish

-Original Message-
From: Isan Fulia [mailto:isan.fu...@germinait.com] 
Sent: Tuesday, February 08, 2011 5:02 PM
To: solr-user@lucene.apache.org
Subject: Turn off caching

Hi,
My solrConfig file looks like

config
  updateHandler class=solr.DirectUpdateHandler2 /

  requestDispatcher handleSelect=true 
requestParsers enableRemoteStreaming=false
multipartUploadLimitInKB=2048 /
  /requestDispatcher

  requestHandler name=standard class=solr.StandardRequestHandler
default=true /
  requestHandler name=/update class=solr.XmlUpdateRequestHandler /
  requestHandler name=/admin/
class=org.apache.solr.handler.admin.AdminHandlers /


  queryResponseWriter  name=xslt
class=org.apache.solr.request.XSLTResponseWriter
  /queryResponseWriter
 !--config for the admin interface --
  admin
defaultQuery*:*/defaultQuery
  /admin
/config


EveryTime I fire the same query so as to compare the results for
different
configurations , the query result time is getting reduced because of
caching.
So I want to turn off the cahing or clear the ache before  i fire the
same
query .
Does anyone know how to do it.


-- 
Thanks  Regards,
Isan Fulia.


Re: optimize is taking too much time

2010-02-18 Thread Jagdish Vasani
Hi,

you should not optimize index after each insert of document.insted you
should optimize it after inserting some good no of documents.
because in optimize it will merge  all segments to one according to setting
of lucene index.

thanks,
Jagdish
On Fri, Feb 12, 2010 at 4:01 PM, mklprasad mklpra...@gmail.com wrote:


 hi
 in my solr u have 1,42,45,223 records having some 50GB .
 Now when iam loading a new record and when its trying optimize the docs its
 taking 2 much memory and time


 can any body please tell do we have any property in solr to get rid of
 this.

 Thanks in advance

 --
 View this message in context:
 http://old.nabble.com/optimize-is-taking-too-much-time-tp27561570p27561570.html
 Sent from the Solr - User mailing list archive at Nabble.com.