Re: increasing number of threads for faceting in JSON format

2020-12-24 Thread Arturas Mazeika
reat that I can get things faster by reducing the gap and by > > > increasing the number of threads. How to reduce gaps I know: one can > > > replace "gap": "+1HOUR" with "gap": "+1MONTH" What should I > > change > > >

impressive improvement in documentation

2020-12-11 Thread Arturas Mazeika
Hi Solr fans, I am impressed to see that documentation of solr improves so nicely over time. If one compares the 7.1 version of json api with the current (8.7) one sees that additional fields are documented: https://lucene.apache.org/solr/guide/8_7/json-request-api.html Query parametersJSON

Re: nested facets of query and terms type in JSON format

2020-12-10 Thread Arturas Mazeika
(or other readers) might find this "Query Facet" example handy > - it uses the "type": "query" syntax that MIchael mentioned. [1] > > [1] > https://lucene.apache.org/solr/guide/8_5/json-facet-api.html#query-facet > > Best, > Jason > > On Thu, De

Re: increasing number of threads for faceting in JSON format

2020-12-10 Thread Arturas Mazeika
text below to increase the number of threads from one to 20? > > > > Cheers, > > Arturas > > > > On Thu, Dec 3, 2020 at 1:54 PM Munendra S N > > wrote: > > > > > Hi, > > > > > > Currently, JSON facets have support for specifying the numb

Re: increasing number of threads for faceting in JSON format

2020-12-03 Thread Arturas Mazeika
. By reducing the number of buckets, computation should become > much faster > > Regards, > Munendra S N > > > > On Thu, Dec 3, 2020 at 1:52 PM Arturas Mazeika wrote: > > > Hi Solr-Users, > > > > I am trying to better understand the solr capabilities, how o

Re: nested facets of query and terms type in JSON format

2020-12-03 Thread Arturas Mazeika
"t_buckets": { > "type": "range", > "field": "t", > "sort": { "t": "asc" }, > "start": "2018-05-02T1

Re: nested facets of query and terms type in JSON format

2020-12-03 Thread Arturas Mazeika
acet": { "type": "range", "field": "t", "sort": { "t": "asc" }, "start": "2018-05-02T17:00:00.000Z", "end": "2020-11-16T

nested facets of query and terms type in JSON format

2020-12-03 Thread Arturas Mazeika
Hi Solr Team, I am trying to check how I can formulate facet queries using JSON format. I can successfully formulate query, range, term queries, as well as nested term queries. How can I formulate a nested facet query involving "query" as well as "range" formulations? The following does not work:

increasing number of threads for faceting in JSON format

2020-12-03 Thread Arturas Mazeika
Hi Solr-Users, I am trying to better understand the solr capabilities, how one can formulate queries in JSON format as well as tweak parameters. Currently I have a logs collection (ca 6GB large) with a dozen of attributes running in single server mode (F:\solr_deployment\solr-8.7.0\bin\solr.cmd

Re: chaining charFilter

2020-12-02 Thread Arturas Mazeika
enter values, you’ll see the transformations work. Although > that screen doesn’t show the CharFitler transformations correctly, > but the tokens at the end are chained. > > Best, > Erick > > > On Dec 2, 2020, at 9:18 AM, Arturas Mazeika wrote: > > > > Hi Solr-Team, &g

chaining charFilter

2020-12-02 Thread Arturas Mazeika
Hi Solr-Team, The manual of charfilters says that one can chain them: (from https://lucene.apache.org/solr/guide/6_6/charfilterfactories.html#CharFilterFactories-solr.MappingCharFilterFactory ): CharFilters can be chained like Token Filters and placed in front of a Tokenizer. CharFilters can

Re: example for "parse-date" processor

2019-09-03 Thread Arturas Mazeika
as default? Cheers, Arturas On Tue, Sep 3, 2019 at 1:09 PM Jörn Franke wrote: > How do you send the request? You need to specify the update.chain > parameter with the name of the Update chain or define it as default > > > Am 03.09.2019 um 12:14 schrieb Arturas Mazeika : > >

example for "parse-date" processor

2019-09-03 Thread Arturas Mazeika
Hi Solr Fans, I am trying to figure out how to use the parse-date processor for pdates. I am able to insert data with this python code to a solr collection/core: solr = pysolr.Solr('http://localhost:/solr/core1', timeout=10) solr.add([ { "t": '2017-08-19T21:00:42.043Z', } ])

Re: %solr_logs_dir% does not like spaces

2019-02-26 Thread Arturas Mazeika
stead of ‘Program Files (x86)’ use ‘PROGRA~2’ > > And don’t have spaces in your subdirectory… > > > > NB: Depending on your Windows Version you may Have another alias for > ‘Program Files (x86)’; use «dir /X» to view the aliases. > > > > Gesendet von Mail<https://go.micr

%solr_logs_dir% does not like spaces

2019-02-26 Thread Arturas Mazeika
Hi All, I am testing solr 7.7 (and 7.6) under windows. My aim is to set logging into a subdirectory that contains spaces of a directory that contains spaces. If I set on windows: setx /m SOLR_LOGS_DIR "f:\solr_deployment\logs" and start a solr instance:

Re: how to use HTMLStripCharFilter in solrJ?

2018-07-06 Thread Arturas Mazeika
ssor factory. > > 3) Create a custom Lucene analyzer using html strip char filter and white > space tokenizer. Use the "invoking the analyzer" example given in > http://lucene.apache.org/core/7_4_0/core/org/apache/lucene/a > nalysis/package-summary.html > > Ahmet &g

Re: how to use HTMLStripCharFilter in solrJ?

2018-07-06 Thread Arturas Mazeika
: > I am confused. Why you do not just add the CharFilter definition to the > field type you need? > > You see to be trying to do it completely on the cliwnt side? No sure. > > Regards, > Alex > > On Thu, Jul 5, 2018, 2:53 AM Arturas Mazeika, wrote: > > &

Re: push to the limit without going over

2018-07-05 Thread Arturas Mazeika
rror. > > You will get significantly better throughput if you batch your > docs and use the client.add(list_of_documents) BTW. > > Another possibility is to use the new metrics (since Solr 6.4). They > provide over 200 metrics you can query, and it's quite > possible that

Re: Querying in Solrcloud

2018-07-05 Thread Arturas Mazeika
Hi Erick, wow. This Email had a such a profound effect and filled so many gaps in my head. I was wondering how master-slave (through replication) and (quorum based or whatever the name is) distribution live under the same hood in solr. And in such a concise manner! Good Job indeed. I wonder

how to use HTMLStripCharFilter in solrJ?

2018-07-05 Thread Arturas Mazeika
Hi Solr Folk, What would be the easiest way to use some of the Solr and Lucene components in SolrJ? I am pretty amazed how much thought and careful engineering went into some individual components to cover the wild real world effectively. And I wonder whether one could re-use some of them in

push to the limit without going over

2018-07-04 Thread Arturas Mazeika
Hi Solr Folk, I am trying to push solr to the limit and sometimes I succeed. The questions is how to not go over it, e.g., avoid: java.lang.RuntimeException: Tried fetching cluster state using the node names we knew of, i.e. [192.168.56.1:9998_solr, 192.168.56.1:9997_solr,

Re: /replication?command=details does not show infos for all replicas on the core

2018-07-02 Thread Arturas Mazeika
indeed! Cheers, Arturas On Fri, Jun 29, 2018 at 5:44 PM, Shawn Heisey wrote: > On 6/29/2018 8:47 AM, Arturas Mazeika wrote: > >> Out of curiosity: some cores give infos for both shards (through >> replication query) and some only for one (if you still be able to see the >&

Re: /replication?command=details does not show infos for all replicas on the core

2018-06-29 Thread Arturas Mazeika
). I wonder why.. Cheers, Arturas On Fri, Jun 29, 2018 at 4:30 PM, Shawn Heisey wrote: > On 6/29/2018 7:53 AM, Arturas Mazeika wrote: > >> but the query reports infos on only one shard: >> >> F:\solr_server\solr-7.2.1>curl -s >> http://localhost:9996/solr/de_wik

/replication?command=details does not show infos for all replicas on the core

2018-06-29 Thread Arturas Mazeika
Hi Solr-Team, I am benchmarking solr with the German Wikipedia pages on 4 nodes (Running on ports , 9998, 9997 and 9996), 4 shards, replication factor 2): "F:\solr_server\solr-7.2.1\bin\solr.cmd" start -m 3g -cloud -p -s "F:\solr_server\solr-7.2.1\example\cloud\node1\solr"

Re: cannot download the config directory with zkcli.bat from embedded zookeeper

2018-06-28 Thread Arturas Mazeika
highly puzzled. Comments are very welcome. I'll keep an eye on what's going on and keep you guys informed. Thanks for the support!! Cheers, Arturas On Thu, Jun 28, 2018 at 6:07 PM, Arturas Mazeika wrote: > Hi Shawn, et al, > > I haven't thought about ip6. Interesting idea. > &g

Re: cannot download the config directory with zkcli.bat from embedded zookeeper

2018-06-28 Thread Arturas Mazeika
solr.cmd" start -m 1g -cloud -p -s "F:\solr_server\solr-7.2.1\example\cloud\node1\solr" "F:\solr_server\solr-7.2.1\bin\solr.cmd" start -m 1g -cloud -p 9998 -s "F:\solr_server\solr-7.2.1\example\cloud\node2\solr" -z 127.0.0.1:10999 "F:\solr_server\sol

cannot download the config directory with zkcli.bat from embedded zookeeper

2018-06-28 Thread Arturas Mazeika
Hi Solr Folks, I was benchmarking solrCloud with the German Wikipedia docs under Windows 10 Pro, noticed that Windows defender was grabbing resources substantially, changed the policy (so windows defender would not kick in). Since then I am not able to download the config files from the solrCloud

Re: top 10 query overall vs shard

2018-06-22 Thread Arturas Mazeika
Hi Shawn et al, Thanks a lot for the prompt answer. It looks to me that I made quite a few mistakes in formulating those solr queries. Setting shards.qt to the name of the core was completely wrong. I tried to search for shards.qt in http://lucene.apache.org/solr/guide/7_3/ but it did not give

top 10 query overall vs shard

2018-06-22 Thread Arturas Mazeika
Hi Solr-Team, I am familiarizing myself with solr cloud and I am trying out and compare different processing setups. Short story: term-query ran on shard gives lower numbers compared querying the complete index. I wonder why. Long story: I grabbed the 2.7.1 version of solr, created a 4 core

Re: solrj (admin) requests

2018-05-04 Thread Arturas Mazeika
8 9:07 AM, Arturas Mazeika wrote: > > Short question: > > > > How can I systematically explore the solrj functionality/API? > > As Erick said, there is not an extensive programming guide. The > javadocs for SolrJ classes are pretty decent, but figuring out precisely

solrj (admin) requests

2018-05-03 Thread Arturas Mazeika
Hi Solr Team, Short question: How can I systematically explore the solrj functionality/API? Long question: I am discovering solrj functionality and I am pretty much impressed what solrj can do. What I am less impressed is my knowledge how to find what I am looking for. On the positive side,

Re: versions of documentation: suggestion for improvement

2018-04-24 Thread Arturas Mazeika
0595 into the > issue LUCENE-7924 ... but so far no one has attempted the > tooling/scripting work needed to make it happen. > > Pathes certainly welcome. > > > > : Date: Mon, 23 Apr 2018 09:55:35 +0200 > : From: Arturas Mazeika <maze...@gmail.com> > : Reply-To: s

versions of documentation: suggestion for improvement

2018-04-23 Thread Arturas Mazeika
Hi Solr-Team, If I google for specific features for solr, I usually get redirected to 6.6 version of the documentation, like this one: https://lucene.apache.org/solr/guide/6_6/overview-of-documents-fields-and-schema-design.html Since I am playing with 7.2 version of solr, I almost always need

Re: Writing config directly to zookeeper

2018-04-18 Thread Arturas Mazeika
Hi Walter, Thanks for the message. Would you care to share the tool with us? I would be interested.. Or have you shared it already? Cheers, Arturas On Wed, Apr 18, 2018 at 5:09 PM, Walter Underwood wrote: > I wrote a Python tool to do this. I use the kazoo package to

Re: schema-api: modifying schema in xml format

2018-04-18 Thread Arturas Mazeika
Schema API only understands commands in JSON. I looked through the > ref guide page, and I’m surprised that this isn’t stated directly; I’ll try > to fix that. > > -- > Steve > www.lucidworks.com > > > On Apr 18, 2018, at 4:12 AM, Arturas Mazeika <maze...@gmail.com> wrot

schema-api: modifying schema in xml format

2018-04-18 Thread Arturas Mazeika
Hi solr-users, is it possible to modify the managed schema using schema api and submit the commands in XML format? I am able to add a data type using: curl -X POST -H 'Content-type:application/json' --data-binary '{ "add-field-type": { "name":"text_de_ph", "class":"solr.TextField",

querying for positions (with term vector component)

2018-04-16 Thread Arturas Mazeika
Hi All, Term vector component allows to store and retrieve quite a bit of very useful information, e.g., for a term one can get: "verlag",[ "tf",6, "positions",["position",5, "position",42, "position",64, "position",85, "position",106, "position",127], "offsets",[ "start",35, "end",41,

Re: some parent documents

2018-04-06 Thread Arturas Mazeika
10 vendors. Consider then answer to be 1% large (there are 10 000 documents satisfying the query). What would be the complexity of answering it? Cheers, Arturas On Thu, Apr 5, 2018 at 11:47 AM, Arturas Mazeika <maze...@gmail.com> wrote: > Hi Mikhail et al, > > Thanks a lot for s

Re: some parent documents

2018-04-05 Thread Arturas Mazeika
[1] from [2]? > > https://github.com/apache/lucene-solr/blob/master/ > lucene/join/src/java/org/apache/lucene/search/join/ > ToParentBlockJoinQuery.java#L178 > > On Wed, Apr 4, 2018 at 3:39 PM, Arturas Mazeika <maze...@gmail.com> wrote: > > > Hi Mikha

Re: some parent documents

2018-04-04 Thread Arturas Mazeika
org> wrote: > q=+{!parent which=ntype:p v='+msg:Hello +person:Arturas'} +{!parent which= > ntype:p v='+msg:ciao +person:Vai'} > > On Wed, Apr 4, 2018 at 12:19 PM, Arturas Mazeika <maze...@gmail.com> > wrote: > > > Hi Mikhail et al, > > > > It seems to

Re: some parent documents

2018-04-04 Thread Arturas Mazeika
quot;_childDocuments_": [ {"id":"2_1", "ntype":"c", "person":"Vai", "time":"4:14", "msg":"Hi"}, {"id":"2_2", "ntype":"c", &qu

Re: some parent documents

2018-04-03 Thread Arturas Mazeika
s" , "time":"4:14", "msg":"IBM Watson"}, {"id":"2_3", "person":"Vai" , "time":"4:15", "msg":"need to retain content"}, {"id":"2

Re: querying vs. highlighting: complete freedom?

2018-04-03 Thread Arturas Mazeika
t;david.w.smi...@gmail.com> wrote: > Thanks for your review! > > On Tue, Apr 3, 2018 at 6:56 AM Arturas Mazeika <maze...@gmail.com> wrote: > ... > > > What I missed at the beginning of the documentation is the minimal set of > > requirements that is reacquired

some parent documents

2018-04-03 Thread Arturas Mazeika
Hi Solr Fans, I am trying to make sense of information retrieval using expressions like "some parent", "*only parent*", " *all parent*". I am also trying to understand the syntax "!parent which" and "!child of". On the technical level, I am reading the following documents: [1]

Re: querying vs. highlighting: complete freedom?

2018-04-03 Thread Arturas Mazeika
proving the documentation here. I hope it's > clearer. > https://builds.apache.org/job/Solr-reference-guide-master/ > javadoc/highlighting.html > The docs for hl.fl, hl.q, hl.qparser were all updated. The meat of the > change was a new note in hl.fl including an example. It's ki

Re: querying vs. highlighting: complete freedom?

2018-03-26 Thread Arturas Mazeika
hlighted. > > On the surface, this does seem like something that should be > changed, I'll go ahead and ping the dev list. > > NOTE: I was trying this on Solr 7.1 > > Best, > Erick > > On Fri, Mar 23, 2018 at 12:03 PM, Arturas Mazeika <maze...@gmail.com> > wro

Re: querying vs. highlighting: complete freedom?

2018-03-23 Thread Arturas Mazeika
il.com> > > wrote: > > > >> Perhaps we try it the other way round .. what's your use case for this? > I'm > >> trying to think of a situation where I'd need this a as user? > >> > >> The only reason I see myself doing this is CTRL+F in a page when the

Re: querying vs. highlighting: complete freedom?

2018-03-23 Thread Arturas Mazeika
ere I'd need this a as user? > > The only reason I see myself doing this is CTRL+F in a page when the search > result is not immediately visible for me ;) > > On Mar 23, 2018 9:41 AM, "Arturas Mazeika" <maze...@gmail.com> wrote: > > > H

Re: querying vs. highlighting: complete freedom?

2018-03-23 Thread Arturas Mazeika
s to specify > field2... > > The above does not bloat your index at all since the cost of > stored="true" indexed="true" is the same as if you use two fields, > each with only one option turned on. > > The second approach if you want to use FastVectorHighlig

querying vs. highlighting: complete freedom?

2018-03-22 Thread Arturas Mazeika
Hi Solr-Users, I've been playing with a german collection of documents, where I tried to search for one word (q=Tag) and highlighted another: (hl.q=Kundigung). Is this a "legal" use case? My key question is how can I tell solr which query analyzer to use for highlighting? Strictly speaking, I