Re: Programmatic Basic Auth on CloudSolrClient

2021-03-04 Thread Tomás Fernández Löbbe
lient? you will have to make sure to close all the clients that are created externally after done, since the Solr client won't in this case. On Thu, Mar 4, 2021 at 1:22 PM Mark H. Wood wrote: > On Wed, Mar 03, 2021 at 10:34:50AM -0800, Tomás Fernández Löbbe wrote: > > As far as I

Re: Programmatic Basic Auth on CloudSolrClient

2021-03-03 Thread Tomás Fernández Löbbe
igure the client > with your workaround? > > From: Tomás Fernández Löbbe<mailto:tomasflo...@gmail.com> > Sent: 04 March 2021 12:05 AM > To: solr-user@lucene.apache.org<mailto:solr-user@lucene.apache.org> > Subject: Re: Programmatic Basic Auth on CloudSolrClient > &g

Re: NPE in QueryComponent.mergeIds when using timeAllowed and sorting SOLR 8.7

2021-03-03 Thread Tomás Fernández Löbbe
Patch looks good to me. Since it's a bugfix it can be committed to 8_8 branch and released on the next bugfix release, though I don't think it should trigger one. In the meantime, if you can patch your environment and confirm that it fixes your problem, that's a good comment to leave in

Re: Programmatic Basic Auth on CloudSolrClient

2021-03-03 Thread Tomás Fernández Löbbe
As far as I know the current OOTB options are system properties or per-request (which would allow you to use different per collection, but probably not ideal if you do different types of requests from different parts of your code). A workaround (which I've used in the past) is to have a custom

Re: How pull replica works

2021-01-06 Thread Tomás Fernández Löbbe
Hi Abhishek, The pull replicas uses the "/replication" endpoint to copy full segment files (sections of the index) from the leader. It works in a similar way to the legacy leader/follower replication. This[1] talk tries to explain the different replica types and how they work. HTH, Tomás [1]

Re: [CVE-2020-13957] The checks added to unauthenticated configset uploads in Apache Solr can be circumvented

2020-10-13 Thread Tomás Fernández Löbbe
; * No Solr API, including the Admin UI, is designed to be exposed to > > non-trusted parties. Tune your firewall so that only trusted computers > and > > people are allowed access > > > > Credit: > > Tomás Fernández Löbbe, András Salamon > > > > R

Re: Updating configset

2020-09-11 Thread Tomás Fernández Löbbe
uns an async command to RELOAD. > > I really should open source that thing (in my copious free time). > > wunder > Walter Underwood > wun...@wunderwood.org > http://observer.wunderwood.org/ (my blog) > > > On Sep 11, 2020, at 9:35 AM, Tomás Fernández Löbbe &

Re: Updating configset

2020-09-11 Thread Tomás Fernández Löbbe
> > > > > >> On Fri, Sep 11, 2020, 13:26 Houston Putman > > wrote: > > >> > > >> I completely agree, there should be a way to overwrite an existing > > >> configSet. > > >> > > >> Looks like https://is

Re: Updating configset

2020-09-11 Thread Tomás Fernández Löbbe
I was in the same situation recently. I think it would be nice to have the configset UPLOAD command be able to override the existing configset instead of just fail (with a parameter such as override=true or something). We need to be careful with the trusted/unstrusted flag there, but that should

Re: Pull Replica compaints about UpdateLog being disabled when DocBasedVersionConstraintsProcessorFactory

2020-08-05 Thread Tomás Fernández Löbbe
This is an interesting bug. I’m wondering if we can completely skip the initialization of UpdateRequestProcessorFactories in PULL replicas... On Wed, Aug 5, 2020 at 8:40 AM Erick Erickson wrote: > Offhand, this looks like a bug, please raise a JIRA. > > You said: " We also have

Re: Multiple fq vs combined fq performance

2020-07-10 Thread Tomás Fernández Löbbe
All non-cached filters will be executed together (leapfrog between them) and will be sorted by the filter cost (I guess that, since you aren't setting a cost, then the order of the input matters). You can try setting a cost in your filters (lower than 100, so that they don't become post filters)

Re: [EXTERNAL] Getting rid of Master/Slave nomenclature in Solr

2020-06-23 Thread Tomás Fernández Löbbe
I agree in general with what Trey and Jan said and have suggested. I personally like to use "leader/follower". It's true that somewhat collides with SolrCloud terminology, but that's not a problem IMO, now that replica types exist, the “role” of the replica (leader vs. non-leader/follower) doesn’t

Re: Unbalanced shard requests

2020-05-16 Thread Tomás Fernández Löbbe
I just backported Michael’s fix to be released in 8.5.2 On Fri, May 15, 2020 at 6:38 AM Michael Gibney wrote: > Hi Wei, > SOLR-14471 has been merged, so this issue should be fixed in 8.6. > Thanks for reporting the problem! > Michael > > On Mon, May 11, 2020 at 7:51 PM Wei wrote: > > > >

Re: shard.preference for single shard queries

2019-12-05 Thread Tomás Fernández Löbbe
Look at SOLR-12217, it explains the limitation and has a patch for SolrJ cases. Should be merged soon. Note that the combination of replica types you are describing is not recommended. See

Re: Lucene optimization to disable hit count

2019-11-20 Thread Tomás Fernández Löbbe
Not yet: https://issues.apache.org/jira/browse/SOLR-13289 On Wed, Nov 20, 2019 at 4:57 PM Wei wrote: > Hi, > > I see this lucene optimization to disable hit counts for better query > performance: > > https://issues.apache.org/jira/browse/LUCENE-8060 > > Is the feature available in Solr 8.3? > >

Re: fq pfloat_field:* returns no documents, tfloat:* does

2019-11-20 Thread Tomás Fernández Löbbe
Hi Webster, > The fq facet_melting_point:* "Point" numeric fields don't support that syntax currently, and the way to retrieve "docs with any value in field foo" is "foo:[* TO *]". See https://issues.apache.org/jira/browse/SOLR-11746 On Wed, Nov 20, 2019 at 2:21 PM Webster Homer <

Re: NPE during spell checking when result collapsing is activated and local parameters are used

2019-11-15 Thread Tomás Fernández Löbbe
roblem with the collations, of > course. Therefore I never questioned it... > > Thank you! > Stefan > > > Am 15. November 2019 um 00:01:52, Tomás Fernández Löbbe ( > tomasflo...@gmail.com) schrieb: > > I believe your syntax is incorrect. I believe local params must all be > included i

Re: NPE during spell checking when result collapsing is activated and local parameters are used

2019-11-14 Thread Tomás Fernández Löbbe
I believe your syntax is incorrect. I believe local params must all be included in between the same {!...}, and "{!" can only be at the beginning have you tried: ={!collapse tag=collapser field=productId sort='merchantOrder asc, price asc, id asc'} On Thu, Nov 14, 2019 at 4:54 AM Stefan

Re: Does Solr replicate data securely

2019-11-13 Thread Tomás Fernández Löbbe
Yes, if you are using TLS for running Solr, the replication will happen using TLS On Wed, Nov 13, 2019 at 2:45 PM Pushkar Raste wrote: > Hi, > Can some help me with my question. > > On Tue, Nov 12, 2019 at 10:20 AM Pushkar Raste > wrote: > > > Hi, > > How about in the master/slave set up. If I

Lucene/Solr swag

2019-09-10 Thread Tomás Fernández Löbbe
If you are interested, Apache Comdev team added Lucene and Solr items to RedBubble: Lucene: https://www.redbubble.com/people/comdev/works/40953165-apache-lucene?asc=u Solr: https://www.redbubble.com/people/comdev/works/40952682-apache-solr?asc=u

Re: Mistake assert tips in FST builder ?

2019-04-18 Thread Tomás Fernández Löbbe
The Lucene list is probably better for this question. I'd try java-u...@lucene.apache.org On Mon, Apr 15, 2019 at 9:04 PM zhenyuan wei wrote: > Hi, >With current newest version, 9.0.0-snapshot,In > Builder.UnCompileNode.addArc() function, > found this line: > > assert numArcs == 0 || label

Re: Intervals vs Span guidance

2019-03-26 Thread Tomás Fernández Löbbe
While solr-user is not a bad place to ask this question, I suspect you'll get more answers in java-u...@lucene.apache.org since there is a lot going on at the Lucene level right now. On Tue, Mar 26, 2019 at 9:09 AM Ramsey Haddad (BLOOMBERG/ LONDON) < rhadda...@bloomberg.net> wrote: > We are

Re: cve-2017-

2019-02-28 Thread Tomás Fernández Löbbe
I updated the description of SOLR-12770 a bit. The problem stated is that, since the "shards" parameter allows any URL, someone could make an insecure Solr instance hit some other (secure) web endpoint. Solr would throw an exception, but the error

Re: Re: High CPU usage with Solr 7.7.0

2019-02-27 Thread Tomás Fernández Löbbe
Maybe a thread dump would be useful if you still have some instance running on 7.7 On Wed, Feb 27, 2019 at 7:28 AM Lukas Weiss wrote: > I can confirm this. Downgrading to 7.6.0 solved the issue. > Thanks for the hint. > > > > Von:"Joe Obernberger" > An: solr-user@lucene.apache.org,

Re: Reporting security vulnerability in Solr

2019-02-20 Thread Tomás Fernández Löbbe
Hi Krzysztof, There is some information on the past CVEs and dependency issues in https://wiki.apache.org/solr/SolrSecurity. For reporting, creating a private Jira is good, or following the guidelines here: https://www.apache.org/security/ (email secur...@apache.org or secur...@lucene.apache.org)

Re: Soft commit and new replica types

2018-12-14 Thread Tomás Fernández Löbbe
to submit a PR to the Reference Guide with those > explanations? I can take a stab at it. > > Regards, > Edward > > On Fri, Dec 14, 2018 at 3:08 AM Tomás Fernández Löbbe < > tomasflo...@gmail.com> > wrote: > > > > > > > > > No, I am not see

Re: Soft commit and new replica types

2018-12-13 Thread Tomás Fernández Löbbe
ceive the documents anyway (only replicates). Commit is skipped anyway (see DistributedUpdateProcessor.processCommit) The transaction log is only used for recovery purposes (or realtime get). I hope that clarifies things. > > > PS: congratulations on the Berlin Buzzwords' talk. :) > Thanks! >

Re: Soft commit and new replica types

2018-12-10 Thread Tomás Fernández Löbbe
I think this is a good point. The tricky part is that if TLOG replicas don't replicate often, their transaction logs will get too big too, so you want the replication interval of TLOG replicas to be tied to the auto(hard)Commit interval (by default at least). If you are using them for search, you

Re: TolerantUpdateProcessorFactory maxErrors=-1 issue

2018-09-21 Thread Tomás Fernández Löbbe
Hi Derek, I suspect you need to move the TolerantUpdateProcessorFactory to the beginning of the chain On Thu, Sep 20, 2018 at 6:17 PM Derek Poh wrote: > Does any one have any idea whatcould be the causeof this? > > On 19/9/2018 11:40 AM, Derek Poh wrote: > > In addition, I tried withmaxErrors=3

Re: 7.3 appears to leak

2018-09-05 Thread Tomás Fernández Löbbe
I created SOLR-12743 to track this. On Mon, Jul 16, 2018 at 12:30 PM Markus Jelsma wrote: > Hello Thomas, > > To be absolutely sure you suffer from the same problem as one of our > collections, can you confirm that your Solr cores are leaking a > SolrIndexSearcher instance on each commit? If

Re: Heap Memory Problem after Upgrading to 7.4.0

2018-09-05 Thread Tomás Fernández Löbbe
I think this is pretty bad. I created https://issues.apache.org/jira/browse/SOLR-12743. Feel free to add any more details you have there. On Mon, Sep 3, 2018 at 1:50 PM Markus Jelsma wrote: > Hello Björn, > > Take great care, 7.2.1 cannot read an index written by 7.4.0, so you > cannot roll

Re: Solr Cloud not routing to PULL replicas

2018-08-28 Thread Tomás Fernández Löbbe
Hi Ash, Do you see all shard queries going to the TLOG replicas or “most” (are there some going to the PULL replicas). You can confirm this by looking in the logs for queries with “isShard=true” parameter. Are the PULL replicas active (since you are using a load balancer I’m guessing you are not

Re: code v.s. schema for BEST_COMPRESSION mode

2018-06-17 Thread Tomás Fernández Löbbe
The schema configuration in your first link is the way to tell Solr to use a particular compression mode in the Lucene index (the second link). So yes, the schema change should be enough. On Sun, Jun 17, 2018 at 6:39 AM Zahra Aminolroaya wrote: > I want to reduce the size of indexed and stored

Re: collection properties

2018-04-13 Thread Tomás Fernández Löbbe
Yes... Unfortunately there is no GET API :S Can you open a Jira? Patch should be trivial On Fri, Apr 13, 2018 at 3:05 PM, Hendrik Haddorp wrote: > Hi, > > with Solr 7.3 it is possible to set arbitrary collection properties using >

Re: Solr performance on EC2 linux

2017-05-02 Thread Tomás Fernández Löbbe
I remember seeing some performance impact (even when not using it) and it was attributed to the calls to System.nanoTime. See SOLR-7875 and SOLR-7876 (fixed for 5.3 and 5.4). Those two Jiras fix the impact when timeAllowed is not used, but I don't know if there were more changes to improve the

Re: Interval Facets with JSON

2017-02-23 Thread Tomás Fernández Löbbe
Hi Deniz, Interval Facets is currently not supported with JSON Facets as Tom said. Could you create a Jira issue? On Fri, Feb 10, 2017 at 6:16 AM, Tom Evans wrote: > On Wed, Feb 8, 2017 at 11:26 PM, deniz wrote: > > Tom Evans-2 wrote > >> I

Re: NumericDocValues only supports long?

2017-02-14 Thread Tomás Fernández Löbbe
I think you should use FloatFieldSource. Solr uses Float.floatToIntBits(floatValue) when adding the DV field, so you could use Float.intBitsToFloat((int)longValue) when reading (See TrieField.createFields(...)), but FloatFieldSource is already doing that for you. On Tue, Feb 14, 2017 at 10:37 AM,

Re: Limit = 0? Does it still calculate facet ?

2016-12-22 Thread Tomás Fernández Löbbe
Yes, facet.limit will short circuit and not calculate the facet for the field. I'm assuming you can't just use facet=false? Tomas On Thu, Dec 22, 2016 at 1:00 PM, William Bell wrote: > We have a qt=provider and it sets facets. > > We want to short circuit the facet. Can we

CREATEALIAS to non-existing collections

2016-12-09 Thread Tomás Fernández Löbbe
We currently support requests to CREATEALIAS to collections that don’t exist. Requests to this alias later result in 404s. If the target collection is later created, requests to the alias will begin to work. I’m wondering if someone is relying on this behavior, or if we should validate the

Re: Solr suddenly starts creating .cfs (compound) segments during indexing

2016-09-27 Thread Tomás Fernández Löbbe
By default, TieredMergePolicy uses CFS for segments that are less than 10% of the index[1]. If you set the "useCompoundFile" element in solrconfig (to either true or false) you can override this[2]. TMP also has some other limits and logic on when to (and when not to) use CFS. You can take a look

Re: Faceting search issues

2016-09-27 Thread Tomás Fernández Löbbe
I wonder why in the "facet_field" section of the first query it says: "facet_fields": {"id": []} when it should be saying "facet_fields": {"name": []} Also, why is the second query not including the fq in the echoParams section. What is that other query with fq=aggregationname:story? This is

Re: Viewing the Cache Stats [SOLR 6.1.0]

2016-09-24 Thread Tomás Fernández Löbbe
That thread is pretty old and probably talking about the old(est) admin UI (before 4.0). The cache stats can be found selecting the core in the dropdown and then "Plugin/Stats". See https://cwiki.apache.org/confluence/pages/viewpage.action?pageId=32604180 Tomás On Sat, Sep 24, 2016 at 12:14

Re: AW: group.facet=true and facet on field of type int -> org.apache.solr.common.SolrException: Exception during facet.field

2016-07-19 Thread Tomás Fernández Löbbe
Hi Sebastian, This looks like https://issues.apache.org/jira/browse/SOLR-7495 On Jul 19, 2016 3:46 AM, "Sebastian Riemer" wrote: > May I respectfully refer again to a question I posted last week? > > Thank you very much and a nice day to you all! > > Sebastian >

Re: deploy solr on cloud providers

2016-07-06 Thread Tomás Fernández Löbbe
On Wed, Jul 6, 2016 at 2:30 AM, Lorenzo Fundaró < lorenzo.fund...@dawandamail.com> wrote: > On 6 July 2016 at 00:00, Tomás Fernández Löbbe <tomasflo...@gmail.com> > wrote: > > > The leader will do the replication before responding to the client, so > lets > >

Re: deploy solr on cloud providers

2016-07-05 Thread Tomás Fernández Löbbe
The leader will do the replication before responding to the client, so lets say the leader gets to update it's local copy, but it's terminated before sending the request to the replicas, the client should get either an HTTP 500 or no http response. From the client code you can take action (log,

Re: deploy solr on cloud providers

2016-07-05 Thread Tomás Fernández Löbbe
I think there are two parts to this question: * If a node just disappears you should be fine in terms of data availability, since Solr in "SolrCloud" replicates the data as it comes it (before sending the http response). Even if the leader disappears and never comes back as long as you have one

Re: OOM script executed

2016-05-03 Thread Tomás Fernández Löbbe
ryError > -XX:HeapDumpPath=/var/solr/logs/OOM_Heap_dump/" > ...but this doesn't seem to be really helpful... > > Kind regards, > Bastien > > > On 02/05/2016 22:55, Tomás Fernández Löbbe wrote: > >> You could, but before that I'd try to see what's using your me

Re: OOM script executed

2016-05-02 Thread Tomás Fernández Löbbe
You could, but before that I'd try to see what's using your memory and see if you can decrease that. Maybe identify why you are running OOM now and not with your previous Solr version (assuming you weren't, and that you are running with the same JVM settings). A bigger heap usually means more work

Re: What does the "Max Doc" means in Admin interface?

2016-05-02 Thread Tomás Fernández Löbbe
"Max Docs" is a confusing. It's not really the maximum number of docs you can have, it's just the total amount of docs in your index INCLUDING DELETED DOCS that haven't been cleared by a merge. "Heap Memory Usage" is currently broken. See https://issues.apache.org/jira/browse/SOLR-7475 On Sun,

Re: Next Solr Release - 5.5.1 or 6.0 ?

2016-03-24 Thread Tomás Fernández Löbbe
> > > Not to mention the fact that Solr 6 is using deprecated Lucene 6 > numeric types if those are removed in Lucene 7, then what? > > I believe this is going to be an issue. We have SOLR-8396 open, but it doesn't look like it's going to make

Re: SolrCloud: published host/port

2016-03-24 Thread Tomás Fernández Löbbe
I believe this can be done by setting the "host" and "hostPort" elements in solr.xml. In the default solr.xml they are configured in a way to support also setting them via System properties: ${host:} ${jetty.port:8983} Tomás On Wed, Mar 23, 2016 at 11:26 PM, Hendrik Haddorp

Re: New comer - Benoit Vanalderweireldt

2016-02-27 Thread Tomás Fernández Löbbe
Yes, you can create a new Jira if there isn't one already. I believe you can create a new pull request with the Jira number in the title, and that gets automatically appended to the Jira issue. Other option is to create a patch and upload it to the Jira manually. On Sat, Feb 27, 2016 at 11:52 AM,

Re: SolrCloud replicas out of sync

2016-01-28 Thread Tomás Fernández Löbbe
Maybe you are hitting the reordering issue described in SOLR-8129? Tomás On Wed, Jan 27, 2016 at 11:32 AM, David Smith wrote: > Sure. Here is our SolrCloud cluster: > >+ Three (3) instances of Zookeeper on three separate (physical) > servers. The ZK servers

Re: SOLR replicas performance

2016-01-08 Thread Tomás Fernández Löbbe
Hi Luca, It looks like your queries are complex wildcard queries. My theory is that you are CPU-bounded, for a single query one CPU core for each shard will be at 100% for the duration of the sub-query. Smaller shards make these sub-queries faster which is why 16 shards is better than 8 in your

Re: Facet shows deleted values...

2015-12-29 Thread Tomás Fernández Löbbe
I believe the problem here is that terms from the deleted docs still appear in the facets, even with a doc count of 0, is that it? Can you use facet.mincount=1 or would that not be a good fit for your use case?

Re: Solr index segment level merge

2015-12-29 Thread Tomás Fernández Löbbe
Would collection aliases be an option (assuming you are using SolrCloud mode)? https://cwiki.apache.org/confluence/display/solr/Collections+API#CollectionsAPI-api4 On Tue, Dec 29, 2015 at 9:21 PM, Erick Erickson wrote: > Could you simply add the new documents to the

Re: How turn on logging for segment merging

2015-11-01 Thread Tomás Fernández Löbbe
You can turn on "infoStream" from the solrconfig: https://cwiki.apache.org/confluence/display/solr/IndexConfig+in+SolrConfig#IndexConfiginSolrConfig-OtherIndexingSettings Tomás On Sun, Nov 1, 2015 at 8:59 AM, Pushkar Raste wrote: > Is segment merging information logged

Re: SolrCloud Admin UI shows node is Down, but state.json says it's active/up

2015-09-08 Thread Tomás Fernández Löbbe
I believe this is expected in the current code. From Replica.State javadoc: /** * The replica's state. In general, if the node the replica is hosted on is * not under {@code /live_nodes} in ZK, the replica's state should be * discarded. */ public enum State { /** * The

Re: Order of hosts in zkHost

2015-09-04 Thread Tomás Fernández Löbbe
I believe Arcadius has a point, but I still think the answer is no. ZooKeeper clients (Solr/SolrJ) connect to a single ZooKeeper server instance at a time, and keep that session open to that same server as long as they can/need. During this time, all interactions between the client and the ZK

Re: Lucene/Solr 5.0 and custom FieldCahe implementation

2015-08-31 Thread Tomás Fernández Löbbe
ation as described @ > > >> > https://issues.apache.org/jira/browse/LUCENE-2394. I'm not > endorsing > > >> the > > >> > patch that is provided (I haven't even looked at it) just the > concept > > in > > >> > general. > > >>

Re: Lucene/Solr 5.0 and custom FieldCahe implementation

2015-08-27 Thread Tomás Fernández Löbbe
I don't think there is a way to do this now. Maybe we should separate the logic of creating the SolrIndexSearcher to a factory. Moving this logic away from SolrCore is already a win, plus it will make it easier to unit test and extend for advanced use cases. Tomás On Wed, Aug 26, 2015 at 8:10

Re: SolrCloud Core Reload

2015-04-17 Thread Tomás Fernández Löbbe
Optimize will be distributed to all shards/replicas. I believe reload will only reload the specific core. For reloading the complete collection use the Collections API: https://cwiki.apache.org/confluence/display/solr/Collections+API On Thu, Apr 16, 2015 at 5:15 PM, Vincenzo D'Amore

Re: 5.1 'unique' facet function / calcDistinct

2015-04-17 Thread Tomás Fernández Löbbe
II. Is there a way to use the stats.calcdistinct functionality and only return the countDistinct portion of the response and not the full list of distinct values -- as provided in the distinctValues portion of the response. In a field with high cardinality the response size becomes too

Re: Range facets in sharded search

2015-04-16 Thread Tomás Fernández Löbbe
This looks like a bug. The logic to merge range facets from shards seems to only be merging counts, not the first level elements. Could you create a Jira? On Thu, Apr 16, 2015 at 2:38 PM, Will Miller wmil...@fbbrands.com wrote: I am seeing some some odd behavior with range facets across

Re: Range facets in sharded search

2015-04-16 Thread Tomás Fernández Löbbe
Should be fixed in 5.2. See https://issues.apache.org/jira/browse/SOLR-7412 On Thu, Apr 16, 2015 at 3:18 PM, Tomás Fernández Löbbe tomasflo...@gmail.com wrote: This looks like a bug. The logic to merge range facets from shards seems to only be merging counts, not the first level elements

Re: Suggester Example In Documentation Not Working

2015-01-22 Thread Tomás Fernández Löbbe
I see that the docs say that the doc needs to be indexed only, but for Fuzzy or Analyzed, I think the field needs to be stored. On the other side, not sure how much sense it makes to use any of those two implementations if the field type you want to have is string. Tomás On Thu, Jan 22, 2015 at

Re: Slow faceting performance on a docValues field

2015-01-13 Thread Tomás Fernández Löbbe
Range Faceting won't use the DocValues even if they are there set, it translates each gap to a filter. This means that it will end up using the FilterCache, which should cause faster followup queries if you repeat the same gaps (and don't commit). You may also want to try interval faceting, it

Re: Slow faceting performance on a docValues field

2015-01-13 Thread Tomás Fernández Löbbe
possible in your use case that may be faster. Tomás On Tue, Jan 13, 2015 at 11:12 AM, Tomás Fernández Löbbe tomasflo...@gmail.com wrote: No, you are not misreading, right now there is no automatic way of generating the intervals on the server side similar to range faceting... I guess it won't work

Re: Slow faceting performance on a docValues field

2015-01-13 Thread Tomás Fernández Löbbe
structure it this way. I assume I must be mis-understanding how to use Interval faceting with dates. Are there any concrete examples you know of? A google search did not come up with much. Kind regards, Dave On Tuesday, January 13, 2015 12:16 PM, Tomás Fernández Löbbe tomasflo

Re: Slow faceting performance on a docValues field

2015-01-13 Thread Tomás Fernández Löbbe
. If a search yielded millions of hits, I could understand 24 seconds to calculate the facets. But not for a search with only 3 hits. What am I missing? Regards, David On Tuesday, January 13, 2015 1:12 PM, Tomás Fernández Löbbe tomasflo...@gmail.com wrote: No, you are not misreading

Re: import solr source to eclipse

2014-10-12 Thread Tomás Fernández Löbbe
The way I do this: From a terminal: svn checkout https://svn.apache.org/repos/asf/lucene/dev/trunk/ lucene-solr-trunk cd lucene-solr-trunk ant eclipse ... And then, from your Eclipse import existing java project, and select the directory where you placed lucene-solr-trunk On Sun, Oct 12, 2014 at

Re: Turn off suggester

2014-09-25 Thread Tomás Fernández Löbbe
The SuggestComponent is not in the default components list. There must be a request handler with this component added explicitly in the solrconfig.xml Tomás On Thu, Sep 25, 2014 at 12:22 PM, Alexandre Rafalovitch arafa...@gmail.com wrote: Isn't it one of the Solr components? Can it be just

Re: Solr Faceting issue

2014-08-04 Thread Tomás Fernández Löbbe
If I understand correctly, you are looking for multi-select faceting: https://wiki.apache.org/solr/SimpleFacetParameters#Multi-Select_Faceting_and_LocalParams On Mon, Aug 4, 2014 at 9:46 PM, Smitha Rajiv smitharaji...@gmail.com wrote: Hi Solr Experts, Request you to please help me in fixing

Re: Memory leak for debugQuery?

2014-07-16 Thread Tomás Fernández Löbbe
Also, is this trunk? Solr 4.x? Single shard, right? On Wed, Jul 16, 2014 at 2:24 PM, Erik Hatcher erik.hatc...@gmail.com wrote: Tom - You could maybe isolate it a little further by seeing using the “debug parameter with values of timing|query|results Erik On May 15, 2014, at

Re: Continue indexing doc after error

2014-07-01 Thread Tomás Fernández Löbbe
I think what you want is what’s described in https://issues.apache.org/jira/browse/SOLR-445 This has not been committed because it still doesn’t work with SolrCloud. Hoss gave me the hint to look at DistributingUpdateProcessorFactory to solve the problem described in the last comments, but I

Re: How to Configure Solr For Test Purposes?

2014-05-27 Thread Tomás Fernández Löbbe
What do you suggest for my purpose? If a test case fails re-running it for some times maybe a solution? What kind of configuration do you suggest for my Solr configuration? From the snippet of test that you showed, it looks like it's testing only Solr functionality. So, first make sure this

Re: SolrMeter is dead?

2014-05-16 Thread Tomás Fernández Löbbe
It didn't have any improvements for a long time now (It doesn't have any SolrCloud-related feautes for example), I just added a note on Solr wiki to alert users about that. Feel free to ask on the solrmeter mailing list if you have any other questions. Tomás On Wed, May 14, 2014 at 3:56 AM,

Re: Search Suggestion Filtering

2014-01-15 Thread Tomás Fernández Löbbe
I think your use case is the one described in LUCENE-5350, maybe you want to take a look to the patch and comments there. Tomás On Wed, Jan 15, 2014 at 12:58 PM, Hamish Campbell hamish.campb...@koordinates.com wrote: Hi all, I'm looking into options for filtering the search suggestions

Re: distributed search is significantly slower than direct search

2013-11-17 Thread Tomás Fernández Löbbe
Hi Yuval, quick question. You say that your code has 750k docs and around 400mb? Is this some kind of test dataset and you expect it to grow significantly? For an index of this size, I wouldn't use distributed search, single shard should be fine. Tomás On Sun, Nov 17, 2013 at 6:50 AM, Yuval

Re: limiting deep pagination

2013-10-08 Thread Tomás Fernández Löbbe
I don't know of any OOTB way to do that, I'd write a custom request handler as you suggested. Tomás On Tue, Oct 8, 2013 at 3:51 PM, Peter Keegan peterlkee...@gmail.com wrote: Is there a way to configure Solr 'defaults/appends/invariants' such that the product of the 'start' and 'rows'

Re: SolrCloud distribute search question.

2013-10-04 Thread Tomás Fernández Löbbe
Yes, the machine that gets the initial request is the one that distributes to the shards and the aggregates the results. On Fri, Oct 4, 2013 at 9:55 AM, yriveiro yago.rive...@gmail.com wrote: Hi, When a distributed search is done, the inital query is forwarded to all shards that are part

Re: App server?

2013-10-03 Thread Tomás Fernández Löbbe
You may also want to take a look at this Jira: https://issues.apache.org/jira/browse/SOLR-4792 for Solr 5.0 (trunk) Tomás On Thu, Oct 3, 2013 at 10:41 AM, Michael Sokolov msoko...@safaribooksonline.com wrote: On 10/02/2013 06:44 PM, Mark wrote: Is Jetty sufficient for running Solr or

Re: Top 10 Terms in Index (by date)

2013-04-02 Thread Tomás Fernández Löbbe
in a document it needs to be counted that many times. That seemed to rule out faceting like you mentioned as well as the TermsComponent (which as I understand also only counts documents). Thanks, Andy Pickler On Mon, Apr 1, 2013 at 4:31 PM, Tomás Fernández Löbbe tomasflo...@gmail.com wrote: So

Re: Top 10 Terms in Index (by date)

2013-04-01 Thread Tomás Fernández Löbbe
So you have one document per user comment? Why not use faceting plus filtering on the dateCreated field? That would count number of documents for each term (so, in your case, if a term is used twice in one comment it would only count once). Is that what you are looking for? Tomás On Mon, Apr 1,

Re: Urgent:Solr cloud issue

2013-03-28 Thread Tomás Fernández Löbbe
Could you give more details on what's not working? Have you followed the instructions here: http://wiki.apache.org/solr/SolrCloud#Getting_Started Are you using an embedded Zookeeper or an external server? How many of them? Are you using numShards=1?2? What do you see in the Solr UI, in the cloud

Re: [ANNOUNCE] Solr wiki editing change

2013-03-28 Thread Tomás Fernández Löbbe
Steve, could you add me to the contrib group? TomasFernandezLobbe Thanks! Tomás On Thu, Mar 28, 2013 at 1:04 PM, Steve Rowe sar...@gmail.com wrote: On Mar 28, 2013, at 11:57 AM, Jilal Oussama jilal.ouss...@gmail.com wrote: Please add OussamaJilal to the group. Added to solr

Re: [Beginner] wants to contribute in open source project

2013-03-11 Thread Tomás Fernández Löbbe
You can also take a look at http://wiki.apache.org/solr/HowToContribute Tomás On Mon, Mar 11, 2013 at 9:20 AM, Andy Lester a...@petdance.com wrote: On Mar 11, 2013, at 11:14 AM, chandresh pancholi chandreshpancholi...@gmail.com wrote: I am beginner in this field. It would be great if

Re: Upgrade Solr3.5 to Solr4.1 - Index Reformat ?

2013-03-11 Thread Tomás Fernández Löbbe
Hi Feroz, due to Lucene's backward compatibility policy ( http://wiki.apache.org/lucene-java/BackwardsCompatibility ), a Solr 4.1 instance should be able to read an index generated by a Solr 3.5 instance. This would not be true if you need to change the schema. Also, be careful because Solr 4.1

Re: SolrCloud: port out of range:-1

2013-03-08 Thread Tomás Fernández Löbbe
A couple of comments about your deployment architecture too. You'll need to change the zoo.cfg to make the Zookeeper ensemble work with two instances as you are trying to do, have you? The example configuration with the zoo.cfg is intended for a single ZK instance as described in the SolrCloud

Re: Query parsing issue

2013-03-06 Thread Tomás Fernández Löbbe
It should be easy to extend ExtendedDismaxQParser and do your pre-processing in the parse() method before calling edismax's parse. Or maybe you could change the way EDismax is splitting the input query into clauses by extending the splitIntoClauses method? Tomás On Wed, Mar 6, 2013 at 6:37 AM,

Re: Solr cloud distributed queries, what goes on in the consolidation step?

2013-02-15 Thread Tomás Fernández Löbbe
In step 4, once the node 1 gets all the responses, it merges and sorts them: Lets say you requested 15 docs from each shard (because the rows parameter is 15), at this point node 1 merges the results from all the responses and gets the top 15 across all of them. The second request is only to get

Re: Eject a node from SolrCloud

2013-02-07 Thread Tomás Fernández Löbbe
Yes, currently the only option is to shutdown the node. Maybe not the cleanest way to remove a node. See this jira too: https://issues.apache.org/jira/browse/SOLR-3512 On Thu, Feb 7, 2013 at 7:20 AM, yriveiro yago.rive...@gmail.com wrote: Hi, Exists any way to eject a node from a solr

Re: Is Solr Cloud will be helpful in case of Load balancing

2013-02-01 Thread Tomás Fernández Löbbe
Yes and no. SolrCloud won't do it automatically. But it will make it easier for you to add/remove nodes from a collection. And if you use CloudSolrServer for queries, the new nodes will automatically be used for queries once they are ready to respond. Tomás On Fri, Feb 1, 2013 at 7:35 AM,

Re: copyField - copy only specific words

2013-01-25 Thread Tomás Fernández Löbbe
I think the best way will be to pre-process the document (or use a custom UpdateRequestProcessor). Other option, if you'll only use the cities field for faceting/sorting/searching (you don't need the stored content) would be to use a regular copyField and use a KeepWordFilter for the cities field.

Re: Solr cache considerations

2013-01-18 Thread Tomás Fernández Löbbe
(outside to Solr), an indexing a bulk every 10 minutes. Thanks. On Fri, Jan 18, 2013 at 2:15 AM, Tomás Fernández Löbbe tomasflo...@gmail.com wrote: I think fieldValueCache is not per segment, only fieldCache is. However, unless I'm missing something, this cache is only used for faceting

Re: group.ngroups behavior in response

2013-01-17 Thread Tomás Fernández Löbbe
Bu Amit is right, when you use group.main, the number of groups is not displayed, even if you set grop.ngroups. I think in this case NumFound should display the number of groups instead of the number of docs matching. Other option would be to keep numFound as the number of docs matching and add

Re: Solr cache considerations

2013-01-17 Thread Tomás Fernández Löbbe
I think fieldValueCache is not per segment, only fieldCache is. However, unless I'm missing something, this cache is only used for faceting on multivalued fields On Thu, Jan 17, 2013 at 8:58 PM, Erick Erickson erickerick...@gmail.comwrote: filterCache: This is bounded by 1M * (maxDoc) / 8 *

Re: Large transaction logs

2013-01-10 Thread Tomás Fernández Löbbe
Yes, you must issue hard commits. You can use autocommit and use openSearcher=false. Autocommit is not distributed, it has to be configured in every node (which will automatically be, because you are using the exact same solrconfig for all your nodes). Other option is to issue an explicit hard

Re: solr4.0 problem zkHost with multiple hosts throws out of range exception

2013-01-03 Thread Tomás Fernández Löbbe
I think it should be –DzkHost=zoo1:8983,zoo2:8983,zoo3:8983/solrroot Tomás On Thu, Jan 3, 2013 at 2:14 PM, Mark Miller markrmil...@gmail.com wrote: I don't really understand your question. More than one what? More than one external zk node? Start up an ensemble, and pass a comma sep list

Re: Upgrading from 3.6 to 4.0

2013-01-02 Thread Tomás Fernández Löbbe
AFAIK Solr 4 should be able to read Solr 3.6 indexes. Soon those files will be updated to 4.0 format and will not be readable by Solr 3.6 anymore. See http://wiki.apache.org/lucene-java/BackwardsCompatibility You should not use a a 3.6 SolrJ client with Solr 4 server. Tomás On Wed, Jan 2, 2013

  1   2   3   >