Re: Deprecate Schemaless Mode?

2020-08-05 Thread Alexandre Rafalovitch
As David said, I did a lot of breaking apart of default configuration and it is a bit of a mess in there. (if anybody wants to review the breakdown for Solr 6: https://www.slideshare.net/arafalov/rebuilding-solr-6-examples-layer-by-layer-lucenesolrrevolution-2016, slide 19 is the kicker) I

Re: Deprecate Schemaless Mode?

2020-08-04 Thread David Smiley
Thanks for starting this thread Marcus! For a historical note, the current _default configSet being "data driven" (aka "schemaless", a worse name) is largely because of SOLR-10272 Maybe I should have fought harder against it then. I threatened

Re: Deprecate Schemaless Mode?

2020-08-04 Thread Jan Høydahl
Learning mode won’t work if you have 10 existing collections and want to create #11. We could rather have a SchemaLearningUpdateHandler so people could explicitly post documents to say /schema-guess to modify the schema. We could even have this implicit. Then the _default config would have

Re: Deprecate Schemaless Mode?

2020-08-04 Thread Gus Heck
Interesting read. Might have changed now that we have authentication capabilities... but let's not thread jack :) On Tue, Aug 4, 2020 at 8:28 AM Erick Erickson wrote: > Having the admin UI allow uploads may not be secure. When I had a similar > idea a long time ago it got shot down, see the

Re: Deprecate Schemaless Mode?

2020-08-04 Thread Erick Erickson
Having the admin UI allow uploads may not be secure. When I had a similar idea a long time ago it got shot down, see the discussion at: https://issues.apache.org/jira/browse/SOLR-5287. I _think_ this is a different issue if the configs have to be residing on the system, not coming in from

Re: Deprecate Schemaless Mode?

2020-08-03 Thread Gus Heck
On Mon, Aug 3, 2020 at 5:03 PM Erick Erickson wrote: > Gus’s point about implementing something before removing it is well taken, > but we can deprecate it immediately without removing it. Gus’s point about > dynamic fields not being found until later in the cycle is well taken, but > not enough

Re: Deprecate Schemaless Mode?

2020-08-03 Thread Erick Erickson
Putting this up top so people will read it ;) Perhaps this is all just overthinking. Is the crux of the matter that schemaless is the default? Would it suffice to make it something that had to be explicitly enabled, rather than be something in solrconfig? In essence, flip the current way we do

Re: Deprecate Schemaless Mode?

2020-08-03 Thread Marcus Eagan
Typo*, I meant deprecate vs. remove, which obviously cannot do. On Mon, Aug 3, 2020 at 12:05 Marcus Eagan wrote: > Furthermore, just to be clear, I opened a discussion about deprecating and > not replacing schemaless mode for two reasons: > > (1) the pain it has inflicted on Solr users and

Re: Deprecate Schemaless Mode?

2020-08-03 Thread Marcus Eagan
Furthermore, just to be clear, I opened a discussion about deprecating and not replacing schemaless mode for two reasons: (1) the pain it has inflicted on Solr users and reputation of Solr — deprecation logs speak volumes. (2) to get a better understanding of what engineers and others in the

Re: Deprecate Schemaless Mode?

2020-08-03 Thread Marcus Eagan
I know a person using it in production today. It's causing problems. They could abandon Solr altogether. It seems like a schema creation wizard is the right getting started motion if we know that schemaless doesn't do what people think it does. It's misleading. It's also a false representation of

Re: Deprecate Schemaless Mode?

2020-08-03 Thread Anshum Gupta
+1 Jason. Here's some context on how this came into being. Users find it difficult to understand and create a basic schema when just trying out Solr. This mode was supposed to help them bootstrap, and one they had a better understanding of how things worked, they'd tune it before using the

Re: Deprecate Schemaless Mode?

2020-08-03 Thread Gus Heck
I almost never use schemaless mode (better named "schema guessing mode") and I would never recommend it for use beyond prototyping. The primary use I see for it is to throw a bunch of data at it to get a starting point for a schema... say for example you want to see what tika's going to produce

Re: Deprecate Schemaless Mode?

2020-08-03 Thread Jan Høydahl
I’m against deprecating it. Can we rename the feature as SchemaGuessing or FieldGuessing mode? That would set expectations right from the start. You may want to ask the user community too, but ask if they use it in development, and if they like it, since it is not made for prod use :) Jan

Re: Deprecate Schemaless Mode?

2020-08-03 Thread Tomás Fernández Löbbe
Agree with Jason. It's useful for prototyping and developing. I remember seeing some warnings about it (in the logs?), but maybe we need more? On Mon, Aug 3, 2020 at 10:41 AM Jason Gerlowski wrote: > > Is anyone on this list using schemaless mode in production or have you > tried to? > >

Re: Deprecate Schemaless Mode?

2020-08-03 Thread Jason Gerlowski
> Is anyone on this list using schemaless mode in production or have you tried > to? Schemaless mode is one of a group of Solr features present for convenience but not intended for production usage. It's in the same boat as "bin/post", and SolrCell, and others. These features do cause

Deprecate Schemaless Mode?

2020-08-03 Thread Marcus Eagan
Community, There are many of us that have had to deal with the pain of managing the schemaless mode of operation in Solr. I'm curious to get others thoughts about how well it is working for them and if they would like to continue to use it. I for one don't think Schemaless works as intended and