Re: When to use StringField and when to use FacetField for categorization?

2023-10-23 Thread Cody Amen
The Jesuit mind is always justifying itself and trying to seem like there is popularity and hype around there ultimate disgraceful mislead. Sent from my iPhone > On Oct 23, 2023, at 2:26 PM, Cody Amen wrote: > > Hey man dont advertise or opinion. Lucene is just fine the way it is. Your > jus

Re: When to use StringField and when to use FacetField for categorization?

2023-10-23 Thread Cody Amen
Hey man dont advertise or opinion. Lucene is just fine the way it is. Your just idolating some Jesuit opinion to try and hurt people and disinformation. Sent from my iPhone > On Oct 23, 2023, at 2:25 PM, Cody Amen wrote: > > As oppossed to like i want to find everything less than < 6.00 > >

Re: When to use StringField and when to use FacetField for categorization?

2023-10-23 Thread Cody Amen
As oppossed to like i want to find everything less than < 6.00 Sent from my iPhone > On Oct 20, 2023, at 7:05 AM, Michael Wechner > wrote: > > Hi Adrien > > Thank you very much for your feedback as well! > > I just replaced the StringField by KeywordField :-) > > Thanks > > Michael > >>

Re: When to use StringField and when to use FacetField for categorization?

2023-10-23 Thread Cody Amen
Yo, a facet can be booleon or coordinate or currency,,, so maybe all your facets are the currency and then your fields would be an integer. And then you could say... i just want yen, or pesos or whatever Sent from my iPhone > On Oct 20, 2023, at 7:05 AM, Michael Wechner > wrote: > > Hi Adri

Re: When to use StringField and when to use FacetField for categorization?

2023-10-23 Thread Michael Wechner
Hi Greg Thank you very much for your additional information, really very much appreciated! Yes, generally speaking I think Lucene has many great features, which unfortunately are not so obvious for various reasons. Documentation could of course always be better, but I guess it is also beca

Re: When to use StringField and when to use FacetField for categorization?

2023-10-23 Thread Cody Amen
Not all of your fields might be strings Sent from my iPhone > On Oct 23, 2023, at 1:10 PM, Greg Miller wrote: > > Hey Michael- > > You've gotten a lot of great information here already. I'll point you to > one more implementation as well: StringValueFacetCounts. This > implementation lets you

Re: When to use StringField and when to use FacetField for categorization?

2023-10-23 Thread Greg Miller
Hey Michael- You've gotten a lot of great information here already. I'll point you to one more implementation as well: StringValueFacetCounts. This implementation lets you do faceting over arbitrary "string-like" doc value fields (SORTED and SORTED_SET). So if you already have a field of this type

Re: When to use StringField and when to use FacetField for categorization?

2023-10-20 Thread Michael Wechner
thanks very much for this additional information, Marc! Am 20.10.23 um 20:30 schrieb Marc D'Mello: Just following up on Mike's comment: It used to be that the "doc values" based faceting did not support arbitrary hierarchy, but I think that was fixed at some point. Yeah it was fixed a yea

Re: When to use StringField and when to use FacetField for categorization?

2023-10-20 Thread Marc D'Mello
Just following up on Mike's comment: > It used to be that the "doc values" based faceting did not support > arbitrary hierarchy, but I think that was fixed at some point. Yeah it was fixed a year or two ago, SortedSetDocValuesFacetField supports hierarchical faceting, I think you just need to e

Re: When to use StringField and when to use FacetField for categorization?

2023-10-20 Thread Michael Wechner
cool, thank you very much! Michael Am 20.10.23 um 15:44 schrieb Michael McCandless: You can use either the "doc values" implementation for facets (SortedSetDocValuesFacetField), or the "taxonomy" implementation (FacetField, in which case, yes, you need to create a TaxonomyWriter). It used to

Re: When to use StringField and when to use FacetField for categorization?

2023-10-20 Thread Michael McCandless
You can use either the "doc values" implementation for facets (SortedSetDocValuesFacetField), or the "taxonomy" implementation (FacetField, in which case, yes, you need to create a TaxonomyWriter). It used to be that the "doc values" based faceting did not support arbitrary hierarchy, but I think

Re: When to use StringField and when to use FacetField for categorization?

2023-10-20 Thread Michael Wechner
Hi Adrien Thank you very much for your feedback as well! I just replaced the StringField by KeywordField :-) Thanks Michael Am 20.10.23 um 14:13 schrieb Adrien Grand: FYI there is also KeywordField, which combines StringField and SortedSetDocValuesField. It supports filtering, sorting, facet

Re: When to use StringField and when to use FacetField for categorization?

2023-10-20 Thread Michael Wechner
Hi Mike Thanks for your feedback! IIUC in order to have the actual advantages of Facets one has to "connect" it with a TaxonomyWriter FacetsConfig config = new FacetsConfig(); DirectoryTaxonomyWriter taxoWriter = new DirectoryTaxonomyWriter(taxoDir); indexWriter.addDocument(config.build(taxoW

Re: When to use StringField and when to use FacetField for categorization?

2023-10-20 Thread Adrien Grand
FYI there is also KeywordField, which combines StringField and SortedSetDocValuesField. It supports filtering, sorting, faceting and retrieval. It's my go-to field for string values. Le ven. 20 oct. 2023, 12:20, Michael McCandless a écrit : > There are some differences. > > StringField is indexe

Re: When to use StringField and when to use FacetField for categorization?

2023-10-20 Thread Michael McCandless
There are some differences. StringField is indexed into the inverted index (postings) so you can do efficient filtering. You can also store in stored fields to retrieve. FacetField does everything StringField does (filtering, storing (maybe?)), but in addition it stores data for faceting. I.e.

When to use StringField and when to use FacetField for categorization?

2023-10-20 Thread Michael Wechner
Hi I have found the following simple Facet Example https://github.com/apache/lucene/blob/main/lucene/demo/src/java/org/apache/lucene/demo/facet/SimpleFacetsExample.java whereas for a simple categorization of documents I currently use StringField, e.g. doc1.add(new StringField("category", "bo