Re: additional term meta data

2021-01-29 Thread John Wang
--- > *From:* Martin Gainty > *Sent:* Wednesday, January 6, 2021 6:28 AM > *To:* dev@lucene.apache.org > *Subject:* Re: additional term meta data > > how to access first and last? > which version will you be merging > > -- > *Fro

Re: additional term meta data

2021-01-09 Thread John Wang
ore > this information? > > simon > > On Wed, Jan 6, 2021 at 8:06 PM John Wang wrote: > > > > Thank you, Martin! > > > > You can apply the patch to the 8.7 build by just ignoring the changes to > Lucene90xxx. Appreciate the help and guidance! > > > > -Joh

Re: additional term meta data

2021-01-06 Thread John Wang
hopefully your > TestLucene84PostingsFormat will run w/o fail or error > > Thx > martin- > > -- > *From:* John Wang > *Sent:* Wednesday, January 6, 2021 10:15 AM > *To:* dev@lucene.apache.org > *Subject:* Re: additional term meta data &g

Re: additional term meta data

2021-01-06 Thread John Wang
appreciated. Thank you! -John On Wed, Jan 6, 2021 at 3:28 AM Martin Gainty wrote: > how to access first and last? > which version will you be merging > > -- > *From:* John Wang > *Sent:* Tuesday, January 5, 2021 8:19 PM > *To:* dev@lucene.a

additional term meta data

2021-01-05 Thread John Wang
Hi folks: We like to propose a feature to add additional per-term metadata to the term diction. Currently, the TermsEnum API returns docFreq as its only meta-data. We needed a way to quickly get the first and last doc id in the postings without having to scan through the entire postings list.

Re: clue

2020-06-13 Thread John Wang
Yeah, release naming convention ties with lucene versions On Fri, Jun 12, 2020 at 2:41 AM Bram Van Dam wrote: > This looks great! I'm assuming that the 8.5.0 in the release name > indicates that this will only work with Lucene 8.5.0 indexes? > > On 10/06/2020 20:15, John Wang w

clue

2020-06-10 Thread John Wang
Hi folks: A while ago I wrote a tool to manage a Lucene index: https://github.com/javasoze/clue. If there is any interest, I'd like to donate it to the Lucene community as I don't find myself having time to properly support it. Thank you -John

Re: ram estimate for docvalues is incorrect

2020-05-28 Thread John Wang
find this conference presentation I gave with my > colleagues interesting, which touch on this: > https://youtu.be/hqeYAnsxPH8?t=855 > > ~ David Smiley > Apache Lucene/Solr Search Developer > http://www.linkedin.com/in/davidwsmiley > > > On Wed, May 27, 2020

Re: ram estimate for docvalues is incorrect

2020-05-28 Thread John Wang
ore: there are also field >>> infos, segment infos, BufferedIndexInput buffers, ... not to mention things >>> that we cache in threadlocals, as we would now need to count the number of >>> threads that have a cache entry. >>> >>> I'd suggest looking in

Re: ram estimate for docvalues is incorrect

2020-05-27 Thread John Wang
memory? > > > On Wed, May 27, 2020 at 9:52 PM John Wang wrote: > >> Hello, >> >> We have a reader cache that depends on the memory usage for each reader. >> We found the calculation of reader size for doc values to be under counting. >> >> See line: &

ram estimate for docvalues is incorrect

2020-05-27 Thread John Wang
Hello, We have a reader cache that depends on the memory usage for each reader. We found the calculation of reader size for doc values to be under counting. See line:

Re: DenseNumericDocValues corner case issue

2020-05-06 Thread John Wang
.java#L29 > > Thanks. > > On Wed, May 6, 2020 at 11:04 PM John Wang wrote: > >> Hi folks: >> >> We ran into a problem with DenseNumericDocValues in Lucene 8.0 codec >> where advanceExact returns true even when advancing to an invalid docid. >> >&

DenseNumericDocValues corner case issue

2020-05-06 Thread John Wang
Hi folks: We ran into a problem with DenseNumericDocValues in Lucene 8.0 codec where advanceExact returns true even when advancing to an invalid docid. Our code looks like: if (docval.advanceExact(docid)) { var myVal = docVal.get(docid); } when docid == DocIdSetIterator.NO_MORE_DOCS, the

[jira] [Resolved] (LUCENE-8718) Add docValueCount support for SortedSetDocValues

2019-03-15 Thread John Wang (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8718?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] John Wang resolved LUCENE-8718. --- Resolution: Workaround > Add docValueCount support for SortedSetDocVal

[jira] [Commented] (LUCENE-8718) Add docValueCount support for SortedSetDocValues

2019-03-15 Thread John Wang (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8718?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16793753#comment-16793753 ] John Wang commented on LUCENE-8718: --- [~jpountz] Yes, this would yield the same result, nice! Thanks

[jira] [Commented] (LUCENE-8718) Add docValueCount support for SortedSetDocValues

2019-03-12 Thread John Wang (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8718?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16790707#comment-16790707 ] John Wang commented on LUCENE-8718: --- [~jpountz] The convention has been to return -1 when underlying

[jira] [Created] (LUCENE-8718) Add docValueCount support for SortedSetDocValues

2019-03-10 Thread John Wang (JIRA)
John Wang created LUCENE-8718: - Summary: Add docValueCount support for SortedSetDocValues Key: LUCENE-8718 URL: https://issues.apache.org/jira/browse/LUCENE-8718 Project: Lucene - Core Issue

SuRf FST implementation

2018-06-23 Thread John Wang
The SigMod paper describes a more compact FST implementation looks really interesting: http://www.cs.cmu.edu/~huanche1/publications/surf_paper.pdf (reference implementation: https://github.com/efficient/SuRF) Was wondering if Lucene's FST implementation used by term dictionary can take

[jira] [Commented] (LUCENE-7597) MultiFieldQueryParser does not honor default operator when there is a special character

2016-12-17 Thread John Wang (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-7597?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15757257#comment-15757257 ] John Wang commented on LUCENE-7597: --- Made the following change to getMultiFieldQuery() method and seems

[jira] [Created] (LUCENE-7597) MultiFieldQueryParser does not honor default operator when there is a special character

2016-12-17 Thread John Wang (JIRA)
John Wang created LUCENE-7597: - Summary: MultiFieldQueryParser does not honor default operator when there is a special character Key: LUCENE-7597 URL: https://issues.apache.org/jira/browse/LUCENE-7597

StoredFields directly from a file

2016-06-18 Thread John Wang
Hi folks: A question on indexing StoredFields (using the compressioncodec): I have a flat file storing in sequential order contents I intend to add to stored fields. I keep a list of lengths in memory: (byte,byte),(byte,byte,byte)... <--- file [2,3,...] <-- memory where each element

[jira] [Commented] (LUCENE-7002) MultiCollector throws NPE when there is CollectTerminatedException is thrown by a subcollector

2016-01-30 Thread John Wang (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-7002?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15125014#comment-15125014 ] John Wang commented on LUCENE-7002: --- [~jpountz] Looks like this might be related to LUCENE-6772

[jira] [Created] (LUCENE-7002) MultiCollector throws NPE when there is CollectTerminatedException is thrown by a subcollector

2016-01-29 Thread John Wang (JIRA)
John Wang created LUCENE-7002: - Summary: MultiCollector throws NPE when there is CollectTerminatedException is thrown by a subcollector Key: LUCENE-7002 URL: https://issues.apache.org/jira/browse/LUCENE-7002

[jira] [Updated] (LUCENE-6969) Exception in reading SortedDocValues

2016-01-09 Thread John Wang (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-6969?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] John Wang updated LUCENE-6969: -- Description: While reading SortedDocValues, I am getting the following stacktrace: {noformat

[jira] [Updated] (LUCENE-6969) Exception in reading SortedDocValues

2016-01-09 Thread John Wang (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-6969?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] John Wang updated LUCENE-6969: -- Description: While reading SortedDocValues, I am getting the following stacktrace: {noformat

[jira] [Created] (LUCENE-6969) Exception in reading SortedDocValues

2016-01-09 Thread John Wang (JIRA)
John Wang created LUCENE-6969: - Summary: Exception in reading SortedDocValues Key: LUCENE-6969 URL: https://issues.apache.org/jira/browse/LUCENE-6969 Project: Lucene - Core Issue Type: Bug

[jira] [Closed] (LUCENE-6969) Exception in reading SortedDocValues

2016-01-09 Thread John Wang (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-6969?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] John Wang closed LUCENE-6969. - Resolution: Invalid False alarm. My index had multiple segments, and the docid sits on the border

Re: SortingLeafReader and IndexWriter.addIndexes

2015-12-16 Thread John Wang
Thanks Adrien! -John On Wed, Dec 16, 2015 at 1:09 AM, Adrien Grand <jpou...@gmail.com> wrote: > Hi John, > > Javadocs are outdated indeed. You can use SlowCodecReaderWrapper.wrap to > make the sorting reader implement the codec reader API. > > Le mer. 16 déc. 2015 à

SortingLeafReader and IndexWriter.addIndexes

2015-12-15 Thread John Wang
Hi folks: I am interested in using the SortingLeafReader to sort my index. According to examples, calling IndexWriter.addIndexes on the wrapper SortingLeafReader would do the trick. In the recent releases, IndexWriter.addIndexes api is now only taking a CodecReader. Is there another way to do

Lucene Benchmark

2014-09-24 Thread John Wang
Hi guys: Can you guys point me to some details on the Lucene Benchmark module? Specifically the grammar/syntax for the Algorithm files? Thanks -John

Re: Lucene Benchmark

2014-09-24 Thread John Wang
that it reports fancy tables which you can meet in performance optimization jiras. On Wed, Sep 24, 2014 at 10:45 AM, John Wang john.w...@gmail.com wrote: Hi guys: Can you guys point me to some details on the Lucene Benchmark module? Specifically the grammar/syntax for the Algorithm files

[jira] [Commented] (LUCENE-5495) Boolean Filter does not handle FilterClauses with only bits() implemented

2014-03-10 Thread John Wang (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5495?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13925923#comment-13925923 ] John Wang commented on LUCENE-5495: --- Hi Uwe: Looking at the Filter doc, I don't

[jira] [Commented] (LUCENE-5495) Boolean Filter does not handle FilterClauses with only bits() implemented

2014-03-10 Thread John Wang (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5495?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13926222#comment-13926222 ] John Wang commented on LUCENE-5495: --- Thanks Uwe for the feedback. In FilteredQuery

[jira] [Updated] (LUCENE-5488) FilteredQuery.explain does not honor FilterStrategy

2014-03-09 Thread John Wang (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5488?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] John Wang updated LUCENE-5488: -- Attachment: LUCENE-5488.patch FilteredQuery.explain does not honor FilterStrategy

[jira] [Commented] (LUCENE-5488) FilteredQuery.explain does not honor FilterStrategy

2014-03-09 Thread John Wang (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5488?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13925353#comment-13925353 ] John Wang commented on LUCENE-5488: --- Thanks Lei for pointing this out. New patch

[jira] [Commented] (LUCENE-5495) Boolean Filter does not handle FilterClauses with only bits() implemented

2014-03-09 Thread John Wang (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5495?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13925466#comment-13925466 ] John Wang commented on LUCENE-5495: --- Thanks Lei! Comments addressed, see new patch

[jira] [Updated] (LUCENE-5495) Boolean Filter does not handle FilterClauses with only bits() implemented

2014-03-09 Thread John Wang (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5495?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] John Wang updated LUCENE-5495: -- Attachment: LUCENE-5495.patch Boolean Filter does not handle FilterClauses with only bits

[jira] [Created] (LUCENE-5495) Boolean Filter does not handle FilterClauses with only bits() implemented

2014-03-06 Thread John Wang (JIRA)
John Wang created LUCENE-5495: - Summary: Boolean Filter does not handle FilterClauses with only bits() implemented Key: LUCENE-5495 URL: https://issues.apache.org/jira/browse/LUCENE-5495 Project: Lucene

[jira] [Updated] (LUCENE-5495) Boolean Filter does not handle FilterClauses with only bits() implemented

2014-03-06 Thread John Wang (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5495?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] John Wang updated LUCENE-5495: -- Attachment: LUCENE-5495.patch Boolean Filter does not handle FilterClauses with only bits

[jira] [Updated] (LUCENE-5488) FilteredQuery.explain does not honor FilterStrategy

2014-03-06 Thread John Wang (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5488?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] John Wang updated LUCENE-5488: -- Attachment: LUCENE-5488.patch FilteredQuery.explain does not honor FilterStrategy

[jira] [Commented] (LUCENE-5488) FilteredQuery.explain does not honor FilterStrategy

2014-03-06 Thread John Wang (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5488?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13923174#comment-13923174 ] John Wang commented on LUCENE-5488: --- Thanks Michael. imports removed

[jira] [Comment Edited] (LUCENE-5488) FilteredQuery.explain does not honor FilterStrategy

2014-03-06 Thread John Wang (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5488?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13923174#comment-13923174 ] John Wang edited comment on LUCENE-5488 at 3/6/14 10:26 PM

[jira] [Created] (LUCENE-5488) FilteredQuery.explain does not honor FilterStrategy

2014-03-04 Thread John Wang (JIRA)
John Wang created LUCENE-5488: - Summary: FilteredQuery.explain does not honor FilterStrategy Key: LUCENE-5488 URL: https://issues.apache.org/jira/browse/LUCENE-5488 Project: Lucene - Core Issue

[jira] [Updated] (LUCENE-5488) FilteredQuery.explain does not honor FilterStrategy

2014-03-04 Thread John Wang (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5488?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] John Wang updated LUCENE-5488: -- Attachment: LUCENE-5488.patch FilteredQuery.explain does not honor FilterStrategy

[jira] [Updated] (LUCENE-5446) ExtractWikipedia produces null strings

2014-02-14 Thread John Wang (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5446?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] John Wang updated LUCENE-5446: -- Description: The ExtractWikipedia program produces null strings as output. This is because

[jira] [Commented] (LUCENE-5446) ExtractWikipedia produces null strings

2014-02-14 Thread John Wang (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5446?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13901533#comment-13901533 ] John Wang commented on LUCENE-5446: --- Maybe in ExtractWikipedia.extract(), we should

[jira] [Created] (LUCENE-5446) ExtractWikipedia produces null strings

2014-02-14 Thread John Wang (JIRA)
John Wang created LUCENE-5446: - Summary: ExtractWikipedia produces null strings Key: LUCENE-5446 URL: https://issues.apache.org/jira/browse/LUCENE-5446 Project: Lucene - Core Issue Type: Bug

Re: Getting ready for Lucene/Solr 4.7

2014-02-13 Thread John Wang
+1 -John On Thu, Feb 13, 2014 at 11:44 AM, Simon Willnauer simon.willna...@gmail.com wrote: I just looked through the CHANGE.TXT today and I think we should get 4.7 out of the door. I can do the release though if nobody wants to volunteer - I suggest to cut a release branch in next week

[jira] [Commented] (LUCENE-5426) Make SortedSetDocValuesReaderState customizable

2014-02-06 Thread John Wang (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5426?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13893469#comment-13893469 ] John Wang commented on LUCENE-5426: --- Thanks Michael! Can't wait for release of 4.7

[jira] [Commented] (LUCENE-5425) Make creation of FixedBitSet in FacetsCollector overridable

2014-02-05 Thread John Wang (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5425?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13892204#comment-13892204 ] John Wang commented on LUCENE-5425: --- Yes, this works great for us! Thanks Shai

[jira] [Commented] (LUCENE-5426) Make SortedSetDocValuesReaderState customizable

2014-02-05 Thread John Wang (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5426?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13893092#comment-13893092 ] John Wang commented on LUCENE-5426: --- Do you think this can make it to 4.7? Thanks

[jira] [Commented] (LUCENE-5425) Make creation of FixedBitSet in FacetsCollector overridable

2014-02-04 Thread John Wang (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5425?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13890911#comment-13890911 ] John Wang commented on LUCENE-5425: --- Hi Shai: Our motivation for generalization

[jira] [Commented] (LUCENE-5425) Make creation of FixedBitSet in FacetsCollector overridable

2014-02-03 Thread John Wang (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5425?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13889604#comment-13889604 ] John Wang commented on LUCENE-5425: --- I named it to newHitSet in case we do decide

[jira] [Commented] (LUCENE-5425) Make creation of FixedBitSet in FacetsCollector overridable

2014-02-01 Thread John Wang (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5425?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13888673#comment-13888673 ] John Wang commented on LUCENE-5425: --- Thanks Lei! Shai, given that using OBS Iterator

[jira] [Commented] (LUCENE-5428) Make Faceting counting array overridable

2014-02-01 Thread John Wang (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5428?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13888677#comment-13888677 ] John Wang commented on LUCENE-5428: --- Hi Shai: We are currently writing our own

[jira] [Commented] (LUCENE-5425) Make creation of FixedBitSet in FacetsCollector overridable

2014-02-01 Thread John Wang (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5425?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13888678#comment-13888678 ] John Wang commented on LUCENE-5425: --- Ok. I will revert to the previous patch applied

[jira] [Updated] (LUCENE-5425) Make creation of FixedBitSet in FacetsCollector overridable

2014-02-01 Thread John Wang (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5425?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] John Wang updated LUCENE-5425: -- Attachment: fixbitset.patch Make creation of FixedBitSet in FacetsCollector overridable

[jira] [Commented] (LUCENE-5425) Make creation of FixedBitSet in FacetsCollector overridable

2014-02-01 Thread John Wang (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5425?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13888679#comment-13888679 ] John Wang commented on LUCENE-5425: --- Reverted patch attached. Make creation

[jira] [Commented] (LUCENE-5426) Make SortedSetDocValuesReaderState customizable

2014-01-31 Thread John Wang (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5426?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13887981#comment-13887981 ] John Wang commented on LUCENE-5426: --- This patch applies to 4.6 release branch. I

[jira] [Updated] (LUCENE-5426) Make SortedSetDocValuesReaderState customizable

2014-01-31 Thread John Wang (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5426?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] John Wang updated LUCENE-5426: -- Attachment: sortedsetreaderstate.patch Make SortedSetDocValuesReaderState customizable

[jira] [Commented] (LUCENE-5426) Make SortedSetDocValuesReaderState customizable

2014-01-31 Thread John Wang (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5426?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13888104#comment-13888104 ] John Wang commented on LUCENE-5426: --- new patch diff'd against trunk. I incorporated

[jira] [Updated] (LUCENE-5426) Make SortedSetDocValuesReaderState customizable

2014-01-31 Thread John Wang (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5426?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] John Wang updated LUCENE-5426: -- Attachment: sortedsetreaderstate.patch Make SortedSetDocValuesReaderState customizable

[jira] [Updated] (LUCENE-5426) Make SortedSetDocValuesReaderState customizable

2014-01-31 Thread John Wang (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5426?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] John Wang updated LUCENE-5426: -- Attachment: (was: sortedsetreaderstate.patch) Make SortedSetDocValuesReaderState customizable

[jira] [Updated] (LUCENE-5426) Make SortedSetDocValuesReaderState customizable

2014-01-31 Thread John Wang (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5426?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] John Wang updated LUCENE-5426: -- Attachment: (was: sortedsetreaderstate.patch) Make SortedSetDocValuesReaderState customizable

[jira] [Commented] (LUCENE-5425) Make creation of FixedBitSet in FacetsCollector overridable

2014-01-31 Thread John Wang (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5425?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13888135#comment-13888135 ] John Wang commented on LUCENE-5425: --- The API would be so much prettier if it returns

[jira] [Commented] (LUCENE-5425) Make creation of FixedBitSet in FacetsCollector overridable

2014-01-31 Thread John Wang (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5425?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13888213#comment-13888213 ] John Wang commented on LUCENE-5425: --- I realize if we make getting the docs abstract

[jira] [Commented] (LUCENE-5425) Make creation of FixedBitSet in FacetsCollector overridable

2014-01-31 Thread John Wang (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5425?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13888233#comment-13888233 ] John Wang commented on LUCENE-5425: --- new patch added: 1) created a MutableDocIdSet

[jira] [Updated] (LUCENE-5425) Make creation of FixedBitSet in FacetsCollector overridable

2014-01-31 Thread John Wang (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5425?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] John Wang updated LUCENE-5425: -- Attachment: facetscollector.patch Make creation of FixedBitSet in FacetsCollector overridable

[jira] [Created] (LUCENE-5428) Make Faceting counting array overridable

2014-01-31 Thread John Wang (JIRA)
John Wang created LUCENE-5428: - Summary: Make Faceting counting array overridable Key: LUCENE-5428 URL: https://issues.apache.org/jira/browse/LUCENE-5428 Project: Lucene - Core Issue Type

[jira] [Updated] (LUCENE-5428) Make Faceting counting array overridable

2014-01-31 Thread John Wang (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5428?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] John Wang updated LUCENE-5428: -- Attachment: facetcounter.patch Make Faceting counting array overridable

[jira] [Created] (LUCENE-5425) Make creation of FixedBitSet in FacetsCollector overridable

2014-01-30 Thread John Wang (JIRA)
John Wang created LUCENE-5425: - Summary: Make creation of FixedBitSet in FacetsCollector overridable Key: LUCENE-5425 URL: https://issues.apache.org/jira/browse/LUCENE-5425 Project: Lucene - Core

[jira] [Updated] (LUCENE-5425) Make creation of FixedBitSet in FacetsCollector overridable

2014-01-30 Thread John Wang (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5425?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] John Wang updated LUCENE-5425: -- Attachment: facetscollector.patch Make creation of FixedBitSet in FacetsCollector overridable

[jira] [Updated] (LUCENE-5426) Make SortedSetDocValuesReaderState customizable

2014-01-30 Thread John Wang (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5426?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] John Wang updated LUCENE-5426: -- Attachment: sortedsetreaderstate.patch Make SortedSetDocValuesReaderState customizable

[jira] [Created] (LUCENE-5426) Make SortedSetDocValuesReaderState customizable

2014-01-30 Thread John Wang (JIRA)
John Wang created LUCENE-5426: - Summary: Make SortedSetDocValuesReaderState customizable Key: LUCENE-5426 URL: https://issues.apache.org/jira/browse/LUCENE-5426 Project: Lucene - Core Issue Type

[jira] [Updated] (LUCENE-5426) Make SortedSetDocValuesReaderState customizable

2014-01-30 Thread John Wang (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5426?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] John Wang updated LUCENE-5426: -- Attachment: sortedsetreaderstate.patch Make SortedSetDocValuesReaderState customizable

[jira] [Commented] (LUCENE-5426) Make SortedSetDocValuesReaderState customizable

2014-01-30 Thread John Wang (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5426?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13887438#comment-13887438 ] John Wang commented on LUCENE-5426: --- You are right. Re-attached. Make

[jira] [Commented] (LUCENE-5327) Expose getNumericDocValues and getBinaryDocValues at toplevel reader and searcher levels

2013-11-10 Thread John Wang (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5327?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13818392#comment-13818392 ] John Wang commented on LUCENE-5327: --- Perhaps I should have explained our usecase, which

[jira] [Commented] (LUCENE-5327) Expose getNumericDocValues and getBinaryDocValues at toplevel reader and searcher levels

2013-11-10 Thread John Wang (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5327?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13818402#comment-13818402 ] John Wang commented on LUCENE-5327: --- done, closed. Expose getNumericDocValues

[jira] [Closed] (LUCENE-5327) Expose getNumericDocValues and getBinaryDocValues at toplevel reader and searcher levels

2013-11-10 Thread John Wang (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5327?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] John Wang closed LUCENE-5327. - Resolution: Won't Fix Expose getNumericDocValues and getBinaryDocValues at toplevel reader

[jira] [Commented] (LUCENE-5327) Expose getNumericDocValues and getBinaryDocValues at toplevel reader and searcher levels

2013-11-09 Thread John Wang (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5327?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13818355#comment-13818355 ] John Wang commented on LUCENE-5327: --- Hi Uwe and Robert for the feedback. I like the API

[jira] [Commented] (LUCENE-5327) Expose getNumericDocValues and getBinaryDocValues at toplevel reader and searcher levels

2013-11-05 Thread John Wang (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5327?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13813973#comment-13813973 ] John Wang commented on LUCENE-5327: --- Oh, I didn't know about MultiDocValues. I am happy

[jira] [Created] (LUCENE-5327) Expose getNumericDocValues and getBinaryDocValues at toplevel reader and searcher levels

2013-11-04 Thread John Wang (JIRA)
John Wang created LUCENE-5327: - Summary: Expose getNumericDocValues and getBinaryDocValues at toplevel reader and searcher levels Key: LUCENE-5327 URL: https://issues.apache.org/jira/browse/LUCENE-5327

[jira] [Updated] (LUCENE-5327) Expose getNumericDocValues and getBinaryDocValues at toplevel reader and searcher levels

2013-11-04 Thread John Wang (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5327?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] John Wang updated LUCENE-5327: -- Attachment: patch.diff Expose getNumericDocValues and getBinaryDocValues at toplevel reader

[jira] [Updated] (LUCENE-5328) Adding a AtomicReader implementation that supports injection of dynamic data at runtime

2013-11-04 Thread John Wang (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5328?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] John Wang updated LUCENE-5328: -- Attachment: patch.diff Adding a AtomicReader implementation that supports injection of dynamic data

Building a codec with terms of custom ComparatorBytesRef

2013-09-09 Thread John Wang
Hi guys: In the codec api, it seems you can set term order via an arbitrary ComparatorBytesRef. I tried to use this to create a term dictionary of an order dictated by my own ComparatorBytesRef. The problem arises when building the FST. Specifically BlockTreeTermsWriter.finishTerm() -

backward incompatibility with MockTokenFilter

2013-08-15 Thread John Wang
Hi folks: In release 4.3.1, MockTokenFilter has an api to turn on/off position increments, e.g. : set/getEnablePositionIncrements() In release 4.4.0 that was removed. And the default behavior in 4.4.0 is that it is assumed to be true. But I don't see this change documented or a migration

Re: lucene indexing and field configuration or schema

2013-08-04 Thread John Wang
, Adrien Grand jpou...@gmail.com wrote: Hi John, On Mon, Jun 10, 2013 at 8:17 PM, John Wang john.w...@gmail.com wrote: We found having the schema information up front allows us flexibilities in designing our posting list, also makes the indexingchain logic much simpler. Can you give

lucene indexing and field configuration or schema

2013-06-10 Thread John Wang
Hi folks: Solr has schemas that defined per field configuration for the entire corpus, whereas Lucene determines the information from each individual document. So on that level, it is inconsistent. We found having the schema information up front allows us flexibilities in designing our

[jira] [Commented] (LUCENE-4569) Allow customization of column stride field and norms via indexing chain

2013-06-03 Thread John Wang (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-4569?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13673539#comment-13673539 ] John Wang commented on LUCENE-4569: --- Hi Simon: Looked at the latest code on trunk

[jira] [Resolved] (LUCENE-4569) Allow customization of column stride field and norms via indexing chain

2013-06-03 Thread John Wang (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-4569?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] John Wang resolved LUCENE-4569. --- Resolution: Won't Fix Lucene Fields: (was: New,Patch Available) This seem to be fixed

[jira] [Commented] (LUCENE-4569) Allow customization of column stride field and norms via indexing chain

2013-05-30 Thread John Wang (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-4569?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13671164#comment-13671164 ] John Wang commented on LUCENE-4569: --- Hey Simon: Was wondering if you had a chance

[jira] [Commented] (LUCENE-4992) ArrayOutOfBoundsException in BooleanScorer2

2013-05-12 Thread John Wang (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-4992?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13655478#comment-13655478 ] John Wang commented on LUCENE-4992: --- makes senses, agreed

[jira] [Commented] (LUCENE-4992) ArrayOutOfBoundsException in BooleanScorer2

2013-05-11 Thread John Wang (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-4992?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13655463#comment-13655463 ] John Wang commented on LUCENE-4992: --- Thanks Rob for the patch! This makes the api much

[jira] [Commented] (LUCENE-4992) ArrayOutOfBoundsException in BooleanScorer2

2013-05-10 Thread John Wang (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-4992?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13654822#comment-13654822 ] John Wang commented on LUCENE-4992: --- Thanks Rob for following up! I think the right

[jira] [Commented] (LUCENE-4992) ArrayOutOfBoundsException in BooleanScorer2

2013-05-10 Thread John Wang (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-4992?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13654857#comment-13654857 ] John Wang commented on LUCENE-4992: --- Oh! that's nice! We will just do that! Thanks

[jira] [Commented] (LUCENE-4992) ArrayOutOfBoundsException in BooleanScorer2

2013-05-10 Thread John Wang (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-4992?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13654940#comment-13654940 ] John Wang commented on LUCENE-4992: --- Hey Rob: Just verified your suggestion works

[jira] [Created] (LUCENE-4992) ArrayOutOfBoundsException in BooleanScorer2

2013-05-09 Thread John Wang (JIRA)
John Wang created LUCENE-4992: - Summary: ArrayOutOfBoundsException in BooleanScorer2 Key: LUCENE-4992 URL: https://issues.apache.org/jira/browse/LUCENE-4992 Project: Lucene - Core Issue Type

[jira] [Commented] (LUCENE-4992) ArrayOutOfBoundsException in BooleanScorer2

2013-05-09 Thread John Wang (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-4992?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13653074#comment-13653074 ] John Wang commented on LUCENE-4992: --- in DisjunctionSumScorer, line 96: private void

[jira] [Updated] (LUCENE-4992) ArrayOutOfBoundsException in BooleanScorer2

2013-05-09 Thread John Wang (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-4992?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] John Wang updated LUCENE-4992: -- Attachment: patch.diff ArrayOutOfBoundsException in BooleanScorer2

[jira] [Commented] (LUCENE-4992) ArrayOutOfBoundsException in BooleanScorer2

2013-05-09 Thread John Wang (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-4992?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13653083#comment-13653083 ] John Wang commented on LUCENE-4992: --- Hi Rob: I do have a test that reproduces

  1   2   3   4   5   >