Hello Rajib,
Check OffsetAttribute.

On Tue, Jan 30, 2024 at 1:31 PM Saha, Rajib <rajib.s...@sap.com.invalid>
wrote:

> Hi,
>
> In our project for Lucene migration from 2.4.0 to 8.11.2, we need your
> suggestion to address a case.
> With Lucene 2.4.0, we were using the kind of below code snippet.
> With Lucene 8.11.2[Written snippet below], we need to extract the
> startOffset & endOffset value for further some calculation similar to
> Lucene 2.4.0.
> Is there any easy way/API to extract the values from tokenStream?
>
> //Lucene 2.4.0
> ===========================================================================
> Token token;
> TokenStream valueStream = analyzer.tokenStream(new
> StringReader(fieldValue), false,true);
> while ((token = valueStream.next()) != null) {
>               int startOffset = token.startOffset();
>               int endOffset = token.endOffset();
>
>               //Do some calculation based on startOffset & endOffset
> }
>
> ============================================================================
>
>
> //Lucene 8.11.2
> ========================================================================
> TokenStream valueStream = analyzer.tokenStream(field, new
> StringReader(fieldValue));
> CharTermAttribute charValueTermAttribute =
> valueStream.addAttribute(CharTermAttribute.class);
> while (valueStream.incrementToken()) {
>               String termValueText = charValueTermAttribute.toString();
>
>               //How to get startOffset & endOffset as like in Lucene 2.4
>
>               //Do some calculation based on startOffset & endOffset
> }
>
> Please let me know, if there is any further information is required from
> my side.
>
> Regards
> Rajib
>


-- 
Sincerely yours
Mikhail Khludnev

Reply via email to