[ 
https://issues.apache.org/jira/browse/LUCY-295?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15271805#comment-15271805
 ] 

ASF GitHub Bot commented on LUCY-295:
-------------------------------------

GitHub user rectang opened a pull request:

    https://github.com/apache/lucy/pull/47

    LUCY-295 Int widths for text/byte sizes

    Address -Wconversion warnings relating to string lengths, code point 
counts, and binary data lengths.
    
    In addition, minor refactoring of Inversion.

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/rectang/lucy LUCY-295-text-sizes

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/lucy/pull/47.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #47
    
----
commit 2fe08b3686270d2367871b9425039c7b07b51d46
Author: Marvin Humphrey <[email protected]>
Date:   2016-05-02T23:32:31Z

    Refactor resizing of Inversion.

commit d6135b56c13257c044853aa5dd8fa16f1478018a
Author: Marvin Humphrey <[email protected]>
Date:   2016-05-02T23:41:36Z

    Disallow Token lengths over 2 GB.

commit 080c33acda88cb5d35a8a91541873b906ad38310
Author: Marvin Humphrey <[email protected]>
Date:   2016-05-04T01:59:04Z

    Fix end offsets for edge case highlight data.
    
    Under some circumstances (outside the most common code paths), the end
    offset for the last token in a field may have been too high, as a result
    of counting bytes rather than code points in UTF-8 source data.
    However, Highlighter only uses this data for heat mapping; it uses safe
    string iteration when actually choosing excerpt boundaries, and cannot
    overrun.

commit 619ec102499b0f7cc98006ce0ba71dd7f098f879
Author: Marvin Humphrey <[email protected]>
Date:   2016-05-04T02:17:49Z

    Change type to avoid integer promotion confusion.
    
    The compiler promotes `uint8_t` to `int` when performing bitwise
    operations, which then gets confusing when you assign to a `size_t`
    variable.  Avoid the whole mess by using `unsigned` instead of
    `uint8_t`.

commit a4b0b3b252f4bf253039756bad02b2fe80077114
Author: Marvin Humphrey <[email protected]>
Date:   2016-05-03T00:08:32Z

    Address -Wconversion for string/byte lengths.
    
    For text lengths, unicode code point counts, and sometimes arbitrary
    byte lengths: add casts and adress potential overflow issues with
    checks.

commit 5ba152510713a98c981e461e4102464a618f3807
Author: Marvin Humphrey <[email protected]>
Date:   2016-05-05T01:45:01Z

    Change width of size variables for RawPosting.

----


> Address integer conversion compiler warnings
> --------------------------------------------
>
>                 Key: LUCY-295
>                 URL: https://issues.apache.org/jira/browse/LUCY-295
>             Project: Lucy
>          Issue Type: Improvement
>          Components: Core
>            Reporter: Marvin Humphrey
>            Assignee: Marvin Humphrey
>            Priority: Minor
>             Fix For: 0.6.0
>
>
> Enabling `-Wconversion` reveals a fairly large number of warnings.  Most are 
> harmless and just require an inconsequential type change or a cast.  Some are 
> best fixed with API changes. There may be some bugs.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to