This is an automated email from the ASF dual-hosted git repository.

hansbrende pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/any23.git.


    from 65d1a30  Merge pull request #134 from davidcockbill/ANY23-422
     add d64dac9  ANY23-418 improve TikaEncodingDetector
     add de8c887  Oops -- minor fixup
     add e9f11b4  ANY23-418 add additional unit tests
     add f0df840  simplifications, implemented more robust UTF-8 state machine
     add e355fb9  small improvement
     add 44a1255  use ternary operator to reduce verbosity
     add f25f805  reset state to valid on invalid
     add 94e700d  further improvements & simplifications
     add 58ff4e1  minor refactor
     add 74c0af9  additional simplifications and refinements
     add a500375  simplify utf8 byte statistics by including f8 artifact
     add c9b7706  minor refactor
     add 320be6a  Merge branch 'master' into ANY23-418
     add dce3c09  ANY23-418 update f8 artifact, cleanup
     new df18147  Merge branch 'master' into ANY23-418
     new e9c001f  ANY23-418 update NOTICE.txt

The 2 revisions listed above as "new" are entirely new to this
repository and will be described in separate emails.  The revisions
listed as "add" were already present in the repository and have only
been added to this reference.


Summary of changes:
 NOTICE.txt                                         |   5 +-
 encoding/pom.xml                                   |   6 +
 .../org/apache/any23/encoding/EncodingUtils.java   | 340 +++++++++++++++++++++
 .../any23/encoding/TikaEncodingDetector.java       | 261 +++++++++++-----
 .../any23/encoding/TikaEncodingDetectorTest.java   |  43 ++-
 5 files changed, 567 insertions(+), 88 deletions(-)
 create mode 100644 
encoding/src/main/java/org/apache/any23/encoding/EncodingUtils.java

Reply via email to