This is an automated email from the ASF dual-hosted git repository.

tallison pushed a change to branch chardet-work
in repository https://gitbox.apache.org/repos/asf/tika.git


      at 534c72f369 chardet -

This branch includes the following new commits:

     new 5ba43d32cd Merge origin/main
     new efe463af88 TIKA-4662: add charset training data generation script
     new dfb587c82d TIKA-4662: train initial chardetect model (v1, 32 classes, 
65536 buckets)
     new fdb2392c4c TIKA-4662: retrain model v2 (28 classes), fix 
Shift_JIS/IBM424 false positive
     new 40ad000bfc TIKA-4662: add probe-length sweep and confusion matrix to 
EvalCharsetDetectors
     new ba5e8ea62c chardet - WIP
     new 33e9ce75f9 chardet - WIP
     new e884f9f8e3 chardet - WIP
     new 534c72f369 chardet -

The 9 revisions listed above as "new" are entirely new to this
repository and will be described in separate emails.  The revisions
listed as "add" were already present in the repository and have only
been added to this reference.


Reply via email to