commit python-charset-normalizer for openSUSE:Factory

Source-Sync Wed, 27 Oct 2021 13:21:30 -0700

Script 'mail_helper' called by obssrc
Hello community,

here is the log from the commit of package python-charset-normalizer for 
openSUSE:Factory checked in at 2021-10-27 22:21:05
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Comparing /work/SRC/openSUSE:Factory/python-charset-normalizer (Old)
 and      /work/SRC/openSUSE:Factory/.python-charset-normalizer.new.1890 (New)
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++


Package is "python-charset-normalizer"

Wed Oct 27 22:21:05 2021 rev:8 rq:927599 version:2.0.7

Changes:
--------
--- 
/work/SRC/openSUSE:Factory/python-charset-normalizer/python-charset-normalizer.changes
      2021-05-20 19:25:59.841711911 +0200
+++ 
/work/SRC/openSUSE:Factory/.python-charset-normalizer.new.1890/python-charset-normalizer.changes
    2021-10-27 22:21:17.695201405 +0200
@@ -1,0 +2,141 @@
+Sun Oct 17 14:01:59 UTC 2021 - Martin Hauke <[email protected]>
+
+- Update to version 2.0.7
+  * Addition: bento Add support for Kazakh (Cyrillic) language
+    detection
+  * Improvement: sparkle Further improve inferring the language
+    from a given code page (single-byte).
+  * Removed: fire Remove redundant logging entry about detected
+    language(s).
+  * Improvement: zap Refactoring for potential performance
+    improvements in loops.
+  * Improvement: sparkles Various detection improvement (MD+CD).
+  * Bugfix: bug Fix a minor inconsistency between Python 3.5 and
+    other versions regarding language detection.
+- Update to version 2.0.6
+  * Bugfix: bug Unforeseen regression with the loss of the
+    backward-compatibility with some older minor of Python 3.5.x.
+  * Bugfix: bug Fix CLI crash when using --minimal output in
+    certain cases.
+  * Improvement: sparkles Minor improvement to the detection
+    efficiency (less than 1%).
+- Update to version 2.0.5
+  * Improvement: sparkles The BC-support with v1.x was improved,
+    the old staticmethods are restored.
+  * Remove: fire The project no longer raise warning on tiny
+    content given for detection, will be simply logged as warning
+    instead.
+  * Improvement: sparkles The Unicode detection is slightly
+    improved, see #93
+  * Bugfix: bug In some rare case, the chunks extractor could cut
+    in the middle of a multi-byte character and could mislead the
+    mess detection.
+  * Bugfix: bug Some rare 'space' characters could trip up the
+    UnprintablePlugin/Mess detection.
+  * Improvement: art Add syntax sugar __bool__ for results
+    CharsetMatches list-container.
+- Update to version 2.0.4
+  * Improvement: sparkle Adjust the MD to lower the sensitivity,
+    thus improving the global detection reliability.
+  * Improvement: sparkle Allow fallback on specified encoding
+    if any.
+  * Bugfix: bug The CLI no longer raise an unexpected exception
+    when no encoding has been found.
+  * Bugfix: bug Fix accessing the 'alphabets' property when the
+    payload contains surrogate characters.
+  * Bugfix: bug pencil2 The logger could mislead (explain=True) on
+    detected languages and the impact of one MBCS match (in #72)
+  * Bugfix: bug Submatch factoring could be wrong in rare edge
+    cases (in #72)
+  * Bugfix: bug Multiple files given to the CLI were ignored when
+    publishing results to STDOUT. (After the first path) (in #72)
+  * Internal: art Fix line endings from CRLF to LF for certain
+    files.
+- Update to version 2.0.3
+  * Improvement: sparkles Part of the detection mechanism has been
+    improved to be less sensitive, resulting in more accurate
+    detection results. Especially ASCII. #63 Fix #62
+  * Improvement: sparklesAccording to the community wishes, the
+    detection will fall back on ASCII or UTF-8 in a last-resort
+    case.
+- Update to version 2.0.2
+  * Bugfix: bug Empty/Too small JSON payload miss-detection fixed.
+  * Improvement: sparkler Don't inject unicodedata2 into sys.modules
+- Update to version 2.0.1
+  * Bugfix: bug Make it work where there isn't a filesystem
+    available, dropping assets frequencies.json.
+  * Improvement: sparkles You may now use aliases in cp_isolation
+    and cp_exclusion arguments.
+  * Bugfix: bug Using explain=False permanently disable the verbose
+    output in the current runtime #47
+  * Bugfix: bug One log entry (language target preemptive) was not
+    show in logs when using explain=True #47
+  * Bugfix: bug Fix undesired exception (ValueError) on getitem of
+    instance CharsetMatches #52
+  * Improvement: wrench Public function normalize default args
+    values were not aligned with from_bytes #53
+- Update to version 2.0.0
+  * Performance: zap 4x to 5 times faster than the previous 1.4.0
+    release.
+  * Performance: zap At least 2x faster than Chardet.
+  * Performance: zap Accent has been made on UTF-8 detection,
+    should perform rather instantaneous.
+  * Improvement: back The backward compatibility with Chardet has
+    been greatly improved. The legacy detect function returns an
+    identical charset name whenever possible.
+  * Improvement: sparkle The detection mechanism has been slightly
+    improved, now Turkish content is detected correctly (most of
+    the time)
+  * Code: art The program has been rewritten to ease the
+    readability and maintainability. (+Using static typing)
+  * Tests: heavy_check_mark New workflows are now in place to
+    verify the following aspects: Performance, Backward-
+    Compatibility with Chardet, and Detection Coverage in addition#
+    to currents tests. (+CodeQL)
+  * Dependency: heavy_minus_sign This package no longer require
+    anything when used with Python 3.5 (Dropped cached_property)
+  * Docs: pencil2 Performance claims have been updated, the guide
+    to contributing, and the issue template.
+  * Improvement: sparkle Add --version argument to CLI
+  * Bugfix: bug The CLI output used the relative path of the
+    file(s). Should be absolute.
+  * Deprecation: red_circle Methods coherence_non_latin, w_counter,
+    chaos_secondary_pass of the class CharsetMatch are now
+    deprecated and scheduled for removal in v3.0
+  * Improvement: sparkle If no language was detected in content,
+    trying to infer it using the encoding name/alphabets used.
+  * Removal: fire Removed support for these languages: Catalan,
+    Esperanto, Kazakh, Baque, Volap??k, Azeri, Galician, Nynorsk,
+    Macedonian, and Serbocroatian.
+  * Improvement: sparkle utf_7 detection has been reinstated.
+  * Removal: fire The exception hook on UnicodeDecodeError has
+    been removed.
+- Update to version 1.4.1
+  * Improvement: art Logger configuration/usage no longer
+    conflict with others #44
+- Update to version 1.4.0
+  * Dependency: heavy_minus_sign Using standard logging instead
+    of using the package loguru.
+  * Dependency: heavy_minus_sign Dropping nose test framework in
+    favor of the maintained pytest.
+  * Dependency: heavy_minus_sign Choose to not use dragonmapper
+    package to help with gibberish Chinese/CJK text.
+  * Dependency: wrench heavy_minus_sign Require cached_property
+    only for Python 3.5 due to constraint. Dropping for every
+    other interpreter version.
+  * Bugfix: bug BOM marker in a CharsetNormalizerMatch instance
+    could be False in rare cases even if obviously present. Due
+    to the sub-match factoring process.
+  * Improvement: sparkler Return ASCII if given sequences fit.
+  * Performance: zap Huge improvement over the larges payload.
+  * Change: fire Stop support for UTF-7 that does not contain a
+    SIG. (Contributions are welcome to improve that point)
+  * Feature: sparkler CLI now produces JSON consumable output.
+  * Dependency: Dropping PrettyTable, replaced with pure JSON
+    output.
+  * Bugfix: bug Not searching properly for the BOM when trying
+    utf32/16 parent codec.
+  * Other: zap Improving the package final size by compressing
+    frequencies.json.
+
+-------------------------------------------------------------------

Old:
----
  charset_normalizer-1.3.9.tar.gz

New:
----
  charset_normalizer-2.0.7.tar.gz

++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++

Other differences:
------------------
++++++ python-charset-normalizer.spec ++++++
--- /var/tmp/diff_new_pack.YWuQoY/_old  2021-10-27 22:21:18.179201653 +0200
+++ /var/tmp/diff_new_pack.YWuQoY/_new  2021-10-27 22:21:18.183201656 +0200
@@ -19,14 +19,13 @@
 %{?!python_module:%define python_module() python-%{**} python3-%{**}}
 %define skip_python2 1
 Name:           python-charset-normalizer
-Version:        1.3.9
+Version:        2.0.7
 Release:        0
 Summary:        Python Universal Charset detector
 License:        MIT
 URL:            https://github.com/ousret/charset_normalizer
-Source:         
https://files.pythonhosted.org/packages/source/c/charset_normalizer/charset_normalizer-%{version}.tar.gz
+Source:         
https://github.com/Ousret/charset_normalizer/archive/refs/tags/%{version}.tar.gz#/charset_normalizer-%{version}.tar.gz
 BuildRequires:  %{python_module setuptools}
-BuildRequires:  dos2unix
 BuildRequires:  fdupes
 BuildRequires:  python-rpm-macros
 Requires:       python-PrettyTable
@@ -45,6 +44,7 @@
 BuildRequires:  %{python_module cached-property >= 1.5}
 BuildRequires:  %{python_module dragonmapper >= 0.2}
 BuildRequires:  %{python_module loguru >= 0.5}
+BuildRequires:  %{python_module pytest-cov}
 BuildRequires:  %{python_module pytest}
 BuildRequires:  %{python_module zhon}
 # /SECTION
@@ -55,8 +55,6 @@
 
 %prep
 %setup -q -n charset_normalizer-%{version}
-dos2unix README.md
-chmod a-x charset_normalizer/assets/frequencies.json
 
 %build
 %python_build
@@ -79,6 +77,6 @@
 %doc README.md
 %license LICENSE
 %python_alternative %{_bindir}/normalizer
-%{python_sitelib}/*
+%{python_sitelib}/charset_normalizer*
 
 %changelog

++++++ charset_normalizer-1.3.9.tar.gz -> charset_normalizer-2.0.7.tar.gz ++++++
++++ 12455 lines of diff (skipped)

commit python-charset-normalizer for openSUSE:Factory

Reply via email to