Script 'mail_helper' called by obssrc Hello community, here is the log from the commit of package python-docx2txt for openSUSE:Factory checked in at 2025-04-10 22:00:00 ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ Comparing /work/SRC/openSUSE:Factory/python-docx2txt (Old) and /work/SRC/openSUSE:Factory/.python-docx2txt.new.1907 (New) ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Package is "python-docx2txt" Thu Apr 10 22:00:00 2025 rev:2 rq:1268405 version:0.9 Changes: -------- --- /work/SRC/openSUSE:Factory/python-docx2txt/python-docx2txt.changes 2024-08-21 23:27:26.933169800 +0200 +++ /work/SRC/openSUSE:Factory/.python-docx2txt.new.1907/python-docx2txt.changes 2025-04-10 22:00:55.624806243 +0200 @@ -1,0 +2,6 @@ +Mon Apr 7 10:49:07 UTC 2025 - John Paul Adrian Glaubitz <adrian.glaub...@suse.com> + +- Update to 0.9 + * Updates to setup.cfg file + +------------------------------------------------------------------- Old: ---- docx2txt-0.8.tar.gz New: ---- docx2txt-0.9.tar.gz ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ Other differences: ------------------ ++++++ python-docx2txt.spec ++++++ --- /var/tmp/diff_new_pack.bm5vp2/_old 2025-04-10 22:00:56.196830132 +0200 +++ /var/tmp/diff_new_pack.bm5vp2/_new 2025-04-10 22:00:56.196830132 +0200 @@ -1,7 +1,7 @@ # # spec file for package python-docx2txt # -# Copyright (c) 2024 SUSE LLC +# Copyright (c) 2025 SUSE LLC # # All modifications and additions to the file contributed by third parties # remain the property of their copyright owners, unless otherwise agreed @@ -15,20 +15,21 @@ # Please submit bugfixes or comments via https://bugs.opensuse.org/ # + %{?sle15_python_module_pythons} Name: python-docx2txt -Version: 0.8 +Version: 0.9 Release: 0 Summary: A pure python-based utility to extract text and images from docx files License: MIT Group: Development/Languages/Python URL: https://github.com/ankushshah89/python-docx2txt Source: https://files.pythonhosted.org/packages/source/d/docx2txt/docx2txt-%{version}.tar.gz -BuildRequires: python-rpm-macros BuildRequires: %{python_module pip} BuildRequires: %{python_module setuptools} BuildRequires: %{python_module wheel} BuildRequires: fdupes +BuildRequires: python-rpm-macros BuildArch: noarch %python_subpackages ++++++ docx2txt-0.8.tar.gz -> docx2txt-0.9.tar.gz ++++++ diff -urN '--exclude=CVS' '--exclude=.cvsignore' '--exclude=.svn' '--exclude=.svnignore' old/docx2txt-0.8/MANIFEST.in new/docx2txt-0.9/MANIFEST.in --- old/docx2txt-0.8/MANIFEST.in 1970-01-01 01:00:00.000000000 +0100 +++ new/docx2txt-0.9/MANIFEST.in 2025-03-24 21:35:58.000000000 +0100 @@ -0,0 +1 @@ +include LICENSE.txt diff -urN '--exclude=CVS' '--exclude=.cvsignore' '--exclude=.svn' '--exclude=.svnignore' old/docx2txt-0.8/PKG-INFO new/docx2txt-0.9/PKG-INFO --- old/docx2txt-0.8/PKG-INFO 2019-06-23 21:58:36.000000000 +0200 +++ new/docx2txt-0.9/PKG-INFO 2025-03-24 21:39:14.604485500 +0100 @@ -1,12 +1,17 @@ -Metadata-Version: 1.1 +Metadata-Version: 2.4 Name: docx2txt -Version: 0.8 +Version: 0.9 Summary: A pure python-based utility to extract text and images from docx files. Home-page: https://github.com/ankushshah89/python-docx2txt +Download-URL: https://github.com/ankushshah89/python-docx2txt/tarball/0.9 Author: Ankush Shah Author-email: ankush.shah.n...@gmail.com -License: UNKNOWN -Download-URL: https://github.com/ankushshah89/python-docx2txt/tarball/0.8 -Description: UNKNOWN Keywords: python,docx,text,images,extract -Platform: UNKNOWN +License-File: LICENSE.txt +Dynamic: author +Dynamic: author-email +Dynamic: download-url +Dynamic: home-page +Dynamic: keywords +Dynamic: license-file +Dynamic: summary diff -urN '--exclude=CVS' '--exclude=.cvsignore' '--exclude=.svn' '--exclude=.svnignore' old/docx2txt-0.8/README.md new/docx2txt-0.9/README.md --- old/docx2txt-0.8/README.md 1970-01-01 01:00:00.000000000 +0100 +++ new/docx2txt-0.9/README.md 2025-03-24 21:35:58.000000000 +0100 @@ -0,0 +1,30 @@ +# python-docx2txt # + +A pure python-based utility to extract text from docx files. + +The code is taken and adapted from [python-docx](https://github.com/python-openxml/python-docx). It can however also extract text from header, footer and hyperlinks. __It can now also extract images.__ + +## How to install? ## +```bash +pip install docx2txt +``` + +## How to run? ## + +a. From command line: +```bash +# extract text +docx2txt file.docx +# extract text and images +docx2txt -i /tmp/img_dir file.docx +``` +b. From python: +```python +import docx2txt + +# extract text +text = docx2txt.process("file.docx") + +# extract text and write images in /tmp/img_dir +text = docx2txt.process("file.docx", "/tmp/img_dir") +``` diff -urN '--exclude=CVS' '--exclude=.cvsignore' '--exclude=.svn' '--exclude=.svnignore' old/docx2txt-0.8/docx2txt/__init__.py new/docx2txt-0.9/docx2txt/__init__.py --- old/docx2txt-0.8/docx2txt/__init__.py 2019-06-23 21:52:43.000000000 +0200 +++ new/docx2txt-0.9/docx2txt/__init__.py 2025-03-24 21:35:58.000000000 +0100 @@ -1,4 +1,4 @@ from .docx2txt import process from .docx2txt import process_args -VERSION = '0.8' +VERSION = '0.9' diff -urN '--exclude=CVS' '--exclude=.cvsignore' '--exclude=.svn' '--exclude=.svnignore' old/docx2txt-0.8/docx2txt.egg-info/PKG-INFO new/docx2txt-0.9/docx2txt.egg-info/PKG-INFO --- old/docx2txt-0.8/docx2txt.egg-info/PKG-INFO 1970-01-01 01:00:00.000000000 +0100 +++ new/docx2txt-0.9/docx2txt.egg-info/PKG-INFO 2025-03-24 21:39:14.000000000 +0100 @@ -0,0 +1,17 @@ +Metadata-Version: 2.4 +Name: docx2txt +Version: 0.9 +Summary: A pure python-based utility to extract text and images from docx files. +Home-page: https://github.com/ankushshah89/python-docx2txt +Download-URL: https://github.com/ankushshah89/python-docx2txt/tarball/0.9 +Author: Ankush Shah +Author-email: ankush.shah.n...@gmail.com +Keywords: python,docx,text,images,extract +License-File: LICENSE.txt +Dynamic: author +Dynamic: author-email +Dynamic: download-url +Dynamic: home-page +Dynamic: keywords +Dynamic: license-file +Dynamic: summary diff -urN '--exclude=CVS' '--exclude=.cvsignore' '--exclude=.svn' '--exclude=.svnignore' old/docx2txt-0.8/docx2txt.egg-info/SOURCES.txt new/docx2txt-0.9/docx2txt.egg-info/SOURCES.txt --- old/docx2txt-0.8/docx2txt.egg-info/SOURCES.txt 1970-01-01 01:00:00.000000000 +0100 +++ new/docx2txt-0.9/docx2txt.egg-info/SOURCES.txt 2025-03-24 21:39:14.000000000 +0100 @@ -0,0 +1,12 @@ +LICENSE.txt +MANIFEST.in +README.md +setup.cfg +setup.py +bin/docx2txt +docx2txt/__init__.py +docx2txt/docx2txt.py +docx2txt.egg-info/PKG-INFO +docx2txt.egg-info/SOURCES.txt +docx2txt.egg-info/dependency_links.txt +docx2txt.egg-info/top_level.txt \ No newline at end of file diff -urN '--exclude=CVS' '--exclude=.cvsignore' '--exclude=.svn' '--exclude=.svnignore' old/docx2txt-0.8/docx2txt.egg-info/dependency_links.txt new/docx2txt-0.9/docx2txt.egg-info/dependency_links.txt --- old/docx2txt-0.8/docx2txt.egg-info/dependency_links.txt 1970-01-01 01:00:00.000000000 +0100 +++ new/docx2txt-0.9/docx2txt.egg-info/dependency_links.txt 2025-03-24 21:39:14.000000000 +0100 @@ -0,0 +1 @@ + diff -urN '--exclude=CVS' '--exclude=.cvsignore' '--exclude=.svn' '--exclude=.svnignore' old/docx2txt-0.8/docx2txt.egg-info/top_level.txt new/docx2txt-0.9/docx2txt.egg-info/top_level.txt --- old/docx2txt-0.8/docx2txt.egg-info/top_level.txt 1970-01-01 01:00:00.000000000 +0100 +++ new/docx2txt-0.9/docx2txt.egg-info/top_level.txt 2025-03-24 21:39:14.000000000 +0100 @@ -0,0 +1 @@ +docx2txt diff -urN '--exclude=CVS' '--exclude=.cvsignore' '--exclude=.svn' '--exclude=.svnignore' old/docx2txt-0.8/setup.cfg new/docx2txt-0.9/setup.cfg --- old/docx2txt-0.8/setup.cfg 2017-11-25 18:39:38.000000000 +0100 +++ new/docx2txt-0.9/setup.cfg 2025-03-24 21:39:14.604766800 +0100 @@ -1,2 +1,7 @@ [metadata] -description-file = README.md +description_file = README.md + +[egg_info] +tag_build = +tag_date = 0 + diff -urN '--exclude=CVS' '--exclude=.cvsignore' '--exclude=.svn' '--exclude=.svnignore' old/docx2txt-0.8/setup.py new/docx2txt-0.9/setup.py --- old/docx2txt-0.8/setup.py 2019-06-23 21:53:07.000000000 +0200 +++ new/docx2txt-0.9/setup.py 2025-03-24 21:35:58.000000000 +0100 @@ -7,13 +7,13 @@ setup( name='docx2txt', packages=['docx2txt'], - version='0.8', + version='0.9', description='A pure python-based utility to extract text and images ' 'from docx files.', author='Ankush Shah', author_email='ankush.shah.n...@gmail.com', url='https://github.com/ankushshah89/python-docx2txt', - download_url='https://github.com/ankushshah89/python-docx2txt/tarball/0.8', + download_url='https://github.com/ankushshah89/python-docx2txt/tarball/0.9', keywords=['python', 'docx', 'text', 'images', 'extract'], scripts=scripts, classifiers=[],