Am 21. August 2018 16:13:32 MESZ schrieb Andrei Popenta <andrei.pope...@visma.com>: >Hi, > >thank you guys, I managed to get everything working. > >I've noticed one issue with parseUbuntu.py - if the file is encoded in >UTF-8, parseUbuntu.py will not be able to identify all packages and in >this >case the resulting xml will not have packages for >( USN-3728-1,USN-3729-1,USN-3731-1,USN-3736-1,USN-3739-1 and >USN-3745-1) - >I've tested on the announcements from August only. >To fix this, before running parseUbuntu.py I converted the downloaded >file >to ascii. > >Hope this will help anybody else who encountered this issue. > >Andrei > > >On Tue, 14 Aug 2018 at 18:11, Thomas Müller <4thomasmuel...@gmail.com> >wrote: > >> not with the html2text package, but with python-html2text from >centos7 >> repo I had this issue too >> >> Von meinem iPad gesendet >> >> Thomas Müller >> >> >> >> > Am 14.08.2018 um 17:05 schrieb Robert Paschedag ><robert.pasche...@web.de >> >: >> > >> > Am 14. August 2018 16:41:53 MESZ schrieb "Thomas Müller" < >> 4thomasmuel...@gmail.com>: >> >> Hi Andrei, >> >> >> >> This package parses not correct. >> >> Take this: >> >> >> >https://centos.pkgs.org/7/nux-dextop-x86_64/html2text-1.3.2a-14.el7.nux.x86_64.rpm.html >> >> >> >> Von meinem iPad gesendet >> >> >> >> Thomas Müller >> >> >> >> >> >> >> >>> Am 14.08.2018 um 16:32 schrieb Andrei Popenta >> >> <andrei.pope...@visma.com>: >> >>> >> >>> Hi Robert, >> >>> >> >>> Thanks for your reply. >> >>> I tried the process but it's not quite working for me. >> >>> >> >>> I've executed getDebianAnnouncements.py which downloaded the >> >> announcements in /tmp/debian_security/ >> >>> Next I run parseDebian.py which outputs: >> >>> Processing patch 1/542 (file: 2018-msg00099.html) >> >>> Message not parseable >> >>> [...] >> >>> Processing patch 542/542 (file: 2018-msg00035.html) >> >>> Message not parseable >> >>> No security announcements available or parseable. Bye. >> >>> >> >>> Here is the content of one of the messages(2018-msg00200.html): >> >> https://pastebin.com/ktinBPLb >> >>> Has my html file been converted as expected or not? >> >>> I have python2-html2text version 2016.9.19 release 1.el7 >installed. >> >>> >> >>> Thanks, >> >>> Andrei >> >>> >> >>> >> >>>> On Mon, 13 Aug 2018 at 17:39, Robert Paschedag >> >> <robert.pasche...@web.de> wrote: >> >>>> >> >>>> Sorry.....copy and paste error. >> >>>> >> >>>> The link to @phil site is http://www.devops-blog.net/ >> >>>> >> >>>> Look for spacewalk tags. >> >>>> >> >>>> Robert >> >>>> >> >>>> Gesendet: Montag, 13. August 2018 um 16:31 Uhr >> >>>> Von: "Robert Paschedag" <robert.pasche...@web.de> >> >>>> An: spacewalk-list@redhat.com >> >>>> Cc: spacewalk-list@redhat.com >> >>>> Betreff: Aw: [Spacewalk-list] debian errata to spacewalk >> >>>> >> >>>> Based on the excellent work of @philicious (and others, of >course) >> >> (see https://github.com/rpasche/spacewalk-scripts), >> >>>> I was able to create several scripts to create erratas for >debian. >> >>>> >> >>>> See my repo (or @philicious) at >> >> https://github.com/rpasche/spacewalk-scripts (forked from @phil) >or >> >> https://github.com/philicious/spacewalk-scripts >> >>>> >> >>>> getDebianAnnouncements.py: This parses the HTML files from >> >> security.debian.com and saves them for later parsing >> >>>> parseDebian.py: parses the announcements and creates one XML >file >> >> with errata structure. >> >>>> errata-import.py: uses the XML file to finally create "errata" >> >> within spacewalk and link the packages. >> >>>> >> >>>> Edit: I just saw, that @phils repo is more up-to-date as I sent >him >> >> my "debian" scripts so there is *one* repo with all >> >>>> "debian/ubuntu" specific stuff. >> >>>> >> >>>> Robert >> >>>> >> >>>> Gesendet: Montag, 13. August 2018 um 15:30 Uhr >> >>>> Von: "Andrei Popenta" <andrei.pope...@visma.com> >> >>>> An: spacewalk-list@redhat.com >> >>>> Betreff: [Spacewalk-list] debian errata to spacewalk >> >>>> Hi guys, >> >>>> >> >>>> I am interested to know what methods do you use to get debian >errata >> >> into spacewalk. >> >>>> >> >>>> I was relying on the xml and script provided by >> >> http://oserrata.atix.de/ and http://defs.steve-meier.de/ , but the >xml >> >> hasn't been updated since 31st of May. >> >>>> >> >>>> debian doesn't publish the package name in their DSA emails, so >I >> >> was wondering how can I identify the right packages affected by >> >> security updates in order to write a script for errata import. >> >>>> >> >>>> regards, >> >>>> Andrei >> >>>> _______________________________________________ Spacewalk-list >> >> mailing list Spacewalk-list@redhat.com >> >> https://www.redhat.com/mailman/listinfo/spacewalk-list >> >>>> _______________________________________________ >> >>>> Spacewalk-list mailing list >> >>>> Spacewalk-list@redhat.com >> >>>> https://www.redhat.com/mailman/listinfo/spacewalk-list >> >>> _______________________________________________ >> >>> Spacewalk-list mailing list >> >>> Spacewalk-list@redhat.com >> >>> https://www.redhat.com/mailman/listinfo/spacewalk-list >> > >> > Huh? >> > >> > Did not expect to have problems with html2text. >> > >> > Robert >> > -- >> > sent from my mobile device >> >> _______________________________________________ >> Spacewalk-list mailing list >> Spacewalk-list@redhat.com >> https://www.redhat.com/mailman/listinfo/spacewalk-list
Feel free to create a pull request at https://github.com/philicious/spacewalk-scripts So everybody will gain from this. Robert -- sent from my mobile device _______________________________________________ Spacewalk-list mailing list Spacewalk-list@redhat.com https://www.redhat.com/mailman/listinfo/spacewalk-list