On Sun, Sep 20, 2020 at 1:54 PM Gavin Smith <[email protected]> wrote: > > Does anybody know of a way to test lots of Texinfo files, e.g. > everything linked from https://www.gnu.org/manual/? What I'd like to do > is test all of these files with the Texinfo XS parser enabled and also > with it disabled, and see what if any differences there are.
I went through all of the pages on that page and manually downloaded what I could. I have about 200 manuals that I've tested. It's an ad hoc collection and not all of them include all the necessary files, so running with --force and --no-validate is necessary. It's been worthwhile, though, and brought up some important bugs. I've been through them finding differences for Info output (with a script), and have moved on to HTML output now. Most of the manuals did not encounter issues, which is encouraging. There are many manuals that I couldn't get easily as there wasn't a link to download the Texinfo sources on the website (where there was, this was always because the web manuals were generated with the gendocs script). It would be a never-ending project to try to get everything and the dependencies for all the manuals, but if there are other manuals that are important to check that aren't listed below, you could let me know if you know how to get hold of them. The total corpus that I've collected so far is 254 MB, so too big to share in an email. If anybody is interested in accessing the corpus, though, let me know and we can discuss how we could share the files. 8sync accounting acm alive anubis aris artanis auctex autoconf autogen automake bash bc bison bpel2owfn cflow c-graph combine complexity coreutils cpio datamash dc dico diffutils direvent eintr emacs-manual find fog fontopia gama gawk gawkinet gdbm gettext g-golf gnats gnuastro gnubg gnubik gnuchess gnudos gnuit gnulib gnun gnu-pw-mgr gperf grep groff grub grub-dev gsasl gsrc gss gst gst-base gst-libs guile guile-cv guile-gnome-atk guile-gnome-clutter guile-gnome-clutter-glx guile-gnome-corba guile-gnome-gconf guile-gnome-gdk guile-gnome-glib guile-gnome-gnome-vfs guile-gnome-gobject guile-gnome-gtk guile-gnome-libglade guile-gnome-libgnome guile-gnome-libgnomecanvas guile-gnome-libgnomeui guile-gnome-pango guile-gnome-pangocairo guile-ncurses guile-opengl guile-rpc guile-sdl gv gwl gzip hello hurd idutils indent inetutils Libdbh libextractor libidn libmicrohttpd LibreDWG librejs libtool libunistring lightning liquidwar6 m4 mailutils maintain make mcron mdk mes mino misc mpria mtools multiboot muse orgadoc parted plotutils preview-latex prime proxyknife pspp pyconfigure radius rcs reftex remotecontrol rottlog rush screen sed serveez sharutils shepherd sqltutor standards stow tar termcap teseq tput txinfsnp units vc-dwim wdiff websocket4j web-trans wget xboard (extracted from the emacs sources): misc/auth.texi misc/autotype.texi misc/bovine.texi misc/calc.texi misc/cc-mode.texi misc/cl.texi misc/dbus.texi misc/dired-x.texi misc/ebrowse.texi misc/ede.texi misc/ediff.texi misc/edt.texi misc/efaq.texi misc/efaq-w32.texi misc/eieio.texi misc/emacs-gnutls.texi misc/emacs-mime.texi misc/epa.texi misc/erc.texi misc/ert.texi misc/eshell.texi misc/eudc.texi misc/eww.texi misc/flymake.texi misc/forms.texi misc/gnus-coding.texi misc/gnus-faq.texi misc/gnus.texi misc/htmlfontify.texi misc/idlwave.texi misc/ido.texi misc/info.texi misc/mairix-el.texi misc/message.texi misc/mh-e.texi misc/newsticker.texi misc/nxml-mode.texi misc/octave-mode.texi misc/org.texi misc/pcl-cvs.texi misc/pgg.texi misc/rcirc.texi misc/reftex.texi misc/remember.texi misc/sasl.texi misc/sc.texi misc/semantic.texi misc/sem-user.texi misc/ses.texi misc/sieve.texi misc/smtpmail.texi misc/speedbar.texi misc/srecode.texi misc/todo-mode.texi misc/tramp.texi misc/trampver.texi misc/url.texi misc/vhdl-mode.texi misc/viper.texi misc/vip.texi misc/widget.texi misc/wisent.texi misc/woman.texi
