On Tue, Aug 05, 2025 at 10:04:42AM +0100, Alex Bennée wrote: > Daniel P. Berrangé <berra...@redhat.com> writes: > > > On Mon, Aug 04, 2025 at 05:29:59PM +0100, Alex Bennée wrote: > >> We don't build the PDFs ourselves for the hosted docs and it looks > >> like rtd can't manage building PDFs now they have gone over a certain > >> size. Disable the extra formats so we can at least have the online > >> stuff again. > > > > Regardless of build problems, IMHO, we should not have been building > > the PDFs as no effort is being made to validate that the content is > > formatting well under the layout constraints of PDFs > > True. > > I will say the one thing I have found PDFs good for is uploading the > docs into a LLM context like NotebookLM. Otherwise you end up having to > add individual links which a) is a pain and b) is a potential DDoS > source if the model keeps hitting the host which as I'm sure everyone is > aware is a problem for FLOSS archives at the moment.
Is there a "single page HTML" option that would service that need ? In general PDFs are a pretty awful format for programatically consuming text, because they have no logical content structure like HTML docs, so I'd expect HTML is a better format to feed into any tool either LLM or not. With regards, Daniel -- |: https://berrange.com -o- https://www.flickr.com/photos/dberrange :| |: https://libvirt.org -o- https://fstop138.berrange.com :| |: https://entangle-photo.org -o- https://www.instagram.com/dberrange :|