Daniel P. Berrangé <berra...@redhat.com> writes: > On Tue, Aug 05, 2025 at 10:04:42AM +0100, Alex Bennée wrote: >> Daniel P. Berrangé <berra...@redhat.com> writes: >> >> > On Mon, Aug 04, 2025 at 05:29:59PM +0100, Alex Bennée wrote: >> >> We don't build the PDFs ourselves for the hosted docs and it looks >> >> like rtd can't manage building PDFs now they have gone over a certain >> >> size. Disable the extra formats so we can at least have the online >> >> stuff again. >> > >> > Regardless of build problems, IMHO, we should not have been building >> > the PDFs as no effort is being made to validate that the content is >> > formatting well under the layout constraints of PDFs >> >> True. >> >> I will say the one thing I have found PDFs good for is uploading the >> docs into a LLM context like NotebookLM. Otherwise you end up having to >> add individual links which a) is a pain and b) is a potential DDoS >> source if the model keeps hitting the host which as I'm sure everyone is >> aware is a problem for FLOSS archives at the moment. > > Is there a "single page HTML" option that would service that need ?
There is an htmlzip format but I think that is just a bundle of html files in a zip container. I'll see if there is an all one page option in the docs. > > In general PDFs are a pretty awful format for programatically > consuming text, because they have no logical content structure > like HTML docs, so I'd expect HTML is a better format to feed > into any tool either LLM or not. > > With regards, > Daniel -- Alex Bennée Virtualisation Tech Lead @ Linaro