Daniel P. Berrangé <berra...@redhat.com> writes:

> On Tue, Aug 05, 2025 at 10:04:42AM +0100, Alex Bennée wrote:
>> Daniel P. Berrangé <berra...@redhat.com> writes:
>> 
>> > On Mon, Aug 04, 2025 at 05:29:59PM +0100, Alex Bennée wrote:
>> >> We don't build the PDFs ourselves for the hosted docs and it looks
>> >> like rtd can't manage building PDFs now they have gone over a certain
>> >> size. Disable the extra formats so we can at least have the online
>> >> stuff again.
>> >
>> > Regardless of build problems, IMHO, we should not have been building
>> > the PDFs as no effort is being made to validate that the content is
>> > formatting well under the layout constraints of PDFs
>> 
>> True.
>> 
>> I will say the one thing I have found PDFs good for is uploading the
>> docs into a LLM context like NotebookLM. Otherwise you end up having to
>> add individual links which a) is a pain and b) is a potential DDoS
>> source if the model keeps hitting the host which as I'm sure everyone is
>> aware is a problem for FLOSS archives at the moment.
>
> Is there a "single page HTML" option that would service that need ?

There is an htmlzip format but I think that is just a bundle of html
files in a zip container. I'll see if there is an all one page option in
the docs.

>
> In general PDFs are a pretty awful format for programatically
> consuming text, because they have no logical content structure
> like HTML docs, so I'd expect HTML is a better format to feed
> into any tool either LLM or not.
>
> With regards,
> Daniel

-- 
Alex Bennée
Virtualisation Tech Lead @ Linaro

Reply via email to