On Tue, Aug 05, 2025 at 10:04:42AM +0100, Alex Bennée wrote:
> Daniel P. Berrangé <berra...@redhat.com> writes:
> 
> > On Mon, Aug 04, 2025 at 05:29:59PM +0100, Alex Bennée wrote:
> >> We don't build the PDFs ourselves for the hosted docs and it looks
> >> like rtd can't manage building PDFs now they have gone over a certain
> >> size. Disable the extra formats so we can at least have the online
> >> stuff again.
> >
> > Regardless of build problems, IMHO, we should not have been building
> > the PDFs as no effort is being made to validate that the content is
> > formatting well under the layout constraints of PDFs
> 
> True.
> 
> I will say the one thing I have found PDFs good for is uploading the
> docs into a LLM context like NotebookLM. Otherwise you end up having to
> add individual links which a) is a pain and b) is a potential DDoS
> source if the model keeps hitting the host which as I'm sure everyone is
> aware is a problem for FLOSS archives at the moment.

Is there a "single page HTML" option that would service that need ?

In general PDFs are a pretty awful format for programatically
consuming text, because they have no logical content structure
like HTML docs, so I'd expect HTML is a better format to feed
into any tool either LLM or not.

With regards,
Daniel
-- 
|: https://berrange.com      -o-    https://www.flickr.com/photos/dberrange :|
|: https://libvirt.org         -o-            https://fstop138.berrange.com :|
|: https://entangle-photo.org    -o-    https://www.instagram.com/dberrange :|


Reply via email to