Hi,

>
> I'd like to, but the way anarchism 15 is being uploaded right now makes
> me really feel uncomfortable:

I think what really is missing is to change the Debian package README to
clarify some points.
Please, have a look to the just pushed README [0] in "upstream".
We thought it could be a good idea to have separated the script to
download automatically (and even in real time running it in a box) the
authors' original and the original html extracted from the script plus
conversions to other formats. The repository with the html would be the
Debian upstream and the repository with scraper should be also packaged
for Debian.

>
> - there's tons of trailing whitespaces (at least in debian/changelog and
>   some html files).
>   If I open my editor on a file, I end up changing tons of lines. Shall
>   I use sed to change a typo, or shall you fix your whitespaces first?
>

The trailing whitespaces come from the original HTML.
Personally, i would not modify the original HTML, but we could include a
patch in the Debian package to do so.

> - the .html is not anymore readable by a human, it's just a bunch of
>   parsed content.

Hmm, you mean the HTML code or what is rendered in a browser?.
Usually people do not read HTML code, i guess that is why the Debian
package generated txt, and for more readable formats we have Markdown,
that is why is generated by the scraper.
The HTML displayed in a browser should be readable as in the original
Website. If it is not, then maybe there is some bug in the scraper.

>   Do I really have to create a patch of (at least) 8K just to add a
>   letter?

This is a good question, the idea with the separate scraper and source
repositories was that people could fix the Markdown directly in that
repository, but then there should probably be other Markdown directory
or branch, to do not end up overwrotten by the conversion from the HTML
but possible to merge and use that one for the Debian package.

>
> - there's now multiple directories having the same content, instead of
>   just one html/ from where to get all other file formats.
>   How am I supposed to fix this typo? Do I have to fix it in all 3
>   places?

The scraper generate the markdown and txt directories. Having markdown,
txt should probably not exists, it comes from the previous Debian script.

>
>   Even if I wanted to fix the problem at the root, the script you used
>   to generate it's not in the package itself, and is *big*.

Yeah, sorry, it has not been packaged yet, but the idea is to package it
independently of the HTML.

>   I'm not even sure anymore where I am contributing now.
>
>   Before anarchism 15 was released I tried to help a bit myself, and
>   ended up with the attached relevant debian/ files.
>   My solution has clearly more little errors than yours, but at least
>   anybody can reprocude it.

You can not reproduce the HTML, Markdown and txt from the scraper?. It
should, that was the main idea.
The "source" HTML used to generated the older Debian versions, was never
in any repository nor there was an script to download it from the
original Website.

Maybe you can find something helpful from
>   there to help me fix this issue.
>

Ok, i will look those files. It would be great if you look at the new
README.
A process like the following seems good to me:
1. the scraper obtain the HTML from the original Website
2. the scraper push the HTML to other repository, together with the
generated Markdown
3. Debian package uses the generated "source" as upstream.

Probably what is missing, as said above, is to have a separated Markdown
directory where to push any content change that does not reflect the
original and specify clearly that that is the directory where to fix
typos and content. At least would be easy to see differences with the
original and new original versions.

Thanks for your comments,
ju.

[0] https://0xacab.org/ju/afaq/blob/master/README.md

Reply via email to