[Wikitech-l] Re: My idea about wikipage parser (shared link on the article)

2022-09-03 Thread Dušan Kreheľ
Hi all.

I must rewrite the article, as the another style of the article, or
the format. The informations are okey..

Dušan Kreheľ

2022-09-03 14:19 GMT+02:00, Dušan Kreheľ :
> Hi Thiemo.
>
> I updated the document and the my basic idea is marked in text as
> bold. Or do You like the separate paragraph as answer in the document?
>
> Dušan Kreheľ
>
> 2022-09-03 11:13 GMT+02:00, Thiemo Kreuz :
>> Hello Dušan,
>>
>> It appears like the article is incomplete. What is the idea you want to
>> propose?
>>
>> Kind regards
>> Thiemo
>> ___
>> Wikitech-l mailing list -- wikitech-l@lists.wikimedia.org
>> To unsubscribe send an email to wikitech-l-le...@lists.wikimedia.org
>> https://lists.wikimedia.org/postorius/lists/wikitech-l.lists.wikimedia.org/
>
___
Wikitech-l mailing list -- wikitech-l@lists.wikimedia.org
To unsubscribe send an email to wikitech-l-le...@lists.wikimedia.org
https://lists.wikimedia.org/postorius/lists/wikitech-l.lists.wikimedia.org/

[Wikitech-l] Re: My idea about wikipage parser (shared link on the article)

2022-09-03 Thread Dušan Kreheľ
Hi Thiemo.

I updated the document and the my basic idea is marked in text as
bold. Or do You like the separate paragraph as answer in the document?

Dušan Kreheľ

2022-09-03 11:13 GMT+02:00, Thiemo Kreuz :
> Hello Dušan,
>
> It appears like the article is incomplete. What is the idea you want to
> propose?
>
> Kind regards
> Thiemo
> ___
> Wikitech-l mailing list -- wikitech-l@lists.wikimedia.org
> To unsubscribe send an email to wikitech-l-le...@lists.wikimedia.org
> https://lists.wikimedia.org/postorius/lists/wikitech-l.lists.wikimedia.org/
___
Wikitech-l mailing list -- wikitech-l@lists.wikimedia.org
To unsubscribe send an email to wikitech-l-le...@lists.wikimedia.org
https://lists.wikimedia.org/postorius/lists/wikitech-l.lists.wikimedia.org/

[Wikitech-l] Re: Reducing size of pageviews dump (shared link on the article)

2022-09-03 Thread Dušan Kreheľ
Hello Thiemo.

I updated the document. Look You the document or the document changes.

I think, for the low number values is better storing as text. Example,
one reason, the RAW data have lower memory size. Example for input "1
15 85" is the test size 7 B, but in memory format would be minimal 3
input values * (minimal) 4 bytes per one value = 12 bytes. The binary
dump data would be compress (for the zero bytes in RAW data) both as a
text data. The text format  is more human format as like to use,
Example for the programmer or to use in the spreadsheet calculator.

Dušan Kreheľ

2022-09-03 11:16 GMT+02:00, Thiemo Kreuz :
> Hello Dušan,
>
> I find this really fascinating. Unfortunately, it looks like the
> article doesn't explain the proposed format. Where is the domain in
> the new format? What does "DAY_HOUR" mean? What's the difference
> between "DAY_HOUR2", "DAY2_HOUR", and "DAY2_HOUR2"? What is the file
> naming scheme for the new format?
>
> Being fascinated by file formats myself I also wonder. Why not make it
> binary?
>
> Kind regards
> Thiemo
> ___
> Wikitech-l mailing list -- wikitech-l@lists.wikimedia.org
> To unsubscribe send an email to wikitech-l-le...@lists.wikimedia.org
> https://lists.wikimedia.org/postorius/lists/wikitech-l.lists.wikimedia.org/
___
Wikitech-l mailing list -- wikitech-l@lists.wikimedia.org
To unsubscribe send an email to wikitech-l-le...@lists.wikimedia.org
https://lists.wikimedia.org/postorius/lists/wikitech-l.lists.wikimedia.org/

[Wikitech-l] Re: Reducing size of pageviews dump (shared link on the article)

2022-09-03 Thread Thiemo Kreuz
Hello Dušan,

I find this really fascinating. Unfortunately, it looks like the
article doesn't explain the proposed format. Where is the domain in
the new format? What does "DAY_HOUR" mean? What's the difference
between "DAY_HOUR2", "DAY2_HOUR", and "DAY2_HOUR2"? What is the file
naming scheme for the new format?

Being fascinated by file formats myself I also wonder. Why not make it binary?

Kind regards
Thiemo
___
Wikitech-l mailing list -- wikitech-l@lists.wikimedia.org
To unsubscribe send an email to wikitech-l-le...@lists.wikimedia.org
https://lists.wikimedia.org/postorius/lists/wikitech-l.lists.wikimedia.org/

[Wikitech-l] Re: My idea about wikipage parser (shared link on the article)

2022-09-03 Thread Thiemo Kreuz
Hello Dušan,

It appears like the article is incomplete. What is the idea you want to propose?

Kind regards
Thiemo
___
Wikitech-l mailing list -- wikitech-l@lists.wikimedia.org
To unsubscribe send an email to wikitech-l-le...@lists.wikimedia.org
https://lists.wikimedia.org/postorius/lists/wikitech-l.lists.wikimedia.org/

[Wikitech-l] Reducing size of pageviews dump (shared link on the article)

2022-09-03 Thread Dušan Kreheľ
Hi,

i wanna share my idea (writed in the article) about the reducing size
of pageviews dump:
https://en.wikipedia.org/wiki/User:Du%C5%A1an_Krehe%C4%BE/Signpost_draft:New_pageview_dump_export_format_(concept)

The primary technical content would be done.

Dušan Kreheľ
___
Wikitech-l mailing list -- wikitech-l@lists.wikimedia.org
To unsubscribe send an email to wikitech-l-le...@lists.wikimedia.org
https://lists.wikimedia.org/postorius/lists/wikitech-l.lists.wikimedia.org/

[Wikitech-l] My idea about wikipage parser (shared link on the article)

2022-09-03 Thread Dušan Kreheľ
Hi,

i wanna share my idea (writed in the article) about the wikipage
parser: 
https://en.wikipedia.org/wiki/User:Du%C5%A1an_Krehe%C4%BE/Signpost_draft:My_idea_about_wikipage_parser

The primary technical content would be done.

Dušan Kreheľ
___
Wikitech-l mailing list -- wikitech-l@lists.wikimedia.org
To unsubscribe send an email to wikitech-l-le...@lists.wikimedia.org
https://lists.wikimedia.org/postorius/lists/wikitech-l.lists.wikimedia.org/