[Wikitech-l] Re: My idea about wikipage parser (shared link on the article)
Hi all. I must rewrite the article, as the another style of the article, or the format. The informations are okey.. Dušan Kreheľ 2022-09-03 14:19 GMT+02:00, Dušan Kreheľ : > Hi Thiemo. > > I updated the document and the my basic idea is marked in text as > bold. Or do You like the separate paragraph as answer in the document? > > Dušan Kreheľ > > 2022-09-03 11:13 GMT+02:00, Thiemo Kreuz : >> Hello Dušan, >> >> It appears like the article is incomplete. What is the idea you want to >> propose? >> >> Kind regards >> Thiemo >> ___ >> Wikitech-l mailing list -- wikitech-l@lists.wikimedia.org >> To unsubscribe send an email to wikitech-l-le...@lists.wikimedia.org >> https://lists.wikimedia.org/postorius/lists/wikitech-l.lists.wikimedia.org/ > ___ Wikitech-l mailing list -- wikitech-l@lists.wikimedia.org To unsubscribe send an email to wikitech-l-le...@lists.wikimedia.org https://lists.wikimedia.org/postorius/lists/wikitech-l.lists.wikimedia.org/
[Wikitech-l] Re: My idea about wikipage parser (shared link on the article)
Hi Thiemo. I updated the document and the my basic idea is marked in text as bold. Or do You like the separate paragraph as answer in the document? Dušan Kreheľ 2022-09-03 11:13 GMT+02:00, Thiemo Kreuz : > Hello Dušan, > > It appears like the article is incomplete. What is the idea you want to > propose? > > Kind regards > Thiemo > ___ > Wikitech-l mailing list -- wikitech-l@lists.wikimedia.org > To unsubscribe send an email to wikitech-l-le...@lists.wikimedia.org > https://lists.wikimedia.org/postorius/lists/wikitech-l.lists.wikimedia.org/ ___ Wikitech-l mailing list -- wikitech-l@lists.wikimedia.org To unsubscribe send an email to wikitech-l-le...@lists.wikimedia.org https://lists.wikimedia.org/postorius/lists/wikitech-l.lists.wikimedia.org/
[Wikitech-l] Re: Reducing size of pageviews dump (shared link on the article)
Hello Thiemo. I updated the document. Look You the document or the document changes. I think, for the low number values is better storing as text. Example, one reason, the RAW data have lower memory size. Example for input "1 15 85" is the test size 7 B, but in memory format would be minimal 3 input values * (minimal) 4 bytes per one value = 12 bytes. The binary dump data would be compress (for the zero bytes in RAW data) both as a text data. The text format is more human format as like to use, Example for the programmer or to use in the spreadsheet calculator. Dušan Kreheľ 2022-09-03 11:16 GMT+02:00, Thiemo Kreuz : > Hello Dušan, > > I find this really fascinating. Unfortunately, it looks like the > article doesn't explain the proposed format. Where is the domain in > the new format? What does "DAY_HOUR" mean? What's the difference > between "DAY_HOUR2", "DAY2_HOUR", and "DAY2_HOUR2"? What is the file > naming scheme for the new format? > > Being fascinated by file formats myself I also wonder. Why not make it > binary? > > Kind regards > Thiemo > ___ > Wikitech-l mailing list -- wikitech-l@lists.wikimedia.org > To unsubscribe send an email to wikitech-l-le...@lists.wikimedia.org > https://lists.wikimedia.org/postorius/lists/wikitech-l.lists.wikimedia.org/ ___ Wikitech-l mailing list -- wikitech-l@lists.wikimedia.org To unsubscribe send an email to wikitech-l-le...@lists.wikimedia.org https://lists.wikimedia.org/postorius/lists/wikitech-l.lists.wikimedia.org/
[Wikitech-l] Re: Reducing size of pageviews dump (shared link on the article)
Hello Dušan, I find this really fascinating. Unfortunately, it looks like the article doesn't explain the proposed format. Where is the domain in the new format? What does "DAY_HOUR" mean? What's the difference between "DAY_HOUR2", "DAY2_HOUR", and "DAY2_HOUR2"? What is the file naming scheme for the new format? Being fascinated by file formats myself I also wonder. Why not make it binary? Kind regards Thiemo ___ Wikitech-l mailing list -- wikitech-l@lists.wikimedia.org To unsubscribe send an email to wikitech-l-le...@lists.wikimedia.org https://lists.wikimedia.org/postorius/lists/wikitech-l.lists.wikimedia.org/
[Wikitech-l] Re: My idea about wikipage parser (shared link on the article)
Hello Dušan, It appears like the article is incomplete. What is the idea you want to propose? Kind regards Thiemo ___ Wikitech-l mailing list -- wikitech-l@lists.wikimedia.org To unsubscribe send an email to wikitech-l-le...@lists.wikimedia.org https://lists.wikimedia.org/postorius/lists/wikitech-l.lists.wikimedia.org/
[Wikitech-l] Reducing size of pageviews dump (shared link on the article)
Hi, i wanna share my idea (writed in the article) about the reducing size of pageviews dump: https://en.wikipedia.org/wiki/User:Du%C5%A1an_Krehe%C4%BE/Signpost_draft:New_pageview_dump_export_format_(concept) The primary technical content would be done. Dušan Kreheľ ___ Wikitech-l mailing list -- wikitech-l@lists.wikimedia.org To unsubscribe send an email to wikitech-l-le...@lists.wikimedia.org https://lists.wikimedia.org/postorius/lists/wikitech-l.lists.wikimedia.org/
[Wikitech-l] My idea about wikipage parser (shared link on the article)
Hi, i wanna share my idea (writed in the article) about the wikipage parser: https://en.wikipedia.org/wiki/User:Du%C5%A1an_Krehe%C4%BE/Signpost_draft:My_idea_about_wikipage_parser The primary technical content would be done. Dušan Kreheľ ___ Wikitech-l mailing list -- wikitech-l@lists.wikimedia.org To unsubscribe send an email to wikitech-l-le...@lists.wikimedia.org https://lists.wikimedia.org/postorius/lists/wikitech-l.lists.wikimedia.org/