Hi,

I just enrolled this list, thanks to Dan Andreescu, who let me know about it, and I have a question on processing clickstream data.

I downloaded a file for last month clickstream data (https://dumps.wikimedia.org/other/clickstream/2022-12/clickstream-eswiki-2022-12.tsv.gz) and have problems to open it and processing it.

The only programme I could open it was OpenRefine. Other programmes (Numbers and LibreOffice) just couldn't cope with it.

I can use OpenRefine to do some transformation and delete some rows I don't need, but even then, with some 1.5milion rows, I can not open it with numbers or libreoffice to do sum of the column 4.

Which tools do you use to work with such big files?

Thanks.
--
========================
Robert Garrigós i Castro
https://garrigos.cat
+34 620 91 87 01
_______________________________________________
Analytics mailing list -- [email protected]
To unsubscribe send an email to [email protected]

Reply via email to