On Tue, Jan 3, 2017 at 9:30 AM, Stas Malyshev <[email protected]>
wrote:

> Hi!
>
> >     1. Is there a unique key for the query log? The log I am refering to
> >     is the *wdqs_extract* table**from
> >     the hive database wmf.**We would like to be able to
> >     permanently link our own computed data with the log entry we
> >     computed it from.
>
> I think you can use hostname+sequence (from
> https://wikitech.wikimedia.org/wiki/Analytics/Data/Webrequest, assuming
> those are preserved in wdqs_extract) as a key.
>

​Adrian, you can also consider adding other fields to Stas' recommendation
to create the key, to be sure about uniqueness. For example, IP and UA
fields, in combination with hostname and sequence (or browser language, if
it's relevant in your case). Let us know what you end up using on this
thread, so we know the answer for the future. :)

Best,
Leila​



>
>
> --
> Stas Malyshev
> [email protected]
>
> _______________________________________________
> Analytics mailing list
> [email protected]
> https://lists.wikimedia.org/mailman/listinfo/analytics
>
_______________________________________________
Analytics mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/analytics

Reply via email to