Hi!

>     1. Is there a unique key for the query log? The log I am refering to
>     is the *wdqs_extract* table**from
>     the hive database wmf.**We would like to be able to
>     permanently link our own computed data with the log entry we
>     computed it from.

I think you can use hostname+sequence (from
https://wikitech.wikimedia.org/wiki/Analytics/Data/Webrequest, assuming
those are preserved in wdqs_extract) as a key.

>     2. Is it possible to find out if a query in a given log entry was
>     accepted by the sparql endpoint as valid?

If it wasn't, the result code should be 400.

>     3. Is there any other database system besides hive installed on the
>     server?

I think the currently recommended interface is beeline, not sure about
other DB systems.

>     And finally a question on conventions for this mailing list: Am I
>     correct in sending one mail for multiple questions or should I send
>     separate mails for each question?

I think it's ok. For the questions regarding data and other WDQS
specifics you may also CC me or [email protected].

-- 
Stas Malyshev
[email protected]

_______________________________________________
Analytics mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/analytics

Reply via email to