[Wikidata-bugs] [Maniphest] [Commented On] T76509: Figure out why Titan gets OOM when importing data

2014-12-02 Thread GWicke
GWicke added a comment. Is 48k transaction records all transactions committed up to that point? TASK DETAIL https://phabricator.wikimedia.org/T76509 REPLY HANDLER ACTIONS Reply to comment or attach files, or !close, !claim, !unsubscribe or !assign username. To: Smalyshev, GWicke Cc

[Wikidata-bugs] [Maniphest] [Commented On] T76373: Evaluate Titan as graph storage/query engine for Wikidata Query service

2014-12-02 Thread GWicke
GWicke added a comment. ! In T76373#799513, @Smalyshev wrote: Technical issues: # On import, titan sometimes slows down and gets into GC loops. # On querying, for vertices with a lot of edges (such as `wd(Q5).in(P31)`, i.e. humans, titan produces a backend exception: ``` Caused

[Wikidata-bugs] [Maniphest] [Commented On] T76373: Evaluate Titan as graph storage/query engine for Wikidata Query service

2014-12-02 Thread GWicke
GWicke added a comment. ! In T76373#802449, @Smalyshev wrote: Note, that running Titan with Cassandra embedded requires GC tuning. While embedded Cassandra can provide lower latency query answering, its GC behavior under load is less predictable. Yeah, agreed. GC scaling limits are kind

[Wikidata-bugs] [Maniphest] [Changed CC] T77897: Test host for Titan

2014-12-08 Thread GWicke
GWicke added a subscriber: wikidata-query-service. TASK DETAIL https://phabricator.wikimedia.org/T77897 REPLY HANDLER ACTIONS Reply to comment or attach files, or !close, !claim, !unsubscribe or !assign username. To: GWicke Cc: Aklapper, GWicke, mark, RobH, Smalyshev, Eloquence, aude

[Wikidata-bugs] [Maniphest] [Edited] T77897: Test host for Titan

2014-12-08 Thread GWicke
GWicke edited the task description. TASK DETAIL https://phabricator.wikimedia.org/T77897 REPLY HANDLER ACTIONS Reply to comment or attach files, or !close, !claim, !unsubscribe or !assign username. To: GWicke Cc: Aklapper, GWicke, mark, RobH, Smalyshev, Eloquence, aude, Tobi_WMDE_SW

[Wikidata-bugs] [Maniphest] [Changed CC] T75460: Sane Lua label access in non-content languages

2014-12-09 Thread GWicke
GWicke added a subscriber: GWicke. GWicke added a comment. Please make sure that the language(s) used are static for anything in the content of a page. In particular, it should not depend on the user interface language, as that would impose a heavy performance penalty on the user. TASK DETAIL

[Wikidata-bugs] [Maniphest] [Edited] T77897: Test host for Titan

2014-12-11 Thread GWicke
GWicke edited the task description. TASK DETAIL https://phabricator.wikimedia.org/T77897 REPLY HANDLER ACTIONS Reply to comment or attach files, or !close, !claim, !unsubscribe or !assign username. EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences

[Wikidata-bugs] [Maniphest] [Commented On] T76372: Evaluate TinkerPop GraphSail frontend

2014-12-11 Thread GWicke
GWicke added a comment. Might be worth starting with Tinkerpop3 https://github.com/tinkerpop/tinkerpop3/blob/master/CHANGELOG.asciidoc. TASK DETAIL https://phabricator.wikimedia.org/T76372 REPLY HANDLER ACTIONS Reply to comment or attach files, or !close, !claim, !unsubscribe or !assign

[Wikidata-bugs] [Maniphest] [Edited] T76372: Evaluate TinkerPop GraphSail frontend

2014-12-11 Thread GWicke
GWicke edited the task description. TASK DETAIL https://phabricator.wikimedia.org/T76372 REPLY HANDLER ACTIONS Reply to comment or attach files, or !close, !claim, !unsubscribe or !assign username. EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences

[Wikidata-bugs] [Maniphest] [Commented On] T77897: Test host for Titan

2014-12-17 Thread GWicke
GWicke added a comment. @smalyshev started testing yesterday. Things are working fine so far. Thanks! TASK DETAIL https://phabricator.wikimedia.org/T77897 REPLY HANDLER ACTIONS Reply to comment or attach files, or !close, !claim, !unsubscribe or !assign username. EMAIL PREFERENCES

[Wikidata-bugs] [Maniphest] [Created] T84923: Reliable publish / subscribe event bus

2014-12-18 Thread GWicke
GWicke created this task. GWicke added a subscriber: GWicke. GWicke added a project: wikidata-query-service. GWicke changed Security from none to none. TASK DESCRIPTION We need a reliable way to distribute a variety of update events emitted from MediaWiki core (and other services) to various

[Wikidata-bugs] [Maniphest] [Edited] T84923: Reliable publish / subscribe event bus

2014-12-18 Thread GWicke
GWicke edited the task description. TASK DETAIL https://phabricator.wikimedia.org/T84923 REPLY HANDLER ACTIONS Reply to comment or attach files, or !close, !claim, !unsubscribe or !assign username. EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences

[Wikidata-bugs] [Maniphest] [Triaged] T84923: Reliable publish / subscribe event bus

2014-12-18 Thread GWicke
GWicke triaged this task as Normal priority. TASK DETAIL https://phabricator.wikimedia.org/T84923 REPLY HANDLER ACTIONS Reply to comment or attach files, or !close, !claim, !unsubscribe or !assign username. EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel

[Wikidata-bugs] [Maniphest] [Created] T85181: Investigate design public API, possibly using MQL

2014-12-22 Thread GWicke
GWicke created this task. GWicke added a subscriber: GWicke. GWicke added a project: wikidata-query-service. TASK DESCRIPTION We need to investigate design a public query API. Goals for this API are: - Easy to build queries with a variety of clients (probably JSON) without dealing

[Wikidata-bugs] [Maniphest] [Changed CC] T85181: Investigate design public API, possibly using MQL

2014-12-22 Thread GWicke
GWicke removed a subscriber: Aklapper. GWicke added a project: Services. GWicke set Security to none. TASK DETAIL https://phabricator.wikimedia.org/T85181 REPLY HANDLER ACTIONS Reply to comment or attach files, or !close, !claim, !unsubscribe or !assign username. EMAIL PREFERENCES https

[Wikidata-bugs] [Maniphest] [Commented On] T76373: Evaluate Titan as graph storage/query engine for Wikidata Query service

2014-12-22 Thread GWicke
GWicke added a comment. Re performance and indexing, from a mail thread: Earlier today Stas I were looking a bit into what is happening behind the scenes in some of the slower queries like https://www.mediawiki.org/wiki/Wikibase/Indexing/Benchmarks

[Wikidata-bugs] [Maniphest] [Commented On] T76373: Evaluate Titan as graph storage/query engine for Wikidata Query service

2014-12-23 Thread GWicke
GWicke added a comment. In https://phabricator.wikimedia.org/T76373#941917, @Manybubbles wrote: I'd spin up a new one - probably just on a single node. I think in the long run we probably can run this on the production search cluster but for now lets keep it off just in case it does

[Wikidata-bugs] [Maniphest] [Commented On] T84923: Reliable publish / subscribe event bus

2014-12-23 Thread GWicke
GWicke added a comment. In https://phabricator.wikimedia.org/T84923#940155, @JanZerebecki wrote: The nature of these event type candidates is such that they are changes with a log existing at the provider. Wikidata might be the exeception here. Most other events are not available

[Wikidata-bugs] [Maniphest] [Commented On] T85181: Investigate design public API, possibly using MQL

2014-12-23 Thread GWicke
GWicke added a comment. In https://phabricator.wikimedia.org/T85181#940893, @Smalyshev wrote: Are we considering supporting WDQ API mini-language as the option for the queries or it's not a viable option? The problem I see with the WDQ language is the need to perform error-prone custom

[Wikidata-bugs] [Maniphest] [Commented On] T76373: Evaluate Titan as graph storage/query engine for Wikidata Query service

2014-12-23 Thread GWicke
GWicke added a comment. Another fun article for dates: http://en.wikipedia.org/wiki/Timeline_of_the_far_future TASK DETAIL https://phabricator.wikimedia.org/T76373 REPLY HANDLER ACTIONS Reply to comment or attach files, or !close, !claim, !unsubscribe or !assign username. EMAIL

[Wikidata-bugs] [Maniphest] [Edited] T85181: Investigate design public API, possibly using MQL

2014-12-23 Thread GWicke
GWicke edited the task description. TASK DETAIL https://phabricator.wikimedia.org/T85181 REPLY HANDLER ACTIONS Reply to comment or attach files, or !close, !claim, !unsubscribe or !assign username. EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences

[Wikidata-bugs] [Maniphest] [Commented On] T76373: Evaluate Titan as graph storage/query engine for Wikidata Query service

2014-12-24 Thread GWicke
GWicke added a comment. In https://phabricator.wikimedia.org/T76373#943426, @Smalyshev wrote: Proposed storage format for dates: 1. Dates are stored as long signed integers, representing number of seconds since 1970-01-01 00:00:00 UTC. 2. This gives us range of 292 bln years http

[Wikidata-bugs] [Maniphest] [Commented On] T76373: Evaluate Titan as graph storage/query engine for Wikidata Query service

2014-12-24 Thread GWicke
GWicke added a comment. In https://phabricator.wikimedia.org/T76373#943454, @Smalyshev wrote: seconds(292M-12-31T23:59:59) seconds(292M+1) It's pretty likely that there were more than 356 leap years between -292M and 1970, so it's very possible for the years to be non-monotonic if we don't

[Wikidata-bugs] [Maniphest] [Commented On] T76373: Evaluate Titan as graph storage/query engine for Wikidata Query service

2014-12-24 Thread GWicke
GWicke added a comment. In https://phabricator.wikimedia.org/T76373#943459, @Smalyshev wrote: For dates beyond real Gregorian calendar, the values more precise than years have little meaning anyway, so I don't think it matters too much as long as comparisons and lookups (i.e. which Greek

[Wikidata-bugs] [Maniphest] [Commented On] T85181: Investigate design public API, possibly using MQL

2014-12-29 Thread GWicke
GWicke added a comment. @jeroendedauw: I think it's possible to implement a simple query interface (such as the current wikibase API, as far as I understand it) on top of a more powerful one such as the one we are discussing here. The reverse is not necessarily true. What is your view

[Wikidata-bugs] [Maniphest] [Commented On] T84923: Reliable publish / subscribe event bus

2015-01-24 Thread GWicke
GWicke added a comment. In https://phabricator.wikimedia.org/T84923#968636, @JanZerebecki wrote: http://www.fedmsg.com might fit this need. It is used/developed by Fedora and Debian people and is a federated, reliable message bus with history of cryptographically authenticated json messages

[Wikidata-bugs] [Maniphest] [Edited] T84923: Reliable publish / subscribe event bus

2015-02-03 Thread GWicke
GWicke edited the task description. TASK DETAIL https://phabricator.wikimedia.org/T84923 REPLY HANDLER ACTIONS Reply to comment or attach files, or !close, !claim, !unsubscribe or !assign username. EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences

[Wikidata-bugs] [Maniphest] [Commented On] T85181: Investigate design public API, possibly using MQL

2015-01-25 Thread GWicke
GWicke added a comment. In https://phabricator.wikimedia.org/T85181#971763, @JanZerebecki wrote: No, just a list of things that were brought up, when people talked about querying Wikidata. I don't like Ask. Only ever used Ask and Cypher. I think the way Cypher goes about the problem is neat

[Wikidata-bugs] [Maniphest] [Commented On] T84923: Reliable publish / subscribe event bus

2015-01-05 Thread GWicke
GWicke added a comment. In https://phabricator.wikimedia.org/T84923#943464, @JanZerebecki wrote: https://www.wikidata.org/w/api.php?action=helpmodules=wbgetentities and recent changes, see also https://phabricator.wikimedia.org/T85103 and https://phabricator.wikimedia.org/T85100

[Wikidata-bugs] [Maniphest] [Updated] T85159: Deploy a Wikidata complex query service into production

2015-01-12 Thread GWicke
GWicke added a blocking task: T85181: Investigate design public API, possibly using MQL. TASK DETAIL https://phabricator.wikimedia.org/T85159 REPLY HANDLER ACTIONS Reply to comment or attach files, or !close, !claim, !unsubscribe or !assign username. EMAIL PREFERENCES https

[Wikidata-bugs] [Maniphest] [Commented On] T86561: Wikidata Query Service hardware

2015-01-12 Thread GWicke
GWicke added a comment. In https://phabricator.wikimedia.org/T86561#971491, @JanZerebecki wrote: Do we want to have the Cassandra and Titan nodes be in the same rack as I assume that query performance is very latency sensitive? I don't think that there is a huge enough difference in latency

[Wikidata-bugs] [Maniphest] [Edited] T85181: Investigate design public API, possibly using MQL

2015-01-06 Thread GWicke
GWicke edited the task description. GWicke removed a subscriber: Manybubbles. TASK DETAIL https://phabricator.wikimedia.org/T85181 REPLY HANDLER ACTIONS Reply to comment or attach files, or !close, !claim, !unsubscribe or !assign username. EMAIL PREFERENCES https

[Wikidata-bugs] [Maniphest] [Changed CC] T86016: Simple Query Language against both Titan and SQL

2015-01-12 Thread GWicke
GWicke added a subscriber: GWicke. GWicke added a comment. I would actually argue for starting with a simple subset of whatever we plan to use in the longer term (MQL?), and then expand from there. Lets not expose n different interfaces for the same thing just because we can. Doing so would

[Wikidata-bugs] [Maniphest] [Commented On] T85181: Investigate design public API, possibly using MQL

2015-01-12 Thread GWicke
GWicke added a comment. @janzerebecki: Are there any in that list that you would prefer over MQL? If so, for which reasons? TASK DETAIL https://phabricator.wikimedia.org/T85181 REPLY HANDLER ACTIONS Reply to comment or attach files, or !close, !claim, !unsubscribe or !assign username

[Wikidata-bugs] [Maniphest] [Changed CC] T85181: Investigate design public API, possibly using MQL

2015-01-07 Thread GWicke
GWicke added a subscriber: Manybubbles. TASK DETAIL https://phabricator.wikimedia.org/T85181 REPLY HANDLER ACTIONS Reply to comment or attach files, or !close, !claim, !unsubscribe or !assign username. EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences

[Wikidata-bugs] [Maniphest] [Commented On] T84923: Reliable publish / subscribe event bus

2015-01-07 Thread GWicke
GWicke added a comment. @halfak has written up very similar ideas at https://meta.wikimedia.org/wiki/Research:MediaWiki_events:_a_generalized_public_event_datasource TASK DETAIL https://phabricator.wikimedia.org/T84923 REPLY HANDLER ACTIONS Reply to comment or attach files, or !close

[Wikidata-bugs] [Maniphest] [Edited] T84923: Reliable publish / subscribe event bus

2015-01-07 Thread GWicke
GWicke edited the task description. TASK DETAIL https://phabricator.wikimedia.org/T84923 REPLY HANDLER ACTIONS Reply to comment or attach files, or !close, !claim, !unsubscribe or !assign username. EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences

[Wikidata-bugs] [Maniphest] [Edited] T84923: Reliable publish / subscribe event bus

2015-01-07 Thread GWicke
GWicke edited the task description. TASK DETAIL https://phabricator.wikimedia.org/T84923 REPLY HANDLER ACTIONS Reply to comment or attach files, or !close, !claim, !unsubscribe or !assign username. EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences

[Wikidata-bugs] [Maniphest] [Updated] T84923: Reliable publish / subscribe event bus

2015-01-07 Thread GWicke
GWicke added a project: Analytics. GWicke added a comment. In https://phabricator.wikimedia.org/T84923#961622, @bd808 wrote: can support large delays (order of days) for individual consumers Do you have a strong use case to support this need? Yes. Hosts can go down for multiple days

[Wikidata-bugs] [Maniphest] [Commented On] T85181: Investigate design public API, possibly using MQL

2015-01-07 Thread GWicke
GWicke added a comment. In https://phabricator.wikimedia.org/T85181#958820, @JeroenDeDauw wrote: As far as I can tell, decisions are currently made based on the use case the WMF has, rather than also holding the Wkidata plans into account. I'm getting this impression because I'm not seeing

[Wikidata-bugs] [Maniphest] [Up For Grabs] T85181: Investigate design public API, possibly using MQL

2015-02-20 Thread GWicke
GWicke placed this task up for grabs. TASK DETAIL https://phabricator.wikimedia.org/T85181 REPLY HANDLER ACTIONS Reply to comment or attach files, or !close, !claim, !unsubscribe or !assign username. EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences

[Wikidata-bugs] [Maniphest] [Changed Project Column] T84923: Reliable publish / subscribe event bus

2015-03-17 Thread GWicke
GWicke moved this task to Future on the Services workboard. TASK DETAIL https://phabricator.wikimedia.org/T84923 WORKBOARD https://phabricator.wikimedia.org/project/board/69/ REPLY HANDLER ACTIONS Reply to comment or attach files, or !close, !claim, !unsubscribe or !assign username

[Wikidata-bugs] [Maniphest] [Edited] T84923: Reliable publish / subscribe event bus

2015-03-13 Thread GWicke
GWicke edited the task description. TASK DETAIL https://phabricator.wikimedia.org/T84923 REPLY HANDLER ACTIONS Reply to comment or attach files, or !close, !claim, !unsubscribe or !assign username. EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences

[Wikidata-bugs] [Maniphest] [Changed Subscribers] T84923: Reliable publish / subscribe event bus

2015-03-10 Thread GWicke
GWicke added a subscriber: Eevans. TASK DETAIL https://phabricator.wikimedia.org/T84923 REPLY HANDLER ACTIONS Reply to comment or attach files, or !close, !claim, !unsubscribe or !assign username. EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences

[Wikidata-bugs] [Maniphest] [Changed Subscribers] T90109: BlazeGraph Finalization: Zookeeper

2015-03-02 Thread GWicke
GWicke added a subscriber: GWicke. TASK DETAIL https://phabricator.wikimedia.org/T90109 REPLY HANDLER ACTIONS Reply to comment or attach files, or !close, !claim, !unsubscribe or !assign username. EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences

[Wikidata-bugs] [Maniphest] [Edited] T85181: Investigate design public API, possibly using MQL

2015-01-25 Thread GWicke
GWicke edited the task description. TASK DETAIL https://phabricator.wikimedia.org/T85181 REPLY HANDLER ACTIONS Reply to comment or attach files, or !close, !claim, !unsubscribe or !assign username. EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences

[Wikidata-bugs] [Maniphest] [Edited] T85181: Investigate design public API, possibly using MQL

2015-01-25 Thread GWicke
GWicke edited the task description. TASK DETAIL https://phabricator.wikimedia.org/T85181 REPLY HANDLER ACTIONS Reply to comment or attach files, or !close, !claim, !unsubscribe or !assign username. EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences

[Wikidata-bugs] [Maniphest] [Updated] T93913: Investigate using RESTBase for batched entity lookups

2015-04-01 Thread GWicke
GWicke added a project: RESTBase-Usecase. TASK DETAIL https://phabricator.wikimedia.org/T93913 REPLY HANDLER ACTIONS Reply to comment or attach files, or !close, !claim, !unsubscribe or !assign username. EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences

[Wikidata-bugs] [Maniphest] [Changed Subscribers] T93913: Investigate using RESTBase for batched entity lookups

2015-04-01 Thread GWicke
GWicke added subscribers: mobrovac, Eevans. TASK DETAIL https://phabricator.wikimedia.org/T93913 REPLY HANDLER ACTIONS Reply to comment or attach files, or !close, !claim, !unsubscribe or !assign username. EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel

[Wikidata-bugs] [Maniphest] [Edited] T84923: Reliable publish / subscribe event bus

2015-05-15 Thread GWicke
GWicke edited the task description. TASK DETAIL https://phabricator.wikimedia.org/T84923 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: GWicke Cc: mmodell, Eevans, chasemp, brion, Krenair, Halfak, JanZerebecki, bd808, MZMcBride, mobrovac, GWicke

[Wikidata-bugs] [Maniphest] [Edited] T84923: Reliable publish / subscribe event bus

2015-05-15 Thread GWicke
GWicke edited the task description. TASK DETAIL https://phabricator.wikimedia.org/T84923 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: GWicke Cc: mmodell, Eevans, chasemp, brion, Krenair, Halfak, JanZerebecki, bd808, MZMcBride, mobrovac, GWicke

[Wikidata-bugs] [Maniphest] [Edited] T84923: Reliable publish / subscribe event bus

2015-06-22 Thread GWicke
GWicke edited the task description. TASK DETAIL https://phabricator.wikimedia.org/T84923 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: GWicke Cc: Mattflaschen, Ottomata, mmodell, Eevans, chasemp, brion, Krenair, Halfak, JanZerebecki, bd808

[Wikidata-bugs] [Maniphest] [Edited] T84923: Reliable publish / subscribe event bus

2015-07-31 Thread GWicke
GWicke edited the task description. TASK DETAIL https://phabricator.wikimedia.org/T84923 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: GWicke Cc: Aklapper, Matanya, Mattflaschen, Ottomata, mmodell, Eevans, chasemp, brion, Krenair, Halfak

[Wikidata-bugs] [Maniphest] [Commented On] T107602: Set up a public interface to the wikidata query service

2015-08-04 Thread GWicke
GWicke added a comment. @jeroendedauw: https://en.wikipedia.org/api/rest_v1/?doc TASK DETAIL https://phabricator.wikimedia.org/T107602 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: Joe, GWicke Cc: JohnLewis, hoo, GWicke, greg, Lydia_Pintscher

[Wikidata-bugs] [Maniphest] [Commented On] T107602: Set up a public interface to the wikidata query service

2015-08-04 Thread GWicke
GWicke added a subscriber: GWicke. GWicke added a comment. Will the query service return raw HTML or SVG content? If it's only returning other content types like JSON, then CORS might not end up mattering too much. An alternative to a separate domain could be to use `https://wikidata.org/api

[Wikidata-bugs] [Maniphest] [Edited] T84923: Reliable publish / subscribe event bus

2015-07-16 Thread GWicke
GWicke edited the task description. Herald added subscribers: Matanya, Aklapper. TASK DETAIL https://phabricator.wikimedia.org/T84923 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: GWicke Cc: Aklapper, Matanya, Mattflaschen, Ottomata, mmodell

[Wikidata-bugs] [Maniphest] [Commented On] T116247: Define edit related events for change propagation

2015-10-23 Thread GWicke
GWicke added a comment. @ottomata, UUIDs are described in https://en.wikipedia.org/wiki/Universally_unique_identifier. An example for a v1 UUID is `b54adc00-67f9-11d9-9669-0800200c9a66`. There are libraries to extract the high-resolution timestamp for most environments. Regarding a separate

[Wikidata-bugs] [Maniphest] [Commented On] T116247: Define edit related events for change propagation

2015-10-23 Thread GWicke
GWicke added a comment. @JanZerebecki: Suppression information would indeed be needed for public access to older events. One option would be to key this on the event's UUID. We could also consider superseding the message using Kafka's deduplication (compaction) based on the same UUID. TASK

[Wikidata-bugs] [Maniphest] [Changed Subscribers] T116247: Define edit related events for change propagation

2015-10-23 Thread GWicke
GWicke added a subscriber: EBernhardson. TASK DETAIL https://phabricator.wikimedia.org/T116247 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: GWicke Cc: EBernhardson, Smalyshev, yuvipanda, Hardikj, daniel, aaron, GWicke, mobrovac, MZMcBride, bd808

[Wikidata-bugs] [Maniphest] [Commented On] T116247: Define edit related events for change propagation

2015-10-22 Thread GWicke
GWicke added a comment. Some notes from the meeting: 1. Framing, for all events - **uri**: string; path or url. Example: /en.wikipedia.org/v1/page/title/San_Francisco - **reqid**: v1 UUID <https://en.wikipedia.org/wiki/Universally_unique_identifier#Version_1_.28MAC_address_.26_date-time

[Wikidata-bugs] [Maniphest] [Commented On] T116247: Define edit related events for change propagation

2015-10-26 Thread GWicke
GWicke added a comment. In https://phabricator.wikimedia.org/T116247#1754698, @Ottomata wrote: > > If we have a use case for emitting two secondary events *to the same topic* > > that were both triggered by the same primary event (user click / request > > id), then we can

[Wikidata-bugs] [Maniphest] [Commented On] T116247: Define edit related events for change propagation

2015-10-29 Thread GWicke
GWicke added a comment. @ottomata, they will be filled in somewhere, but I think we haven't necessarily decided on filling them in at production time. To me it seems that filling in either at production or consumption time will work, as long as defaults don't change. It sounds like you have

[Wikidata-bugs] [Maniphest] [Commented On] T116247: Define edit related events for change propagation

2015-10-29 Thread GWicke
GWicke added a comment. @ottomata: Based on our backwards-compatibility rules, the latest schema will be a superset of previous schemas. This means that you will be able to understand both old and new data in a given topic using the //latest// schema. TASK DETAIL https

[Wikidata-bugs] [Maniphest] [Commented On] T116247: Define edit related events for change propagation

2015-10-29 Thread GWicke
GWicke added a comment. @ottomata, you are basically making the case for filling in the defaults at consumption time. TASK DETAIL https://phabricator.wikimedia.org/T116247 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: mobrovac, GWicke Cc

[Wikidata-bugs] [Maniphest] [Commented On] T116247: Define edit related events for change propagation

2015-10-29 Thread GWicke
GWicke added a comment. @ottomata: If you fill in the defaults at consumption time, then you have a choice of how you want to treat old events. You can either fill in the defaults from the latest schema (probably what you want in most cases), or choose to explicitly distinguish fields

[Wikidata-bugs] [Maniphest] [Commented On] T116247: Define edit related events for change propagation

2015-10-27 Thread GWicke
GWicke added a comment. > I've been thinking about it too. Ideally, we could leave these fields out of > schema defs, simply reference them. But, that seems not to be in correlation > with storing them in a git repo. What I see as a possible solution is to put > these c

[Wikidata-bugs] [Maniphest] [Commented On] T116247: Define edit related events for change propagation

2015-10-28 Thread GWicke
GWicke added a comment. @ottomata, I think understanding the semantics of an event primarily requires knowledge of the topic. The topic in turn provides access to the schema, which describes the structure of the events. It is likely that we'll have multiple topics record similarly-structured

[Wikidata-bugs] [Maniphest] [Commented On] T116247: Define edit related events for change propagation

2015-10-26 Thread GWicke
GWicke added a comment. > If we adopt a convention of always storing schema name and/or revision in the > schemas themselves, then we can do like EventLogging does and infer and > validate the schema based on this value. This would especially be helpful in > associating a message

[Wikidata-bugs] [Maniphest] [Commented On] T116247: Define edit related events for change propagation

2015-10-26 Thread GWicke
GWicke added a comment. > I'm not so sure actually that these will always be redundant. I think the > request ID should be persisted to track the same event throughout the system. > Imagine a user clicks on something which produces an event in the queue and > that event triggers

[Wikidata-bugs] [Maniphest] [Edited] T116247: Define edit related events for change propagation

2015-10-23 Thread GWicke
GWicke edited the task description. TASK DETAIL https://phabricator.wikimedia.org/T116247 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: GWicke Cc: EBernhardson, Smalyshev, yuvipanda, Hardikj, daniel, aaron, GWicke, mobrovac, MZMcBride, bd808

[Wikidata-bugs] [Maniphest] [Commented On] T116247: Define edit related events for change propagation

2015-10-23 Thread GWicke
GWicke added a comment. > Right, but how would you do this in say, Hive? Or in bash? Timestamp logic > should be easy and immediate. Yeah, Hive really seems to be lacking built-in support for UUIDs. There seems to be UDF code to deal with them, but it's definitely not as conv

[Wikidata-bugs] [Maniphest] [Commented On] T116247: Define edit related events for change propagation

2015-10-23 Thread GWicke
GWicke added a comment. I went ahead and updated the task description with the current framing / per-event schema. I renamed the `reqid` to just `id`, and added a `ts` field containing the same timestamp in ISO 8601 format. TASK DETAIL https://phabricator.wikimedia.org/T116247 EMAIL

[Wikidata-bugs] [Maniphest] [Commented On] T114443: EventBus MVP

2015-10-21 Thread GWicke
GWicke added a comment. We are having a hangout meeting tomorrow (Thursday, 22nd) between 11&12am SF time. Please let us know if you'd like to join. TASK DETAIL https://phabricator.wikimedia.org/T114443 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailprefere

[Wikidata-bugs] [Maniphest] [Edited] T84923: Reliable publish / subscribe event bus

2015-10-21 Thread GWicke
GWicke edited the task description. TASK DETAIL https://phabricator.wikimedia.org/T84923 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: GWicke Cc: Aklapper, Matanya, Mattflaschen, Ottomata, mmodell, Eevans, chasemp, brion, Krenair, Halfak

[Wikidata-bugs] [Maniphest] [Updated] T116247: Define edit related events for change propagation

2015-10-21 Thread GWicke
GWicke added a blocked task: T102476: RFC: Requirements for change propagation. TASK DETAIL https://phabricator.wikimedia.org/T116247 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: GWicke Cc: yuvipanda, Hardikj, daniel, aaron, GWicke, mobrovac

[Wikidata-bugs] [Maniphest] [Updated] T84923: Reliable publish / subscribe event bus

2015-10-21 Thread GWicke
GWicke added a blocked task: T102476: RFC: Requirements for change propagation. TASK DETAIL https://phabricator.wikimedia.org/T84923 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: GWicke Cc: Aklapper, Matanya, Mattflaschen, Ottomata, mmodell, Eevans

[Wikidata-bugs] [Maniphest] [Created] T116247: Define edit related events for change propagation

2015-10-21 Thread GWicke
GWicke created this task. GWicke added subscribers: Aklapper, Matanya, Mattflaschen, Ottomata, mmodell, Eevans, chasemp, brion, Krenair, Halfak, JanZerebecki, bd808, MZMcBride, mobrovac, GWicke, aaron, daniel, Hardikj, yuvipanda. GWicke added projects: operations, EventBus, Discovery, Epic

[Wikidata-bugs] [Maniphest] [Raised Priority] T116247: Define edit related events for change propagation

2015-10-21 Thread GWicke
GWicke raised the priority of this task from "Normal" to "High". GWicke set Security to None. TASK DETAIL https://phabricator.wikimedia.org/T116247 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: GWicke Cc: yuvipanda, Hardikj, d

[Wikidata-bugs] [Maniphest] [Edited] T116247: Define edit related events for change propagation

2015-10-21 Thread GWicke
GWicke edited the task description. TASK DETAIL https://phabricator.wikimedia.org/T116247 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: GWicke Cc: yuvipanda, Hardikj, daniel, aaron, GWicke, mobrovac, MZMcBride, bd808, JanZerebecki, Halfak, Krenair

[Wikidata-bugs] [Maniphest] [Edited] T116247: Define edit related events for change propagation

2015-10-21 Thread GWicke
GWicke edited the task description. TASK DETAIL https://phabricator.wikimedia.org/T116247 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: GWicke Cc: yuvipanda, Hardikj, daniel, aaron, GWicke, mobrovac, MZMcBride, bd808, JanZerebecki, Halfak, Krenair

[Wikidata-bugs] [Maniphest] [Commented On] T114443: EventBus MVP

2015-11-04 Thread GWicke
GWicke added a comment. @faidon: Until very recently (last days), there wasn't actually any REST proxy with schema validation in the EventLogging repository. @ottomata now has a patch implementing such a service <https://gerrit.wikimedia.org/r/#/c/235671/24/server/bin/eventlogging-serv

[Wikidata-bugs] [Maniphest] [Updated] T114474: More flexible and modernized Recent Changes code

2015-11-05 Thread GWicke
GWicke added a subscriber: GWicke. GWicke added a comment. There is some related high-level discussion about recent changes and page history as event streams in https://phabricator.wikimedia.org/T107595. One idea is to layer event streams, which would potentially let us integrate related

[Wikidata-bugs] [Maniphest] [Changed Subscribers] T105638: RFC: Streamlining Composer usage

2015-11-06 Thread GWicke
GWicke added a subscriber: mobrovac. TASK DETAIL https://phabricator.wikimedia.org/T105638 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: JanZerebecki, GWicke Cc: mobrovac, GWicke, Addshore, Qgil, Spage, greg, tstarling, aude, hoo, daniel

[Wikidata-bugs] [Maniphest] [Commented On] T105638: RFC: Streamlining Composer usage

2015-11-06 Thread GWicke
GWicke added a subscriber: GWicke. GWicke added a comment. Here is an idea for a workflow-based solution that would work for nodejs as well: 1. Each code project has a corresponding deploy repository. For nodejs, current practice is to have the code as a submodule of the deploy repository

[Wikidata-bugs] [Maniphest] [Commented On] T114443: EventBus MVP

2015-11-02 Thread GWicke
GWicke added a comment. @ottomata: In my recollection of the discussion & the log you linked to, the question of which REST producer proxy to use was left open. Our priority is to get basic events into Kafka before the end of this month, so that we can start building on top of this for ch

[Wikidata-bugs] [Maniphest] [Commented On] T114443: EventBus MVP

2015-10-16 Thread GWicke
GWicke added a comment. In https://phabricator.wikimedia.org/T114443#1731399, @ori wrote: > In https://phabricator.wikimedia.org/T114443#1731284, @GWicke wrote: > > > See https://phabricator.wikimedia.org/T88459#1604768. tl;dr: It's not > > necessarily clear that saving ver

[Wikidata-bugs] [Maniphest] [Commented On] T114443: EventBus MVP

2015-10-16 Thread GWicke
GWicke added a comment. In https://phabricator.wikimedia.org/T114443#1730753, @Eevans wrote: > 1. Already leverages a (really slick) JSON schema registry > <https://meta.wikimedia.org/wiki/Category:Schemas_%28active%29?status=active> Optionally fetching schemas from a URL isn

[Wikidata-bugs] [Maniphest] [Commented On] T114443: EventBus MVP

2015-10-16 Thread GWicke
GWicke added a comment. > For starters, it means that we have alternatives for environments where Kafka > is overkill (small third-party installations, dev environments, mw-vagrant, > etc). Using, for example, sqlite instead of Kafka is already something > supported. As far

[Wikidata-bugs] [Maniphest] [Commented On] T114443: EventBus MVP

2015-10-19 Thread GWicke
GWicke added a comment. A PR adding remote schema support to the nodejs frontend is now available at https://github.com/wikimedia/restevent/pull/1. This means that we can now choose to use local or remote schemas per-topic in the configuration. TASK DETAIL https://phabricator.wikimedia.org

[Wikidata-bugs] [Maniphest] [Updated] T114443: EventBus MVP

2015-10-07 Thread GWicke
GWicke added a project: MediaWiki-RfCs. TASK DETAIL https://phabricator.wikimedia.org/T114443 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: GWicke Cc: EBernhardson, bd808, Joe, dr0ptp4kt, madhuvishy, Nuria, ori, faidon, aaron, GWicke, mobrovac

[Wikidata-bugs] [Maniphest] [Changed Project Column] T114443: EventBus MVP

2015-10-07 Thread GWicke
GWicke moved this task to Ready for RFC meeting on the MediaWiki-RfCs workboard. TASK DETAIL https://phabricator.wikimedia.org/T114443 WORKBOARD https://phabricator.wikimedia.org/project/board/52/ EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences

[Wikidata-bugs] [Maniphest] [Commented On] T84923: Reliable publish / subscribe event bus

2015-08-31 Thread GWicke
GWicke added a comment. Etherpad notes: https://etherpad.wikimedia.org/p/scalable_events_system TASK DETAIL https://phabricator.wikimedia.org/T84923 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: GWicke Cc: Aklapper, Matanya, Mattflaschen

[Wikidata-bugs] [Maniphest] [Edited] T84923: Reliable publish / subscribe event bus

2015-08-31 Thread GWicke
GWicke edited the task description. TASK DETAIL https://phabricator.wikimedia.org/T84923 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: GWicke Cc: Aklapper, Matanya, Mattflaschen, Ottomata, mmodell, Eevans, chasemp, brion, Krenair, Halfak

[Wikidata-bugs] [Maniphest] [Commented On] T114443: EventBus MVP

2015-10-01 Thread GWicke
GWicke added a comment. I have now integrated some of those changes into the description. TASK DETAIL https://phabricator.wikimedia.org/T114443 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: GWicke Cc: dr0ptp4kt, madhuvishy, Nuria, ori, faidon

[Wikidata-bugs] [Maniphest] [Edited] T114443: EventBus MVP

2015-10-01 Thread GWicke
GWicke edited the task description. TASK DETAIL https://phabricator.wikimedia.org/T114443 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: GWicke Cc: dr0ptp4kt, madhuvishy, Nuria, ori, faidon, aaron, GWicke, mobrovac, Halfak, Eevans, Ottomata

[Wikidata-bugs] [Maniphest] [Edited] T114443: EventBus MVP

2015-10-04 Thread GWicke
GWicke edited the task description. TASK DETAIL https://phabricator.wikimedia.org/T114443 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: GWicke Cc: EBernhardson, bd808, Joe, dr0ptp4kt, madhuvishy, Nuria, ori, faidon, aaron, GWicke, mobrovac, Halfak

[Wikidata-bugs] [Maniphest] [Commented On] T114443: EventBus MVP

2015-10-04 Thread GWicke
GWicke added a comment. @ori, I changed the text to clarify which of those are potential, and which are concrete plans for this quarter. Please follow the provided links if things are still unclear. TASK DETAIL https://phabricator.wikimedia.org/T114443 EMAIL PREFERENCES https

[Wikidata-bugs] [Maniphest] [Edited] T114443: EventBus MVP

2015-10-04 Thread GWicke
GWicke edited the task description. TASK DETAIL https://phabricator.wikimedia.org/T114443 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: GWicke Cc: EBernhardson, bd808, Joe, dr0ptp4kt, madhuvishy, Nuria, ori, faidon, aaron, GWicke, mobrovac, Halfak

[Wikidata-bugs] [Maniphest] [Commented On] T114443: EventBus MVP

2015-10-04 Thread GWicke
GWicke added a comment. @Nuria, see the task description, heading "Initial use cases". TASK DETAIL https://phabricator.wikimedia.org/T114443 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: GWicke Cc: EBernhardson, bd808, Joe, dr0ptp4kt,

[Wikidata-bugs] [Maniphest] [Commented On] T114443: EventBus MVP

2015-10-02 Thread GWicke
GWicke added a comment. @ottomata, main reason would be the ability to work with $simple_queue, $binary_kafka, $amazon_queue and so on without changes in MW code. This isn't so theoretical. We'll want a lighter-weight queue for testing, developers and third party users rather soon. TASK

  1   2   >