dcausse updated the task description.
TASK DETAIL
https://phabricator.wikimedia.org/T239414
EMAIL PREFERENCES
https://phabricator.wikimedia.org/settings/panel/emailpreferences/
To: dcausse
Cc: Smalyshev, Lucas_Werkmeister_WMDE, Igorkim78, dcausse, Aklapper, Hook696,
Daryl-TTMG
dcausse created this task.
dcausse added projects: Wikidata-Query-Service, Discovery-Search (Current work).
Restricted Application added a subscriber: Aklapper.
Restricted Application added a project: Wikidata.
TASK DESCRIPTION
The current workflow of the updater requires loading the triples
dcausse claimed this task.
dcausse triaged this task as "Medium" priority.
TASK DETAIL
https://phabricator.wikimedia.org/T239687
EMAIL PREFERENCES
https://phabricator.wikimedia.org/settings/panel/emailpreferences/
To: dcausse
Cc: dcausse, Aklapper, darthmon_wmde, DannyS712, Nan
dcausse updated the task description.
TASK DETAIL
https://phabricator.wikimedia.org/T239414
EMAIL PREFERENCES
https://phabricator.wikimedia.org/settings/panel/emailpreferences/
To: dcausse
Cc: Smalyshev, Lucas_Werkmeister_WMDE, Igorkim78, dcausse, Aklapper,
darthmon_wmde, DannyS712
dcausse updated the task description.
TASK DETAIL
https://phabricator.wikimedia.org/T239414
EMAIL PREFERENCES
https://phabricator.wikimedia.org/settings/panel/emailpreferences/
To: dcausse
Cc: Smalyshev, Lucas_Werkmeister_WMDE, Igorkim78, dcausse, Aklapper,
darthmon_wmde, DannyS712
dcausse created this task.
dcausse added projects: Wikidata-Query-Service, Discovery-Search (Current work).
Restricted Application added a subscriber: Aklapper.
Restricted Application added a project: Wikidata.
TASK DESCRIPTION
Seen on wdqs1004 after enabling async imports
20:29:03.200
dcausse triaged this task as "Medium" priority.
TASK DETAIL
https://phabricator.wikimedia.org/T239750
EMAIL PREFERENCES
https://phabricator.wikimedia.org/settings/panel/emailpreferences/
To: dcausse
Cc: dcausse, Aklapper, darthmon_wmde, DannyS712, Nandana,
dcausse added a comment.
References and values are identified by a hash computed over their
properties. It is not a stable ID as it is always generated on the fly when
extracting the entity data.
The current RDF projection makes it a resource that is referenced from other
triples. The
dcausse added a comment.
Some numbers extracted from a dump:
- number of values: 20,659,551
- number of unique values: 11,028,526
- number of references: 60,078,314
- number of unique references: 58,876,057
So to the question:
> is it worthwhile to dedup values&ref
dcausse moved this task from To Be Deployed to Done on the Discovery-Search
(Current work) board.
dcausse closed this task as "Resolved".
TASK DETAIL
https://phabricator.wikimedia.org/T101013
WORKBOARD
https://phabricator.wikimedia.org/project/board/1227/
EMAIL PREFERENC
dcausse closed subtask T101013: Log Wikidata Query Service queries to the event
gate infrastructure as "Resolved".
TASK DETAIL
https://phabricator.wikimedia.org/T234968
EMAIL PREFERENCES
https://phabricator.wikimedia.org/settings/panel/emailpreferences/
To: dcausse
Cc: J
dcausse created this task.
dcausse added a project: Wikidata-Query-Service.
Restricted Application added a subscriber: Aklapper.
Restricted Application added a project: Wikidata.
TASK DESCRIPTION
Seen in munged dumps:
- Nov 6 munged dump: 5 909 445 794
- Nov 15 dump (lexeme): 21 591 655
dcausse created this task.
dcausse added a project: Wikidata-Query-Service.
Restricted Application added a subscriber: Aklapper.
Restricted Application added a project: Wikidata.
TASK DESCRIPTION
As of today when we run an update to blazegraph we only extract the total
number of mutations
dcausse created this task.
dcausse added projects: Wikidata, CirrusSearch.
Restricted Application added a subscriber: Aklapper.
Restricted Application added a project: Discovery-Search.
TASK DESCRIPTION
The sanitizer seems to be a bit aggressive with wikidata causing significant
load on the
dcausse updated the task description.
TASK DETAIL
https://phabricator.wikimedia.org/T239931
EMAIL PREFERENCES
https://phabricator.wikimedia.org/settings/panel/emailpreferences/
To: dcausse
Cc: Ladsgroup, Addshore, Aklapper, dcausse, darthmon_wmde, DannyS712, Nandana,
Lahi, Gq86
dcausse added a comment.
We should export the triples from a production journal to try to understand
where are the differences. To do this we need to copy a journal and run some
tools provided by blazegraph.
The tool is ExportKB to run it we need all the jars present in the war (the
dcausse added a comment.
Does it mean that we would make WikbaseClient dependent on CirrusSearch and create all necessary query builders into this client?
Have we considered the possibility to run an actual API call to wbsearchentit...@wikidata.org?
I have no clue if the current API output would
dcausse added a comment.
moving back to in progress as the second patch generated some warnings on test servers:
[Wed Jun 27 13:49:51 2018] [hphp] [482:7f0a5afff700:37030:01] [] \nWarning: Invalid argument supplied for foreach() in /srv/mediawiki/php-1.32.0-wmf.8/extensions/CirrusSearch
dcausse added a subscriber: dcausse.
TASK DETAIL
https://phabricator.wikimedia.org/T88534
EMAIL PREFERENCES
https://phabricator.wikimedia.org/settings/panel/emailpreferences/
To: dcausse
Cc: dcausse, Ricordisamoa, Addshore, Deskana, Manybubbles, Christopher,
Wikidata-bugs, hoo, daniel
dcausse added a comment.
Looks like the mapping will be changed (GD should use the
CirrusSearchMappingConfig hook no?) so yes I'm afraid that we'll have to
rebuild the index with the new mapping.
TASK DETAIL
https://phabricator.wikimedia.org/T115482
EMAIL PREFERENC
dcausse added a subscriber: dcausse.
dcausse added a comment.
//First of all: sorry for all the low level details in this comment but it's
always complex to tackle such relevance issues.//
I assume that `life` is the query.
Wikidata already uses `incoming_link` to boost the top-N results
dcausse added a comment.
Yes if you have numeric properties that are ready to use, we might be able to
use them soon.
With https://gerrit.wikimedia.org/r/#/c/249460/ we should be able to write a
custom rescore profile for wikidata and try to workaround the poor lucene
scores we have today.
PS
dcausse added a comment.
Sorry... I was completely wrong when analyzing lucene explain for Q3 (it's a
pain to debug scoring issues
<https://www.wikidata.org/w/index.php?title=Special:Search&limit=10&offset=850&profile=default&search=life&cirrusDumpResult&cirru
dcausse added a comment.
A big +1.
As far as I know it should be pretty straightforward, you just need to
implement 2 hooks (`CirrusSearchMappingConfig` and
`CirrusSearchBuildDocumentParse`).
The profiles (we may want to create multiple profiles with different weights
for testing purpose) can
dcausse edited blocking tasks, added: T120089: Add an internal completion or
suggestions API to core SearchEngine; removed: T112028: Implement completion
suggester as a Beta Feature.
TASK DETAIL
https://phabricator.wikimedia.org/T78157
EMAIL PREFERENCES
https://phabricator.wikimedia.org
dcausse added a comment.
@aude I can help to write the rescore profiles when you are ready.
Also I realized that the example profiles I wrote in Cirrus are wrong: they use
"multiply" to combine the scores but it makes no sense : `(weight1 * score1) *
(weight2 * score2)`. We might pre
dcausse added a comment.
We can inhibit tf/idf by setting the weight of the main query to 0 and use
either "max" or "add". Note that tf/idf will still play a role to extract the
top-N results that will be rescored. N is 8196*7 (number of shards) so if
shards are well bala
dcausse claimed this task.
TASK DETAIL
https://phabricator.wikimedia.org/T110648
EMAIL PREFERENCES
https://phabricator.wikimedia.org/settings/panel/emailpreferences/
To: dcausse
Cc: Sjoerddebruin, EBernhardson, aude, dcausse, Deskana, daniel, Mbch331,
Aklapper, Lydia_Pintscher, Wikidata
dcausse moved this task to In progress on the Discovery-Cirrus-Sprint workboard.
TASK DETAIL
https://phabricator.wikimedia.org/T110648
WORKBOARD
https://phabricator.wikimedia.org/project/board/1227/
EMAIL PREFERENCES
https://phabricator.wikimedia.org/settings/panel/emailpreferences/
To
dcausse moved this task to Needs review on the Discovery-Cirrus-Sprint
workboard.
TASK DETAIL
https://phabricator.wikimedia.org/T110648
WORKBOARD
https://phabricator.wikimedia.org/project/board/1227/
EMAIL PREFERENCES
https://phabricator.wikimedia.org/settings/panel/emailpreferences/
To
dcausse moved this task to Needs review on the Discovery-Cirrus-Sprint
workboard.
TASK DETAIL
https://phabricator.wikimedia.org/T110648
WORKBOARD
https://phabricator.wikimedia.org/project/board/1227/
EMAIL PREFERENCES
https://phabricator.wikimedia.org/settings/panel/emailpreferences/
To
dcausse added a comment.
moving back to needs-review as all patches needed in wikidata have been merged.
TASK DETAIL
https://phabricator.wikimedia.org/T110648
EMAIL PREFERENCES
https://phabricator.wikimedia.org/settings/panel/emailpreferences/
To: dcausse
Cc: gerritbot, Sjoerddebruin
dcausse added a comment.
Are we emitting exceptions when the HTTP status is not what we expect, e.g.
404? If yes this is worrisome and we definitely need to look into what entity
and revision is producing such RDF.
TASK DETAIL
https://phabricator.wikimedia.org/T249099
EMAIL PREFERENCES
dcausse added a project: Discovery-Search (Current work).
TASK DETAIL
https://phabricator.wikimedia.org/T249260
EMAIL PREFERENCES
https://phabricator.wikimedia.org/settings/panel/emailpreferences/
To: Addshore, dcausse
Cc: dcausse, Aklapper, DD063520, CBogen, Samantha_Alipio_WMDE
dcausse added a comment.
@DD063520 did you make any modification to the LocalSettings?
I tried a fresh install with a fix for T249496
<https://phabricator.wikimedia.org/T249496> applied and
`updateSearchIndexConfig.php` worked appropriately.
The error `Unknown Similarity type
dcausse edited projects, added Wikidata-Query-Service; removed WDQS-Optimizer.
Restricted Application added a project: Wikidata.
TASK DETAIL
https://phabricator.wikimedia.org/T249196
EMAIL PREFERENCES
https://phabricator.wikimedia.org/settings/panel/emailpreferences/
To: dcausse
Cc
dcausse claimed this task.
TASK DETAIL
https://phabricator.wikimedia.org/T249196
EMAIL PREFERENCES
https://phabricator.wikimedia.org/settings/panel/emailpreferences/
To: dcausse
Cc: Aklapper, dcausse, CBogen, darthmon_wmde, Nandana, Lahi, Gq86,
Lucas_Werkmeister_WMDE, GoranSMilovanovic
dcausse claimed this task.
dcausse added a project: Discovery-Search (Current work).
TASK DETAIL
https://phabricator.wikimedia.org/T248464
EMAIL PREFERENCES
https://phabricator.wikimedia.org/settings/panel/emailpreferences/
To: dcausse
Cc: Aklapper, Zbyszko, CBogen, darthmon_wmde, Nandana
dcausse moved this task from In Progress to To Be Deployed on the
Discovery-Search (Current work) board.
dcausse added a comment.
The lag on wdqs1007 has been absorbed much faster than other eqiad nodes.
F31741086: lag_wdqs1007.png <https://phabricator.wikimedia.org/F31741
dcausse triaged this task as "Medium" priority.
dcausse moved this task from needs triage to Wikidata Search on the
Discovery-Search board.
TASK DETAIL
https://phabricator.wikimedia.org/T248365
WORKBOARD
https://phabricator.wikimedia.org/project/board/1849/
EMAIL PREFERENC
dcausse triaged this task as "Medium" priority.
dcausse moved this task from needs triage to Wikidata Search on the
Discovery-Search board.
dcausse added a comment.
Aliases were put in the labels field for performance reasons, we need to
investigated whether it's feasible o
dcausse created this task.
dcausse added projects: Wikidata, Wikidata-Query-Service.
TASK DESCRIPTION
Currently when the check returns: `CHECK_NRPE STATE UNKNOWN: Socket timeout
after 10 seconds. for Query Service HTTP Port and NaN for WDQS high update lag`
we do not send an alert.
Being is
dcausse closed this task as "Declined".
TASK DETAIL
https://phabricator.wikimedia.org/T239397
EMAIL PREFERENCES
https://phabricator.wikimedia.org/settings/panel/emailpreferences/
To: dcausse
Cc: Lucas_Werkmeister_WMDE, dcausse, Aklapper, darthmon_wmde, Nandana, L
dcausse added a comment.
In T244341#6062795 <https://phabricator.wikimedia.org/T244341#6062795>,
@Dipsacus_fullonum wrote:
> Yes, `isLiteral` should still work for properties where the real values
are literals. Without knowing the internal workings of Blazegraph I would guess
dcausse added a comment.
In T244341#6064237 <https://phabricator.wikimedia.org/T244341#6064237>,
@Dipsacus_fullonum wrote:
> Many queries use the optimizer hint `hint:Prior hint:rangeSafe true. ` when
e.g. comparing date or number values with constants in a filter as suggested
dcausse added a comment.
In T242453#6071767 <https://phabricator.wikimedia.org/T242453#6071767>,
@Addshore wrote:
> This just happened again, so depooled and restarted 1006, and switched
traffic over to codfw.
> Seems to always be 1006?
I don't think so, it happe
dcausse removed a parent task: T244590: EPIC: Rework the WDQS updater as an
event driven application.
TASK DETAIL
https://phabricator.wikimedia.org/T243603
EMAIL PREFERENCES
https://phabricator.wikimedia.org/settings/panel/emailpreferences/
To: Zbyszko, dcausse
Cc: Aklapper, Zbyszko
dcausse removed a subtask: T243603: Create a way to deploy WDQS artifacts to
Archiva with Jenkins.
TASK DETAIL
https://phabricator.wikimedia.org/T244590
EMAIL PREFERENCES
https://phabricator.wikimedia.org/settings/panel/emailpreferences/
To: dcausse
Cc: revi, Mholloway, Ladsgroup
dcausse closed this task as a duplicate of T246568: Deepcategory returns only
very few results.
TASK DETAIL
https://phabricator.wikimedia.org/T228348
EMAIL PREFERENCES
https://phabricator.wikimedia.org/settings/panel/emailpreferences/
To: dcausse
Cc: Smalyshev, Mathew.onipe, Gehel
dcausse added a comment.
merged in T246568 <https://phabricator.wikimedia.org/T246568> which is where
we'll announce that the full reload has been done.
TASK DETAIL
https://phabricator.wikimedia.org/T228348
EMAIL PREFERENCES
https://phabricator.wikimedia.org/set
dcausse added a comment.
@ArielGlenn no not yet, this is still blocked on T243292
<https://phabricator.wikimedia.org/T243292> which requires some investigation
to determine which component (dump or the wdqs transformation process) is wrong.
TASK DETAIL
https://phabricator.wikimed
dcausse claimed this task.
dcausse added a project: Discovery-Search (Current work).
TASK DETAIL
https://phabricator.wikimedia.org/T245728
EMAIL PREFERENCES
https://phabricator.wikimedia.org/settings/panel/emailpreferences/
To: dcausse
Cc: Gehel, Zbyszko, Aklapper, dcausse, CBogen
dcausse triaged this task as "Medium" priority.
TASK DETAIL
https://phabricator.wikimedia.org/T245728
EMAIL PREFERENCES
https://phabricator.wikimedia.org/settings/panel/emailpreferences/
To: dcausse
Cc: Gehel, Zbyszko, Aklapper, dcausse, CBogen, darthmon_wmde, Nandana, L
dcausse triaged this task as "Medium" priority.
TASK DETAIL
https://phabricator.wikimedia.org/T248464
EMAIL PREFERENCES
https://phabricator.wikimedia.org/settings/panel/emailpreferences/
To: dcausse
Cc: Aklapper, Zbyszko, Blissjay007, Oblanco79, Alter-paule, Beast1978, CBog
dcausse triaged this task as "High" priority.
TASK DETAIL
https://phabricator.wikimedia.org/T250140
EMAIL PREFERENCES
https://phabricator.wikimedia.org/settings/panel/emailpreferences/
To: dcausse
Cc: William_Avery, Aklapper, Addshore, dcausse, darthmon_wmde, Nandana, L
dcausse added a parent task: T251149: [epic] Ryan's onboarding to the Search
Platform team.
TASK DETAIL
https://phabricator.wikimedia.org/T250140
EMAIL PREFERENCES
https://phabricator.wikimedia.org/settings/panel/emailpreferences/
To: dcausse
Cc: William_Avery, Aklapper, Addshore, dc
dcausse triaged this task as "High" priority.
dcausse added a comment.
Raising to high, this issue might be hard to solve as it sounds related to
the blazegraph design flaw of running with unbounded thread pools.
We might perhaps at least try to add some debugging code to i
dcausse created this task.
dcausse added a project: Wikidata-Query-Service.
Restricted Application added a subscriber: Aklapper.
Restricted Application added a project: Wikidata.
TASK DESCRIPTION
AC:
- the streaming updater should produce its events to kafka
- the events should remain
dcausse added a parent task: T244590: EPIC: Rework the WDQS updater as an event
driven application.
TASK DETAIL
https://phabricator.wikimedia.org/T251270
EMAIL PREFERENCES
https://phabricator.wikimedia.org/settings/panel/emailpreferences/
To: dcausse
Cc: Aklapper, dcausse, darthmon_wmde
dcausse triaged this task as "Medium" priority.
dcausse added a project: Discovery-Search (Current work).
TASK DETAIL
https://phabricator.wikimedia.org/T251270
EMAIL PREFERENCES
https://phabricator.wikimedia.org/settings/panel/emailpreferences/
To: dcausse
Cc: Aklapper, dcaus
dcausse added a subtask: T251270: The streaming updater should produce its
events to kafka.
TASK DETAIL
https://phabricator.wikimedia.org/T244590
EMAIL PREFERENCES
https://phabricator.wikimedia.org/settings/panel/emailpreferences/
To: dcausse
Cc: revi, Mholloway, Ladsgroup, Multichill
dcausse created this task.
dcausse added a project: Wikidata-Query-Service.
Restricted Application added a subscriber: Aklapper.
Restricted Application added a project: Wikidata.
TASK DESCRIPTION
AC:
- a component running close to blazegraph should read the content produced in
T251270
dcausse added a subtask: T251275: Add a new updater component to update
blazegraph based on the content present in the streaming updater output kafka
stream.
TASK DETAIL
https://phabricator.wikimedia.org/T244590
EMAIL PREFERENCES
https://phabricator.wikimedia.org/settings/panel
dcausse added a parent task: T244590: EPIC: Rework the WDQS updater as an event
driven application.
TASK DETAIL
https://phabricator.wikimedia.org/T251275
EMAIL PREFERENCES
https://phabricator.wikimedia.org/settings/panel/emailpreferences/
To: dcausse
Cc: dcausse, Aklapper, darthmon_wmde
dcausse renamed this task from "Add a new updater component to update
blazegraph based on the content present in the streaming updater output kafka
stream" to "Update blazegraph based on the content present in the streaming
updater output kafka stream".
dcausse updated
dcausse created this task.
dcausse added a project: Wikidata-Query-Service.
Restricted Application added subscribers: Strainu, Cosine02, revi, Aklapper.
Restricted Application added a project: Wikidata.
TASK DESCRIPTION
Some sitelinks are missing for Q5084390.
at the time of writing
https
dcausse updated the task description.
TASK DETAIL
https://phabricator.wikimedia.org/T251387
EMAIL PREFERENCES
https://phabricator.wikimedia.org/settings/panel/emailpreferences/
To: dcausse
Cc: Aklapper, revi, Cosine02, Strainu, dcausse, darthmon_wmde, Nandana, Lahi,
Gq86
dcausse updated the task description.
TASK DETAIL
https://phabricator.wikimedia.org/T251387
EMAIL PREFERENCES
https://phabricator.wikimedia.org/settings/panel/emailpreferences/
To: dcausse
Cc: Aklapper, revi, Cosine02, Strainu, dcausse, darthmon_wmde, Nandana, Lahi,
Gq86
dcausse added a comment.
@Peter_James thanks! The current update strategy assumes that entity <>
sitelink pairs are unique and thus when a sitelink is removed it blindly
assumes that it's not used elsewhere. Not doing so would require a much more
costly update process that wo
dcausse added a comment.
I think the best approach here is to wait for the cleanup in T249613
<https://phabricator.wikimedia.org/T249613> and its report then make sure that
true duplicates are removed and then schedule a new full reload of all the
servers.
In the meantime ite
dcausse triaged this task as "High" priority.
TASK DETAIL
https://phabricator.wikimedia.org/T251387
EMAIL PREFERENCES
https://phabricator.wikimedia.org/settings/panel/emailpreferences/
To: dcausse
Cc: Peter_James, Aklapper, revi, Cosine02, Strainu, dcausse, darthmon_wmde,
Nan
dcausse added a comment.
@Multichill the discussion
<https://www.wikidata.org/wiki/Wikidata:Contact_the_development_team/Query_Service_and_search#Blank_node_deprecation_in_WDQS_&_Wikibase_RDF_model>
seems to have stalled. Thanks to Peter the pros and cons has been well
summari
dcausse renamed this task from "Wikibase RDF dump: stop using blank nodes for
encoding SomeValue and OWL constraints" to "Stop using blank nodes for encoding
SomeValue and OWL constraints in WDQS".
dcausse updated the task description.
TASK DETAIL
https://phabricator.w
dcausse added a comment.
In T244341#6097277 <https://phabricator.wikimedia.org/T244341#6097277>, @Pfps
wrote:
> I don't understand why it was considered necessary to make a breaking
change the RDF dump to improve WDQS performance when there is a solution that
does not m
dcausse updated the task description.
TASK DETAIL
https://phabricator.wikimedia.org/T245541
EMAIL PREFERENCES
https://phabricator.wikimedia.org/settings/panel/emailpreferences/
To: dcausse
Cc: Addshore, Aklapper, Lucas_Werkmeister_WMDE, mkroetzsch, Daniel_Mietchen,
Jheald, dcausse
dcausse updated the task description.
TASK DETAIL
https://phabricator.wikimedia.org/T245541
EMAIL PREFERENCES
https://phabricator.wikimedia.org/settings/panel/emailpreferences/
To: dcausse
Cc: Addshore, Aklapper, Lucas_Werkmeister_WMDE, mkroetzsch, Daniel_Mietchen,
Jheald, dcausse
dcausse added a comment.
happened a couple of times on a test run:
FailedOp(FullImport(Q93246620,2020-05-04T12:57:49Z,1173447691),org.wikidata.query.rdf.tool.exception.ContainedException:
Didn't get a revision id for [])
FailedOp(FullImport(Q12439094,2020-05-04T13:2
dcausse closed this task as "Declined".
dcausse added a comment.
checkpointing works as expected now
TASK DETAIL
https://phabricator.wikimedia.org/T249097
EMAIL PREFERENCES
https://phabricator.wikimedia.org/settings/panel/emailpreferences/
To: dcausse
Cc: dcausse, Aklappe
dcausse closed subtask T249097: [WDQS Streaming Updater] Fix pipeline
checkpointing as "Declined".
TASK DETAIL
https://phabricator.wikimedia.org/T244590
EMAIL PREFERENCES
https://phabricator.wikimedia.org/settings/panel/emailpreferences/
To: dcausse
Cc: revi, Mholloway,
dcausse claimed this task.
dcausse triaged this task as "Medium" priority.
dcausse added a project: Discovery-Search (Current work).
TASK DETAIL
https://phabricator.wikimedia.org/T251275
EMAIL PREFERENCES
https://phabricator.wikimedia.org/settings/panel/emailpreferences/
To:
dcausse added a comment.
In T249260#6112298 <https://phabricator.wikimedia.org/T249260#6112298>,
@DD063520 wrote:
> Is there a way to reindex?
If the index already exists perhaps forcing a reindex might help.
For this you need to run:
`php updateSearchIndexC
dcausse added subscribers: JAllemandou, dcausse.
dcausse closed this task as "Declined".
dcausse added a comment.
Closing this, @JAllemandou has done plenty of work on this already.
TASK DETAIL
https://phabricator.wikimedia.org/T169798
EMAIL PREFERENCES
https://phabricator.wik
dcausse closed subtask T169798: Create UDFs for analyzing SPARQL queries as
"Declined".
TASK DETAIL
https://phabricator.wikimedia.org/T143819
EMAIL PREFERENCES
https://phabricator.wikimedia.org/settings/panel/emailpreferences/
To: dcausse
Cc: Andrawaag, Esc3300, JAllemandou, mpop
dcausse added a comment.
In T244341#6113236 <https://phabricator.wikimedia.org/T244341#6113236>,
@Lucas_Werkmeister_WMDE wrote:
>> Is anyone proposing a change to Wikibase (or Wikidata)?
>
> Yes – the goal is that the RDF in the query service, the RDF dumps,
dcausse added a comment.
In T244341#6124638 <https://phabricator.wikimedia.org/T244341#6124638>, @Pfps
wrote:
> If 'unskolemizing' is a trivial step then that should be implemented by
WDQS, instead of pushing it to every consumer (including indirect consumers) of
W
dcausse added a comment.
In T244341#6124894 <https://phabricator.wikimedia.org/T244341#6124894>, @Pfps
wrote:
> I was completely unaware that WDQS is so integrated into the inner workings
of Wikidata. Where is this described? Was this mentioned in the announcement
of the
dcausse added a comment.
In T244341#6129321 <https://phabricator.wikimedia.org/T244341#6129321>, @Pfps
wrote:
> I thus view it misleading to state in this Phabricator ticket that
"performance issues [of the WDQS] cause edits on wikidata to be throttled",
which gives
dcausse assigned this task to Zbyszko.
dcausse added a project: Discovery-Search (Current work).
TASK DETAIL
https://phabricator.wikimedia.org/T243292
EMAIL PREFERENCES
https://phabricator.wikimedia.org/settings/panel/emailpreferences/
To: Zbyszko, dcausse
Cc: Mahir256, Physikerwelt
dcausse assigned this task to EBernhardson.
dcausse closed this task as "Resolved".
TASK DETAIL
https://phabricator.wikimedia.org/T230754
EMAIL PREFERENCES
https://phabricator.wikimedia.org/settings/panel/emailpreferences/
To: EBernhardson, dcausse
Cc: Aklapper, Gehel, Smalysh
dcausse added a comment.
The munger should exclude rdf:type statement by default:
SELECT ?o {
wd:M19705716 a ?o .
}
returns :
schema:ImageObject
schema:MediaObject
wikibase:Mediainfo
similar query on query.wikidata.org do not return such statements
dcausse created this task.
dcausse added projects: Wikidata-Query-Service, Analytics.
Restricted Application added a subscriber: Aklapper.
Restricted Application added a project: Wikidata.
TASK DESCRIPTION
Generating the initial state of the wdqs streaming update requires parsing
the TTL dumps
dcausse added a subtask: T253753: Increase retention for
mediawiki.revision-create on the kafka jumbo cluster.
TASK DETAIL
https://phabricator.wikimedia.org/T244590
EMAIL PREFERENCES
https://phabricator.wikimedia.org/settings/panel/emailpreferences/
To: dcausse
Cc: revi, Mholloway
dcausse added a parent task: T244590: EPIC: Rework the WDQS updater as an event
driven application.
TASK DETAIL
https://phabricator.wikimedia.org/T253753
EMAIL PREFERENCES
https://phabricator.wikimedia.org/settings/panel/emailpreferences/
To: dcausse
Cc: Ottomata, dcausse, Aklapper, CBogen
dcausse added a comment.
@ArielGlenn we plan to make a subtle change to the dump (prefixes), this
won't be technically a breaking change but could cause some confusion if users
start to assume the presence of some prefixes. Would it be possible to pause
the publication of the dumps whi
dcausse added a comment.
Just a note on the current problem:
the prefixes defined in ttl dumps are identical to the ones used by wikidata
e.g.:
@prefix wdt: <http://commons.wikimedia.org/prop/direct/> .
This is perfectly valid but might cause some confusions because when
dcausse created this task.
dcausse added projects: WikibaseMediaInfo, Wikidata.
Restricted Application added a subscriber: Aklapper.
TASK DESCRIPTION
Currently the RDF output of commons is using the same set of prefixes as the
ones used by wikidata. This is confusing as someone reading
dcausse added a subscriber: CBogen.
dcausse added a comment.
Looks like it was decided not to use wikidata specific prefixes for MediaInfo
exports but uses a more specific `sdc` for these (see: T222995
<https://phabricator.wikimedia.org/T222995>).
The code does still look to be har
dcausse added a comment.
@WMDE-leszek oops, sorry I replied before reading you comment and was reading
an old code base... if this is just a config change it can hopefully be merged
soon. Thanks!
TASK DETAIL
https://phabricator.wikimedia.org/T221917
EMAIL PREFERENCES
https
dcausse closed this task as "Declined".
dcausse added a comment.
Wikibase has now a way to override the default namespaces, for commons it
should happen thanks to
https://gerrit.wikimedia.org/r/c/operations/mediawiki-config/+/569260 .
TASK DETAIL
https://phabricator.wikimedia.o
dcausse added a comment.
@JAllemandou I think that is an option as well, the thing is that is it is
transitional to help to bootstrap a test of the full pipeline. In the end we
won't be using jumbo and thus won't be able to rely on a 30days retention on
main so hopefully we
201 - 300 of 1262 matches
Mail list logo