Thadguidry added a comment.
Hi @AndySeaborne What is the latest benchmarks for loading Wikidata all and
truthy with Jena 4.4.0 release annd the new TDB2 xloader with "--threads"
argument? I noticed the release notes said this:
> == Improved bulk loader
>
> This r
Thadguidry added a comment.
@BenAtOlive I think for bikeshedding or hand-waving discussions, you can just
start an new discussion thread in Oxigraph's GitHub Discussions (not Issues).
Here: https://github.com/oxigraph/oxigraph/discussions
TASK DETAIL
https://phabricator.wikimedi
Thadguidry added a comment.
As someone who has "been there, done that" (even with Apache Geode)... I can
tell you that **data locality** is very important when you want to maximize
performance. But if the data is maintained as distributed, then the only way
to squeeze ou
Thadguidry added a comment.
@Addshore That's what I figured. :-) This issue did feel old and sort of in
a dustbin. Agree it should be closed.
TASK DETAIL
https://phabricator.wikimedia.org/T220823
EMAIL PREFERENCES
https://phabricator.wikimedia.org/settings/panel/emailpreferences
Thadguidry added a comment.
@Tpt Looks great! The ROADMAP file was a suggested alternative to the
Milestones, sorry didn't make that clear. I much prefer grouping or tagging
issues against Milestones as you have done! You have the right idea regarding
a single source of truth and ex
Thadguidry added a comment.
Hi @Tpt Can you elaborate more in your Milestones and create more Milestone
as necessary for your future vision? Like what you mean by "no storage format
stability for now", and what that really means to users and what you are
thinking about in the
Thadguidry updated the task description.
TASK DETAIL
https://phabricator.wikimedia.org/T289428
EMAIL PREFERENCES
https://phabricator.wikimedia.org/settings/panel/emailpreferences/
To: Thadguidry
Cc: Aklapper, Thadguidry, Invadibot, MPhamWMF, maantietaja, Wilmanbeno, CBogen,
Akuckartz
Thadguidry updated the task description.
TASK DETAIL
https://phabricator.wikimedia.org/T289428
EMAIL PREFERENCES
https://phabricator.wikimedia.org/settings/panel/emailpreferences/
To: Thadguidry
Cc: Aklapper, Thadguidry, Invadibot, MPhamWMF, maantietaja, Wilmanbeno, CBogen,
Akuckartz
Thadguidry created this task.
Thadguidry added projects: Wikidata, CirrusSearch, Elasticsearch.
Restricted Application added a subscriber: Aklapper.
Restricted Application added a project: Discovery-Search.
TASK DESCRIPTION
(Lydia asked that I write this up, just in case)
I thought that
Thadguidry added a comment.
I'd suggest adding **replica shards** (copies of primary shards) that help to
both ensure redundancy to protect against failure, but they also vastly
increase the capacity for read requests such as searching, like Adam's entity
term lookup use case
Thadguidry added subscribers: Tpt, Thadguidry.
Thadguidry added a comment.
+1 for Oxigraph. @TPT has been putting in a ton of good effort, research,
features, and stability. Sponsoring him now in GitHub as well for his effort.
As it's being developed in Rust, it automatically
Thadguidry added a comment.
We'll also want to improve the Help:Ranking page
<https://www.wikidata.org/wiki/Help:Ranking#Deprecated_rank> once this proposal
task is implemented.
TASK DETAIL
https://phabricator.wikimedia.org/T210961
EMAIL PREFERENCES
https://phabricator.wi
Thadguidry added a comment.
Agree generally on this proposals' assertions. It makes sense to from a data
quality perspective, and since we are actively adding new tools to improve our
data quality, then having a new "outdated" rank to represent a "once upon a
time thi
Thadguidry added a comment.
Hi @aidhog Aidan in my opinion I would say "NO, not a good test-case for this
need". And the only reason is this... it's ASCII only (chars <128) and doesn't
let us unsure proper load handling for all data in all languages, multilingual
dat
Thadguidry added a comment.
Someone needs to add a Documentation task to this.
I assume all the new options available and perhaps a reference link to this
ticket would go somewhere in here?
https://www.mediawiki.org/wiki/Wikibase/Indexing/RDF_Dump_Format
TASK DETAIL
https
Thadguidry added a comment.
I'd like to see this made a bit higher priority? It seems it would be fairly
trivial to implement with a good impact.
TASK DETAIL
https://phabricator.wikimedia.org/T219037
EMAIL PREFERENCES
https://phabricator.wikimedia.org/settings/panel/emailprefer
Thadguidry added a comment.
To Reproduce:
1. Create a new Lexeme
2. **Lemma:** type `chevrette`
3. **Language of Lemma:** type `cajun` and look at dropdown listing
4. Notice that `Louisiana French` Q3083213 is at the bottom of dropdown list
instead of top of list.
TASK DETAIL
Thadguidry added a comment.
In Freebase, we offered word, phrase, and full (exact match). I think the
wbsearchentities API could offer something similar, although with a slight cost
of indexing.
Besides `name` we also supported `alias{full}`. Using alias: matched both
name and aliases
Thadguidry added a comment.
@Gehel Hi Guillaume Isn't the streaming updater work done now by @dcausse ?
Is it time for your tuning engineers to revisit some of this or not really?
TASK DETAIL
https://phabricator.wikimedia.org/T238362
EMAIL PREFERENCES
https://phabricator.wikimedi
Thadguidry added a comment.
@dcausse Dunno if this might help but could a simple window help or where you
use KeyedProcessFunction
<https://ci.apache.org/projects/flink/flink-docs-stable/dev/stream/operators/process_function.html>
on a KeyedStream? If the stream is unkeyed (or initia
Thadguidry added a comment.
> - the output of this is a simple event without any data saying: do a diff
between rev X and Y, fully delete entity QXYZ, ...
Is that supposed to be "data saving" ?
> rdf diff generation: materialize the command and fetch the data from
w
Thadguidry updated the task description.
TASK DETAIL
https://phabricator.wikimedia.org/T244590
EMAIL PREFERENCES
https://phabricator.wikimedia.org/settings/panel/emailpreferences/
To: Thadguidry
Cc: tfmorris, revi, Mholloway, Ladsgroup, Multichill, darthmon_wmde, Iamamz3,
Smalyshev
Thadguidry added a parent task: T261049: Propagate the error to UX for merge
failure when Lemma's do not exactly match. .
TASK DETAIL
https://phabricator.wikimedia.org/T203643
EMAIL PREFERENCES
https://phabricator.wikimedia.org/settings/panel/emailpreferences/
To: Lea_Lacroix
Thadguidry added a subtask: T203643: Sometimes Special:MergeLexemes gives
summary on target lexeme, and sometimes not.
TASK DETAIL
https://phabricator.wikimedia.org/T261049
EMAIL PREFERENCES
https://phabricator.wikimedia.org/settings/panel/emailpreferences/
To: Thadguidry
Cc: VIGNERON
Thadguidry created this task.
Thadguidry added a project: Wikidata Lexicographical data.
Restricted Application added a project: Wikidata.
TASK DESCRIPTION
**BUG:**
Merge dialog shows continuing...but no error is given to the user when trying
to merge lemma that do not match exactly.
Try
Thadguidry added a comment.
If it helps or is needed, the query that you can use is here:
SELECT ?wd ?wdLabel ?corrName ?schema
{
values (?corr ?corrName)
{(wdt:P2235 "superProp") (wdt:P2236 "subProp") (wdt:P1628 "equivProp")
(wdt:
Thadguidry added a comment.
@Lydia_Pintscher Oops! You forgot to include the main one also !!!
Equivalent Property P1628 <https://phabricator.wikimedia.org/P1628> :-)
TASK DETAIL
https://phabricator.wikimedia.org/T249868
EMAIL PREFERENCES
https://phabricator.wikimed
Thadguidry added a comment.
Is there anything inherently wrong or technically infeasible or undesirable,
if an id used 2 letters? ES45 versus E45
<https://phabricator.wikimedia.org/E45> ?
TASK DETAIL
https://phabricator.wikimedia.org/T214884
EMAIL PREFERENCES
Thadguidry updated the task description.
TASK DETAIL
https://phabricator.wikimedia.org/T237645
EMAIL PREFERENCES
https://phabricator.wikimedia.org/settings/panel/emailpreferences/
To: Thadguidry
Cc: dcausse, Aklapper, Thadguidry, darthmon_wmde, DannyS712, Nandana, Jony,
Prisshahlla, Lahi
Thadguidry added a comment.
Thanks, updated ticket.
TASK DETAIL
https://phabricator.wikimedia.org/T237645
EMAIL PREFERENCES
https://phabricator.wikimedia.org/settings/panel/emailpreferences/
To: Thadguidry
Cc: dcausse, Aklapper, Thadguidry, darthmon_wmde, DannyS712, Nandana, Jony
Thadguidry updated the task description.
TASK DETAIL
https://phabricator.wikimedia.org/T237645
EMAIL PREFERENCES
https://phabricator.wikimedia.org/settings/panel/emailpreferences/
To: Thadguidry
Cc: dcausse, Aklapper, Thadguidry, darthmon_wmde, DannyS712, Nandana, Jony,
Prisshahlla, Lahi
Thadguidry added a comment.
@dcausse Yes, I mean running a full text search. Fulltext searches are cheap
when you index terms in multiple ways. Why would you not want to index terms
in multiple ways? Freebase was able to leverage this quite easily with
Lucene/Solr indexes and provided
Thadguidry added a comment.
TODO: Just wanted to highlight that once decisions are made... please ensure
to update the Glossary item <https://www.wikidata.org/wiki/Wikidata:Glossary> !
Currently it reads:
> EntitySchema is a special type of Wikidata page containing a document
Thadguidry added a comment.
@dbarratt in the Wikibase ontology I could not find those properties in the
OWL document returned. Sorry, I'm getting caught up with your schema layouts
as fast as I can :-) I expected my parser to retrieve information about their
description, range, domai
Thadguidry added a comment.
Something is amiss with these...not found.
"wikibase": "http://wikiba.se/ontology#";,
"statements": {
"@id": "wikibase:statements"
},
"ident
35 matches
Mail list logo