Lucas_Werkmeister_WMDE created this task.
Lucas_Werkmeister_WMDE added projects: Wikidata, Wikidata wb_terms Trailblazing.
Restricted Application added a subscriber: Aklapper.

TASK DESCRIPTION
  When trying to store terms longer than 255 bytes in the new normalized term 
store (in real-world data, this is typically descriptions in non-Latin 
scripts), a fail-safe exception is thrown in 
`ReplicaMasterAwareRecordIdsAcquirer`: the term text is implicitly truncated by 
the database (cf. T108255 <https://phabricator.wikimedia.org/T108255>), and 
then a subsequent select with `wbx_text = 'untruncated term text'` can’t find 
it. (If `ReplicaMasterAwareRecordIdsAcquirer` didn’t detect this case and throw 
a fail-safe exception, it would continue to attempt acquiring an ID for the 
same text in an infinite loop.)
  
  We need to fix this somewhere between `DatabaseTermIdsAcquirer` and 
`ReplicaMasterAwareRecordIdsAcquirer`: callers of the `TermIdsAcquirer` 
interface shouldn’t be expected to truncate the terms to some store-specific 
length.

TASK DETAIL
  https://phabricator.wikimedia.org/T225805

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: Lucas_Werkmeister_WMDE
Cc: Lucas_Werkmeister_WMDE, Aklapper, darthmon_wmde, alaa_wmde, Nandana, Lahi, 
Gq86, GoranSMilovanovic, QZanden, LawExplorer, _jensen, rosalieper, 
Wikidata-bugs, aude, Lydia_Pintscher, Mbch331
_______________________________________________
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs

Reply via email to