Getting working inflection rules for even a single language is a major
task, and doing so for several hundred languages would be a
overwhelming task.  I can't see how this can be implemented as part of
the Wikidata project within a reasonable time frame.

There is a few shortcuts that can be made, and it is possible to make
some generalized tools. For an open source alternative take a look at
Apertium (http://en.wikipedia.org/wiki/Apertium). Usually it is only
the generation/disambiguation phase that is necessary, and this makes
the task somewhat simpler, but it is still a major undertaking.

Note that some of the basic tools already exist, we only need to
interface them to Mediawiki, but the tools needs definition files to
work (that is inflection rules for Northern Sami language for example,
or Norwegian bokmål and nynorsk, or Swedish) and it is those
definitions that is the major task.

John

On Thu, Oct 18, 2012 at 11:14 AM, Nikola Smolenski <[email protected]> wrote:
> On 18/10/12 11:06, John Erling Blad wrote:
>>
>> well-formed text automatically. One of the more common problems are
>> names that uses different inflection rules due to context and how they
>> are written. Such inflection rules are not part of the Wikidata
>> project and will probably be a major undertaking in itself.
>
>
> Why do you think that inflection rules will not be a part of Wikidata? They
> would be hugely needed on Wiktionary and there is no reason for Wikidata not
> being able to contain them.
>
>
> _______________________________________________
> Wikitech-l mailing list
> [email protected]
> https://lists.wikimedia.org/mailman/listinfo/wikitech-l

_______________________________________________
Wikitech-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikitech-l

Reply via email to