Re: [Wikisource-l] Automatic text tagging and annotation

2020-07-28 Thread Bodhisattwa Mandal
Amir,

Coming from a community with not much volunteer force, I actually want any
strategy which involves minimal human interference into the tagging
process, as we can't afford to spread our thin line.

Your first option looks more inclined to what I was trying to say. However,
I understand that there will be possibilities of errors or ambiguities and
need some level of human check system anyway.

Personally, I would love the third option but looks like it requires more
engineering than the other two, forgive me if I am wrong. So considering
the lack of initiatives in this area in the past, I would stick to the
first one as a more practical approach for now.

Regards,
Bodhisattwa


On Wed, Jul 29, 2020, 02:18 Amir E. Aharoni 
wrote:

> Do you mean that templates (or some other annotation syntax) will be added
> to wikitext, just not by humans?
>
> Or suggested by software, and added to wikitext after being confirmed by
> humans?
>
> Or not added to wikitext at all and stored separately somewhere?
>
> בתאריך יום ג׳, 28 ביולי 2020, 18:36, מאת Bodhisattwa Mandal ‏<
> bodhisattwa.rg...@gmail.com>:
>
>> Hello,
>>
>> I would like to know if any Wikisource community has moved forward to
>> *automatically[1]* tag or annotate Wikisource texts or has any plans to
>> do so.
>>
>> Regards,
>> Bodhisattwa
>>
>> [1] (without manually adding annotation templates)
>> ___
>> Wikisource-l mailing list
>> Wikisource-l@lists.wikimedia.org
>> https://lists.wikimedia.org/mailman/listinfo/wikisource-l
>>
> ___
> Wikisource-l mailing list
> Wikisource-l@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wikisource-l
>
___
Wikisource-l mailing list
Wikisource-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikisource-l


Re: [Wikisource-l] Automatic text tagging and annotation

2020-07-28 Thread Nicolas VIGNERON
Hi all,

Some tools exists outside Wikisource, for instance I know
https://www.textrazor.com/demo who find the Qid of words in a text (very
good quality but proprietary) or https://ordia.toolforge.org/text-to-lexemes
(crude but open, based on SPARQL and for Lexemes), that can generate
annotation on the fly.
It's not easy (there is a lot of questions) but I'm confident that some
things are doable (at least as a POC).

Cheers, ~nicolas
___
Wikisource-l mailing list
Wikisource-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikisource-l


Re: [Wikisource-l] Automatic text tagging and annotation

2020-07-28 Thread Amir E. Aharoni
Do you mean that templates (or some other annotation syntax) will be added
to wikitext, just not by humans?

Or suggested by software, and added to wikitext after being confirmed by
humans?

Or not added to wikitext at all and stored separately somewhere?

בתאריך יום ג׳, 28 ביולי 2020, 18:36, מאת Bodhisattwa Mandal ‏<
bodhisattwa.rg...@gmail.com>:

> Hello,
>
> I would like to know if any Wikisource community has moved forward to
> *automatically[1]* tag or annotate Wikisource texts or has any plans to
> do so.
>
> Regards,
> Bodhisattwa
>
> [1] (without manually adding annotation templates)
> ___
> Wikisource-l mailing list
> Wikisource-l@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wikisource-l
>
___
Wikisource-l mailing list
Wikisource-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikisource-l


Re: [Wikisource-l] Automatic text tagging and annotation

2020-07-28 Thread Asaf Bartov
Indeed a very interesting direction.  I suggested it during the Wikisource
Conference in 2016 (Vienna) as one distinguishing feature Wikisource
*could* develop to differentiate itself from various other text
repositories like Project Gutenberg, HathiTrust, etc.  But WMF is not ready
to allocate engineering resources to this, and there hasn't been a
volunteer attempt, as far as I know.

   A.

Asaf Bartov (he/him/his)

Senior Program Officer, Emerging Wikimedia Communities

Wikimedia Foundation 

Imagine a world in which every single human being can freely share in the
sum of all knowledge. Help us make it a reality!
https://donate.wikimedia.org


On Tue, Jul 28, 2020 at 9:58 PM Bodhisattwa Mandal <
bodhisattwa.rg...@gmail.com> wrote:

> Thanks Nicolas.
>
> Nemo, for now, any persons, places, creative works, events etc. mentioned
> in the Wikisource texts and have Wikidata items.
>
> Regards,
> Bodhisattwa
>
> On Wed, Jul 29, 2020, 00:17 Federico Leva (Nemo) 
> wrote:
>
>> What kind of tagging and annotation do you have in mind?
>>
>> Federico
>>
> ___
> Wikisource-l mailing list
> Wikisource-l@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wikisource-l
>
___
Wikisource-l mailing list
Wikisource-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikisource-l


Re: [Wikisource-l] Automatic text tagging and annotation

2020-07-28 Thread Bodhisattwa Mandal
Thanks Nicolas.

Nemo, for now, any persons, places, creative works, events etc. mentioned
in the Wikisource texts and have Wikidata items.

Regards,
Bodhisattwa

On Wed, Jul 29, 2020, 00:17 Federico Leva (Nemo)  wrote:

> What kind of tagging and annotation do you have in mind?
>
> Federico
>
___
Wikisource-l mailing list
Wikisource-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikisource-l


Re: [Wikisource-l] Automatic text tagging and annotation

2020-07-28 Thread Federico Leva (Nemo)
What kind of tagging and annotation do you have in mind?

Federico

___
Wikisource-l mailing list
Wikisource-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikisource-l


Re: [Wikisource-l] Automatic text tagging and annotation

2020-07-28 Thread Nicolas VIGNERON
Hi Bodhi,

I'm interested to know the answer too. There are a lot of untapped
potentials there but no real plans that I know of.

I'm cc-ing this to C. Scott Ananian who did a presentation on a related
subject during the last Wikimania (
https://wikimania.wikimedia.org/wiki/2019:Transcription/A_general_annotation_service
not automated tho... but as far as I know, this is the closest to you're
idea) and maybe you could provide some answers.

Cheers,
~nicolas

Le mar. 28 juil. 2020 à 17:36, Bodhisattwa Mandal <
bodhisattwa.rg...@gmail.com> a écrit :

> Hello,
>
> I would like to know if any Wikisource community has moved forward to
> *automatically[1]* tag or annotate Wikisource texts or has any plans to
> do so.
>
> Regards,
> Bodhisattwa
>
> [1] (without manually adding annotation templates)
> ___
> Wikisource-l mailing list
> Wikisource-l@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wikisource-l
>
___
Wikisource-l mailing list
Wikisource-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikisource-l


[Wikisource-l] Automatic text tagging and annotation

2020-07-28 Thread Bodhisattwa Mandal
Hello,

I would like to know if any Wikisource community has moved forward to
*automatically[1]* tag or annotate Wikisource texts or has any plans to do
so.

Regards,
Bodhisattwa

[1] (without manually adding annotation templates)
___
Wikisource-l mailing list
Wikisource-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikisource-l