[Corpora-List] Re: Multilingual dictionary of phonetic spelling

Kilian Evang Thu, 15 Sep 2022 07:17:35 -0700

Hi Luis,

Another resource you might want to look into is WikiPron:


https://github.com/kylebgorman/wikipron

Cheers,
Kilian

Am Do., 15. Sept. 2022 um 15:58 Uhr schrieb Gilles Sérasset <
[email protected]>:

> Hi Luis,
>
> Don’t know if this could be useful to you, but currently, the DBnary
> dataset contains phonetic (IPA) transcription of many entries.
>
> DBnary is linked data and can be explored through its public endpoint
> using SPARQL language: http://kaiko.getalp.org/sparql
>
> For instance the following query will tell you how many phonetic reps are
> available in which languages.
>
> select ?lang count(?pr) where {
>   [] ontolex:phoneticRep ?pr.
>   BIND (lang(?pr) as ?lang)
> }
> GROUP BY ?lang ORDER BY DESC(COUNT(?pr))
>
> This will give you a long table (I only include the first lines (results
> are order on the number of phoneticRep).
>
> langcallret-1
>
> fr-fonipa
>
> 2657875
>
> en-fonipa
>
> 663697
>
> ru-fonipa
>
> 389891
>
> de-fonipa
>
> 230875
>
> fi-fonipa
>
> 199269
>
> es-fonipa
>
> 187090
>
> la-fonipa
>
> 171134
>
> it-fonipa
>
> 154881
>
> pl-fonipa
>
> 136446
>
> sh-fonipa
>
> 116478
>
> pt-fonipa
>
> 90199
>
> ca-fonipa
>
> 86385
>
> eo-fonipa
>
> 84626
>
> avk-fonipa
>
> 73459
>
> es-ipa
>
> 72652
>
> vi-fonipa
>
> 72147
>
> As the data is continuously extracted from wiktionaries, the numbers will
> evolve (and as several language extractors do not yet extract the phonetic
> representation, feel free to file a feature request on DBnary bug tracker).
>
> More info at :
>
> http://kaiko.getalp.org/about-dbnary/
>
> Regards,
>
> Gilles,
>
>
> On 7 Sep 2022, at 16:26, Luis Camacho Caballero <[email protected]>
> wrote:
>
> Dear colleagues
>
> I'm devoted to the revitalization and massification of the Andean
> Amazonian native language with computational processing as a key enabler.
>
> Among the many tasks to do, nowadays I'm dealing with the creation of
> neologisms. That is why I'm looking for the larger multilingual dictionary
> of phonetic spelling, even better if that database includes asian languages
> (mandarin, japanese, korean, hindi, urdu, etc).
>
> If you have this kind of database, I kindly ask you for bring me access,
> if you don't, I'd appreciate any clue about where and/or how access to it
>
> Kind regards
>
> Luis Camacho <https://orcid.org/0000-0001-6569-550X>
>
>
> ------------------------------
>
>
> _______________________________________________
> Corpora mailing list -- [email protected]
> https://list.elra.info/mailman3/postorius/lists/corpora.list.elra.info/
> To unsubscribe send an email to [email protected]
>
>
> _______________________________________________
> Corpora mailing list -- [email protected]
> https://list.elra.info/mailman3/postorius/lists/corpora.list.elra.info/
> To unsubscribe send an email to [email protected]
>

_______________________________________________
Corpora mailing list -- [email protected]
https://list.elra.info/mailman3/postorius/lists/corpora.list.elra.info/
To unsubscribe send an email to [email protected]

[Corpora-List] Re: Multilingual dictionary of phonetic spelling

Reply via email to