Re: Parch for CELI services update (added support for morphology analysis for Swedish, added engine for Sentiment Analysis for French and Italian)

Alessio Bosca Wed, 09 Jan 2013 01:44:11 -0800

Hi All,

I forgot to mention that the svn diff patch is created on the base ofstanbol-nlp-processing branch


Alessio

On 01/09/2013 10:22 AM, Alessio Bosca wrote:

Hi Rupert,
yesterday I updated the morphological Analysis service (We addedsupport for the sv language, fixed the umlauts issue for German andunified all the POS tagsets for the supported languages)Therefore currently the pos tags returned by the web service are notcoherent with the ones declared in the postag mappings in theenhancement engine. For this reasons the tests on the morphologicalengine should be failing.Concerning the umlauts (and /ß character)/, they are now recognized bythe system but currently the lemma produced by the morphologicalanalyzer converts them to sequences of characters (e.g. /ö -> oe, //ß-> ss)./In the future version of the system I would like to include both lemmawritings/.
/
The language identifier has been updated as well (adding a few newlanguages, now the list of supported languages includesen,fr,de,hu,pl,it,es,pt,el,et,lv,tr,pt,ru,ar,ro,da) without changesfor the engines
The patch you can find in the attachment contains the fixes for themorphological service updates (sv language addition and new pos tagmapping, one for all the languages)The patch also contains the client and the test classes for thesentiment analysis engine supporting fr and it
Let me know if you have any problem integrating the patch

Alessio

On 01/07/2013 05:44 PM, Alessio Bosca wrote:
Hi Rupert,

sure tomorrow I'll have a look into that and let you know

bests
    Alessio

On 01/04/2013 01:07 PM, Rupert Westenthaler wrote:
Hi Alessio, all

Thanks for looking into that. However with the Jenking build #1200
there is still one remaining issue
tesetEngine(org.apache.stanbol.enhancer.engines.celi.langid.impl.CeliLanguageIdentifierEnhancementEngineTest)
  Time elapsed: 0.296 sec  <<< FAILURE!
junit.framework.ComparisonFailure: The detected language for text
'Brigitte Bardot, née  le 28 septembre 1934 à Paris, est une actrice
de cinéma et chanteuse française.' MUST BE 'fr' expected:<[f]r> but
was:<[a]r>
         at junit.framework.Assert.assertEquals(Assert.java:100)
atorg.apache.stanbol.enhancer.engines.celi.langid.impl.CeliLanguageIdentifierEnhancementEngineTest.tesetEngine(CeliLanguageIdentifierEnhancementEngineTest.java:101)
Looks like the Language Identification engine detects the language
Arabic for the French text example. I am also able to reproduce this
issue locally.

Can you have a look into that?
best
Rupert
On Fri, Jan 4, 2013 at 10:03 AM, Alessio Bosca<[email protected]> wrote:
Hi Rupert,
thanks for the feedback, there was a problem with the the accesscontrol for
anonymous users in the services. Now it has been fixed .
PS: Next week I'll send you a patch for CELI engines with thesentiment
analysis and bug fixes for the German umlauts.
Sorry for the delay in the release.

Bests,
     Alessio

--
*************************************
Alessio Bosca, Ph.D.
CELI s.r.l.
Via San Quintino 31
10121 Torino
Tel. +39 011.562.71.15
Fax +39 011.506.40.86
http://www.celi.it
*************************************
--
*************************************
Alessio Bosca, Ph.D.
CELI s.r.l.
Via San Quintino 31
10121 Torino
Tel. +39 011.562.71.15
Fax +39 011.506.40.86
http://www.celi.it
*************************************



--
*************************************
Alessio Bosca, Ph.D.
CELI s.r.l.
Via San Quintino 31
10121 Torino
Tel. +39 011.562.71.15
Fax +39 011.506.40.86
http://www.celi.it
*************************************

Re: Parch for CELI services update (added support for morphology analysis for Swedish, added engine for Sentiment Analysis for French and Italian)

Reply via email to