Hello,

As Tino said, the problem associated with PCRE is actually a problem with
the regexp-trules.txt file, I thought that maybe the problem would be
because of I was using the en-es pair languages, and today I have tried
again to follow the guide but using this time the es-ca pair, and I got the
same error.

It seems is a problem with the command:
$ apertium-xtract-regex-trules trules.xml > regexp-trules.txt

This is the guide I'm following is this
http://wiki.apertium.org/wiki/Target-language_tagger_training

After use this command it works just if you escape the character #, and
also using just the t1x file and it seems works, I don't know If escape
manually that character is normal or if I'm doing something wrong

Gema, about the mockups I have been talking with Fran about some doubts I
had and I have the early versions of the input file UI, corpus tagger UI,
performance measure .prob UI and the TXS files manager UI.

Instead of do them with a mockup tool, I have implemented a dummy
interfaces with glade and launched with python because I think could be
better have them as we will have them at the end

The first one is really simple it is the *Input corpus file UI*:

http://imgur.com/PiZgmwa

How it works:

   - You are able to select a corpus file
   - Modify (or directly enter a new corpus from scratch) the actual corpus
   - Once you finish your edition you click on the apply button and it
   shows the corpus tagger UI


Here you can see the 2 interfaces versions to manage the manual supervision
*corpus tagger UI*:

   - First version you have the corpus and go word by word checking the
   correct word depending of the properties:
   http://imgur.com/hOXb9WK


   - Second version (version proposed by Fran) you have just the corpus
   tagged to be disambiguated, highlighed the ambiguous words and you choose
   the correct one by click on them, and let in the mouse hover the word you
   have more info about the preferences:
   http://imgur.com/5G2mqOl

The difference between them is that the first lets you introduce a corpus
file and modify it in the text view using the buttons, and the second one
let you change the ambigueti on the textview

Both design have 2 buttons, one to finish the supervision and the other to
lunch the performance mesure .prob files

For the *performance measure* .prop file UI I have this in mind:
http://imgur.com/RnSiZ7G

How it works:

   - First of all we introduce the tagged corpus base in the left text view
   (if you came from the previous window you already have it inserted)
   - Now you choose your .prob file
   - Some how (maybe by selecting or automaticatly) some part of the tagged
   corpus is selected to train the .prob file
   - Once all is set up you click on the "measure" button to measure the
   performance
   - When the measure is done, on the bottom left you get the accuracy and
   the information about the performance
   - After the measure it shows as well the output generated using the
   .prob file to see where the system has failed tagging

For the TSX management file I have the next idea (this is still in
dicussion with Fran as well because maybe it isn't as usefull as we
expected):

http://imgur.com/WZFEZLP

How it works:

   - First you choose a TSX file
   - it shows all the tags on it, with the name and every item
   - On the second column it says if it is closed or not
   - Also you are able to add new labels and items (the items before you
   have to select the parent label)
   - Once you finish the edition you click on save button

I still have to refine with Fran the Grammar contraint interface and how to
manage the multi word, I expect get them before Monday, but I think we can
start discuss with this interfaces and when I get the others follow with
the others

Regards, and sorry about the length of the email :P
------------------------------------------------------------------------------
Precog is a next-generation analytics platform capable of advanced
analytics on semi-structured data. The platform includes APIs for building
apps and a phenomenal toolset for data science. Developers can use
our toolset for easy data analysis & visualization. Get a free account!
http://www2.precog.com/precogplatform/slashdotnewsletter
_______________________________________________
Apertium-stuff mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/apertium-stuff

Reply via email to