Hi again,

I ran the apertium-eval-translator on my charlifter-restored Turkish files.
Feels great to have arrived at this point.

Now I have to plan as I try to digest the coding challenge on getting some
more information about the tool so I can align my expectations of the
application/coding challenge process with yours, the mentors'.

After spending some hours with the format-handling section of apertium wiki
and "sf.pl" source code, I don't see where to start yet in order to get the
charlifter to respect superblanks. What might you be looking for with
regards to timeline for completion of these task in the week I have to
submit the proposal, if I am still at the point of still digesting the "
sf.pl" code? How can I arrive at a solid understanding of the different
blocks? And where to look to see how superblanks gum up the gears in this
process...is this in how the training corpus is read in?

What are the hashes and global hash refs? Are these the "parameters" for
character replacement? I see 5 listed at line 287 of "sf.pl", but are there
others like the two word lists given for training a language pack?

Lastly, some Turkish text from the restoration process:

Input Turkish text:

\f0 FBI ajanlar\uc0\u305  ve teknik uzmanlar\u305  kay\u305 p uca\u287
\u305 n bulunmas\u305  cal\u305 smalar\u305 na destek olmalar\u305  icin
Malezya\'92ya g\'f6nderildi. {\field{\*\fldinst{HYPERLINK "
http://www.hurriyet.com.tr/index/abd"}}{\fldrslt
\f1\fs28 \cf4 ABD}}\'92li yetkililer Kuala Lumpur\'92dan Pekin\'92e giden
uca\uc0\u287 \u305 n kaybolmas\u305 nda bir ter\'f6r ba\u287 lant\u305
s\u305  olup olmad\u305 \u287 \u305 n\u305  anlamaya cal\u305 st\u305
klar\u305 n\u305  ac\u305 klad\u305 . \u304 ki yolcunun uca\u287 a cal\u305
nt\u305  pasaportlarla bindi\u287 inin ortaya c\u305 kmas\u305 n\u305 n
ard\u305 ndan yetkililer ter\'f6r ihtimalini de de\u287 erlendirmeye
baslam\u305 st\u305 .\

Output:
\f0 FBI ajanlar\üç0\u305  ve teknik uzmanlar\u305  kay\u305 p uca\u287
\u305 n bulunmas\u305  çal\u305 şmalar\u305 na destek olmalar\u305  için
Malezya\''92ya g\'f6nderildi. {\fi̇eld{\*\fldinst{HYPERLİNK "
http://www.hürriyet.com.tr/index/abd<http://www.xn--hrriyet-n2a.com.tr/index/abd>
"}}{\fldrslt
\f1\fs28 \çf4 ABD}}\''92lı yetkililer Kuala Lümpür\''92dan Pekin\''92e
giden uca\üç0\u287 \u305 n kaybolmas\u305 nda bir ter\'f6r ba\u287
lant\u305 s\u305  olup olmad\u305 \u287 \u305 n\u305  anlamaya çal\u305
st\u305 klar\u305 n\u305  aç\u305 klad\u305 . \u304 ki yolcünun uca\u287 a
çal\u305 nt\u305  pasaportlarla bindi\u287 inin ortaya c\u305 kmaş\u305
n\u305 n ard\u305 ndan yetkililer ter\'f6r ihtimalını de de\u287
erlendirmeye başlam\u305 st\u305 .\
}

: )


On 9 March 2014 09:05, Alex Aruj <alex.a...@gmail.com> wrote:

> Turkish language package worked within a minute. Could have been that I
> needed to wait longer for the Spanish. thanks and will get back to you on
> further progress!
>
>
> On 9 March 2014 07:18, Kevin Scannell <ksca...@gmail.com> wrote:
>
>> Do you have the latest charlifter from sourceforge (0.06)?
>>
>> Did you do a "make; sudo make install" to install sf.pl?
>>
>> The line you're looking at is line 185 in sf.pl.in  (before building),
>> not sf.pl.
>>
>> kps
>>
>>
>> On Sun, Mar 9, 2014 at 12:56 AM, Alex Aruj <alex.a...@gmail.com> wrote:
>> > I waited several minutes on a couple different tries before hitting
>> ctrl + c
>> > to quite. I will look at it again tomorrow and if no success, will try a
>> > different approach.
>> >
>> >
>> > On 8 March 2014 22:41, Kevin Scannell <ksca...@gmail.com> wrote:
>> >>
>> >> On Sun, Mar 9, 2014 at 12:01 AM, Alex Aruj <alex.a...@gmail.com>
>> wrote:
>> >> > Hi, this is an update on my adventures with charlifter.
>> >> >
>> >> > I finally found a few moments to dive into charlifter (on Mac OSX,
>> since
>> >> > I
>> >> > have not committed to working on my Windows machine just yet). I
>> have a
>> >> > Spanish corpus and a language package charlifter-es-0.01 but I can't
>> yet
>> >> > try
>> >> > them out together. B/c I downloaded the language pack, I figure I can
>> >> > skip
>> >> > training, however, I can't do anything now since I am at a roadblock.
>> >> > What I
>> >> > am getting stuck on is the installation process of my pre-trained
>> >> > language
>> >> > package downloaded from Lingala NLP on sourceforge.
>> >> >
>> >> >  I guess it is prompting me to enter in "$data", or sf.pl cannot
>> find
>> >> > what
>> >> > it needs. Do I need es-probs.txt or the entire charlifter-es-0.01
>> >> > package to
>> >> > be saved in a sf.pl-accessible location? I did not change any paths
>> in
>> >> > the
>> >> > makefile.
>> >> >
>> >> > Typing in "make install" within my language package directory at
>> >> > Terminal
>> >> > gives this...
>> >> >
>> >> > sf.pl -m -l es
>> >> > Reading in plain text hashes...
>> >> >
>> >> > which in sf.pl is...
>> >> >
>> >> > line 185 print "Reading in plain text hashes...\n"; $|++;
>> >> > my $data = do {
>> >> >     if( open my $fh, '<:utf8', "$lang_arg-probs.txt" ) { local $/;
>> <$fh>
>> >> > }
>> >> >    else { undef }
>> >> >    };
>> >> > eval $data;
>> >> >
>> >>
>> >> How long are you waiting?
>> >>
>> >> You should see this:
>> >>
>> >> charlifter-es$ make
>> >> sf.pl -m -l es
>> >> Reading in plain text hashes...
>> >> Saving storable hashes to disk...
>> >> charlifter-es$
>> >>
>> >> It only takes about 15 seconds to build on my machine.
>> >>
>> >> Then
>> >>
>> >> $ sudo make install
>> >>
>> >> Kevin
>> >
>> >
>> >
>> >
>> > --
>> > Alex
>>
>
>
>
> --
> Alex
>



-- 
Alex
------------------------------------------------------------------------------
Learn Graph Databases - Download FREE O'Reilly Book
"Graph Databases" is the definitive new guide to graph databases and their
applications. Written by three acclaimed leaders in the field,
this first edition is now available. Download your free book today!
http://p.sf.net/sfu/13534_NeoTech
_______________________________________________
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff

Reply via email to