Hi,

it is certainly a good idea to try to get this data
into a useful format. I am sure, there will be
interest for that, especially for the otherwise
under-resourced languages.

-phi

On Mon, Feb 25, 2008 at 11:57 PM, J C Read <[EMAIL PROTECTED]> wrote:
> I thought this prospective source may be of interest to some of the list 
> members
>  in your various experiments.
>
>  Please ignore this message if and accept my apologies if this message does 
> not
>  interest you.
>
>  In my random browsing and searching for multilingual texts for training I
>  stumbled across the following:
>
>  http://www.watchtower.org/languages.htm
>
>  The good thing about the source is that a portion of the content is dynamic. 
> And
>  so, just like the europarl corpus, the potential multilingual corpus we could
>  harvest grows monthly.
>
>  Some of the languages have more support than others but I suppose that's 
> life.
>  I'm thinking of developing a perl script to scrape and paragraph/sentence 
> align
>  this stuff for training our systems. Is this something that any of you guys
>  would be interested in using and or participating in?
>
>  If so, please drop me an off-list mail. Even if it is just to express an
>  interest.
>  _______________________________________________
>  Moses-support mailing list
>  [email protected]
>  http://mailman.mit.edu/mailman/listinfo/moses-support
>
>
_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support

Reply via email to