Hi,

Thank you for tou replies

yes i have a corpus which is not clean, it is so noisy.

it contained Html tags...
like this :
Do&n't show this message again
< > " + & % $ # * / \
(EULA)
...

Really i dont know can i proceed this, im for the first time in a problele
with this kind of corpus...

I am always taking if you propose an idea


Thqnk you

bests

Cyrine




2013/7/19 Philipp Koehn <[email protected]>

> Hi,
>
> given these lines:
> > Read classes: #words: 0  #classes: 1
> > ERROR: no word index for '#085a00;'
> > ERROR: no word index for '#999999;'
> > ERROR: no word index for '#ffffff;'
>
> I would guess that there is something wrong with the word class files
> that you are using -- there may have been an earlier mistake with the
> files generated by mkcls.
>
> Also: does your corpus really consist of RBG color codes?
>
> -phi
>



-- 
*Cyrine NASRI
Ph.D. Student in Computer Science*
_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support

Reply via email to