Re: [Moses-support] collins-parser only parse 2500 !

Philipp Koehn Thu, 21 Oct 2010 02:54:24 -0700

Hi,

I just submitted a update to the wrapper script that
takes care of the limit (by breaking up the input text into
smaller chunks) without the need to change the parser code.


-phi

On Mon, Oct 18, 2010 at 1:15 PM, Hwidong Na <[email protected]> wrote:
> Hi,
>
> You don't need to know the total number of the lines in advance. I
> attached my modified version of "main.c", downsizing the number of
> sentences in memory for each iteration to 250. I tested it using
> "examples/sec??.tagged", and the both input file is parsed without any
> segmentation fault.
>
> Best regards,
> --
> Hwidong Na <[email protected]>
> KLE lab, POSTECH, KOREA
>
>
> 2010-10-15 (금), 16:02 +0200, marco turchi:
>> Hi
>> thanks, I'm trying to modify the main.c in a way that it reads the
>> file twice, the first time to get the number of lines and the second
>> to run the parser. It is not the best solution, but if ti works it can
>> solve the problem.
>>
>> I do not yet take in account the segmentation fault.
>>
>> thanks
>> Marco
>>
>> On Fri, Oct 15, 2010 at 3:49 PM, Raphael Payen <[email protected]>
>> wrote:
>>         I also had the same problem, and I am also interested by the
>>         modifications to make to avoid the segmentation fault.
>>
>>         Since when I tried it was for a simple test and I didnt bother
>>         correcting, I made this script, which you might use also. It
>>         splits the
>>         input into chunks of 2500 lines, It is used like this:
>>         <file split-file-wrapper.py 2500 parse-en-collins >outfile
>>         (But it makes the processing much slower, modifying the source
>>         would be
>>         better).
>>
>>         --
>>         Raphael Payen
>>
>>
>>
>>         On Fri, 2010-10-15 at 14:40 +0200, marco turchi wrote:
>>         > Hi
>>         > I have the same problem with the Collins' parser. Do u know
>>         exactly
>>         > what I need to change in the source code of the parser? or u
>>         have a
>>         > modified version?
>>         >
>>         > Thanks a lot
>>         > Marco
>>         >
>>         > On Thu, Jun 3, 2010 at 5:03 PM, Hwidong Na
>>         <[email protected]>
>>         > wrote:
>>         >         Hi,
>>         >
>>         >         This is not because of the wrapper script, but the
>>         Collins'
>>         >         parser. You
>>         >         can modify the source to iterate the read_sentences
>>         function
>>         >         in the file
>>         >         "main.c". In addition, you need to modify defined
>>         values in
>>         >         "grammar.h"
>>         >         to avoid segmentation faults of long sentences.
>>         >
>>         >         --
>>         >         Hwidong Na <[email protected]>
>>         >         KLE lab, POSTECH, KOREA
>>         >
>>         >
>>         >         2010-05-27 (목), 19:20 +0800, dongxinghua0213:
>>         >
>>         >         > hello,
>>         >         > when  parsing sentences using
>>          parse-en-collins.perl,I find
>>         >         only 2500
>>         >         > parsed sentences are available ,but the number  of
>>         sentences
>>         >         are more
>>         >         > than  one hundred thousand , what can I do to
>>         parse all
>>         >         sentences ?
>>         >         >
>>         >         >  thank you !
>>         >         >
>>         >         >
>>         >         >
>>         >         >
>>         >
>>         
>> ______________________________________________________________________
>>         >         > 网易为中小企业免费提供企业邮箱（自主域名）
>>         >
>>         >         > _______________________________________________
>>         >         > Moses-support mailing list
>>         >         > [email protected]
>>         >         >
>>         http://mailman.mit.edu/mailman/listinfo/moses-support
>>         >
>>         >
>>         >
>>         >
>>         >
>>         >         _______________________________________________
>>         >         Moses-support mailing list
>>         >         [email protected]
>>         >
>>         http://mailman.mit.edu/mailman/listinfo/moses-support
>>         >
>>         > _______________________________________________
>>         > Moses-support mailing list
>>         > [email protected]
>>         > http://mailman.mit.edu/mailman/listinfo/moses-support
>>
>>
>>
>>         _______________________________________________
>>         Moses-support mailing list
>>         [email protected]
>>         http://mailman.mit.edu/mailman/listinfo/moses-support
>>
>>
>> _______________________________________________
>> Moses-support mailing list
>> [email protected]
>> http://mailman.mit.edu/mailman/listinfo/moses-support
>
> _______________________________________________
> Moses-support mailing list
> [email protected]
> http://mailman.mit.edu/mailman/listinfo/moses-support
>
>

_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support

Re: [Moses-support] collins-parser only parse 2500 !

Reply via email to