As long as you're just splitting, keeping the weights consistent isn't
too hard- just keep all the weight in one segment and make all the
rest of the segments have no impact when they multiply (i.e., a
probability of 1, or a cost of 0).  The openFST or AT&T tools can help
you manipulate lattices if you want to do more interesting things with
weights, such as pushing them to the start of paths.

Chris

On Mon, Mar 1, 2010 at 1:58 PM, Loïc BARRAULT
<[email protected]> wrote:
> Indeed, splitting is not hard, but the trickiest thing is how much
> probability/score amount do you give to each part of the split ?
> Maybe it has not any real impact in the end, or has it ?
> Loïc
>
> 2010/3/1 Chris Dyer <[email protected]>
>>
>> I guess word-graph doesn't split phrases either (I was just guessing).
>>  It appears to be in SLF format, which is used by a number of tools
>> (like HTK and the SRI tools).  SRILM can split lattices with
>> multi-word arcs into lattices, or you can write your own code to do
>> it.  It's not terribly hard.
>>
>> Chris
>>
>> On Mon, Mar 1, 2010 at 12:32 PM, Joerg Tiedemann
>> <[email protected]> wrote:
>> >
>> > Ok thanks. I will use the output-word-graph option. However, I also get
>> > phrases with that option (in the w attribute), for example here:
>> >
>> > ....
>> > J=42    S=0     E=53    a=0, 0, 0, -0.693147, 0.999896  l=-13.695
>> > r=-20, 0, -1.60944, 0, 0, 0     w=bill clinton , pC=0.0613498,
>> > c=-3.23392
>> > ...
>> >
>> > I'm not sure if I'm using the command line argument correctly:
>> > echo 'who is bill clinton ?' | \
>> > moses -f moses.ini -output-word-graph test.graph 0
>> >
>> > Jörg
>> >
>> >
>> > On 3/1/10 5:35 PM, Chris Dyer wrote:
>> >> I don't have such a tool, but it wouldn't be too difficult to write
>> >> one.  I think the difference between word graph and search graph is
>> >> the search graph has full phrases on the edges, whereas the word graph
>> >> has single words on the edges.  For the input, you need single word
>> >> edges.
>> >> -Chris
>> >>
>> >> 2010/3/1 Jörg Tiedemann<[email protected]>:
>> >>>
>> >>> Is there a tool to convert output search graphs to word lattices in
>> >>> PLF
>> >>>   (moses lattice input format)? It's the option -output-search-graph
>> >>> that I should use for getting the relevant information, right? I'm not
>> >>> really sure if I understand the difference between -output-word-graph
>> >>> and -output-search-graph
>> >>> Thanks!
>> >>>
>> >>> Jörg
>> >>>
>> >>>
>> >>>
>> >>>
>> >>> *******/\/\/\/\/\/\/\/\/\/\/\******************************************
>> >>>   Jörg Tiedemann                      [email protected]
>> >>>   Visiting Professor                  http://stp.lingfil.uu.se/~joerg/
>> >>>   Dep. of Linguistics and Philology
>> >>>   Uppsala University                  tel: +46 (0)18 - 471 1412
>> >>>   Box 635, SE-751 26 Uppsala/SWEDEN   fax: +46 (0)18 - 471 1094
>> >>>
>> >>> *********************************/\/\/\/\/\/\/\/\/\/\/\****************
>> >>> _______________________________________________
>> >>> Moses-support mailing list
>> >>> [email protected]
>> >>> http://mailman.mit.edu/mailman/listinfo/moses-support
>> >>>
>> > _______________________________________________
>> > Moses-support mailing list
>> > [email protected]
>> > http://mailman.mit.edu/mailman/listinfo/moses-support
>> >
>>
>> _______________________________________________
>> Moses-support mailing list
>> [email protected]
>> http://mailman.mit.edu/mailman/listinfo/moses-support
>
>
>
> --
> ---
> Loïc BARRAULT
> Post-doctoral researcher
> LIUM - University of Le Mans
> Tél. +33/0 2 43 83 38 52
> http://www-lium.univ-lemans.fr/~barrault
> MANY : Open Source MT System Combination
> http://www-lium.univ-lemans.fr/~barrault/MANY
> ---
>

_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support

Reply via email to