There's nothing unusual about hand aligning data. If there was there wouldn't be so many tools for it! While it's true that you wouldn't do it for a million sentences to throw into a large-scale NIST system, it's quite common to hand align small sets for evaluation of techniques, etc. This paper reports hand aligning over 10K sentences: http://aclweb.org/anthology-new/H/H05/H05-1012.pdf
--which, while not much for Arabic-English, would represent a substantial portion of some impoverished data sets. Adam On Tue, Oct 27, 2009 at 10:36 PM, Miles Osborne <[email protected]> wrote: > well, alignment is a task that is really done en mass and not > sentence-by-sentence. apart from say teaching, there isn't really a > need for a GUI to do it. > > (convince me that you are ready to use this to align 8 million > sentence pairs and i'd be impressed) > > Miles > > 2009/10/27 Catalin Braescu <[email protected]>: >> Big thanks for the links! >> >> But I have to say I cannot believe my eyes... most of these programs >> are jar files launcged with parameters from the command line... and >> the way they work could be a textbook for user unfriendliness :-( >> >> How can people stand such primitive and bizarre apps? I am not bashing >> their authors, I am only surprised there weren't any authors of better >> programs... >> >> >> Catalin Braescu >> >> On Tue, Oct 27, 2009 at 9:57 PM, Adam Lopez <[email protected]> wrote: >>> There are several of these around. Note that I have not used any of them. >>> >>> http://www.cs.utah.edu/~hal/HandAlign/ >>> http://www.umiacs.umd.edu/~nmadnani/alignment/forclip.htm >>> http://www.d.umn.edu/~tpederse/parallel.html >>> http://www.let.rug.nl/~tiedeman/Uplug/ >>> >>> Ulrich Germann also demonstrated such an editor at last year's ACL, >>> although it does not seem to be online; perhaps email him. >>> >>> Adam >>> >>> >>> On Tue, Oct 27, 2009 at 6:25 PM, Catalin Braescu <[email protected]> >>> wrote: >>>> Ok, so what I'm looking for is a non-CLI alignment editor. Any ideas? >>>> >>>> >>>> Catalin Braescu >>>> Omlulu.com >>>> >>>> >>>> On Tue, Oct 27, 2009 at 1:41 PM, Catalin Braescu <[email protected]> >>>> wrote: >>>>> I am asking in advance for your forgiveness if my question is trivial >>>>> (or, rather, the answer). >>>>> >>>>> I am looking for a non-CLI tool that a not-very-technical person can >>>>> use to align 2 documents in different languages. >>>>> >>>>> When I'm saying "non--CLI" I mean anything that has a window and a >>>>> "visual" way of handling things: anything between a dual pane Notepad, >>>>> a php-backed web form, a Java Applet, whatever. as in, not a command >>>>> line "thing" - our newly hired PC operators won't be able to handle >>>>> it. >>>>> >>>>> Any suggestions? >>>>> >>>>> >>>>> >>>>> Catalin Braescu >>>>> Omlulu.com >>>>> >>>> _______________________________________________ >>>> Moses-support mailing list >>>> [email protected] >>>> http://mailman.mit.edu/mailman/listinfo/moses-support >>>> >>> >> >> _______________________________________________ >> Moses-support mailing list >> [email protected] >> http://mailman.mit.edu/mailman/listinfo/moses-support >> > > > > -- > The University of Edinburgh is a charitable body, registered in > Scotland, with registration number SC005336. > > _______________________________________________ > Moses-support mailing list > [email protected] > http://mailman.mit.edu/mailman/listinfo/moses-support > _______________________________________________ Moses-support mailing list [email protected] http://mailman.mit.edu/mailman/listinfo/moses-support
