Re: incubator wiki

2016-06-07 Thread Matt Post
Science Department > University of Southern California, Los Angeles, CA 90089 USA > WWW: http://irds.usc.edu/ > ++ > > > > > > > > > > > On 6/6/16, 6:03 PM, "Matt Post"

***UNCHECKED*** Re: [jira] [Commented] (JOSHUA-270) pipeline.pl needs major refactoring

2016-05-25 Thread Matt Post
binCRkioZFHru.bin Description: PGP/MIME Versions Identification

Re: [jira] [Commented] (JOSHUA-270) pipeline.pl needs major refactoring

2016-05-25 Thread Matt Post
of the pipeline with a more versatile (and readable) tool like ducttape. matt > On May 24, 2016, at 7:27 PM, Matt Post (JIRA) <j...@apache.org> wrote: > > > [ > https://issues.apache.org/jira/browse/JOSHUA-270?page=com.atlassian.jira.plugin.system.issuetabpanels:comm

joshua API changes

2016-05-25 Thread Matt Post
Hi folks (especially Felix, Kellen, Tobi) — I made two moderate improvements to Joshua on the way home. The first was to get rid of all the specialized phrase handling; the packer now works as we discussed, packing everything into Hiero format, and the stack-based decoder uses this directly

too many emails

2016-05-25 Thread Matt Post
Does someone know how to turn off the mailing of all github comments to dev? The way I see it, we all have to be on dev, so it should be for people, not robots. I am getting every comment about three times. I would just do it but I don't know how.

incubator wiki

2016-06-06 Thread Matt Post
Hi everyone, I made the confluence page public (read-only), as part of transitioning the website there. It didn't seem to me that anything there was private, but if something should be, we can lock down individual pages to members only. (Does anyone know how to have a Confluence group

Re: Wiki access

2016-05-26 Thread Matt Post
Hi Tom — This is a dumb question, but where is the Joshua wiki? You're not talking about the confluence page, are you? I see you have access there. https://cwiki.apache.org/confluence/display/JOSHUA/Joshua+%28Incubating%29+Home matt > On May 25, 2016, at 5:20 PM, Tom Barber

Re: [GitHub] incubator-joshua pull request: JOSHUA-252 Make it possible to use ...

2016-05-26 Thread Matt Post
yeah this is really strange. I'm talking about the regression tests, not the unit tests. these are in src/test/resources. run for example test/bn-en/hiero/test.sh. 3 seconds on master, 18 on JOSHUA-252 (you might have to remove "-threads 2") matt (from my phone) > On May 26, 2016, at 9:15 PM,

junit in eclipse

2016-06-01 Thread Matt Post
Has anyone successfully run JUnit tests in Eclipse? I'd love to integrate them but am not sure how to go about setting it up. I thought I'd ask before burning the time on Google. I'll volunteer to write a wiki article if you can help me out :) matt

Re: [jira] [Commented] (JOSHUA-264) Remove system exits and replace with RuntimeExceptions

2016-06-14 Thread Matt Post
Go ahead :) > On Jun 14, 2016, at 2:10 PM, Thamme Gowda (JIRA) wrote: > > >[ > https://issues.apache.org/jira/browse/JOSHUA-264?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15330600#comment-15330600 > ] > > Thamme Gowda commented on

Fwd: Build failed in Jenkins: joshua_master #71

2016-06-23 Thread Matt Post
This is going to be a problem — having a portion of the test suite depending on KenLM, which is not bundled, distributable, or platform-independent... > Begin forwarded message: > > From: Apache Jenkins Server > Subject: Build failed in Jenkins: joshua_master #71 >

Re: [IMPORTANT] Roadmap for 6.1 Release

2016-06-23 Thread Matt Post
Hi Lewis, Sorry for taking some time to get back to you. I think the roadmap looks great. One thing, though, is that the Amazon folks and I have discussed making a number of backwards-incompatible changes in an effort to modernize some pieces of the code. This would have to do with things like

Re: hosting release files

2016-04-05 Thread Matt Post
t; University of Southern California, Los Angeles, CA 90089 USA > WWW: http://irds.usc.edu/ > ++ > > > > > > > > > > > On 4/5/16, 1:57 PM, "Matt Post" <p...@cs.jhu.edu> wrote: > >> Does Apache provide a place to host releases

Re: Logo for Joshua

2016-04-05 Thread Matt Post
en by all means please do. We can also >>> request a powered by sticker from press@ >>> Thanks >>> >>> On Tue, Mar 29, 2016 at 9:40 AM, Lewis John Mcgibbney < >>> lewis.mcgibb...@gmail.com> wrote: >>> >>>> ACK >>>

Re: Release Cycle for Joshua

2016-04-12 Thread Matt Post
May 1 is going to be hard for me. I'd like to advocate for quarterly releases with a June 1 first release. Would that be okay? > On Apr 12, 2016, at 1:31 AM, Lewis John Mcgibbney > wrote: > > Cool. > I've got the 1st Incubating release provisionally down for 1st

Re: ApacheCon 2016 and Joshua

2016-03-19 Thread Matt Post
PM, Lewis John Mcgibbney > <lewis.mcgibb...@gmail.com> wrote: > > Hi Matt, > > On Mon, Mar 14, 2016 at 8:26 AM, Matt Post <p...@cs.jhu.edu> wrote: > >> Whoa! Lewis, can you give some more detail on this talk, what you >> proposed, and what you plan

Re: Migrating Community from Github and GoggleGroups to Apache

2016-03-24 Thread Matt Post
(offline yesterday and today, will do tomorrow) matt (from my phone) > On Mar 24, 2016, at 1:48 PM, Lewis John Mcgibbney > wrote: > > Hi Matt, > As the primary figure within the Joshua community I wonder if you can act > on the following. It will go a long way in

Re: Migrating Community from Github and GoggleGroups to Apache

2016-03-26 Thread Matt Post
and launch the existing website over on > joshua.incubator.apache.org (so that something is there), we could then do > the other tasks over the weekend. > Any thoughts on that? > > On Friday, March 25, 2016, Matt Post <p...@cs.jhu.edu> wrote: > >> Lewis -- regarding your requests 1--3,

Re: Migrating Community from Github and GoggleGroups to Apache

2016-03-25 Thread Matt Post
gmail.com> wrote: > >> Thanks Matt, enjoy the time off (hopefully you are not ill) >> Later >> >> On Thu, Mar 24, 2016 at 10:54 AM, Matt Post <p...@cs.jhu.edu> wrote: >> >>> (offline yesterday and today, will do tomorrow) >>> &

Re: [jira] [Assigned] (INFRA-11289) Load Git history for Joshua

2016-03-07 Thread Matt Post
Thanks, Tommaso. I just posted a question there about what this precisely this means. Also, do we have a Joshua source code import yet? Can someone tell me what the new model is supposed to be for development? I am unclear on exactly how the relationship between Apache and Github code will

Re: consolidating thread

2016-03-28 Thread Matt Post
l: chris.a.mattm...@nasa.gov >> WWW: http://sunset.usc.edu/~mattmann/ >> ++ >> Director, Information Retrieval and Data Science Group (IRDS) >> Adjunct Associate Professor, Computer Science Department >>

Re: Problem with git repo

2016-03-28 Thread Matt Post
Hi Daniel, No worries, it didn't take too long to track down (I only started trying to push last night). Thanks for your help! - matt > On Mar 29, 2016, at 12:28 AM, Daniel Takamori wrote: > > Hey Joshua Team, > Sorry about the recent confusion with the git repo; I was in

Re: Using Jira for Issues

2016-04-29 Thread Matt Post
Lewis, this sounds good to me. I'm in the process of moving the (hideous) Joshua web page over to Confluence, and created a Developer page, where I added this to the documentation. https://cwiki.apache.org/confluence/display/JOSHUA/Development Can you look this over and improve it

Re: [jira] [Commented] (JOSHUA-253) Enable execution of Unit tests

2016-04-27 Thread Matt Post
I am fine with you just doing this. The current setup was a something-is-better-than-nothing (which is true) hack, and I'd be happy to have better practices pushed into the project. matt (from my phone) > On Apr 27, 2016, at 2:39 PM, Kellen Sunderland (JIRA) wrote: > > >

Re: joshua_api

2016-04-27 Thread Matt Post
planned as far as changes relating to an API > would go. We have a few more commits coming but they're just performance > improvements and they don't change too much in the way of interfaces or > method signatures. > > -Kellen > > On Wed, Apr 27, 2016 at 4:47 AM, Matt Post <

Re: joshua_api

2016-04-27 Thread Matt Post
ts failing for me currently > (casing issues causing at least one). If you want to wait until we fix > these tests that's also completely fine. > > -Kellen > > On Wed, Apr 27, 2016 at 11:32 AM, Matt Post <p...@cs.jhu.edu> wrote: > >> Do you want me to fix the reca

Re: Language Pack size

2016-05-13 Thread Matt Post
Oh, yes, of course. That's in build_binary. > On May 13, 2016, at 4:39 PM, kellen sunderland <kellen.sunderl...@gmail.com> > wrote: > > Could we also use quantization with the language model to reduce the size? > KenLM supports this right? > > On Fri, May 13, 20

Re: Language Pack size

2016-05-13 Thread Matt Post
the ability for users to play with the individual weights, but I don't think that's a huge loss, since the main weight is LM vs. TM). matt > On May 13, 2016, at 4:45 PM, Matt Post <p...@cs.jhu.edu> wrote: > > Oh, yes, of course. That's in build_binary. > > >> On May 1

Re: GIZA++ Licensing

2016-05-06 Thread Matt Post
I included a bunch of tools like GIZA a while back in order to make it easier for people to build systems. I think that's the wrong approach now, since we're focusing on providing black-box systems. So we should remove tools that aren't run-time dependencies, like GIZA, and just ask people to

Re: Podling Report Reminder - May 2016

2016-05-01 Thread Matt Post
Thanks, Lewis. I'll take a look at this by Weds. > On May 1, 2016, at 4:16 PM, Lewis John Mcgibbney > wrote: > > Hi Folks, > Initial report populated as below > > JoshuaJoshua is a statistical machine translation toolkitJoshua has > been incubating since

Fwd: Blogging opportunities on Linux.com and OpenSource.com

2016-04-18 Thread Matt Post
This would be a great thing to do once we push out the first release. I'd be happy to participate if anyone wants to team up. > Begin forwarded message: > > From: Sally Khudairi > Subject: Blogging opportunities on Linux.com and OpenSource.com > Date: April 17, 2016 at

Re: http://joshua.incubator.apache.org/

2016-04-14 Thread Matt Post
Hi Igor, Yes, here's the ticket in case that is helpful to you: https://issues.apache.org/jira/browse/INFRA-11295?jql=project%20%3D%20INFRA%20AND%20text%20~%20%22Joshua%22 matt > On Apr 13, 2016, at 7:20 PM, Tom Barber wrote: > > Hi Igor > > I believe in

Re: tests are not run using latest code

2016-04-14 Thread Matt Post
"ant test" runs the regression tests under test/ (it runs test/run-all-tests.sh, which looks for all scripts underneath test of the form test.sh, and executes them). No unit tests are currently run. This is obviously broken. matt > On Apr 12, 2016, at 8:02 PM, Lewis John Mcgibbney

Re: Avoiding master failures with CI

2016-07-13 Thread Matt Post
I misread the day, here, and thought you meant today. I can't do tomorrow afternoon, but that time on Friday works for me. We could also go into next week if that's better. > On Jul 13, 2016, at 9:41 AM, Matt Post <p...@cs.jhu.edu> wrote: > > That works for me. I've watche

master pushes

2016-07-28 Thread Matt Post
Hi folks, Sorry for the continued pushes to master. We have had Travis-CI enabled, but I haven't taken the time to get it setup. Someone else should feel free to take charge, here; otherwise, I hope to have time to do this after my workshop is done, at the end of next week. matt

Re: Podling Report Reminder - August 2016

2016-08-01 Thread Matt Post
Hi folks, I just loaded this with a draft. Comments / unilateral changes will not meet resistance from me. -- Joshua Joshua is a statistical machine translation toolkit Joshua has been incubating since 2016-02-13. Three most important issues to address in the move towards graduation: 1.

Re: [GitHub] incubator-joshua issue #33: Refactored unit tests to all use TestNG, removed...

2016-07-31 Thread Matt Post
Hi Kellen, The current standard location for KenLM is $JOSHUA/lib. I'm happy to move this if there is a more conventional spot. $JOSHUA/target? matt > On Jul 29, 2016, at 3:47 AM, KellenSunderland wrote: > > Github user KellenSunderland commented on the issue: > >

Re: Podling Report Reminder - August 2016

2016-08-02 Thread Matt Post
podling > > - Henry > > On Mon, Aug 1, 2016 at 9:08 AM, kellen sunderland < > kellen.sunderl...@gmail.com> wrote: > >> Looks good to me. Should we mention that we're planning a release that >> moves our build system to maven? >> >> On Mon, Aug 1,

Re: Language Pack English-Japanese

2016-08-04 Thread Matt Post
sh language >> pack. >> And YES, I mean translation memories by TMS/XLIFF. But I may convert >> TMS to what you specified format. >> >> And also I knew English to Japanese is very difficult, but also I >> believe sample of English-Japanese language pack will

Re: Issue Building LM on master branch

2016-07-17 Thread Matt Post
Lewis — This is a good-sized dataset, and on a single desktop machine, I expect it would take at least a day to go all the way through alignment, model-building, and tuning. fast_align is a good idea, though it isn't integrated into the pipeline (shouldn't be too hard, and is on the list). You

Re: Avoiding master failures with CI

2016-07-18 Thread Matt Post
uter Science Department > University of Southern California, Los Angeles, CA 90089 USA > WWW: http://irds.usc.edu/ > ++ > > > > > > > > > > > On 7/15/16, 2:05 PM, "Matt

Re: Russian Language Model for Joshua

2016-07-15 Thread Matt Post
ttmann, Chris A (3980) > <chris.a.mattm...@jpl.nasa.gov> wrote: > > Yes please! :) > > Sent from my iPhone > >> On Jul 15, 2016, at 1:39 PM, Matt Post <p...@cs.jhu.edu> wrote: >> >> I have one built on Common Crawl. It's 25 GB uncompressed. My Ken

Re: Avoiding master failures with CI

2016-07-15 Thread Matt Post
cience Department >> University of Southern California, Los Angeles, CA 90089 USA >> WWW: http://irds.usc.edu/ >> ++ >> >> >> >> >> >> >> >> >> >> &g

Re: joshua - Build # 49 - Still Failing!

2016-07-13 Thread Matt Post
Minor point, but are there any objections to changing this to 6.1-SNAPSHOT? matt > On Jul 13, 2016, at 8:45 AM, kellen sunderland > wrote: > > Ahh, ok. I guess I'll just keep an eye on it. Thanks Tom (and thanks for > doing the work to set this up). > > On

bigtranslate

2016-07-13 Thread Matt Post
t; > > On 7/12/16, 5:12 PM, "kellen sunderland" <kellen.sunderl...@gmail.com> wrote: > >> Thanks for forwarding Matt. I think a fair number of people from my team >> will want to attend. I'll pass around the registration link. >> >> -Kell

Re: Russian Language Model for Joshua

2016-07-16 Thread Matt Post
Done: http://cs.jhu.edu/~post/tmp/ru.kenlm 4106251755 bytes, sha1sum: 5c894e24dafa42bc44a5bb6822812d6234eda791 Let me know when you have it so I can delete it. matt > On Jul 15, 2016, at 4:42 PM, Matt Post <p...@cs.jhu.edu> wrote: > > All right, started tryi

Re: Russian Language Model for Joshua

2016-07-15 Thread Matt Post
Professor, Computer Science Department > University of Southern California, Los Angeles, CA 90089 USA > WWW: http://irds.usc.edu/ > ++++++ > > > > > > > > > > >> On 7/15/16, 1:42 PM,

Re: Website Branding Issues

2016-07-09 Thread Matt Post
Hi John, I believe I just corrected this: http://joshua.incubator.apache.org However, I don't know who our TLP sponsor is, so the disclaimer is missing that portion of the notice. Can someone advise me here? matt > On Jul 9, 2016, at 11:18 AM, John D. Ament

Re: [DISCUSS] Joshua main Website redirect to wiki?

2016-07-09 Thread Matt Post
ile the website elsewhere and push the result to the asf > servers as I plan to do with a new oodt website. jekyll just compiles > static html after all. > > Tom >> On 9 Jul 2016 22:42, "Matt Post" <p...@cs.jhu.edu> wrote: >> >> I mentioned it a

Re: [DISCUSS] Joshua main Website redirect to wiki?

2016-07-09 Thread Matt Post
I mentioned it a while back and no one objected, so I did it. The issue is that the GitHub approach no longer worked because Apache does not employ Jekyll server side, so there was a major impediment to editing files. I'm open to other options but this is very convenient! matt (from my

Re: [DISCUSS] Joshua main Website redirect to wiki?

2016-07-09 Thread Matt Post
att, > > No objection from my side, I think I missed the discussion thread about it. > I sincerely apologize. > > I am looking at incubator website guide and don't see any rule about how > the HTML is created. > I asked John about the audit to see what he has to say. > &

Re: [DISCUSS] Joshua main Website redirect to wiki?

2016-07-10 Thread Matt Post
Done. > On Jul 9, 2016, at 5:56 PM, Henry Saputra <henry.sapu...@gmail.com> wrote: > > Need to add the incubator logo [1] as part of website branding > > > [1] http://incubator.apache.org/guides/branding.html > > On Sat, Jul 9, 2016 at 2:55 PM, Matt Post &l

Re: Parameters/weights

2016-06-30 Thread Matt Post
Hi Andrew, Thanks for the note. The work building all the models advertised in the paper has fallen behind, but we hope to have it all resolved by the end of the month. Hopefully that will resolve some of the problems you have pointed out here, too. I have updated the PPDB page with a note.

Re: Podling Report Reminder - February 2017

2017-02-01 Thread Matt Post
Folks, I added the Joshua report. https://wiki.apache.org/incubator/February2017 It is due today. Feel free to make comments or initiate discussion here but otherwise what's there is what will be sent. matt > On Jan 25, 2017, at 7:21

problems with BerkeleyLM

2017-02-01 Thread Matt Post
Hi folks, I've found some problems with BerkeleyLM. I haven't diagnosed it yet, and am not going to have time for a week or two at least, but thought I'd bring it to everyone's attention because this affects our no-external-dependency releases. As for the solution, in addition to trying to

Re: Podling Report Reminder - February 2017

2017-01-30 Thread Matt Post
Folks — I'll take care of this next week, after February 6. matt > On Jan 30, 2017, at 10:18 PM, johndam...@apache.org wrote: > > Dear podling, > > This email was sent by an automated system on behalf of the Apache > Incubator PMC. It is an initial reminder to give you plenty of time to >

Re: Cutting RC3

2017-02-23 Thread Matt Post
Thank you for heading this up, Tommaso! I'll be able to catch up on this after today. matt > On Feb 23, 2017, at 3:06 AM, Tommaso Teofili > wrote: > > probably because of the mentioned network issues the artifacts ended up in > two separate staging repositories in

Re: mvn assembly issues

2017-01-19 Thread Matt Post
I have never seen this error before! It seems like this must have something to do with the build environment where this is being done? Maybe there are tar options to not store the userid or to set it to something? > On Jan 18, 2017, at 9:08 PM, David Meikle wrote: > > Hey

Re: Plugging self-hosted Joshua into mailman?

2017-01-19 Thread Matt Post
Karel — On this point, I don't think you should have to use the tutorials, which tell you how to identify training data and build new translation models yourself. I imagine that you would be more interested in downloading pre-built models that don't really require you to be an expert in MT. See

Re: Plugging self-hosted Joshua into mailman?

2017-01-19 Thread Matt Post
> On Jan 17, 2017, at 11:55 AM, Karel Novotný <ka...@apc.org> wrote: > > Hello Matt, > > Thanks for responding... > > On 17.1.2017 17:31, Matt Post wrote: >> Hello, >> >> Joshua would be suitable to this. We have models built for FR→EN and ES

Re: Plugging self-hosted Joshua into mailman?

2017-01-17 Thread Matt Post
Hello, Joshua would be suitable to this. We have models built for FR→EN and ES→EN. I want to improve these because some certain data was left out. I could also build ones for the other direction. One question — What do you mean about 3rd party services being "untrustworthy"? matt > On Jan

Re: Pluggable preprocessing and OpenNLP

2017-01-18 Thread Matt Post
ou have a specific language which would be good for testing for > you? > > The tokenizer can probably trained as well, I saw a couple of tokenized > data sets. Maybe that makes sense for you too. > > Jörn > > > > On Fri, 2017-01-13 at 09:48 -0500, Matt Post wrote: >> Hi Jörn

Re: [DISCUSS] Release Apache Joshua 6.1

2016-08-15 Thread Matt Post
seem to have permission. Can you: - Change the release date of 6.1 to 9/15? - Delete the Joshua 7 milestone? - Rename Joshua 6.2 to Joshua 7? - Set its release date to March 15, 2017? Thanks, matt > On Aug 13, 2016, at 11:39 PM, Matt Post <p...@cs.jhu.edu> wrote: > >

Re: [VOTE] Release Apache Joshua 6.1 (Incubating)

2017-02-26 Thread Matt Post
Hi folks, First, Tommaso, thank you for pulling this together! I want to remind everyone that there's a checklist to go through before sending your +1. Here's from an email from Tom Barber a while back: > Hello folks, > > I see plenty of +1's going through the release vote, which is great to

Re: [jira] [Commented] (JOSHUA-304) word-align.conf alignment template file not compatible with berkeley aligner

2016-08-24 Thread Matt Post
It didn't regenerate. Try wiping out your rundir and starting over. matt (from my phone) > On Aug 24, 2016, at 4:08 PM, Lewis John McGibbney (JIRA) > wrote: > > >[ >

Re: [GitHub] incubator-joshua issue #42: Fix various issues related to resources, warning...

2016-08-30 Thread Matt Post
Rebasing changes the history, so I think you can't do that with repos that have been pushed, right? In which case merge... matt (from my phone) > On Aug 30, 2016, at 3:41 PM, maxthomas wrote: > > Github user maxthomas commented on the issue: > >

roadmap

2016-09-30 Thread Matt Post
Hi folks, Just a status update, since I / we are a bit behind: I'm in the process of putting together the first language pack, along with a script that will bundle it with the jar, a README describing its use and assembly, a CREDITS file describing the data used to build the model, and a

Re: moses2 vs. joshua

2016-10-05 Thread Matt Post
ne could test them on a machine with lots of cores, to see how things scale. matt > On Sep 22, 2016, at 9:09 AM, Matt Post <p...@cs.jhu.edu> wrote: > > Hi folks, > > I have finished the comparison. Here you can find graphs for ar-en and ru-en. > The ground-up rewrite of

Re: language pack #1

2016-10-06 Thread Matt Post
s/preparation/nonbreaking_prefixes"; >> should be: >> my $mydir = "$ENV{JOSHUA}/scripts/nonbreaking_prefixes"; >> >> When I make this modification, it works just fine for me. >> Also, tried in server mode -- seems to work without issue. >> >>

thrax problem

2016-10-07 Thread Matt Post
Hi folks, I thought I'd let you know about a problem I discovered with Thrax. Can you spot it? $ ls -lh grammar.gz -rw-r--r-- 1 mpost staff 2.2G Oct 6 13:55 grammar.gz $ gzip -cd 9/grammar.gz | cut -d\| -f4 | uniq -c | sort -n | tail 8448 las 8643 a 9440 que 9595 se 9696

Re: language pack #1

2016-10-07 Thread Matt Post
> think it shouldn't be too hard. > > On Thu, Oct 6, 2016 at 4:16 PM, Matt Post <p...@cs.jhu.edu> wrote: > >> Okay, I've fixed the nonbreaking_prefixes path issue. >> >> The installation should now ignore your value of $JOSHUA entirely, >> preferring inste

Re: moses2 vs. joshua

2016-09-22 Thread Matt Post
things to do to close this gap. I'd be much happier with 2x or even 1.5x than with 3x, and I bet we could narrow this down. But I'd like to get the 6.1 release out of the way, first, so I'm pushing this off to next month. Sound cool? matt > On Sep 19, 2016, at 6:26 AM, Matt Post <p...@cs.j

Re: Build failed in Jenkins: joshua_master #96

2016-08-24 Thread Matt Post
We are running out of space on builds... > On Aug 23, 2016, at 10:15 PM, Apache Jenkins Server > wrote: > > See > > Changes: > > [lewis.mcgibbney] Update examples README formatting and links. > >

Re: [jira] [Commented] (JOSHUA-291) Improve code quality via static analysis

2016-09-28 Thread Matt Post
ay, as you can >> see from [1]. >> >> I'd opt for waiting a few more hours, then I'd ask infra@. >> >> Regards, >> Tommaso >> >> [1] : http://status.apache.org/ >> >> >> Il giorno mar 27 set 2016 alle ore 20:38 Matt Post <p.

Re: openjdk 8 incompatibility

2016-10-25 Thread Matt Post
Hmm, inclusion of that line looks like a mistake. I've seen Eclipse add random imports because it sorts the suggestions in a very unhelpful manner. I just removed the line and pushed, try again. > On Oct 25, 2016, at 1:11 PM, John Hewitt wrote: > > Hi all, > > Has

Re: [jira] [Created] (JOSHUA-320) --joshua-mem pipeline parameter is not populated to mert processes

2016-10-27 Thread Matt Post
Hi Lewis, You are confusing two things. MERT calls Joshua, and passes it however much memory you set with --joshua-mem. It doesn't this by writing (see pipeline.pl line 1550) $tunedir/decoder_command, which is what Z-MERT calls to run Joshua. Z-MERT is itself a Java program that also gets 4g.

Re: [jira] [Closed] (JOSHUA-100) Add Shen et al. (2008) dependency LM

2016-10-27 Thread Matt Post
t;>Key: JOSHUA-100 >>URL: https://issues.apache.org/jira/browse/JOSHUA-100 >>Project: Joshua >> Issue Type: New Feature >> Reporter: Matt Post >> Assignee: Matt Post >>Fix For: 6.1 >> >> > > > > > -- > This message was sent by Atlassian JIRA > (v6.3.4#6332)

Re: Pipeline Mystery

2016-10-27 Thread Matt Post
yes mert must be dying. Can you post the contents of the tune/ directory? and tail mert.log? matt (from my phone) > Le 27 oct. 2016 à 00:49, John Hewitt a écrit : > > It seems like MERT isn't writing it's final config file (which is typical > of MERT, in my

Re: Lewis Volunteering for 6.1 Release Manager

2016-11-10 Thread Matt Post
Just landing back in the states from Berlin. This sounds great Lewis! matt (from my phone) > Le 10 nov. 2016 à 12:02, lewis john mcgibbney a écrit : > > Hi Folks, > I would like to put myself forward as release manager for 6.1. > I've got a lot of experience working with

Re: Joshua 6.1

2016-10-19 Thread Matt Post
ohn mcgibbney <lewi...@apache.org> wrote: > > Hi Matt, > I like the sound of this :) > > On Fri, Oct 14, 2016 at 9:25 AM, < > dev-digest-h...@joshua.incubator.apache.org> wrote: > >> >> From: Matt Post <p...@cs.jhu.edu> >> To: dev@joshua.incubato

Re: Joshua 6.1

2016-10-14 Thread Matt Post
AL2 license ? (as "convenience" binaries as the > official release consists of the Joshua source code). > I'm asking this because in OpenNLP we have had this long time issue of the > models licensing. > > Regards, > Tommaso > > > > Il giorno gio 13 ott

Re: Joshua Model Input Format(s) and LM Loading

2016-10-25 Thread Matt Post
Hi Lewis, Joshua supports two language model representation packages: KenLM [0] and BerkeleyLM [1]. These were both developed at about the same time, and represented huge gains in doing this task efficiently, over what had previously been the standard approach (SRILM). Ken Heafield (who has

Re: language pack #1

2016-10-25 Thread Matt Post
etty comprehensive. > > It would be great if we could update the Joshua Homebrew recipe with this > language pack and also link to the pack from the Wiki. > > Lewis > > On Mon, Oct 10, 2016 at 2:48 AM, < > dev-digest-h...@joshua.incubator.apache.org> wrote: >

Re: Thrax Error in WordLexicalProbabilityCalculator - Word id 2146928632 out of range 0 1727042

2016-10-21 Thread Matt Post
This is strange. I haven't looked into this again but don't have any insights. Thanks for the followup. > On Oct 21, 2016, at 3:35 PM, lewis john mcgibbney wrote: > > Hi Folks, > Follow up. > It seems that when I clean the .cachepipe as well as all of the existing >

Re: [VOTE] Release Apache Joshua (Incubating) 6.1

2016-11-14 Thread Matt Post
+1 Thanks for starting this off, Lewis! > On Nov 14, 2016, at 12:54 PM, Ramirez, Paul M (398M) > wrote: > > +1, let's get it released!!! > > --Paul > > == > Paul Ramirez - Group Supervisor >

Re: language packs blog post

2016-11-21 Thread Matt Post
That's better, fixed. > On Nov 21, 2016, at 3:14 PM, kellen sunderland <kellen.sunderl...@gmail.com> > wrote: > > Looks good to me, no objection to tweeting it. Nice work putting them all > together. > > On Mon, Nov 21, 2016 at 9:00 PM, Matt Post <p...@cs.

language packs blog post

2016-11-21 Thread Matt Post
Hi folks, I just drafted this; any objections to tweeting it? https://cwiki.apache.org/confluence/display/JOSHUA/2016/11/21/Apache+Joshua+Language+Packs matt

Re: Dockerhub hosted images

2016-11-23 Thread Matt Post
Kellen, can I bother you to post a few first steps? I've successfully pulled this down to my mac but now do not know how to find it, edit it, or run it. I'm porting through the documentation and will find it eventually but this would save me a bit of time. > On Nov 23, 2016, at 8:07 AM,

Re: [VOTE] Release Apache Joshua 6.1 RC#2

2016-11-23 Thread Matt Post
+1 Thanks, Lewis! > On Nov 23, 2016, at 12:15 AM, lewis john mcgibbney wrote: > > Hello user@ and dev, > Please VOTE on the Apache Joshua 6.1 Release Candidate #2. > > We solved 50 issues: https://s.apache.org/joshua6.1 > > Git source tag

test non apache account

2016-11-23 Thread Matt Post
matt (from my phone)

Re: Dockerhub hosted images

2016-11-23 Thread Matt Post
Okay, I have this with docker run -it kellens/apache-joshua-es-en-2016-10-05 bash It seems we are missing Perl (./prepare.sh fails), and we should replace the LanguageModel line with a KenLM instance and build that. I bet we'll need Python, too. > On Nov 23, 2016, at 8:15 AM, M

Re: Any symal experts?

2016-11-23 Thread Matt Post
ution. > > I'll keep the symal use on the backburner and start putting together an > atools port. > > -John > > On Wed, Nov 23, 2016 at 12:18 PM, Matt Post <p...@cs.jhu.edu> wrote: > >> John — I suggest trying to ditch those GIZA++ tools entirely. fast_align &

Re: Any symal experts?

2016-11-23 Thread Matt Post
John — I suggest trying to ditch those GIZA++ tools entirely. fast_align indeed replaced them with "atools"; how much work would it be to port that? > On Nov 23, 2016, at 12:11 PM, John Hewitt wrote: > > Hey everyone, > > I'm packaging up a Java port Fast Align for

Re: Downloading of non ASF licensed code

2016-11-28 Thread Matt Post
This would be easy to do. Maybe just a simple prompt that alerts the user? Something like echo "Warning: this script downloads many tools used in building and running" echo "Joshua. Not all of them are Apache Licensed. If you wish to continue, hit Enter". read j

★ joshua roadmap feature: dynamic phrase tables, retuning, and data sharing

2016-11-28 Thread Matt Post
One project I think could be interesting for Joshua's future is sketched here. - Dynamic phrase tables. Joshua currently lets people add custom phrases to the existing models that then get used. There is a research topic here for how to make it better (particularly, how to set the weights of

Re: Dockerhub hosted images

2016-11-22 Thread Matt Post
How do I clone this? Docker tells me there is no tag "latest", using "-a" tells me the repo is not found, and I can't seem to figure out how to tell Docker to use hub.docker.com... > Here's a link to the first image I've been playing with, es-en. >

Re: "mvn assembly" no longer works

2016-11-17 Thread Matt Post
Ah, thanks Lewis. I did update the README to mention the new package target. > On Nov 17, 2016, at 1:36 AM, lewis john mcgibbney wrote: > > Hi Matt, > Again, I am on digest and didn't receive but I'll reply here. > No need to use the Maven assembly plugin anymore... simply

Re: Updating Incubator summary

2016-11-17 Thread Matt Post
a commit I keep meaning to getting around > to working on". Random thought :) > > Hen > > On Tue, Nov 15, 2016 at 11:09 AM, Matt Post <p...@cs.jhu.edu> wrote: > >> We're still waiting on our first software release, so it seems to me a bit >> premature to gr

package-info.java

2016-11-16 Thread Matt Post
Hi Thamme, Eclipse is complaining about package-info.java files, e.g., The type package-info is already defined for org.apache.joshua.decoder.package-info.java. I see that a while ago you replaced the package-info.html files with these. Is there a particular reason for this? Is .java

Re: Updating Incubator summary

2016-11-15 Thread Matt Post
> On Tue, Nov 15, 2016 at 04:02 Matt Post <p...@cs.jhu.edu> wrote: > >> Thanks, Lewis, and Henri, for pointing this out. >> >> >>> On Nov 15, 2016, at 1:18 AM, lewis john mcgibbney <lewi...@apache.org> >> wrote: >>> >>&

  1   2   3   >