awesome sounds like critical mass is happening for Joshua meet up at ApacheCon. My own schedule is in flux - if I come it may only be for a day so I’ll keep the troops posted.
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ Chris Mattmann, Ph.D. Chief Architect Instrument Software and Science Data Systems Section (398) NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA Office: 168-519, Mailstop: 168-527 Email: [email protected] WWW: http://sunset.usc.edu/~mattmann/ ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ Director, Information Retrieval and Data Science Group (IRDS) Adjunct Associate Professor, Computer Science Department University of Southern California, Los Angeles, CA 90089 USA WWW: http://irds.usc.edu/ ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ -----Original Message----- From: Tom Barber <[email protected]> Reply-To: "[email protected]" <[email protected]> Date: Monday, March 14, 2016 at 8:34 AM To: "[email protected]" <[email protected]> Subject: Re: ApacheCon 2016 and Joshua >I'm off to ApacheCon, but I'm far less interesting. I don't pretend to >understand how any of it works, but as lewis will explain, I like platform >evangalism, demoing tech to "non standard" users and other bits and >pieces. >Plus I can code a bit if someone points me in the right direction :) > >From a personal perspective I find the tech interesting which is half the >reason I threw my hat in the ring, from an employment perspective, some of >my "business" clients would find "offline" translation that doesn't rely >on >big vendor API's useful, also we have done work with the UK Armed Forces >who could do with this type of stuff for translation of intelligence >material gathered from the field and requiring processing. > >Tom > >On Mon, Mar 14, 2016 at 3:26 PM, Matt Post <[email protected]> wrote: > >> Whoa! Lewis, can you give some more detail on this talk, what you >> proposed, and what you plan to talk about? >> >> I haven't ever been to ApacheCon, but am interested in going. I don't >>have >> much of a feel for what motivates folks outside the academic research >> community, and that would be good to have in laying out projects that >>might >> interest people. >> >> Regarding those project, I have a number of them. Perhaps it would be >> useful to flesh them out with some more detail, and perhaps post them, >>for >> those who are interested. First, with respect to Tommaso's question, the >> following: >> >> - Use cases. I'd really like to push machine translation as a black box, >> where people can download and use models, not caring how they work, and >> building on top of them. I think this could be transformative. I've just >> added to Joshua the ability to add, store, and manage custom phrasal >> translation rules, which would let people take a model and add their own >> translations on top of it, perhaps correcting mistakes as they encounter >> them. There's a JSON API for it (undocumented). >> >> Building this up would also require pulling together lots of different >> test sets, evaluating changes, and so on. >> >> - Neural nets. This is a huge research area. I think the advantages are >> that it could enable releasing models that are much smaller. However, on >> the down side, it's not clear what the best way to integrate these >>models >> into Joshua is. Fully neural attention models would require >>re-architecting >> Joshua, as they are essentially a new paradigm. Adding neural >>components as >> feature functions that interact with the existing decoding algorithm >>would >> be an intermediate step. >> >> For other projects, I'd love: >> >> - Better documentation, developer and end-user (probably I need to >>write a >> lot of this; if nothing else, it would be hugely useful to me in terms >>of >> prioritizing to know that people want it) >> >> - Rewriting certain components. The tuning modules, in particular, are a >> real mess, and should be synthesized and improved. >> >> - Replacing Moses components. Joshua can call out to Moses to build >>phrase >> tables; it would be nice to get rid of this (and wouldn't be that hard) >> with our own Java implementations. It would also be good to add a >> lexicalized distortion model to the phrase-based decoder. >> >> matt >> >> >> >> >> >> >> > On Mar 14, 2016, at 9:37 AM, Lewis John Mcgibbney < >> [email protected]> wrote: >> > >> > Hi Kellen, >> > I'll be there for sure. I actually had a talk accepted which is an >>entire >> > overview of Joshua and will probably touch on what the roadmap is >>through >> > incubation and beyond. >> > In short I look forward to tagging up with you and any others are Ho >>will >> > be there/we can interest in the project. >> > A side note here is that more projects and talks at ApacheCon are >> becoming >> > increasingly relevant for science and healthcare so I hope we can do a >> real >> > justice to Joshua by undertaking a driven community building effort >> > @ApacheCon. >> > This being said, let's hash out here some objectives, must haves, >>would >> > likes, roadmap, etc if you like. >> > Thanks for brining this topic up. >> > Lewis >> > >> > On Monday, March 14, 2016, Tommaso Teofili <[email protected]> >> > wrote: >> > >> >> Hi Kellen, >> >> >> >> I won't be at ApacheCon unfortunately, however your idea sounds good >>if >> >> there's anyone else joining. >> >> Perhaps we can also discuss some of the topics you mention here on >>the >> >> list, and maybe that would be of help for f2f chats too. >> >> >> >> I'd particularly interested in discussing: >> >> - use cases >> >> - ideas around usage of neural networks for MT >> >> >> >> Regards, >> >> Tommaso >> >> >> >> >> >> >> >> >> >> Il giorno lun 14 mar 2016 alle ore 12:30 kellen sunderland < >> >> [email protected] <javascript:;>> ha scritto: >> >> >> >>> Hello Joshua fans, >> >>> >> >>> I was wondering how many Joshua/Apache folks are going to be at >> ApacheCon >> >>> in May. I was hoping if there's enough of us we could get together >>and >> >>> chat about development ideas for Joshua, discuss each others use >>cases, >> >>> etc. I know that I for one would love to sync up on how the >> development >> >>> model for the project will work in the future. Maybe ApacheCon >>would >> be >> >> a >> >>> good opportunity to chat face to face. >> >>> >> >>> -Kellen Sunderland >> >>> >> >> >> > >> > >> > -- >> > *Lewis* >> >>
