Common model for runners
Hi, When I build a data flow using the Beam SDK, can someone point me to the code that represents the underlying representation of the beam model itself? Is there an API that lets me retrieve the underlying protobuf-based graph for the data flow? Perhaps some pointers to what code in the runner retrieves this model in order to execute it in the specific engine? Thanks,Ron
Re: [Proposal] Apache Beam Event's Calendar
+1 to the calendar... Thanks,Ron On Friday, January 26, 2018, 2:27:34 PM PST, Griselda Cuevaswrote: Ok, so given that the proposal seem to have traction I'll proceed as follows: - I'll give all PMC members owner rights to the calendar, so they can edit and add events. - I'll share the calendar with the dev@ and user@ lists - I'll investigate how to add the calendar to the Website Proposed dynamic to add events to the calendar: - Anyone can announce an event in the mailing list - Anyone with editing rights in the calendar can add a new event [For people with Calendar editing rights only] How to add an event to the calendar: - I'll suggest we use [tags] to classify events in this calendar: [Deadline] -- Use this tag for reminders on deadlines for CFPs, Registrations, etc. [USA] -- Use this tag for events in the USA [Europe] -- Use this tag for events in Europe [APAC] -- Use this tag for events in APAC - We should add speakers, participants or content directly in the calendar space for details Cheers, G On 25 January 2018 at 01:35, Ismaël Mejía wrote: +1 I think it makes sense to separate it the calendar in two, one for the CFPs more interesting for dev@ and one for confirmed events where there will be presentations on Beam that concerns more the users (user@). It also probably makes sense to include this one in the website too. On Thu, Jan 25, 2018 at 9:46 AM, Etienne Chauchot wrote: > +1, great initiative! > > > Le 25/01/2018 à 01:05, Griselda Cuevas a écrit : > > Hi Beam Community, > > I've created this public calendar to curate events that the Apache Beam > community will attend or is organizing, as well as industry events we should > keep in the radar. With this calendar, I want to: > > Give visibility to the community on what conferences we want to participate > in > What events is our community organizing > > > My proposal: Share the calendar with the entire dev@ & users@ mailing list > and give access as admins to the few folks I know are organizing events to > help curate our events. > > If we get enough votes I'll share the calendar publicly and I'll give access > to the other folks. > > Thanks, > G > >
Re: Eclipse support
Cool thanks Daniel. Thanks,Ron On Friday, January 26, 2018, 1:31:17 PM PST, Daniel Kulp <dk...@apache.org> wrote: We do have some documentation at: https://beam.apache.org/contribute/eclipse/ But it doesn’t really work anymore. I submitted a PR: https://github.com/apache/beam/pull/4502 to use as a starting point to try and get things running in Eclipse again, but it’s not completely there yet. That said, I’m using Eclipse Oxygen, not Neon, which brings a completely new set of challenges. :( The PR will get the poms importing via m2e without error, apt running for the value types, javacc and it related things running for those things, and pretty much get all the generated code and the source directories and everything visible in eclipse so the compiler has a chance. However, I still have 20 errors related to generics that I may need help resolving. I’ll try and look into it more early next week, I hope. (Also need to copy ~/.m2/repository/kr/motd/maven/os-maven-plugin/1.5.0.Final/os-maven-plugin-1.5.0.Final.jar into your /Applications/Eclipse.app/Contents/Eclipse/plugins or equivalent directory) The eclipse-jdt profile uses an older version of Tyco and the jdt it pulls in (0.26). The PR does update Tyco to 1.0.0 which is closer to the newer eclipse compiler, but still not exact. The other thing that generates a TON of errors in Eclipse is Checkstyle. The latest Checkstyle plugin in Eclipse is based on Checkstyle 8.7. Beam uses Checkstyle 6.19. The configurations are not compatible. Longer term, we would need to update Beam to 8.x, but that may be a huge PR of mostly just formatting changes. Checkstyle has become EXTREMELY picky about a few things. It took quite a bit to get CXF updated to 8.x, touching most of the files. I’m not sure how much of an impact it would have on Beam. I’ll probably investigate that next week as well, but that may require some discussions around what the code SHOULD look like in the areas where the old checkstyle didn’t care but the new one does. We’ll see. Dan > On Jan 23, 2018, at 5:14 AM, Ismaël Mejía <ieme...@gmail.com> wrote: > > Hello again, > > The current compiler configuration in maven should be ok. So you can > test that the compilation works from outside of eclipse by running. > > mvn clean compile test-compile -Peclipse-jdt > > The fix is more about making the compiler happy about some use of > Java's generics and some other small code changes of this kind, for > reference you can take a look at the fixes Dan Kulp did for this > previously: > > https://github.com/apache/beam/commit/a0ae04bef40149cdf54d0ab50909f18a444f3023 > https://github.com/apache/beam/commit/957c545eaa33c861b561418b1c7dadf4c31f92f3 > > Regards, > Ismaël > > On Tue, Jan 23, 2018 at 2:56 AM, zlgonzalez <zlgonza...@yahoo.com> wrote: >> Thanks Ismael. I'll try and take a stab at it when I've read more about the >> eclipse compiler. >> >> In the meantime, do you have any pointers? Is it just about finding the >> right Eclipse JDT compiler options? >> >> Thanks, >> Ron >> >> >> >> Sent via the Samsung Galaxy S7 active, an AT 4G LTE smartphone >> >> Original message >> From: Ismaël Mejía <ieme...@gmail.com> >> Date: 1/22/18 1:29 AM (GMT-08:00) >> To: dev@beam.apache.org >> Subject: Re: Eclipse support >> >> Hello, >> >> Thanks for bringing this info, I tried to compile with the eclipse >> compiler and I can confirm that it does not wok, Eclipse's JDT is more >> annoying about generics so it could be related to this. >> >> Filled https://issues.apache.org/jira/browse/BEAM-3508 to track it. >> Feel free to contribute a fix if you feel like it. >> >> Ismaël >> >> >> On Sat, Jan 20, 2018 at 10:20 PM, Ron Gonzalez <zlgonza...@yahoo.com> wrote: >>> Hello again, >>> Sorry to keep asking about this, but I can't seem to get Eclipse working >>> for this project. >>> >>> I did mvn eclipse:clean eclipse:eclipse command and I've reduced the >>> problem now down to the Java generics issues related to autovalue types. I >>> had to run mvn generate-sources generate-test-sources in each of the >>> sub-modules to make a lot of the errors in Eclipse go away since after >>> running mvn -DskipTests clean install, the target/generated-sources and >>> target/generated-test-sources are empty for some reason. >>> >>> Interestingly enough, if I run mvn -Peclipse-jdt -DskipTests clean >>> install >>> from the command line, I am able to reproduce the same errors that I see >>> in >>> my Eclipse ins
Re: Eclipse support
Yes I forgot to mention that. I have done that.So the main problems are the following: 1. The generated code (like RunnerApi.java) seems to be deleted by the time the build is done. By just manully compiling beam/model and refreshing the project, I was able to resolve it and reduce the errors from 1200+ to 113, which correspond to the remaining issues #2 and #3 below. 2. Not sure if this is a JDK8 issue. Right now, importing sets it to JDK7.Description Resource Path Location TypeBound mismatch: The type Combine.PerKey is not a valid substitute for the bounded parameter > of the type AppliedPTransform<InputT,OutputT,TransformT> CombineTranslation.java /beam-runners-core-construction-java/src/main/java/org/apache/beam/runners/core/construction line 82 Java Problem 3. This is particularly strange, but I'm still trying to sort it out. Description Resource Path Location TypeColumn cannot be resolved to a type AutoValue_Column.java /beam-sdks-java-extensions-sql/target/generated-sources/annotations/org/apache/beam/sdk/extensions/sql/meta line 8 Java Problem Thanks,Ron On Wednesday, January 17, 2018, 10:34:48 AM PST, Ted Yu <yuzhih...@gmail.com> wrote: Have you tried running 'mvn eclipse:eclipse' and importing from the root of workspace ? On Wed, Jan 17, 2018 at 10:32 AM, Ron Gonzalez <zlgonza...@yahoo.com> wrote: Hi, I've been trying this for a couple of days now, but I can't seem to get a clean Eclipse import. I refreshed to latest master, got a clean mvn -DskipTests clean install, ran through the Eclipse setup steps for m2e-apt installation. I'm getting errors like below. Do you have any tips to get this going? Thanks,Ron Description Resource Path Location TypeACCUMULATING cannot be resolved to a variable WindowingStrategyTranslation. java /beam-runners-core- construction-java/src/main/ java/org/apache/beam/runners/ core/construction line 56 Java ProblemAFTER_ALL cannot be resolved to a variable TriggerStateMachines.java /beam-runners-core-java/src/ main/java/org/apache/beam/ runners/core/triggers line 34 Java ProblemAFTER_ALL cannot be resolved to a variable TriggerTranslation.java /beam-runners-core- construction-java/src/main/ java/org/apache/beam/runners/ core/construction line 241 Java ProblemAFTER_ANY cannot be resolved to a variable TriggerStateMachines.java /beam-runners-core-java/src/ main/java/org/apache/beam/ runners/core/triggers line 37 Java ProblemAFTER_ANY cannot be resolved to a variable TriggerTranslation.java /beam-runners-core- construction-java/src/main/ java/org/apache/beam/runners/ core/construction line 243 Java ProblemAFTER_EACH cannot be resolved to a variable TriggerStateMachines.java /beam-runners-core-java/src/ main/java/org/apache/beam/ runners/core/triggers line 59 Java ProblemAFTER_EACH cannot be resolved to a variable TriggerTranslation.java /beam-runners-core- construction-java/src/main/ java/org/apache/beam/runners/ core/construction line 245 Java ProblemAFTER_END_OF_WINDOW cannot be resolved to a variable TriggerStateMachines.java /beam-runners-core-java/src/ main/java/org/apache/beam/ runners/core/triggers line 40 Java ProblemAFTER_END_OF_WINDOW cannot be resolved to a variable TriggerTranslation.java /beam-runners-core- construction-java/src/main/ java/org/apache/beam/runners/ core/construction line 248 Java ProblemAFTER_PROCESSING_TIME cannot be resolved to a variable TriggerStateMachines.java /beam-runners-core-java/src/ main/java/org/apache/beam/ runners/core/triggers line 62 Java ProblemAFTER_PROCESSING_TIME cannot be resolved to a variable TriggerTranslation.java /beam-runners-core- construction-java/src/main/ java/org/apache/beam/runners/ core/construction line 276 Java ProblemAFTER_SYNCHRONIZED_PROCESSING_ TIME cannot be resolved to a variable TriggerStateMachines.java /beam-runners-core-java/src/ main/java/org/apache/beam/ runners/core/triggers line 45 Java ProblemAFTER_SYNCHRONIZED_PROCESSING_ TIME cannot be resolved to a variable TriggerTranslation.java /beam-runners-core- construction-java/src/main/ java/org/apache/beam/runners/ core/construction line 302 Java ProblemALIGN_TO cannot be resolved to a variable TriggerStateMachines.java /beam-runners-core-java/src/ main/java/org/apache/beam/ runners/core/triggers line 94 Java ProblemALIGN_TO cannot be resolved to a variable TriggerTranslation.java /beam-runners-core- construction-java/src/main/ java/org/apache/beam/runners/ core/construction line 281 Java ProblemALWAYS cannot be resolved to a variable TriggerStateMachines.java /beam-runners-core-java/src/ main/java/org/apache/beam/ runners/core/triggers line 51 Java ProblemALWAYS cannot be resolved to a variable TriggerTranslation.java /beam-runners-core- construction-java/src/main/ java/org/apache/beam/runners/ core/construction line 304 Java ProblemApiServiceDescriptor cannot be resolved GrpcFnServer.java /beam-runners-java-fn- execution/src/main/java/org
Eclipse support
Hi, I've been trying this for a couple of days now, but I can't seem to get a clean Eclipse import. I refreshed to latest master, got a clean mvn -DskipTests clean install, ran through the Eclipse setup steps for m2e-apt installation. I'm getting errors like below. Do you have any tips to get this going? Thanks,Ron Description Resource Path Location TypeACCUMULATING cannot be resolved to a variable WindowingStrategyTranslation.java /beam-runners-core-construction-java/src/main/java/org/apache/beam/runners/core/construction line 56 Java ProblemAFTER_ALL cannot be resolved to a variable TriggerStateMachines.java /beam-runners-core-java/src/main/java/org/apache/beam/runners/core/triggers line 34 Java ProblemAFTER_ALL cannot be resolved to a variable TriggerTranslation.java /beam-runners-core-construction-java/src/main/java/org/apache/beam/runners/core/construction line 241 Java ProblemAFTER_ANY cannot be resolved to a variable TriggerStateMachines.java /beam-runners-core-java/src/main/java/org/apache/beam/runners/core/triggers line 37 Java ProblemAFTER_ANY cannot be resolved to a variable TriggerTranslation.java /beam-runners-core-construction-java/src/main/java/org/apache/beam/runners/core/construction line 243 Java ProblemAFTER_EACH cannot be resolved to a variable TriggerStateMachines.java /beam-runners-core-java/src/main/java/org/apache/beam/runners/core/triggers line 59 Java ProblemAFTER_EACH cannot be resolved to a variable TriggerTranslation.java /beam-runners-core-construction-java/src/main/java/org/apache/beam/runners/core/construction line 245 Java ProblemAFTER_END_OF_WINDOW cannot be resolved to a variable TriggerStateMachines.java /beam-runners-core-java/src/main/java/org/apache/beam/runners/core/triggers line 40 Java ProblemAFTER_END_OF_WINDOW cannot be resolved to a variable TriggerTranslation.java /beam-runners-core-construction-java/src/main/java/org/apache/beam/runners/core/construction line 248 Java ProblemAFTER_PROCESSING_TIME cannot be resolved to a variable TriggerStateMachines.java /beam-runners-core-java/src/main/java/org/apache/beam/runners/core/triggers line 62 Java ProblemAFTER_PROCESSING_TIME cannot be resolved to a variable TriggerTranslation.java /beam-runners-core-construction-java/src/main/java/org/apache/beam/runners/core/construction line 276 Java ProblemAFTER_SYNCHRONIZED_PROCESSING_TIME cannot be resolved to a variable TriggerStateMachines.java /beam-runners-core-java/src/main/java/org/apache/beam/runners/core/triggers line 45 Java ProblemAFTER_SYNCHRONIZED_PROCESSING_TIME cannot be resolved to a variable TriggerTranslation.java /beam-runners-core-construction-java/src/main/java/org/apache/beam/runners/core/construction line 302 Java ProblemALIGN_TO cannot be resolved to a variable TriggerStateMachines.java /beam-runners-core-java/src/main/java/org/apache/beam/runners/core/triggers line 94 Java ProblemALIGN_TO cannot be resolved to a variable TriggerTranslation.java /beam-runners-core-construction-java/src/main/java/org/apache/beam/runners/core/construction line 281 Java ProblemALWAYS cannot be resolved to a variable TriggerStateMachines.java /beam-runners-core-java/src/main/java/org/apache/beam/runners/core/triggers line 51 Java ProblemALWAYS cannot be resolved to a variable TriggerTranslation.java /beam-runners-core-construction-java/src/main/java/org/apache/beam/runners/core/construction line 304 Java ProblemApiServiceDescriptor cannot be resolved GrpcFnServer.java /beam-runners-java-fn-execution/src/main/java/org/apache/beam/runners/fnexecution line 37 Java Problem Thanks,Ron
Re: Strata Conference this March 6-8
Works for me... On Tuesday, January 16, 2018, 5:45:33 PM PST, Holden Karau <hol...@pigscanfly.ca> wrote: How would folks feel about during the afternoon break (3:20-4:20) on the Wednesday (same day as Eugene's talk)? We could do the Philz which is a bit of a walk but gets us away from the big crowd and also lets folks not attending the conference but in the area join us. On Tue, Jan 16, 2018 at 5:29 PM, Ron Gonzalez <zlgonza...@yahoo.com> wrote: Cool, let me know if you guys finally schedule it. I will definitely try to make it to Eugene's talk but having an informal BoF in the area would be nice... Thanks,Ron On Tuesday, January 16, 2018, 5:06:53 PM PST, Boris Lublinsky <boris.lublin...@lightbend.com > wrote: All for it Boris Lublinsky FDP Architect boris.lublin...@lightbend.com https://www.lightbend.com/ On Jan 16, 2018, at 7:01 PM, Ted Yu <yuzhih...@gmail.com> wrote: +1 to BoF On Tue, Jan 16, 2018 at 5:00 PM, Dmitry Demeshchuk <dmi...@postmates.com> wrote: Probably won't be attending the conference, but totally down for a BoF. On Tue, Jan 16, 2018 at 4:58 PM, Holden Karau <hol...@pigscanfly.ca> wrote: Do interested folks have any timing constraints around a BoF? On Tue, Jan 16, 2018 at 4:30 PM, Jesse Anderson <je...@bigdatainstitute.io> wrote: +1 to BoF. I don't know if any Beam talks will be on the schedule. > We could do an informal BoF at the Philz nearby or similar? -- Twitter: https://twitter.com/h oldenkarau -- Best regards,Dmitry Demeshchuk. -- Twitter: https://twitter.com/holdenkarau
Some interesting use case
Hi, I was wondering if anyone has encountered or used Beam in the following manner: 1. During machine learning training, use Beam to create the event table. The flow may consist of some joins, aggregations, row-based transformations, etc... 2. Once the model is created, deploy the model to some scoring service via PMML (or some other scoring service). 3. Enable the SAME transformations used in #1 by using a separate engine but thereby guaranteeing that it will transform the data identically as the engine used in #1. I think this is a pretty interesting use case where Beam is used to guarantee portability across engines and deployment (batch to true streaming, not micro-batch). What's not clear to me is with respect to how batch joins would translate during one-by-one scoring (probably lookups) or how aggregations given that some kind of history would need to be stored (and how much is kept is configurable too). Thoughts? Thanks,Ron
Re: Strata Conference this March 6-8
Cool, let me know if you guys finally schedule it. I will definitely try to make it to Eugene's talk but having an informal BoF in the area would be nice... Thanks,Ron On Tuesday, January 16, 2018, 5:06:53 PM PST, Boris Lublinskywrote: All for it Boris Lublinsky FDP Architect boris.lublin...@lightbend.com https://www.lightbend.com/ On Jan 16, 2018, at 7:01 PM, Ted Yu wrote: +1 to BoF On Tue, Jan 16, 2018 at 5:00 PM, Dmitry Demeshchuk wrote: Probably won't be attending the conference, but totally down for a BoF. On Tue, Jan 16, 2018 at 4:58 PM, Holden Karau wrote: Do interested folks have any timing constraints around a BoF? On Tue, Jan 16, 2018 at 4:30 PM, Jesse Anderson wrote: +1 to BoF. I don't know if any Beam talks will be on the schedule. > We could do an informal BoF at the Philz nearby or similar? -- Twitter: https://twitter.com/h oldenkarau -- Best regards,Dmitry Demeshchuk.
Strata Conference this March 6-8
Hi, Will there be some talks or representation of Apache Beam at the coming Strata Conference this March 6-8? Would be great to hear someone talk about how Beam's been used at their company as their core data integration platform. Thanks,Ron