Maybe related to or not, but would we want to create a new branch "1.1.x-branch", and make "1.x-branch" target for 1.2?
I'm not clear we don't release 1.2 for moving toward to 2.0.0, so hence the question. - Jungtaek Lim (HeartSaVioR) 2017년 3월 29일 (수) 오전 1:56, Hugo Da Cruz Louro <hlo...@hortonworks.com>님이 작성: > +1 for finishing the porting to Java ahead of anything else - it will be a > significant milestone. I have a JIRA assigned concerning to the porting. I > will work on it for the 2.0 release. > > it’s a priority to guarantee no performance regressions. As part of this > endeavor, explore an automated (or easy) way to run and assert major > performance benchmarks. Ideally any contributor should be able to fairly > easily test the impact of changes under certain performance test scenarios. > > Beam Runner work should take into account the impact of incorporating new > JStorm features and Storm Worker Redesign< > https://issues.apache.org/jira/browse/STORM-2284>. Not very efficient to > start doing it, to find out that it will have to chance in face of Storm > and worker redesign. That is, it should be done after it’s building blocks > are stable. > > Thanks, > Hugo > > On Mar 24, 2017, at 12:07 AM, Arun Mahadevan <ar...@apache.org<mailto: > ar...@apache.org>> wrote: > > +1 to release with the porting completed. I think its mainly the UI server > and log viewer that’s pending. > > We can start doing the regression and performance tests for whatever is > already ported. > > If anyone is running the master branch in their pre-prod / prod > environments, it will be good to know and give us more confidence. > > The other features can be added in follow up releases. > > Regards, > Arun > > > On 3/24/17, 11:47 AM, "Satish Duggana" <satish.dugg...@gmail.com<mailto: > satish.dugg...@gmail.com>> wrote: > > +1 to have 2.0 with porting and performance(it should be at least as good > as 1.x release) issues addressed > > We can target other tasks(mentioned by Taylor and Jungtaek) for 2.x-branch. > > > Exactly-once support: > While thinking through the exactlyonce support design, it is realized > better to avoid acking tuples and implement exactly once by snapshotting > barriers. It seems JStorm folks followed similar design, they claim it > gives better performance. This feature is essential for beam runner and we > can decide on respective approaches though. > > Beam Runner > Lets hold on this for now and keep it in Storm till 2.x. We should avoid > having a minimal beam runner in haste. It is better to address STORM-2284, > exactly-once and other windowing enhancements to enable beam runner. > > JStorm > Agree with Jungtaek on looking at the latest JStorm and align/scope with > the features for 2.x. > > STORM-2284 > We may want to look at JStorm worker before working on respective > components in this epic to pull appropriate enhancements. > > YARN/MESOS > Supporting Storm on YARN/Mesos for 2.x. > > Thanks, > Satish. > > > On Fri, Mar 24, 2017 at 9:09 AM, Jungtaek Lim <kabh...@gmail.com<mailto: > kabh...@gmail.com>> wrote: > > First of all, +1 to complete only port work and do sanity check (including > performance regression), and release. > > If we can get STORM-2284 within deterministic time frame (say 2~3 months) > that should be great, but if not I'd in favor of postponing that to later > 2.x release. > > JStorm released their new versions after code donation. So there're more > things we could get ideas from, or even adopt from. > https://github.com/alibaba/jstorm/blob/master/history.md > As you noticed from release note link, we also need to update phase 2 since > they already changed what we're planning to do in phase 2. For example, > they changed backpressure to end-to-end, and changed to use snapshot rather > than acker. > May be sure, JStorm pulled many features from today's Storm, like Flux, > Windowing, more shuffle groupings, log search, log level change, and so on. > > STORM-2426 <https://issues.apache.org/jira/browse/STORM-2426> is due to > the > limitation of Spout lifecycle (all the things are done in single thread), > and STORM-1358 <https://issues.apache.org/jira/browse/STORM-1358>(JStorm's > multi-thread Spout) can remedy this (despite that Spout implementation may > need to guarantee thread-safety later). It's not a just improvement but > close to design concern so would like to address sooner than other things > in phase 2. > > For Storm SQL side, I've lost progress but major work would be adopting > group by with windowing. It was not available from Calcite but will be > available at next release (1.12.0). > I've filed this to STORM-2405 > <https://issues.apache.org/jira/browse/STORM-2405>, but windowing & micro > batch is not intuitive, so I would like to change the underlying API to > stream API in SQL. Also filed this to STORM-2406 > <https://issues.apache.org/jira/browse/STORM-2406>. > > Just 2 cents btw, hopefully I would like to see metrics V2 sooner since we > lost metrics even when doing normal operation like restarting worker, > rebalancing, and so on. Eventually we need to fight with dynamic scaling, > and then metrics will be broken often. > > Thanks, > Jungtaek Lim (HeartSaVioR) > > 2017년 3월 24일 (금) 오전 5:05, Harsha Chintalapani <st...@harsha.io>님이 작성: > > Storm 2.0 migration to java in itself is a big win and would attract > wider > community and adoption. So my vote would be to resolve the first 3 items > to > get a release out. > All the other featured mentioned are great to have but shouldn't be > blockers for 2.0 release. > > -Harsha > > On Thu, Mar 23, 2017 at 11:51 AM P. Taylor Goetz <ptgo...@gmail.com> > wrote: > > With the 1.1.0 release nearing completion, I’d like to turn our > attention > to 2.0 and develop a plan for what features, etc. to include. > > The following 3 are what I feel are the minimum for a 2.0 release. > These > could likely be resolved relatively quickly: > > * Performance — I’ve not benchmarked the master branch vs. 1.0.x or > 1.1.x > in a while, but I feel it will be important to make sure there are no > performance regressions, and would hope that we actually have a > performance > improvement over previous versions. To that end (e.g. if there is in > fact a > performance regression), the proposals that Roshan Naik put together > for > revising the threading and execution model (STORM-2307) and replacing > Disruptor with JCTools (STORM-2306) warrant review and consideration. > See > also STORM-2284 which is the parent JIRA. > > * Finish porting Storm UI to java (STORM-1311) > > * Finish porting log viewer to java (STORM-1280) > > The following are items that are nice to have in 2.0, but I don’t feel > are > absolutely necessary for an initial 2.0 release: > > * Beam Runner (I wouldn’t tie this to 2.0, mentioning it because it’s > relevant) — Initially there seemed to be a lot of interest in this, but > that seems to have trailed off. I spoke with some Beam developers and > there > seems to be interest from that community as well. Do we want to move > that > effort to the Beam community, or keep it here? Moving it to the Beam > community might lead to better collaboration between projects. > > * Bounded Spouts (needed for Beam Runner implementation) — Currently > spouts are unbounded, there no end to the stream. Beam has the concept > of > bounded sources (roughly analogous to batch processing). To support > that, > we would need to implement a similar concept in Storm. One benefit of > such > a feature would be the ability to handle both bounded and unbounded > workflows in Storm. > > * Storm-SQL — Jungtaek/Xin: You have been the primary drivers behind > this > effort. What improvements do you envision for 2.0? > > * Metrics V2 (STORM-2153: Coda Hale Metrics) — I’ve been targeting this > for 1.2.0, but it’s designed to be easily portable to master/2.0. > > * JStorm Migration — Original outline can be found here [1]. Note a lot > of > the associated JIRAs below are assigned, but there hasn’t been any > recent > activity or pull requests, we should probably consider them unassigned > and > up for grabs.: > > * Worker Classloader Isolation (STORM-1338) — Lack of this has been the > bane of a lot of Storm users almost since day one. We have largely > addressed it by shading/relocating dependencies. It would be great to > see > this addressed once and for all. > > * JStorm back pressure implementation (STORM-1324) — The current back > pressure implementation leaves a bit to be desired, and the JStorm > approach > looks promising, though it also depends on the JStorm concept of > “topology > master” (STORM-1323), which may have some implications regarding > security. > > * Dynamic Topology Updates (STORM-1335) — This would provide a command > to > update topology jars and configuration without stopping the topology, > and > is well suited to leverage the blobstore. The restart command (that can > also update the topology configuration) also looks compelling > (STORM-1334). > > * Additional Scheduler Implementations (STORM-1320) > > * Additional Grouping Implementations (STORM-1328) > > > As always I’m open to any opinions and suggestions. > > -Taylor > > [1] > > https://cwiki.apache.org/confluence/pages/viewpage. > action?pageId=61328109 > > > > > > > > > >