thanks tony!

On Jan 12, 2018 10:48 PM, "Tony Kurc" <trk...@gmail.com> wrote:

> I put some of the data I was working with on the wiki -
>
> https://cwiki.apache.org/confluence/display/NIFI/NiFi+1.5.0+nar+files
>
> On Fri, Jan 12, 2018 at 10:28 PM, Jeremy Dyer <jdy...@gmail.com> wrote:
>
> > So my favorite option is Bryan’s option number “three” of using the
> > extension registry. Now my thought is do we really need to add complexity
> > and do anything in the mean time or just focus on that? Meaning we have
> > roughly 500mb of available capacity today so why don’t we spend those man
> > hours we would spend on getting the second repo up on the extension
> > registry instead?
> >
> > @Bryan do you have thoughts about the deployment of those bars in the
> > extension registry? Since we won’t be able to build the release binary
> > anymore would we still need to create separate repos for the nars or
> no?? I
> > have used the registry a little but I’m not 100% sure on your vision for
> > the nars
> >
> > - Jeremy Dyer
> >
> > Sent from my iPhone
> >
> > > On Jan 12, 2018, at 10:18 PM, Tony Kurc <tk...@apache.org> wrote:
> > >
> > > I was looking at nar sizes, and thought some data may be helpful. I
> used
> > my recent RC1 verification as a basis for getting file sizes, and just
> got
> > the file size for each file in the assembly named "*.nar". I don't know
> > whether the images I pasted in will go through, but I made some graphs.b
> > The first is a histogram of nar file size in buckets of 10MB. The second
> > basically is similar to a cumulative distribution, the x axis is the
> "rank"
> > of the nar (smallest to largest), and the y-axis is how what fraction of
> > the all the sizes of the nars together are that rank or lower. In other
> > words, on the graph, the dot at 60 and ~27 means that the smallest 60
> nars
> > contribute only ~27% of the total. Of note, the standard and framework
> nars
> > are at 83 and 84.
> > >
> > >
> > >
> > >
> > >
> > >> On Fri, Jan 12, 2018 at 5:04 PM, Michael Moser <moser...@gmail.com>
> > wrote:
> > >> And of course, as I hit <send> I thought of one more thing.
> > >>
> > >> We could keep all of the code in 1 git repo (1 project) but the
> > >> nifi-assembly part of the build could be broken up to build core NiFi
> > >> separately from the tar/zip functional grouping of other NARs.
> > >>
> > >> On Fri, Jan 12, 2018 at 5:01 PM, Michael Moser <moser...@gmail.com>
> > wrote:
> > >>
> > >> > Long term I would also like to see #3 be the solution.  I think what
> > >> > Joseph N described could be part of the capabilities of #3.
> > >> >
> > >> > I would like to add a note of caution with respect to reorganizing
> and
> > >> > releasing extension bundles separately:
> > >> >
> > >> >    - the burden on release manager expands because many more
> projects
> > >> >    have to be released; probably not all on each release cycle but
> it
> > could
> > >> >    still be many
> > >> >    - the chance of accidentally forgetting to release a project in a
> > >> >    release cycle becomes non-zero
> > >> >    - sharing code between projects gets a bit harder because you
> have
> > to
> > >> >    manage releasing projects in a specific order
> > >> >    - it becomes harder to find all of the projects that need to
> change
> > >> >    when shared code is added
> > >> >    - the simple act of finding code becomes harder ... in which
> > project
> > >> >    is that class in? (IDEs like IntelliJ can search in 1 project,
> but
> > if they
> > >> >    search across multiple projects, then I haven't learned how)
> > >> >
> > >> > I used to maintain several nars in separate projects, and recently
> > >> > reorganized them into 1 project (following NiFi's multi-module maven
> > build)
> > >> > and life has become much easier!
> > >> >
> > >> > -- Mike
> > >> >
> > >> >
> > >> >
> > >> > On Fri, Jan 12, 2018 at 4:33 PM, Chris Herrera <
> > chris.herrer...@gmail.com>
> > >> > wrote:
> > >> >
> > >> >> I very much like the solution proposed by Bryan below. This would
> > allow
> > >> >> for a cleaner docker image as well, while still proving the
> > functionality
> > >> >> as needed. For sure, the extension registry will be great, but in
> > the mean
> > >> >> time this is an adequate mid step.
> > >> >>
> > >> >> Regards,
> > >> >> Chris
> > >> >>
> > >> >> On Jan 12, 2018, 2:52 PM -0600, Bryan Bende <bbe...@gmail.com>,
> > wrote:
> > >> >> > Long term I'd like to see the extension registry take form and
> have
> > >> >> > that be the solution (#3).
> > >> >> >
> > >> >> > In the more near term, we could separate all of the NARs, except
> > for
> > >> >> > framework and maybe standard processors & services, into a
> separate
> > >> >> > git repo.
> > >> >> >
> > >> >> > In that new git repo we could organize things like Joe N just
> > >> >> > described according to some kind of functional grouping. Each of
> > these
> > >> >> > functional bundles could produce its own tar/zip which we can
> make
> > >> >> > available for download.
> > >> >> >
> > >> >> > That would separate the release cycles between core NiFi and the
> > other
> > >> >> > NARs, and also avoid having any single binary artifact that gets
> > too
> > >> >> > large.
> > >> >> >
> > >> >> >
> > >> >> >
> > >> >> > On Fri, Jan 12, 2018 at 3:43 PM, Joseph Niemiec <
> > josephx...@gmail.com>
> > >> >> wrote:
> > >> >> > > just a random thought.
> > >> >> > >
> > >> >> > > Drop In Lib packs... All the Hadoop ones in one package for
> > example
> > >> >> that
> > >> >> > > can be added to a slim Nifi install. Another may be for Cloud,
> or
> > >> >> Database
> > >> >> > > Interactions, Integration (JMS, FTP, etc) of course defining
> > these
> > >> >> groups
> > >> >> > > would be the tricky part... Or perhaps some type of installer
> > which
> > >> >> allows
> > >> >> > > you to elect which packages to download to add to the slim
> > install?
> > >> >> > >
> > >> >> > >
> > >> >> > > On Fri, Jan 12, 2018 at 3:10 PM, Joe Witt <joe.w...@gmail.com>
> > wrote:
> > >> >> > >
> > >> >> > > > Team,
> > >> >> > > >
> > >> >> > > > The NiFi convenience binary (tar.gz/zip) size has grown to
> > 1.1GB now
> > >> >> > > > in the latest release. Apache infra expanded it to 1.6GB
> > allowance
> > >> >> > > > for us but has stated this is the last time.
> > >> >> > > > https://issues.apache.org/jira/browse/INFRA-15816
> > >> >> > > >
> > >> >> > > > We need consider:
> > >> >> > > > 1) removing old nars/less commonly used nars/or particularly
> > massive
> > >> >> > > > nars from the assembly we distribute by default. Folks can
> > still use
> > >> >> > > > these things if they want just not from our convenience
> binary
> > >> >> > > > 2) collapsing nars with highly repeating deps
> > >> >> > > > 3) Getting the extension registry baked into the Flow
> Registry
> > then
> > >> >> > > > moving to separate releases for extension bundles. The main
> > release
> > >> >> > > > then would be just the NiFi framework.
> > >> >> > > >
> > >> >> > > > Any other ideas ?
> > >> >> > > >
> > >> >> > > > I'll plan to start identifying candiates for removal soon.
> > >> >> > > >
> > >> >> > > > Thanks
> > >> >> > > > Joe
> > >> >> > > >
> > >> >> > >
> > >> >> > >
> > >> >> > >
> > >> >> > > --
> > >> >> > > Joseph
> > >> >>
> > >> >
> > >> >
> > >
> >
>

Reply via email to