I had asked a couple questions to Venkatesh earlier please see email below.
He recommended that I move the questions to the dev mailing list and thus
To follow up on the questions asked below to my queries
(a) Multi-tenancy: If I were to bring in data-sets from different customers
then I need to record, annotate or tag and provide access to data-sets only
to the relevant owners. Is it possible for me to record and manage
data-sets for different customers in a single Atlas instance? Does Atlas
provide me with the necessary constructs to separate recording of data-sets
by tenant and tracking metadata etc by tenant?
(c) Performance Numbers: I understand it is built to scale given the use of
HBase but any performance numbers that can be shared will be helpful. E.g.
Is there a limit to the number of data-sets I can record on Atlas? Are
there performance numbers on the number of queries?
(d) Are there companies using Atlas in production at this stage?
Thanks in advance for your responses.
On Fri, Nov 18, 2016 at 9:10 AM, Venkatesh Seetharam <venkat...@apache.org>
> Sandeep - please use the dev mailing list for atlas for a prompt response.
> (a) How can one achieve multi-tenancy on Apache Atlas?
> Can you pls elaborate? You can always have a package structure for your
> data sets.
> (b) Is Atlas ready for production usage?
> It depends, I think it is but needs some scripting around BCP, etc.
> (c) Are there published numbers on the volume of data-sets Atlas can
> Its built to scale, uses Titan & Hbase as a backend store which is known
> to scale.
> On Fri, Nov 4, 2016 at 12:02 PM Sandeep Nayak <datacacoph...@gmail.com>
>> Hi Venkatesh,
>> I apologize for the direct email, if there is a better channel to surface
>> my questions I will be happy to go there. I am subscribed to dev@atlas
>> but thought that may not be the right forum for questions potential Atlas
>> users may have.
>> I am looking for Data Catalog solutions and in early evaluation and from
>> what I read so far it appears Apache Atlas provides most of the
>> capabilities I am looking for. Namely data-set registration, lineage
>> tracking, access control (via Ranger), auditing to name a few.
>> I do have a couple questions which will help me in my evaluation
>> (a) How can one achieve multi-tenancy on Apache Atlas?
>> (b) Is Atlas ready for production usage?
>> (c) Are there published numbers on the volume of data-sets Atlas can
>> manage? One of the requirements I pointed out above is data lineage and if
>> I am ingesting streaming and batch data sets the typical volumes could be
>> very high.
>> Hoping you will point me in the right direction to get answers.
>> Thanks for your time and help.