[ https://issues.apache.org/jira/browse/HADOOP-1298?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12529297 ]
Allen Wittenauer commented on HADOOP-1298: ------------------------------------------ FWIW, I've been looking at ApacheDS in its stand-alone mode to provide the LDAP and Kerberos infrastructure for the grids at Yahoo!. While it is a very young product, it holds a lot of promise.... For (likely very) small sites, I could see this as a potential win. They probably have a namenode with enough memory. If they don't, then they are essentially in the same boat as the large site situation... For large sites, there is a very high probability they already have some sort of major naming services implementation, be it Kerberos, LDAP, or otherwise. They are going to want to integrate Hadoop into those services, which means that the DS instance embedded would need to be able to replicate data from the master source. Chances are very high that their replication technologies aren't going to work with ApacheDS and the embedded DS is going to end up being nothing but a referral server, assuming it has that functionality. Sizing-wise, we've already seen what happens in the xx million case with 0.13 on a 16GB namenode. I don't think embedding or running a DS/KDC side-by-side is viable without bigger hardware. If one has that bigger hardware, they are just as likely to run a copy of their DS/KDC bits instead of using the embedded one anyway. In the case of the places without a KDC or even a DS, it might be useful instead to recommend as part of the Hadoop documentation that they setup naming services replication local to the namenode, (some value) per data nodes, and (some value) per MR nodes. If they don't have any major naming services in place already, then update the ApacheDS section where they talk about grids ( about half way down the page http://cwiki.apache.org/DIRxINTEROP/ ) to include chat about using it with Hadoop and then point to that. [Sidenote: IIRC, the MIT (and Sun and ...) KDC keeps the entire Kerberos DB in memory. I haven't worked enough with the ApacheDS to see how it functions in this regard.] > adding user info to file > ------------------------ > > Key: HADOOP-1298 > URL: https://issues.apache.org/jira/browse/HADOOP-1298 > Project: Hadoop > Issue Type: New Feature > Components: dfs, fs > Reporter: Kurtis Heimerl > Assignee: Christophe Taton > Fix For: 0.15.0 > > Attachments: 1298_2007-09-06b.patch, 1298_2007-09-07g.patch, > hadoop-user-munncha.patch17 > > > I'm working on adding a permissions model to hadoop's DFS. The first step is > this change, which associates user info with files. Following this I'll > assoicate permissions info, then block methods based on that user info, then > authorization of the user info. > So, right now i've implemented adding user info to files. I'm looking for > feedback before I clean this up and make it offical. > I wasn't sure what release, i'm working off trunk. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.