HADOOP-4044 is scheduled to finally make it to 0.21 release. And 0.21 is still a while away.
That said, if one imports a data-set (set of files, or directory) into a warehouse, isn't it safer to move that dataset into the warehouse itself rather than letting it sit outside. For one thing, the target of the symlink might not be accessible to all hadoop slave nodes. -dhruba On Sat, Apr 18, 2009 at 7:41 PM, Edward Capriolo <[email protected]>wrote: > I was looking at HADOOP-4044. It would be nice to be able to work on > files without moving them into the warehouse. Could a SerDe handle a > similar task? >
