The closest thing that's stable may be DBInputFormat, which allows you to Map/Reduce on data that's in a database and also query the same database via the native SQL interface. In this case the DB sits under or next to hadoop.
[shameless-plug] Vertica has an optimized VerticaInput/OutputFormat based on DBInputFormat that can handle large amounts of data [/shameless-plug] -----Original Message----- From: CubicDesign [mailto:[email protected]] Sent: Monday, September 14, 2009 5:04 PM To: [email protected] Subject: HadoopDB and similar stuff Hi. Anybody has experience a DB that can handle large amounts of data on top of Hadoop? HBase and Hive is nice but they also lack of some features. HadoopDB seems to bring some equilibrium. However, it seems to be still an infant project. Any thoughts?
