Dear Wiki user, You have subscribed to a wiki page or wiki category on "Hadoop Wiki" for change notification.
The following page has been changed by JeffHammerbacher: http://wiki.apache.org/hadoop/Hive/Roadmap ------------------------------------------------------------------------------ = 10/27/08 Roadmap Update = - 1. Integrating Dynamic SerDe with the DDL. (Zheng/Pete) - This allows the users to create typed tables along with list and map types from the DDL + # Integrating Dynamic SerDe with the DDL. (Zheng/Pete) - This allows the users to create typed tables along with list and map types from the DDL - 2. Support for Statistics. (Ashish) - These stats are needed to make optimization decisions + # Support for Statistics. (Ashish) - These stats are needed to make optimization decisions - 3. Join Optimizations. (Prasad) - Mapside joins, semi join techniques etc to do the join faster + # Join Optimizations. (Prasad) - Mapside joins, semi join techniques etc to do the join faster - 4. Predicate Pushdown Optimizations. (Namit) - pushing predicates just above the table scan for certain situations in joins as well as ensuring that only required columns are sent across map/reduce boundaries + # Predicate Pushdown Optimizations. (Namit) - pushing predicates just above the table scan for certain situations in joins as well as ensuring that only required columns are sent across map/reduce boundaries - 5. Group By Optimizations. (Joydeep) - various optimizations to make group by faster + # Group By Optimizations. (Joydeep) - various optimizations to make group by faster - 6. Optimizations to reduce the number of map files created by filter operations. (Dhrubha) - Filters with a large number of mappers produces a lot of files which slows down the following operations. This tries to address problems with that. + # Optimizations to reduce the number of map files created by filter operations. (Dhrubha) - Filters with a large number of mappers produces a lot of files which slows down the following operations. This tries to address problems with that. - 7. Transformations in LOAD. (Joydeep) - LOAD currently does not transform the input data if it is not in the format expected by the destination table. + # Transformations in LOAD. (Joydeep) - LOAD currently does not transform the input data if it is not in the format expected by the destination table. - 8. Schemaless map/reduce. (Zheng) - TRANSFORM needs schema while map/reduce is schema less. + # Schemaless map/reduce. (Zheng) - TRANSFORM needs schema while map/reduce is schema less. - 9. Improvements to TRANSFORM. (Zheng) - Make this more intuitive to map/reduce developers - evaluate some other keywords etc.. + # Improvements to TRANSFORM. (Zheng) - Make this more intuitive to map/reduce developers - evaluate some other keywords etc.. - 10. Error Reporting Improvements. (Pete) - Make error reporting for parse errors better + # Error Reporting Improvements. (Pete) - Make error reporting for parse errors better - 11. Help on CLI. (Joydeep) - add help to the CLI + # Help on CLI. (Joydeep) - add help to the CLI - 12. Explode and Collect Operators. (Zheng) - Explode and collect operators to convert collections to individual items and vice versa. + # Explode and Collect Operators. (Zheng) - Explode and collect operators to convert collections to individual items and vice versa. - 13. Propagating sort properties to destination tables. (Prasad) - If the query produces sorted we want to capture that in the destination table's metadata so that downstream optimizations can be enabled. + # Propagating sort properties to destination tables. (Prasad) - If the query produces sorted we want to capture that in the destination table's metadata so that downstream optimizations can be enabled. Other contributions from outside FB ... - 1. JDBC driver (Michi Mutsuzaki @ stanford.edu, Raghu @ stanford.edu) + # JDBC driver (Michi Mutsuzaki @ stanford.edu, Raghu @ stanford.edu) - 2. Fixes to CLI driver (Jeremy Huylebroeck) + # Fixes to CLI driver (Jeremy Huylebroeck) - 3. Web interface... + # Web interface... = Roadmap/call to add more features = The following is the list of useful features that are on the Hive Roadmap:
