[ https://issues.apache.org/jira/browse/HADOOP-1913?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12529990 ]
Hadoop QA commented on HADOOP-1913: ----------------------------------- +1 overall. Here are the results of testing the latest attachment http://issues.apache.org/jira/secure/attachment/12366485/build_table_index.take6.patch against trunk revision r578879. @author +1. The patch does not contain any @author tags. javadoc +1. The javadoc tool did not generate any warning messages. javac +1. The applied patch does not generate any new compiler warnings. findbugs +1. The patch does not introduce any new Findbugs warnings. core tests +1. The patch passed core unit tests. contrib tests +1. The patch passed contrib unit tests. Test results: http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Patch/816/testReport/ Findbugs warnings: http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Patch/816/artifact/trunk/build/test/findbugs/newPatchFindbugsWarnings.html Checkstyle results: http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Patch/816/artifact/trunk/build/test/checkstyle-errors.html Console output: http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Patch/816/console This message is automatically generated. > [HBase] Build a Lucene index on an HBase table > ---------------------------------------------- > > Key: HADOOP-1913 > URL: https://issues.apache.org/jira/browse/HADOOP-1913 > Project: Hadoop > Issue Type: New Feature > Components: contrib/hbase > Reporter: Ning Li > Assignee: stack > Priority: Minor > Fix For: 0.15.0 > > Attachments: build_table_index.patch, > build_table_index.take2.again.patch, build_table_index.take2.patch, > build_table_index.take3.patch, build_table_index.take4.patch, > build_table_index.take5.patch, build_table_index.take6.patch > > > This patch provides a Reducer class and other related classes which help to > build a Lucene index on an HBase table. The index build part is similar to > that of Nutch. > - Each row is modeled as a Lucene document: row key is indexed in its > untokenized form, column name-value pairs are Lucene field name-value pairs. > - IndexConf is used to configure various Lucene parameters, specify whether > to optimize an index and which columns to index and/or store, in tokenized or > untokenized form, etc. > - The number of reduce tasks decides the number of indexes (partitions). > The index(es) is stored in the output path of job configuration. > - The index build process is done in the reduce phase. Users can use the > map phase to join rows from different tables or to pre-parse/analyze column > content, etc. > - A junit test is added to test the build of an index on an HBase table > with an identity mapper. It also serves as an example on how to use the new > classes. > - BuildTableIndex is provided to help building an index on an HBase table. > It should be moved to examples package if HBase decides to have one. > This patch requires the inclusion of the Lucene library. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.