[ 
https://issues.apache.org/jira/browse/HCATALOG-73?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13122677#comment-13122677
 ] 

[email protected] commented on HCATALOG-73:
-------------------------------------------------------



bq.  On 2011-10-05 23:03:10, David Capwell wrote:
bq.  > 
storage-drivers/hbase/src/java/org/apache/hcatalog/hbase/HBaseDirectOutputFormat.java,
 line 40
bq.  > <https://reviews.apache.org/r/2060/diff/3/?file=48147#file48147line40>
bq.  >
bq.  >     Are you not extending TableOutputformat because of future changes 
like disabling WAL and cacheing?
bq.  >     
bq.  >     http://hbase.apache.org/book/perf.writing.html#def.log.flush
bq.  >     
bq.  >     Deferred WAL writes seems to be table only and stuff like cacheing 
can be controlled via the config object itself.

Disabling WAL is what I'd like to try but deferred WAL sounds good too. In 
general having our own OutputFormat keeps us flexible. Deferred is a table 
config tho we need it per session since we will most likely be support some 
sort of point write mechanism.


bq.  On 2011-10-05 23:03:10, David Capwell wrote:
bq.  > 
storage-drivers/hbase/src/test/org/apache/hcatalog/hbase/ManyMiniCluster.java, 
line 59
bq.  > <https://reviews.apache.org/r/2060/diff/3/?file=48149#file48149line59>
bq.  >
bq.  >     Almost every place that uses this variable turns it into a File 
object.  Can we switch this to a File?

sure


bq.  On 2011-10-05 23:03:10, David Capwell wrote:
bq.  > 
storage-drivers/hbase/src/test/org/apache/hcatalog/hbase/ManyMiniCluster.java, 
line 121
bq.  > <https://reviews.apache.org/r/2060/diff/3/?file=48149#file48149line121>
bq.  >
bq.  >     Might also need to close the FileSystem.  If you look at the HBase 
test code itself, it closes it after it stops the DFS cluster.
bq.  >     
bq.  >     FileSystem fs = this.dfsCluster.getFileSystem();
bq.  >                     if (fs != null) {
bq.  >                         fs.close();
bq.  >                     }
bq.  >                     FileSystem.closeAll();

sounds good


bq.  On 2011-10-05 23:03:10, David Capwell wrote:
bq.  > 
storage-drivers/hbase/src/test/org/apache/hcatalog/hbase/ManyMiniCluster.java, 
line 205
bq.  > <https://reviews.apache.org/r/2060/diff/3/?file=48149#file48149line205>
bq.  >
bq.  >     Why not have HBase use HDFS directly?

Right now there is no need to. We don't have tests which require DFS, it would 
just add overhead on resources and running time. I think running 
LocalFileSystem disables block caching in hbase but we're not really testing 
that.One thing I'd like to have running as well is a metastore daemon since the 
fixture is running local mode right now.


bq.  On 2011-10-05 23:03:10, David Capwell wrote:
bq.  > 
storage-drivers/hbase/src/test/org/apache/hcatalog/hbase/ManyMiniCluster.java, 
line 171
bq.  > <https://reviews.apache.org/r/2060/diff/3/?file=48149#file48149line171>
bq.  >
bq.  >     Can we have this configurable via Configuration or via constructor?

Ok changed the instantiation pattern to be a builder. now you can set 
numTaskTrackers, the different Conf objects and enabled/disable daemons. This 
way you run only what you need or you can make the tests run against an actual 
cluster. Doubt our tests will pass against a real cluster tho :D.


- Francis


-----------------------------------------------------------
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/2060/#review2371
-----------------------------------------------------------


On 2011-10-05 16:55:28, Francis Liu wrote:
bq.  
bq.  -----------------------------------------------------------
bq.  This is an automatically generated e-mail. To reply, visit:
bq.  https://reviews.apache.org/r/2060/
bq.  -----------------------------------------------------------
bq.  
bq.  (Updated 2011-10-05 16:55:28)
bq.  
bq.  
bq.  Review request for hcatalog and Vandana Ayyalasomayajula.
bq.  
bq.  
bq.  Summary
bq.  -------
bq.  
bq.  hbase direct output storage driver as describe here:
bq.  
bq.  
https://cwiki.apache.org/confluence/display/HCATALOG/HBase+Output+Storage+Driver+-+Design
bq.  
bq.  
bq.  This addresses bug hcatalog-73.
bq.      https://issues.apache.org/jira/browse/hcatalog-73
bq.  
bq.  
bq.  Diffs
bq.  -----
bq.  
bq.    
storage-drivers/hbase/src/java/org/apache/hcatalog/hbase/HBaseDirectOutputFormat.java
 PRE-CREATION 
bq.    
storage-drivers/hbase/src/java/org/apache/hcatalog/hbase/HBaseDirectOutputStorageDriver.java
 PRE-CREATION 
bq.    
storage-drivers/hbase/src/test/org/apache/hcatalog/hbase/ManyMiniCluster.java 
PRE-CREATION 
bq.    
storage-drivers/hbase/src/test/org/apache/hcatalog/hbase/SkeletonHBaseTest.java 
PRE-CREATION 
bq.    
storage-drivers/hbase/src/test/org/apache/hcatalog/hbase/TestHBaseDirectOutputStorageDriver.java
 PRE-CREATION 
bq.  
bq.  Diff: https://reviews.apache.org/r/2060/diff
bq.  
bq.  
bq.  Testing
bq.  -------
bq.  
bq.  added a unit mini cluster test framework and unit tests.
bq.  
bq.  since this is a sub-module of hcatalog parent tests shouldn't be affected.
bq.  
bq.  
bq.  Thanks,
bq.  
bq.  Francis
bq.  
bq.


                
> Output Storage Driver for HBase (Direct PUTs)
> ---------------------------------------------
>
>                 Key: HCATALOG-73
>                 URL: https://issues.apache.org/jira/browse/HCATALOG-73
>             Project: HCatalog
>          Issue Type: Sub-task
>    Affects Versions: 0.3
>            Reporter: Francis Liu
>            Assignee: Francis Liu
>              Labels: HBASE
>         Attachments: hbase_storage_driver.patch, 
> hbase_storage_driver_rev2.patch, hbase_storage_driver_rev3.patch, 
> hbase_storage_driver_rev4.patch, hbase_storage_driver_rev5.patch
>
>
> This is the first variation for an HBase Output Storage Driver. Which will be 
> doing individual PUTs for each row. The next iteration will involve the use 
> of bulk imports.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to