[ 
https://issues.apache.org/jira/browse/HIVE-21671?focusedWorklogId=238379&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-238379
 ]

ASF GitHub Bot logged work on HIVE-21671:
-----------------------------------------

                Author: ASF GitHub Bot
            Created on: 07/May/19 11:31
            Start Date: 07/May/19 11:31
    Worklog Time Spent: 10m 
      Work Description: maheshk114 commented on pull request #615: HIVE-21671: 
Replicate Streaming ingestion with transactional batch size as 1.
URL: https://github.com/apache/hive/pull/615#discussion_r281501969
 
 

 ##########
 File path: 
streaming/src/java/org/apache/hive/streaming/AbstractRecordWriter.java
 ##########
 @@ -494,24 +494,28 @@ protected void checkAutoFlush() throws 
StreamingIOFailure {
     return addedPartitions;
   }
 
-  protected RecordUpdater createRecordUpdater(final Path partitionPath, int 
bucketId, Long minWriteId,
-    Long maxWriteID)
-    throws IOException {
+  protected RecordUpdater createRecordUpdater(List<String> partitionValues, 
final Path partitionPath,
+                                              int bucketId, Long minWriteId, 
Long maxWriteID)
+          throws IOException {
     // Initialize table properties from the table parameters. This is required 
because the table
     // may define certain table parameters that may be required while writing. 
The table parameter
     // 'transactional_properties' is one such example.
     Properties tblProperties = new Properties();
     tblProperties.putAll(table.getParameters());
-    return acidOutputFormat.getRecordUpdater(partitionPath,
-      new AcidOutputFormat.Options(conf)
-        .filesystem(fs)
-        .inspector(outputRowObjectInspector)
-        .bucket(bucketId)
-        .tableProperties(tblProperties)
-        .minimumWriteId(minWriteId)
-        .maximumWriteId(maxWriteID)
-        .statementId(statementId)
-        .finalDestination(partitionPath));
+
+    AcidOutputFormat.Options options = new AcidOutputFormat.Options(conf)
+            .filesystem(fs)
+            .inspector(outputRowObjectInspector)
+            .bucket(bucketId)
+            .tableProperties(tblProperties)
+            .minimumWriteId(minWriteId)
+            .maximumWriteId(maxWriteID)
+            .statementId(statementId)
+            .finalDestination(partitionPath);
+
+    // Add write directory information in the connection object.
+    conn.addWriteDirectoryInfo(partitionValues, 
AcidUtils.baseOrDeltaSubdirPath(partitionPath, options));
 
 Review comment:
   This will be always delta ..using base or delta may give wrong impression 
that it can be base or delete delta in some cases
 
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


Issue Time Tracking
-------------------

    Worklog Id:     (was: 238379)
    Time Spent: 50m  (was: 40m)

> Replicate Streaming ingestion with transactional batch size as 1.
> -----------------------------------------------------------------
>
>                 Key: HIVE-21671
>                 URL: https://issues.apache.org/jira/browse/HIVE-21671
>             Project: Hive
>          Issue Type: Sub-task
>          Components: repl, Streaming, Transactions
>    Affects Versions: 4.0.0
>            Reporter: Sankar Hariappan
>            Assignee: Sankar Hariappan
>            Priority: Major
>              Labels: DR, pull-request-available, replication
>         Attachments: HIVE-21671.01.patch
>
>          Time Spent: 50m
>  Remaining Estimate: 0h
>
> Replication streaming ingest HiveStreamingConnection on ACID tables with 
> transaction batch size as 1.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to