[ 
https://issues.apache.org/jira/browse/GOBBLIN-1669?focusedWorklogId=792375&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-792375
 ]

ASF GitHub Bot logged work on GOBBLIN-1669:
-------------------------------------------

                Author: ASF GitHub Bot
            Created on: 18/Jul/22 22:17
            Start Date: 18/Jul/22 22:17
    Worklog Time Spent: 10m 
      Work Description: umustafi commented on code in PR #3528:
URL: https://github.com/apache/gobblin/pull/3528#discussion_r923904554


##########
gobblin-data-management/src/test/java/org/apache/gobblin/data/management/copy/TimeAwareRecursiveCopyableDatasetTest.java:
##########
@@ -216,6 +223,40 @@ public void testGetFilesAtPath() throws IOException {
       
Assert.assertTrue(candidateFiles.contains(PathUtils.getPathWithoutSchemeAndAuthority(fileStatus.getPath()).toString()));
     }
 
+    // test ds of daily/yyyy-MM-dd-HH-mm-ss
+    datePattern = "yyyy-MM-dd-HH-mm-ss";
+    formatter = DateTimeFormat.forPattern(datePattern);
+    endDate = 
LocalDateTime.now(DateTimeZone.forID(TimeAwareRecursiveCopyableDataset.DEFAULT_DATE_PATTERN_TIMEZONE));
+
+    candidateFiles = new HashSet<>();
+    for (int i = 0; i < MAX_NUM_DAILY_DIRS; i++) {

Review Comment:
   do we not want to create # directories = min(lookback, max_num_daily_dirs)? 
there could be more directories created than we want if max_num_daily_dirs is > 
lookback time right.





Issue Time Tracking
-------------------

    Worklog Id:     (was: 792375)
    Time Spent: 0.5h  (was: 20m)

> Support seconds with TimeAwareRecursiveCopyableDataset
> ------------------------------------------------------
>
>                 Key: GOBBLIN-1669
>                 URL: https://issues.apache.org/jira/browse/GOBBLIN-1669
>             Project: Apache Gobblin
>          Issue Type: Improvement
>          Components: gobblin-service
>            Reporter: William Lo
>            Assignee: Abhishek Tiwari
>            Priority: Major
>          Time Spent: 0.5h
>  Remaining Estimate: 0h
>
> # Support seconds with the timeiterator
>  # Optimize non-nested timestamp representations e.g. yyyy-mm-dd-hh-mm-ss to 
> not use an iterator, and instead list the top level directory to reduce the 
> number of FS calls.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to