[jira] [Commented] (FALCON-630) late data rerun for process broken in trunk

Venkatesh Seetharam (JIRA) Fri, 22 Aug 2014 11:37:10 -0700

    [ 
https://issues.apache.org/jira/browse/FALCON-630?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14107260#comment-14107260
 ]


Venkatesh Seetharam commented on FALCON-630:
--------------------------------------------

I have a few questions:

* Why do you need this? How is this different from feedNames?
{code}
+    INPUT_NAMES("falconInputs", "name of the inputs", false),
+    INPUT_STORAGE_TYPES("falconInputFeedStorageTypes", "input storage types", 
false),
 {code}

* +1 for this.
 {code}
     // what outputs
-    FEED_NAMES("feedNames", "name of the feeds which are 
generated/replicated/deleted"),
-    FEED_INSTANCE_PATHS("feedInstancePaths", "comma separated feed instance 
paths"),
+    OUTPUT_FEED_NAMES("feedNames", "name of the feeds which are 
generated/replicated/deleted"),
+    OUTPUT_FEED_PATHS("feedInstancePaths", "comma separated feed instance 
paths"),
 {code}

I'd like to reduce the payload and make it easier to evolve but this is not 
solved in FALCON-327. Also, in replication, we will have a cluster pair but 
will only have one cluster for all other lifecycles. How do we make this 
seamless instead of having to add this at 100 places?

> late data rerun for process broken in trunk 
> --------------------------------------------
>
>                 Key: FALCON-630
>                 URL: https://issues.apache.org/jira/browse/FALCON-630
>             Project: Falcon
>          Issue Type: Bug
>          Components: rerun
>    Affects Versions: 0.5
>            Reporter: Samarth Gupta
>            Assignee: Shwetha G S
>            Priority: Blocker
>             Fix For: 0.4
>
>         Attachments: FALCON-630.patch
>
>
> late data rerun for process is not working . it seems like in pre processing 
> record size is storing data by Feed name and not by input name , due to which 
> late data is never detected. 
> {code}
>                     -falconInputFeeds
>                     FETL2-RRLog#FETL-RTBS-PRLog#FETL-RTBS-NPRLog
> {code}
> above even though param in tasktracker logs says InputFeeds , they are 
> actually feed name. 



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Commented] (FALCON-630) late data rerun for process broken in trunk

Reply via email to