[ 
https://issues.apache.org/jira/browse/TAJO-1997?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15057210#comment-15057210
 ] 

ASF GitHub Bot commented on TAJO-1997:
--------------------------------------

Github user eminency commented on a diff in the pull request:

    https://github.com/apache/tajo/pull/883#discussion_r47590084
  
    --- Diff: 
tajo-core/src/main/java/org/apache/tajo/engine/function/FunctionLoader.java ---
    @@ -299,4 +284,38 @@ private static StaticMethodInvocationDesc 
extractStaticMethodInvocation(Method m
     
         return sqlFuncs;
       }
    +
    +  public static Collection<FunctionDesc> loadFunctions(TajoConf conf) 
throws IOException, AmbiguousFunctionException {
    +    List<FunctionDesc> functionList = new 
ArrayList<>(loadBuiltinFunctions().values());
    +    List<FunctionDesc> udfs = loadUserDefinedFunctions(conf);
    +
    +    return mergeFunctionLists(functionList, udfs);
    +  }
    +
    +  @SafeVarargs
    +  static Collection<FunctionDesc> mergeFunctionLists(List<FunctionDesc> 
... functionLists)
    +      throws AmbiguousFunctionException {
    +
    +    Map<Integer, FunctionDesc> funcMap = new HashMap<>();
    +    List<FunctionDesc> baseFuncList = functionLists[0];
    +
    +    // Build a map with a first list
    +    for (FunctionDesc desc: baseFuncList) {
    +      funcMap.put(desc.hashCodeWithoutType(), desc);
    +    }
    +
    +    // Check duplicates for other function lists(should be UDFs 
practically)
    +    for (int i=1; i<functionLists.length; i++) {
    --- End diff --
    
    I considered about that.
    But, there are two reasons why I didn't decide to do it.
    
    First is built-in functions exist statically, that is, they are not changed 
frequently. So I thought it could be useless burden to check each startup 
(number of built-in functions are more than 200, so number of checking will be 
more than 20K times).
    
    Secondly, the check routine is not considering function type as you already 
know. But there are already duplicate functions in built-in functions except 
function type. For example, there is sum() with or without 'distinct' feature.
    Since the check logic should be different, it should be done separately 
before current code part. 
    Thus, I thought the task was not the part of this issue if it should be 
done.


> Registering UDF, it needs to check duplication
> ----------------------------------------------
>
>                 Key: TAJO-1997
>                 URL: https://issues.apache.org/jira/browse/TAJO-1997
>             Project: Tajo
>          Issue Type: Sub-task
>          Components: Function/UDF
>    Affects Versions: 0.11.0
>            Reporter: Jongyoung Park
>            Assignee: Jongyoung Park
>            Priority: Minor
>
> Currently, Tajo doesn't check UDF signature whether it is duplicated with 
> built-in functions.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to