[jira] [Created] (ARROW-7108) [C++][Gandiva] Use appropriate flag for string functions which can return errors

2019-11-10 Thread Projjal Chanda (Jira)
Projjal Chanda created ARROW-7108:
-

 Summary: [C++][Gandiva] Use appropriate flag for string functions 
which can return errors
 Key: ARROW-7108
 URL: https://issues.apache.org/jira/browse/ARROW-7108
 Project: Apache Arrow
  Issue Type: Bug
  Components: C++ - Gandiva
Reporter: Projjal Chanda
Assignee: Projjal Chanda


Functions which can return errors should have flag 
NativeFunction::kCanReturnErrors



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-7308) [C++][Gandiva] Implement castVARCHAR(float_type) functions in Gandiva

2019-12-04 Thread Projjal Chanda (Jira)
Projjal Chanda created ARROW-7308:
-

 Summary: [C++][Gandiva] Implement castVARCHAR(float_type) 
functions in Gandiva
 Key: ARROW-7308
 URL: https://issues.apache.org/jira/browse/ARROW-7308
 Project: Apache Arrow
  Issue Type: Task
  Components: C++ - Gandiva
Reporter: Projjal Chanda
Assignee: Projjal Chanda






--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-7300) [C++][Gandiva] Implement functions to cast from strings to integers/floats

2019-12-02 Thread Projjal Chanda (Jira)
Projjal Chanda created ARROW-7300:
-

 Summary: [C++][Gandiva] Implement functions to cast from strings 
to integers/floats
 Key: ARROW-7300
 URL: https://issues.apache.org/jira/browse/ARROW-7300
 Project: Apache Arrow
  Issue Type: Task
  Components: C++ - Gandiva
Reporter: Projjal Chanda
Assignee: Projjal Chanda






--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-7359) [C++][Gandiva] Don't throw error for locate function with start position exceeding string length, return 0 instead

2019-12-09 Thread Projjal Chanda (Jira)
Projjal Chanda created ARROW-7359:
-

 Summary: [C++][Gandiva] Don't throw error for locate function with 
start position exceeding string length, return 0 instead
 Key: ARROW-7359
 URL: https://issues.apache.org/jira/browse/ARROW-7359
 Project: Apache Arrow
  Issue Type: Bug
  Components: C++ - Gandiva
Reporter: Projjal Chanda
Assignee: Projjal Chanda






--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-7205) [C++][Gandiva] Implement regexp_matches, regexp_like functions in ganidva

2019-11-19 Thread Projjal Chanda (Jira)
Projjal Chanda created ARROW-7205:
-

 Summary: [C++][Gandiva] Implement regexp_matches, regexp_like 
functions in ganidva
 Key: ARROW-7205
 URL: https://issues.apache.org/jira/browse/ARROW-7205
 Project: Apache Arrow
  Issue Type: Task
Reporter: Projjal Chanda






--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-7215) [C++][Gandiva] Implement castVARCHAR(integer_type) functions in Gandiva

2019-11-20 Thread Projjal Chanda (Jira)
Projjal Chanda created ARROW-7215:
-

 Summary: [C++][Gandiva] Implement castVARCHAR(integer_type) 
functions in Gandiva
 Key: ARROW-7215
 URL: https://issues.apache.org/jira/browse/ARROW-7215
 Project: Apache Arrow
  Issue Type: Task
  Components: C++ - Gandiva
Reporter: Projjal Chanda
Assignee: Projjal Chanda


Support following function signature in Gandiva:
FunctionSignature\{name =castVARCHAR, return type =Utf8, param types 
=[integer_type, Int(64, true)]}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-7262) [C++][Gandiva] Implement replace function in Gandiva

2019-11-26 Thread Projjal Chanda (Jira)
Projjal Chanda created ARROW-7262:
-

 Summary: [C++][Gandiva] Implement replace function in Gandiva
 Key: ARROW-7262
 URL: https://issues.apache.org/jira/browse/ARROW-7262
 Project: Apache Arrow
  Issue Type: Task
  Components: C++ - Gandiva
Reporter: Projjal Chanda
Assignee: Projjal Chanda


add _utf8 replace(utf8, utf8, utf8)_ function in Gandiva



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-7263) [C++][Gandiva] Implement locate and position functions

2019-11-26 Thread Projjal Chanda (Jira)
Projjal Chanda created ARROW-7263:
-

 Summary: [C++][Gandiva] Implement locate and position functions
 Key: ARROW-7263
 URL: https://issues.apache.org/jira/browse/ARROW-7263
 Project: Apache Arrow
  Issue Type: Task
  Components: C++ - Gandiva
Reporter: Projjal Chanda
Assignee: Projjal Chanda


Add {{int32 locate(utf8, utf8, int32)}} and {{int32 locate(utf8, utf8) ** 
}}functions. Same for {{position}}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-7278) [C++][Gandiva] Implement Boyer-Moore string search algorithm for functions doing string matching

2019-11-28 Thread Projjal Chanda (Jira)
Projjal Chanda created ARROW-7278:
-

 Summary: [C++][Gandiva] Implement Boyer-Moore string search 
algorithm for functions doing string matching
 Key: ARROW-7278
 URL: https://issues.apache.org/jira/browse/ARROW-7278
 Project: Apache Arrow
  Issue Type: Task
  Components: C++ - Gandiva
Reporter: Projjal Chanda
Assignee: Projjal Chanda


Discussed in https://github.com/apache/arrow/pull/5902#discussion_r351159392



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-7447) [Java] ComplexCopier does incorrect copy in some cases

2019-12-20 Thread Projjal Chanda (Jira)
Projjal Chanda created ARROW-7447:
-

 Summary: [Java] ComplexCopier does incorrect copy in some cases
 Key: ARROW-7447
 URL: https://issues.apache.org/jira/browse/ARROW-7447
 Project: Apache Arrow
  Issue Type: Bug
  Components: Java
Reporter: Projjal Chanda
Assignee: Projjal Chanda


For list union vectors complex copier doesn't do correct copy.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-7934) [C++] Fix UriEscape for empty string

2020-02-25 Thread Projjal Chanda (Jira)
Projjal Chanda created ARROW-7934:
-

 Summary: [C++] Fix UriEscape for empty string
 Key: ARROW-7934
 URL: https://issues.apache.org/jira/browse/ARROW-7934
 Project: Apache Arrow
  Issue Type: Task
Reporter: Projjal Chanda
Assignee: Projjal Chanda






--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-7993) [Java] Support decimal type in ComplexCopier

2020-03-02 Thread Projjal Chanda (Jira)
Projjal Chanda created ARROW-7993:
-

 Summary: [Java] Support decimal type in ComplexCopier
 Key: ARROW-7993
 URL: https://issues.apache.org/jira/browse/ARROW-7993
 Project: Apache Arrow
  Issue Type: Task
  Components: Java
Reporter: Projjal Chanda
Assignee: Projjal Chanda






--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-7660) [C++][Gandiva] Optimise castVarchar(string, int) function for single byte characters

2020-01-23 Thread Projjal Chanda (Jira)
Projjal Chanda created ARROW-7660:
-

 Summary: [C++][Gandiva] Optimise castVarchar(string, int) function 
for single byte characters
 Key: ARROW-7660
 URL: https://issues.apache.org/jira/browse/ARROW-7660
 Project: Apache Arrow
  Issue Type: Task
  Components: C++ - Gandiva
Reporter: Projjal Chanda
Assignee: Projjal Chanda


The current castVarchar function does byte by byte check for handling multibyte 
characters. Since most of the time string consists of single byte characters 
optimise it for that case and move to the slow path when multibyte characters 
detected.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-7793) [Java] If the child allocators leak memory the base allocator doesn't release its memory before throwing exception

2020-02-07 Thread Projjal Chanda (Jira)
Projjal Chanda created ARROW-7793:
-

 Summary: [Java] If the child allocators leak memory the base 
allocator doesn't release its memory before throwing exception
 Key: ARROW-7793
 URL: https://issues.apache.org/jira/browse/ARROW-7793
 Project: Apache Arrow
  Issue Type: Bug
Reporter: Projjal Chanda
Assignee: Projjal Chanda


Here [1] it should call super.close() before throwing exception

[1] 
https://github.com/apache/arrow/blob/762202418541e843923b8cae640d15b4952a0af6/java/memory/src/main/java/org/apache/arrow/memory/BaseAllocator.java#L481



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-7882) [C++][Gandiva] Optimise like function for substring pattern

2020-02-19 Thread Projjal Chanda (Jira)
Projjal Chanda created ARROW-7882:
-

 Summary: [C++][Gandiva] Optimise like function for substring 
pattern
 Key: ARROW-7882
 URL: https://issues.apache.org/jira/browse/ARROW-7882
 Project: Apache Arrow
  Issue Type: Task
  Components: C++ - Gandiva
Reporter: Projjal Chanda
Assignee: Projjal Chanda






--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-7529) [C++][Gandiva] Handle utf8 characters for castVARCHAR(string, int) function

2020-01-09 Thread Projjal Chanda (Jira)
Projjal Chanda created ARROW-7529:
-

 Summary: [C++][Gandiva] Handle utf8 characters for 
castVARCHAR(string, int) function
 Key: ARROW-7529
 URL: https://issues.apache.org/jira/browse/ARROW-7529
 Project: Apache Arrow
  Issue Type: Task
  Components: C++ - Gandiva
Reporter: Projjal Chanda
Assignee: Projjal Chanda






--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-7589) [C++][Gandiva] Calling castVarchar java sometimes results in segmentation fault for input length 0

2020-01-15 Thread Projjal Chanda (Jira)
Projjal Chanda created ARROW-7589:
-

 Summary: [C++][Gandiva] Calling castVarchar java sometimes results 
in segmentation fault for input length 0
 Key: ARROW-7589
 URL: https://issues.apache.org/jira/browse/ARROW-7589
 Project: Apache Arrow
  Issue Type: Bug
Reporter: Projjal Chanda
Assignee: Projjal Chanda






--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-7615) [CI][Gandiva] Ensure that the gandiva jar has only a whitelisted set of shared dependencies as part of Travis CI job

2020-01-20 Thread Projjal Chanda (Jira)
Projjal Chanda created ARROW-7615:
-

 Summary: [CI][Gandiva] Ensure that the gandiva jar has only a 
whitelisted set of shared dependencies as part of Travis CI job
 Key: ARROW-7615
 URL: https://issues.apache.org/jira/browse/ARROW-7615
 Project: Apache Arrow
  Issue Type: Task
Reporter: Projjal Chanda
Assignee: Projjal Chanda






--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-7627) [C++][Gandiva] Optimize string truncate function

2020-01-21 Thread Projjal Chanda (Jira)
Projjal Chanda created ARROW-7627:
-

 Summary: [C++][Gandiva] Optimize string truncate function
 Key: ARROW-7627
 URL: https://issues.apache.org/jira/browse/ARROW-7627
 Project: Apache Arrow
  Issue Type: Task
  Components: C++ - Gandiva
Reporter: Projjal Chanda
Assignee: Projjal Chanda


Current string truncate function does unnecessarily traverses through the 
string two times. Can be done in one pass



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-7473) [C++][Gandiva] Improve error message for locate function

2019-12-26 Thread Projjal Chanda (Jira)
Projjal Chanda created ARROW-7473:
-

 Summary: [C++][Gandiva] Improve error message for locate function
 Key: ARROW-7473
 URL: https://issues.apache.org/jira/browse/ARROW-7473
 Project: Apache Arrow
  Issue Type: Bug
  Components: C++ - Gandiva
Reporter: Projjal Chanda
Assignee: Projjal Chanda


Mention the erroneous index position in the error message



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-7466) [CI][Java] Fix gandiva-jar-osx nightly build failure

2019-12-22 Thread Projjal Chanda (Jira)
Projjal Chanda created ARROW-7466:
-

 Summary: [CI][Java] Fix gandiva-jar-osx nightly build failure
 Key: ARROW-7466
 URL: https://issues.apache.org/jira/browse/ARROW-7466
 Project: Apache Arrow
  Issue Type: Bug
  Components: Continuous Integration
Reporter: Projjal Chanda
Assignee: Projjal Chanda


Gandiva-jar-osx nightly build has been failing for the past few days. From 
[https://github.com/google/error-prone/issues/1441] the issue seems to be 
error-prone version 2.3.3 currently used is incompatible with java 13 that is 
being used in the nightly build. Updating it to 2.3.4 should fix this.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-7496) [CI][Gandiva] Use static protobuf dependency while building Gandiva jar

2020-01-06 Thread Projjal Chanda (Jira)
Projjal Chanda created ARROW-7496:
-

 Summary: [CI][Gandiva] Use static protobuf dependency while 
building Gandiva jar
 Key: ARROW-7496
 URL: https://issues.apache.org/jira/browse/ARROW-7496
 Project: Apache Arrow
  Issue Type: Task
Reporter: Projjal Chanda
Assignee: Projjal Chanda






--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-8086) [Java] Support writing decimal from big endian byte array in UnionListWriter

2020-03-12 Thread Projjal Chanda (Jira)
Projjal Chanda created ARROW-8086:
-

 Summary: [Java] Support writing decimal from big endian byte array 
in UnionListWriter
 Key: ARROW-8086
 URL: https://issues.apache.org/jira/browse/ARROW-8086
 Project: Apache Arrow
  Issue Type: Task
Reporter: Projjal Chanda
Assignee: Projjal Chanda






--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-8393) [C++][Gandiva] Make gandiva function registry case-insensitive

2020-04-10 Thread Projjal Chanda (Jira)
Projjal Chanda created ARROW-8393:
-

 Summary: [C++][Gandiva] Make gandiva function registry 
case-insensitive
 Key: ARROW-8393
 URL: https://issues.apache.org/jira/browse/ARROW-8393
 Project: Apache Arrow
  Issue Type: Task
  Components: C++ - Gandiva
Reporter: Projjal Chanda
Assignee: Projjal Chanda






--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-8412) [C++][Gandiva] Fix gandiva date_diff function definitions

2020-04-12 Thread Projjal Chanda (Jira)
Projjal Chanda created ARROW-8412:
-

 Summary: [C++][Gandiva] Fix gandiva date_diff function definitions
 Key: ARROW-8412
 URL: https://issues.apache.org/jira/browse/ARROW-8412
 Project: Apache Arrow
  Issue Type: Task
  Components: C++ - Gandiva
Reporter: Projjal Chanda
Assignee: Projjal Chanda


The current gandiva date functions date_diff, date_sub definitions take integer 
as first argument and date as second argument:

date_diff(10, d) = d - 10, which seems unintuitive.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-8161) [C++][Gandiva] Consolidate the data generation code for benchmark tests in gandiva into arrow/testing

2020-03-19 Thread Projjal Chanda (Jira)
Projjal Chanda created ARROW-8161:
-

 Summary: [C++][Gandiva] Consolidate the data generation code for 
benchmark tests in gandiva into arrow/testing
 Key: ARROW-8161
 URL: https://issues.apache.org/jira/browse/ARROW-8161
 Project: Apache Arrow
  Issue Type: Task
  Components: C++
Reporter: Projjal Chanda
Assignee: Projjal Chanda






--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-8887) [Java] Buffer size for complex vectors increases rapidly in case of clear/write loop

2020-05-21 Thread Projjal Chanda (Jira)
Projjal Chanda created ARROW-8887:
-

 Summary: [Java] Buffer size for complex vectors increases rapidly 
in case of clear/write loop
 Key: ARROW-8887
 URL: https://issues.apache.org/jira/browse/ARROW-8887
 Project: Apache Arrow
  Issue Type: Task
  Components: Java
Reporter: Projjal Chanda
Assignee: Projjal Chanda


Similar to https://issues.apache.org/jira/browse/ARROW-5232



--
This message was sent by Atlassian Jira
(v8.3.4#803005)