Hi Julian, I already logged. Please take a look at the following JIRAs and
PR.

   - https://issues.apache.org/jira/browse/CALCITE-2220 (PR:
   https://github.com/apache/calcite/pull/651)
   - https://issues.apache.org/jira/browse/CALCITE-2221 (Appreciate your
   input on how to get it fixed)

Thanks.
Shuyi

On Thu, Mar 22, 2018 at 10:55 AM, Julian Hyde <[email protected]> wrote:

> Shuyi,
>
> Many thanks for your help with this issue.
>
> Can you make sure that a JIRA case gets logged, if necessary?
>
> Julian
>
>
> > On Mar 22, 2018, at 10:25 AM, Anton Kedin <[email protected]>
> wrote:
> >
> > Shuyi,
> >
> > The fix works for me. Thanks!
> >
> > Regards,
> > Anton
> >
> >
> > On Thu, Mar 22, 2018 at 9:46 AM Anton Kedin <[email protected]> wrote:
> >
> >> I think we're talking about the same thing. In my case the index is also
> >> not adjusted for the flattened row. And I also see the wrong field
> ordinal
> >> which causes the same kind of mismatch, but it happens higher up in the
> >> user land for me because of how we wrap Calcite code (I'm not working
> with
> >> Calcite source at the moment).
> >>
> >> I will build the Calcite with your fix and will report if it fixes my
> >> issue. Thanks for the help!
> >>
> >> Regards,
> >> Anton
> >>
> >>
> >>
> >> On Thu, Mar 22, 2018 at 1:18 AM Shuyi Chen <[email protected]> wrote:
> >>
> >>> Also, you can try to patch in this PR to see if that fixes your issue,
> >>> https://github.com/apache/calcite/pull/651.
> >>>
> >>> On Thu, Mar 22, 2018 at 12:14 AM, Shuyi Chen <[email protected]>
> wrote:
> >>>
> >>>> I think the following is what happened:
> >>>>
> >>>> Calcite is trying to remove all structured type in the plan right
> below,
> >>>> so optimizer and codegen rules never have to deal with structured
> types.
> >>>>
> >>>> LogicalProject(EXPR$0=[ITEM($3, 1)])
> >>>>  LogicalTableScan(table=[[CATALOG, SALES, DEPT_NESTED]])
> >>>>
> >>>> First, it flatten the LogicalTableScan, and generate the following:
> >>>>
> >>>> LogicalProject(DEPTNO=[$0], NAME=[$1], TYPE=[$2.TYPE], DESC=[$2.DESC],
> >>>> EMPLOYEES=[$3])
> >>>>  LogicalTableScan(table=[[CATALOG, SALES, DEPT_NESTED]])
> >>>>
> >>>> Then it tries to flatten "LogicalProject(EXPR$0=[ITEM($3, 1)])", and
> >>>> generate the following:
> >>>>
> >>>> LogicalProject(EXPR$0$0=[ITEM($3, 1).EMPNO], EXPR$0$1=[ITEM($3,
> >>>> 1).ENAME], EXPR$0$2=[ITEM($3, 1).SKILLS])
> >>>>
> >>>> However, when it combines the 2 flattening results, it did not
> correctly
> >>>> adjust the ordinal post-flattening, which should be $4 now, not $3. So
> >>> this
> >>>> cause the exception since it is a type mismatch.
> >>>>
> >>>> I think I've already developed a fix for this. Will create a PR to
> >>> address
> >>>> both issues.
> >>>>
> >>>> @Anton, although my test error and your issue look similar, I still
> >>> can't
> >>>> reproduce your case (mine throws an error). Can you create a test for
> >>> it?
> >>>> Thanks a lot.
> >>>>
> >>>>
> >>>>
> >>>>
> >>>>
> >>>> On Wed, Mar 21, 2018 at 9:31 PM, Anton Kedin <[email protected]
> >
> >>>> wrote:
> >>>>
> >>>>> Shuyi,
> >>>>>
> >>>>> Thank you for looking into this. Can this error in your case be
> caused
> >>> by
> >>>>> a
> >>>>> similar problem? E.g. SKILLRECORD gets flattened, then when you try
> to
> >>>>> select employees[1] you get SKILLRECORD.DESC field instead of actual
> >>>>> employees[1] because input ref index is not adjusted for the
> flattened
> >>>>> SKILLRECORD?
> >>>>>
> >>>>> Thank you,
> >>>>> Anton
> >>>>>
> >>>>>
> >>>>> On Wed, Mar 21, 2018 at 8:53 PM Shuyi Chen <[email protected]>
> wrote:
> >>>>>
> >>>>>> Actually, the cause for my previous findings is: for the first case,
> >>>>>> SqlToRelConverterTest introduce another LogicalProject
> >>> (RelRoot.project)
> >>>>>> after applying the SqlToRelConverter to remove fields that are not
> >>>>> needed.
> >>>>>> But this function does not work with Record type and flattened
> >>> fields.
> >>>>> It
> >>>>>> simply projects the first several fields from input index-wise, and
> >>> does
> >>>>>> not take into account the flattening behavior. The second case does
> >>> not
> >>>>>> trigger the extra project because it's trivial.
> >>>>>>
> >>>>>> For your case, I tried below:
> >>>>>>
> >>>>>> MockTable deptNestedTable =
> >>>>>>    MockTable.create(this, salesSchema, "DEPT_NESTED", false, 4);
> >>>>>> deptNestedTable.addColumn("DEPTNO", f.intType, true);
> >>>>>> deptNestedTable.addColumn("NAME", f.varchar10Type);
> >>>>>> deptNestedTable.addColumn("SKILLRECORD", f.skillRecordType);
> >>>>>> deptNestedTable.addColumn("EMPLOYEES", f.empListType);
> >>>>>> registerTable(deptNestedTable);
> >>>>>>
> >>>>>> Run the following test:
> >>>>>>
> >>>>>> @Test public void testArrayOfRecord() {
> >>>>>>  sql("select employees[1] from dept_nested").ok();
> >>>>>> }
> >>>>>>
> >>>>>> I am actually getting the following error when run:
> >>>>>>
> >>>>>> java.lang.AssertionError: type mismatch:
> >>>>>> ref:
> >>>>>> RecordType(INTEGER NOT NULL EMPNO, VARCHAR(10) CHARACTER SET
> >>>>> "ISO-8859-1"
> >>>>>> COLLATE "ISO-8859-1$en_US$primary" NOT NULL ENAME,
> >>>>> RecordType(VARCHAR(10)
> >>>>>> CHARACTER SET "ISO-8859-1" COLLATE "ISO-8859-1$en_US$primary" NOT
> >>> NULL
> >>>>>> TYPE, VARCHAR(20) CHARACTER SET "ISO-8859-1" COLLATE
> >>>>>> "ISO-8859-1$en_US$primary" NOT NULL DESC) NOT NULL ARRAY NOT NULL
> >>>>> SKILLS)
> >>>>>> NOT NULL ARRAY NOT NULL
> >>>>>> input:
> >>>>>> VARCHAR(20) CHARACTER SET "ISO-8859-1" COLLATE
> >>>>> "ISO-8859-1$en_US$primary"
> >>>>>> NOT NULL
> >>>>>>
> >>>>>> at org.apache.calcite.util.Litmus$1.fail(Litmus.java:31)
> >>>>>> at org.apache.calcite.plan.RelOptUtil.eq(RelOptUtil.java:1838)
> >>>>>> at
> >>> org.apache.calcite.rex.RexChecker.visitInputRef(RexChecker.java:125)
> >>>>>> at
> >>> org.apache.calcite.rex.RexChecker.visitInputRef(RexChecker.java:57)
> >>>>>> at org.apache.calcite.rex.RexInputRef.accept(RexInputRef.java:112)
> >>>>>> at org.apache.calcite.rex.RexChecker.visitCall(RexChecker.java:140)
> >>>>>> at org.apache.calcite.rex.RexChecker.visitCall(RexChecker.java:57)
> >>>>>> at org.apache.calcite.rex.RexCall.accept(RexCall.java:107)
> >>>>>> at
> >>>>>>
> >>>>>> org.apache.calcite.rex.RexVisitorImpl.visitFieldAccess(RexVi
> >>>>> sitorImpl.java:98)
> >>>>>> at org.apache.calcite.rex.RexChecker.visitFieldAccess(RexChecke
> >>>>> r.java:149)
> >>>>>> at org.apache.calcite.rex.RexChecker.visitFieldAccess(RexChecke
> >>>>> r.java:57)
> >>>>>> at
> >>> org.apache.calcite.rex.RexFieldAccess.accept(RexFieldAccess.java:81)
> >>>>>> at org.apache.calcite.rel.core.Project.isValid(Project.java:187)
> >>>>>> at org.apache.calcite.rel.core.Project.<init>(Project.java:84)
> >>>>>> at
> >>>>>>
> >>>>>> org.apache.calcite.rel.logical.LogicalProject.<init>(Logical
> >>>>> Project.java:65)
> >>>>>> at
> >>>>>>
> >>>>>> org.apache.calcite.rel.logical.LogicalProject.create(Logical
> >>>>> Project.java:120)
> >>>>>> at
> >>>>>>
> >>>>>> org.apache.calcite.rel.logical.LogicalProject.create(Logical
> >>>>> Project.java:103)
> >>>>>> at
> >>>>>>
> >>>>>> org.apache.calcite.rel.core.RelFactories$ProjectFactoryImpl.
> >>>>> createProject(RelFactories.java:127)
> >>>>>> at org.apache.calcite.tools.RelBuilder.project(RelBuilder.
> java:1064)
> >>>>>> at org.apache.calcite.plan.RelOptUtil.createProject(RelOptUtil.
> >>>>> java:2956)
> >>>>>> at org.apache.calcite.plan.RelOptUtil.createProject(RelOptUtil.
> >>>>> java:2873)
> >>>>>> at
> >>>>>>
> >>>>>> org.apache.calcite.sql2rel.RelStructuredTypeFlattener.rewrit
> >>>>> eRel(RelStructuredTypeFlattener.java:477)
> >>>>>> at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
> >>>>>> at
> >>>>>>
> >>>>>> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAcce
> >>>>> ssorImpl.java:62)
> >>>>>> at
> >>>>>>
> >>>>>> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMe
> >>>>> thodAccessorImpl.java:43)
> >>>>>> at java.lang.reflect.Method.invoke(Method.java:498)
> >>>>>> at
> >>>>>>
> >>>>>> org.apache.calcite.util.ReflectUtil.invokeVisitorInternal(Re
> >>>>> flectUtil.java:257)
> >>>>>> at org.apache.calcite.util.ReflectUtil.invokeVisitor(ReflectUti
> >>>>> l.java:214)
> >>>>>> at
> >>>>>> org.apache.calcite.util.ReflectUtil$1.invokeVisitor(ReflectU
> >>>>> til.java:464)
> >>>>>> at
> >>>>>>
> >>>>>> org.apache.calcite.sql2rel.RelStructuredTypeFlattener$Rewrit
> >>>>> eRelVisitor.visit(RelStructuredTypeFlattener.java:721)
> >>>>>> at
> >>>>>>
> >>>>>> org.apache.calcite.sql2rel.RelStructuredTypeFlattener.rewrit
> >>>>> e(RelStructuredTypeFlattener.java:177)
> >>>>>> at
> >>>>>>
> >>>>>> org.apache.calcite.sql2rel.SqlToRelConverter.flattenTypes(Sq
> >>>>> lToRelConverter.java:462)
> >>>>>> at
> >>>>>>
> >>>>>> org.apache.calcite.test.SqlToRelTestBase$TesterImpl.convertS
> >>>>> qlToRel(SqlToRelTestBase.java:585)
> >>>>>> at
> >>>>>>
> >>>>>> org.apache.calcite.test.SqlToRelTestBase$TesterImpl.assertCo
> >>>>> nvertsTo(SqlToRelTestBase.java:690)
> >>>>>> at
> >>>>>>
> >>>>>> org.apache.calcite.test.SqlToRelConverterTest$Sql.convertsTo
> >>>>> (SqlToRelConverterTest.java:2784)
> >>>>>> at
> >>>>>>
> >>>>>> org.apache.calcite.test.SqlToRelConverterTest$Sql.ok(SqlToRe
> >>>>> lConverterTest.java:2776)
> >>>>>> at
> >>>>>>
> >>>>>> org.apache.calcite.test.SqlToRelConverterTest.testArrayOfRec
> >>>>> ord(SqlToRelConverterTest.java:1059)
> >>>>>> at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
> >>>>>> at
> >>>>>>
> >>>>>> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAcce
> >>>>> ssorImpl.java:62)
> >>>>>> at
> >>>>>>
> >>>>>> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMe
> >>>>> thodAccessorImpl.java:43)
> >>>>>> at java.lang.reflect.Method.invoke(Method.java:498)
> >>>>>> at
> >>>>>>
> >>>>>> org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(
> >>>>> FrameworkMethod.java:50)
> >>>>>> at
> >>>>>>
> >>>>>> org.junit.internal.runners.model.ReflectiveCallable.run(Refl
> >>>>> ectiveCallable.java:12)
> >>>>>> at
> >>>>>>
> >>>>>> org.junit.runners.model.FrameworkMethod.invokeExplosively(Fr
> >>>>> ameworkMethod.java:47)
> >>>>>> at
> >>>>>>
> >>>>>> org.junit.internal.runners.statements.InvokeMethod.evaluate(
> >>>>> InvokeMethod.java:17)
> >>>>>> at org.junit.runners.ParentRunner.runLeaf(ParentRunner.java:325)
> >>>>>> at
> >>>>>>
> >>>>>> org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit
> >>>>> 4ClassRunner.java:78)
> >>>>>> at
> >>>>>>
> >>>>>> org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit
> >>>>> 4ClassRunner.java:57)
> >>>>>> at org.junit.runners.ParentRunner$3.run(ParentRunner.java:290)
> >>>>>> at org.junit.runners.ParentRunner$1.schedule(ParentRunner.java:71)
> >>>>>> at org.junit.runners.ParentRunner.runChildren(
> ParentRunner.java:288)
> >>>>>> at org.junit.runners.ParentRunner.access$000(ParentRunner.java:58)
> >>>>>> at org.junit.runners.ParentRunner$2.evaluate(ParentRunner.java:268)
> >>>>>> at org.junit.runners.ParentRunner.run(ParentRunner.java:363)
> >>>>>> at org.junit.runner.JUnitCore.run(JUnitCore.java:137)
> >>>>>> at
> >>>>>>
> >>>>>> com.intellij.junit4.JUnit4IdeaTestRunner.startRunnerWithArgs
> >>>>> (JUnit4IdeaTestRunner.java:117)
> >>>>>> at
> >>>>>>
> >>>>>> com.intellij.junit4.JUnit4IdeaTestRunner.startRunnerWithArgs
> >>>>> (JUnit4IdeaTestRunner.java:42)
> >>>>>> at
> >>>>>>
> >>>>>> com.intellij.rt.execution.junit.JUnitStarter.prepareStreamsA
> >>>>> ndStart(JUnitStarter.java:262)
> >>>>>> at com.intellij.rt.execution.junit.JUnitStarter.main(JUnitStart
> >>>>> er.java:84)
> >>>>>> at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
> >>>>>> at
> >>>>>>
> >>>>>> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAcce
> >>>>> ssorImpl.java:62)
> >>>>>> at
> >>>>>>
> >>>>>> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMe
> >>>>> thodAccessorImpl.java:43)
> >>>>>> at java.lang.reflect.Method.invoke(Method.java:498)
> >>>>>> at
> >>> com.intellij.rt.execution.application.AppMain.main(AppMain.java:147)
> >>>>>>
> >>>>>> Shuyi
> >>>>>>
> >>>>>>
> >>>>>> On Wed, Mar 21, 2018 at 6:09 PM, Shuyi Chen <[email protected]>
> >>> wrote:
> >>>>>>
> >>>>>>> Thanks a lot, Anton. This seems to be a bug in Calcite. When the
> >>>>>> statement
> >>>>>>> involving record types, sql validation seems to work, but the rel
> >>> plan
> >>>>>>> generated might be wrong.  I can also reproduce your case:
> >>>>>>>
> >>>>>>> MockTable deptNestedTable =
> >>>>>>>    MockTable.create(this, salesSchema, "DEPT_NESTED", false, 4);
> >>>>>>> deptNestedTable.addColumn("DEPTNO", f.intType, true);
> >>>>>>> deptNestedTable.addColumn("NAME", f.varchar10Type);
> >>>>>>> deptNestedTable.addColumn("SKILLRECORD", f.skillRecordType);
> >>>>>>> deptNestedTable.addColumn("EMPLOYEES", f.empListType);
> >>>>>>> registerTable(deptNestedTable);
> >>>>>>>
> >>>>>>> Run the following test:
> >>>>>>>
> >>>>>>> @Test public void testArrayOfRecord() {
> >>>>>>>  sql("select skillrecord, employees from dept_nested").ok();
> >>>>>>> }
> >>>>>>>
> >>>>>>> yield:
> >>>>>>> LogicalProject(SKILLRECORD=[$0], EMPLOYEES=[$1])
> >>>>>>>  LogicalProject(SKILLRECORD=[$2], SKILLRECORD1=[$3],
> >>> EMPLOYEES=[$4])
> >>>>>>>    LogicalProject(DEPTNO=[$0], NAME=[$1], TYPE=[$2.TYPE],
> >>>>>> DESC=[$2.DESC],
> >>>>>>> EMPLOYEES=[$3])
> >>>>>>>      LogicalTableScan(table=[[CATALOG, SALES, DEPT_NESTED]])
> >>>>>>>
> >>>>>>> Sometimes, it works:
> >>>>>>>
> >>>>>>> @Test public void testArrayOfRecord() {
> >>>>>>> sql("select name, employees from dept_nested").ok();
> >>>>>>> }
> >>>>>>>
> >>>>>>> yield:
> >>>>>>>
> >>>>>>> LogicalProject(NAME=[$1], EMPLOYEES=[$4])
> >>>>>>>  LogicalProject(DEPTNO=[$0], NAME=[$1], TYPE=[$2.TYPE],
> >>>>> DESC=[$2.DESC],
> >>>>>>> EMPLOYEES=[$3])
> >>>>>>>    LogicalTableScan(table=[[CATALOG, SALES, DEPT_NESTED]])
> >>>>>>>
> >>>>>>> I can take a deeper look.
> >>>>>>>
> >>>>>>> Shuyi
> >>>>>>>
> >>>>>>> On Wed, Mar 21, 2018 at 11:06 AM, Anton Kedin
> >>>>> <[email protected]>
> >>>>>>> wrote:
> >>>>>>>
> >>>>>>>> Hi,
> >>>>>>>>
> >>>>>>>> I have an issue I am not sure how to handle, would appreciate any
> >>>>>>>> pointers.
> >>>>>>>>
> >>>>>>>> I have a table with row type:
> >>>>>>>> RecordType(
> >>>>>>>>    INTEGER orderId,
> >>>>>>>>    RecordType(VARCHAR name, INTEGER personId)
> >>>>>>>>        person,
> >>>>>>>>    RecordType(VARCHAR sku, INTEGER price, VARCHAR currency,
> >>> VARCHAR
> >>>>>> ARRAY
> >>>>>>>> tags)
> >>>>>>>>        ARRAY items
> >>>>>>>> )
> >>>>>>>>
> >>>>>>>> With this row type I am trying to model a JSON object which looks
> >>>>> like
> >>>>>>>> this:
> >>>>>>>> { "orderId" : 1,
> >>>>>>>>  "person" : { "name" : "john", "personId" : 12, },
> >>>>>>>>  "items": [
> >>>>>>>>    { "sku" : "aaa01", "price" : 12, "currency" : "USD", "tags" :
> >>>>>> ["blue",
> >>>>>>>> "book"] }
> >>>>>>>>  ]}
> >>>>>>>>
> >>>>>>>> When selecting the whole items array I get the following plan:
> >>>>>>>> SELECT items FROM PCOLLECTION
> >>>>>>>>
> >>>>>>>> LogicalProject(items=[$3])
> >>>>>>>>  LogicalProject(orderId=[$0], name=[$1.name],
> >>>>> personId=[$1.personId],
> >>>>>>>> items
> >>>>>>>> =[$2])
> >>>>>>>>    LogicalTableScan(table=[[PCOLLECTION]])
> >>>>>>>>
> >>>>>>>> Which looks correct and it works. One thing to note here is that
> >>>>> Calcite
> >>>>>>>> flattens the person row, and makes the input ref for the items
> >>> field
> >>>>> as
> >>>>>>>> $3,
> >>>>>>>> as expected.
> >>>>>>>>
> >>>>>>>> But when I want to get a specific element from that array I get
> >>> the
> >>>>>>>> following:
> >>>>>>>> SELECT items[0] FROM PCOLLECTION
> >>>>>>>>
> >>>>>>>> LogicalProject(EXPR$0$0=[ITEM($2, 0).sku], EXPR$0$1=[ITEM($2,
> >>>>> 0).price],
> >>>>>>>> EXPR$0$2=[ITEM($2, 0).currency], EXPR$0$3=[ITEM($2, 0).tags])
> >>>>>>>>  LogicalProject(orderId=[$0], name=[$1.name],
> >>>>> personId=[$1.personId],
> >>>>>>>> items
> >>>>>>>> =[$2])
> >>>>>>>>    LogicalTableScan(table=[[PCOLLECTION]])
> >>>>>>>>
> >>>>>>>> The first project looks the same. Flattened person row, items
> >>> array,
> >>>>> all
> >>>>>>>> looks similar to the above.
> >>>>>>>> But the outer project calls ITEM($2, i). I would expect it to be
> >>>>>>>> ITEM($3, i) instead,
> >>>>>>>> to adjust for the flattened person row, but it keeps the index as
> >>> $2,
> >>>>>>>> which
> >>>>>>>> would have been the correct index if the row was not flattened,
> >>> but
> >>>>> it
> >>>>>>>> should be $3 for flattened row, similar to the previous example.
> >>>>>>>>
> >>>>>>>> Is there something I am missing or is it a bug and Calcite should
> >>>>> adjust
> >>>>>>>> the input ref index to account for flattened rows in this case as
> >>>>> well?
> >>>>>>>>
> >>>>>>>> Thank you,
> >>>>>>>> Anton
> >>>>>>>>
> >>>>>>>
> >>>>>>>
> >>>>>>>
> >>>>>>> --
> >>>>>>> "So you have to trust that the dots will somehow connect in your
> >>>>> future."
> >>>>>>>
> >>>>>>
> >>>>>>
> >>>>>>
> >>>>>> --
> >>>>>> "So you have to trust that the dots will somehow connect in your
> >>>>> future."
> >>>>>>
> >>>>>
> >>>>
> >>>>
> >>>>
> >>>> --
> >>>> "So you have to trust that the dots will somehow connect in your
> >>> future."
> >>>>
> >>>
> >>>
> >>>
> >>> --
> >>> "So you have to trust that the dots will somehow connect in your
> future."
> >>>
> >>
>
>


-- 
"So you have to trust that the dots will somehow connect in your future."

Reply via email to