Re: Issue with Flink - unrelated executeSql causes other executeSqls to fail.

Austin Cawley-Edwards Tue, 06 Oct 2020 15:33:12 -0700

Unless it's related to this issue[1], which was w/ my JOIN and time
characteristics, though not sure that applies for batch.


Best,
Austin

[1]:
apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/Flink-SQL-Streaming-Join-Creates-Duplicates-td37764.html


On Tue, Oct 6, 2020 at 6:20 PM Austin Cawley-Edwards <
austin.caw...@gmail.com> wrote:

> Hey Dan,
>
> We use Junit5 and Bazel to run Flink SQL tests on a mini cluster and
> haven’t had issues, though we’re only testing on streaming jobs.
>
> Happy to help setting up logging with that if you’d like.
>
> Best,
> Austin
>
> On Tue, Oct 6, 2020 at 6:02 PM Dan Hill <quietgol...@gmail.com> wrote:
>
>> I don't think any of the gotchas apply to me (at the bottom of this link).
>>
>> https://ci.apache.org/projects/flink/flink-docs-stable/dev/stream/testing.html#junit-rule-miniclusterwithclientresource
>>
>> I'm assuming for a batch job that I don't have to do anything for: "You
>> can implement a custom parallel source function for emitting watermarks if
>> your job uses event time timers."
>>
>> On Tue, Oct 6, 2020 at 2:42 PM Dan Hill <quietgol...@gmail.com> wrote:
>>
>>> I've tried to enable additional logging for a few hours today.  I think
>>> something with junit5 is swallowing the logs.  I'm using Bazel and junit5.
>>> I setup MiniClusterResourceConfiguration using a custom extension.  Are
>>> there any known issues with Flink and junit5?  I can try switching to
>>> junit4.
>>>
>>> When I've binary searched this issue, this failure happens if my query
>>> in step 3 has a join it.  If I remove the join, I can remove step 4 and the
>>> code still works.  I've renamed a bunch of my tables too and the problem
>>> still exists.
>>>
>>>
>>>
>>>
>>>
>>> On Tue, Oct 6, 2020, 00:42 Aljoscha Krettek <aljos...@apache.org> wrote:
>>>
>>>> Hi Dan,
>>>>
>>>> there were some bugs and quirks in the MiniCluster that we recently
>>>> fixed:
>>>>
>>>>   - https://issues.apache.org/jira/browse/FLINK-19123
>>>>   - https://issues.apache.org/jira/browse/FLINK-19264
>>>>
>>>> But I think they are probably unrelated to your case. Could you enable
>>>> logging and see from the logs whether the 2) and 3) jobs execute
>>>> correctly on the MiniCluster?
>>>>
>>>> Best,
>>>> Aljoscha
>>>>
>>>> On 06.10.20 08:08, Dan Hill wrote:
>>>> > I'm writing a test for a batch job using
>>>> MiniClusterResourceConfiguration.
>>>> >
>>>> > Here's a simple description of my working test case:
>>>> > 1) I use TableEnvironment.executeSql(...) to create a source and sink
>>>> table
>>>> > using tmp filesystem directory.
>>>> > 2) I use executeSql to insert some test data into the source tabel.
>>>> > 3) I use executeSql to select from source and insert into sink.
>>>> > 4) I use executeSql from the same source to a different sink.
>>>> >
>>>> > When I do these steps, it works.  If I remove step 4, no data gets
>>>> written
>>>> > to the sink.  My actual code is more complex than this (has create
>>>> view,
>>>> > join and more tables).  This is a simplified description but
>>>> highlights the
>>>> > weird error.
>>>> >
>>>> > Has anyone hit issues like this?  I'm assuming I have a small code
>>>> bug in
>>>> > my queries that's causing issues.  These queries appear to work in
>>>> > production so I'm confused.  Are there ways of viewing failed jobs or
>>>> > queries with MiniClusterResourceConfiguration?
>>>> >
>>>> > Thanks!
>>>> > - Dan
>>>> >
>>>>
>>>>

Re: Issue with Flink - unrelated executeSql causes other executeSqls to fail.

Reply via email to