Oops, this is actually the JOIN issue thread [1]. Guess I should revise my
previous "haven't had issues" statement hah. Sorry for the spam!

[1]:
apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/Streaming-SQL-Job-Switches-to-FINISHED-before-all-records-processed-td38382.html

On Tue, Oct 6, 2020 at 6:32 PM Austin Cawley-Edwards <
austin.caw...@gmail.com> wrote:

> Unless it's related to this issue[1], which was w/ my JOIN and time
> characteristics, though not sure that applies for batch.
>
> Best,
> Austin
>
> [1]:
> apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/Flink-SQL-Streaming-Join-Creates-Duplicates-td37764.html
>
>
> On Tue, Oct 6, 2020 at 6:20 PM Austin Cawley-Edwards <
> austin.caw...@gmail.com> wrote:
>
>> Hey Dan,
>>
>> We use Junit5 and Bazel to run Flink SQL tests on a mini cluster and
>> haven’t had issues, though we’re only testing on streaming jobs.
>>
>> Happy to help setting up logging with that if you’d like.
>>
>> Best,
>> Austin
>>
>> On Tue, Oct 6, 2020 at 6:02 PM Dan Hill <quietgol...@gmail.com> wrote:
>>
>>> I don't think any of the gotchas apply to me (at the bottom of this
>>> link).
>>>
>>> https://ci.apache.org/projects/flink/flink-docs-stable/dev/stream/testing.html#junit-rule-miniclusterwithclientresource
>>>
>>> I'm assuming for a batch job that I don't have to do anything for: "You
>>> can implement a custom parallel source function for emitting watermarks if
>>> your job uses event time timers."
>>>
>>> On Tue, Oct 6, 2020 at 2:42 PM Dan Hill <quietgol...@gmail.com> wrote:
>>>
>>>> I've tried to enable additional logging for a few hours today.  I think
>>>> something with junit5 is swallowing the logs.  I'm using Bazel and junit5.
>>>> I setup MiniClusterResourceConfiguration using a custom extension.  Are
>>>> there any known issues with Flink and junit5?  I can try switching to
>>>> junit4.
>>>>
>>>> When I've binary searched this issue, this failure happens if my query
>>>> in step 3 has a join it.  If I remove the join, I can remove step 4 and the
>>>> code still works.  I've renamed a bunch of my tables too and the problem
>>>> still exists.
>>>>
>>>>
>>>>
>>>>
>>>>
>>>> On Tue, Oct 6, 2020, 00:42 Aljoscha Krettek <aljos...@apache.org>
>>>> wrote:
>>>>
>>>>> Hi Dan,
>>>>>
>>>>> there were some bugs and quirks in the MiniCluster that we recently
>>>>> fixed:
>>>>>
>>>>>   - https://issues.apache.org/jira/browse/FLINK-19123
>>>>>   - https://issues.apache.org/jira/browse/FLINK-19264
>>>>>
>>>>> But I think they are probably unrelated to your case. Could you enable
>>>>> logging and see from the logs whether the 2) and 3) jobs execute
>>>>> correctly on the MiniCluster?
>>>>>
>>>>> Best,
>>>>> Aljoscha
>>>>>
>>>>> On 06.10.20 08:08, Dan Hill wrote:
>>>>> > I'm writing a test for a batch job using
>>>>> MiniClusterResourceConfiguration.
>>>>> >
>>>>> > Here's a simple description of my working test case:
>>>>> > 1) I use TableEnvironment.executeSql(...) to create a source and
>>>>> sink table
>>>>> > using tmp filesystem directory.
>>>>> > 2) I use executeSql to insert some test data into the source tabel.
>>>>> > 3) I use executeSql to select from source and insert into sink.
>>>>> > 4) I use executeSql from the same source to a different sink.
>>>>> >
>>>>> > When I do these steps, it works.  If I remove step 4, no data gets
>>>>> written
>>>>> > to the sink.  My actual code is more complex than this (has create
>>>>> view,
>>>>> > join and more tables).  This is a simplified description but
>>>>> highlights the
>>>>> > weird error.
>>>>> >
>>>>> > Has anyone hit issues like this?  I'm assuming I have a small code
>>>>> bug in
>>>>> > my queries that's causing issues.  These queries appear to work in
>>>>> > production so I'm confused.  Are there ways of viewing failed jobs or
>>>>> > queries with MiniClusterResourceConfiguration?
>>>>> >
>>>>> > Thanks!
>>>>> > - Dan
>>>>> >
>>>>>
>>>>>

Reply via email to