[jira] [Commented] (FLINK-1870) Reintroduce Indexed reader functionality to streaming

2016-02-07 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/FLINK-1870?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15136495#comment-15136495
 ] 

ASF GitHub Bot commented on FLINK-1870:
---

Github user mjsax closed the pull request at:

https://github.com/apache/flink/pull/1483


> Reintroduce Indexed reader functionality to streaming
> -
>
> Key: FLINK-1870
> URL: https://issues.apache.org/jira/browse/FLINK-1870
> Project: Flink
>  Issue Type: Task
>  Components: Streaming
>Reporter: Gyula Fora
>Assignee: Matthias J. Sax
>Priority: Minor
>
> The Indexed record reader classes (IndexedReaderIterator, 
> IndexedMutableReader) were introduced to allow the streaming operators to 
> access the index of the last read channel from the input gate. This was a 
> necessary step toward future input sorting operators.
> Unfortunately this untested feature was patched away by the following commit:
> https://github.com/apache/flink/commit/5232c56b13b7e13e2cf10dbe818c221f5557426d
> At some point we need to reimplement these features with proper tests.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (FLINK-1870) Reintroduce Indexed reader functionality to streaming

2016-01-29 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/FLINK-1870?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15123651#comment-15123651
 ] 

ASF GitHub Bot commented on FLINK-1870:
---

Github user aljoscha commented on the pull request:

https://github.com/apache/flink/pull/1483#issuecomment-176839873
  
@mjsax What is the status on this PR? Can it be closed since the feature in 
Storm will be implemented another way? 


> Reintroduce Indexed reader functionality to streaming
> -
>
> Key: FLINK-1870
> URL: https://issues.apache.org/jira/browse/FLINK-1870
> Project: Flink
>  Issue Type: Task
>  Components: Streaming
>Reporter: Gyula Fora
>Assignee: Matthias J. Sax
>Priority: Minor
>
> The Indexed record reader classes (IndexedReaderIterator, 
> IndexedMutableReader) were introduced to allow the streaming operators to 
> access the index of the last read channel from the input gate. This was a 
> necessary step toward future input sorting operators.
> Unfortunately this untested feature was patched away by the following commit:
> https://github.com/apache/flink/commit/5232c56b13b7e13e2cf10dbe818c221f5557426d
> At some point we need to reimplement these features with proper tests.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (FLINK-1870) Reintroduce Indexed reader functionality to streaming

2016-01-29 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/FLINK-1870?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15124736#comment-15124736
 ] 

ASF GitHub Bot commented on FLINK-1870:
---

Github user mjsax commented on the pull request:

https://github.com/apache/flink/pull/1483#issuecomment-177069344
  
I would like to keep this open, until 
[FLINK-2721](https://issues.apache.org/jira/browse/FLINK-2721) is merged. To 
make sure, we do not need this -- for this case, we can also close the 
according JIRA of course.


> Reintroduce Indexed reader functionality to streaming
> -
>
> Key: FLINK-1870
> URL: https://issues.apache.org/jira/browse/FLINK-1870
> Project: Flink
>  Issue Type: Task
>  Components: Streaming
>Reporter: Gyula Fora
>Assignee: Matthias J. Sax
>Priority: Minor
>
> The Indexed record reader classes (IndexedReaderIterator, 
> IndexedMutableReader) were introduced to allow the streaming operators to 
> access the index of the last read channel from the input gate. This was a 
> necessary step toward future input sorting operators.
> Unfortunately this untested feature was patched away by the following commit:
> https://github.com/apache/flink/commit/5232c56b13b7e13e2cf10dbe818c221f5557426d
> At some point we need to reimplement these features with proper tests.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (FLINK-1870) Reintroduce Indexed reader functionality to streaming

2016-01-15 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/FLINK-1870?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15101687#comment-15101687
 ] 

ASF GitHub Bot commented on FLINK-1870:
---

Github user mjsax commented on the pull request:

https://github.com/apache/flink/pull/1483#issuecomment-171944496
  
With embedded mode, I mean to use a spout/bolt within a regular Flink 
streaming program. For this, you could even have a "plain/raw" input stream 
(eg, `DataStream`) or a POJO. Both can be translated into StormTuple 
(which is only used for input tuples). Output tuples are of type `Values` and 
also translated back to `TupleX` type.

Extending the tuples with an additional field would work, too. But I did 
not like this, because is increases the data that need to be shipped over the 
network. For embedded mode, we would need to think of a nice "work around" for 
"plain/raw" and POJO input streams  -- but this should not be a problem. As 
there are no globally unique task-ids anyway in embedded mode, it might be even 
ok not to support this feature...

I personally do not see it as a big (very dangerous) problem in exposing 
"input channel index" to the operators. But if you are strictly against it, I 
will go with the "additional field" approach to provide this information.


> Reintroduce Indexed reader functionality to streaming
> -
>
> Key: FLINK-1870
> URL: https://issues.apache.org/jira/browse/FLINK-1870
> Project: Flink
>  Issue Type: Task
>  Components: Streaming
>Reporter: Gyula Fora
>Assignee: Matthias J. Sax
>Priority: Minor
>
> The Indexed record reader classes (IndexedReaderIterator, 
> IndexedMutableReader) were introduced to allow the streaming operators to 
> access the index of the last read channel from the input gate. This was a 
> necessary step toward future input sorting operators.
> Unfortunately this untested feature was patched away by the following commit:
> https://github.com/apache/flink/commit/5232c56b13b7e13e2cf10dbe818c221f5557426d
> At some point we need to reimplement these features with proper tests.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (FLINK-1870) Reintroduce Indexed reader functionality to streaming

2016-01-15 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/FLINK-1870?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15101584#comment-15101584
 ] 

ASF GitHub Bot commented on FLINK-1870:
---

Github user aljoscha commented on the pull request:

https://github.com/apache/flink/pull/1483#issuecomment-171926900
  
Hi,
I would be strongly against exposing the input channel index in any way to 
operators/user functions. How the index of a channel maps to an upstream 
operator is internal to the network layer and dependent on several things, some 
of which are not even deterministic IMHO. Off the top of my head it should 
depend on the number of upstream tasks, the parallelism of these upstream tasks 
and the order in which the system choses to attach the channels.

I think your goal can be achieved by adding a task-id field in the message 
wrapper and setting this when emitting elements. Setting this for every 
processed element (if using the Storm combat layer or not) seems quite wasteful 
to me.


> Reintroduce Indexed reader functionality to streaming
> -
>
> Key: FLINK-1870
> URL: https://issues.apache.org/jira/browse/FLINK-1870
> Project: Flink
>  Issue Type: Task
>  Components: Streaming
>Reporter: Gyula Fora
>Assignee: Matthias J. Sax
>Priority: Minor
>
> The Indexed record reader classes (IndexedReaderIterator, 
> IndexedMutableReader) were introduced to allow the streaming operators to 
> access the index of the last read channel from the input gate. This was a 
> necessary step toward future input sorting operators.
> Unfortunately this untested feature was patched away by the following commit:
> https://github.com/apache/flink/commit/5232c56b13b7e13e2cf10dbe818c221f5557426d
> At some point we need to reimplement these features with proper tests.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (FLINK-1870) Reintroduce Indexed reader functionality to streaming

2016-01-15 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/FLINK-1870?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15101611#comment-15101611
 ] 

ASF GitHub Bot commented on FLINK-1870:
---

Github user aljoscha commented on the pull request:

https://github.com/apache/flink/pull/1483#issuecomment-171931389
  
I think that the Storm compatibility layer is very important. I also think, 
however, that when making decisions about whether to enhance to core API to 
serve other APIs or making that other API jump through some hoops then the 
cleanliness of the core API should be favored if possible.



> Reintroduce Indexed reader functionality to streaming
> -
>
> Key: FLINK-1870
> URL: https://issues.apache.org/jira/browse/FLINK-1870
> Project: Flink
>  Issue Type: Task
>  Components: Streaming
>Reporter: Gyula Fora
>Assignee: Matthias J. Sax
>Priority: Minor
>
> The Indexed record reader classes (IndexedReaderIterator, 
> IndexedMutableReader) were introduced to allow the streaming operators to 
> access the index of the last read channel from the input gate. This was a 
> necessary step toward future input sorting operators.
> Unfortunately this untested feature was patched away by the following commit:
> https://github.com/apache/flink/commit/5232c56b13b7e13e2cf10dbe818c221f5557426d
> At some point we need to reimplement these features with proper tests.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (FLINK-1870) Reintroduce Indexed reader functionality to streaming

2016-01-15 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/FLINK-1870?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15101592#comment-15101592
 ] 

ASF GitHub Bot commented on FLINK-1870:
---

Github user mjsax commented on the pull request:

https://github.com/apache/flink/pull/1483#issuecomment-171928147
  
We do not have any message wrappers on the wire. Data is transfered as 
plain Flink `TupleX` types and are translated into Storm format before handing 
them over to the bolt. This allows for embedded usage of bolt! Thus, I do not 
want to change this.


> Reintroduce Indexed reader functionality to streaming
> -
>
> Key: FLINK-1870
> URL: https://issues.apache.org/jira/browse/FLINK-1870
> Project: Flink
>  Issue Type: Task
>  Components: Streaming
>Reporter: Gyula Fora
>Assignee: Matthias J. Sax
>Priority: Minor
>
> The Indexed record reader classes (IndexedReaderIterator, 
> IndexedMutableReader) were introduced to allow the streaming operators to 
> access the index of the last read channel from the input gate. This was a 
> necessary step toward future input sorting operators.
> Unfortunately this untested feature was patched away by the following commit:
> https://github.com/apache/flink/commit/5232c56b13b7e13e2cf10dbe818c221f5557426d
> At some point we need to reimplement these features with proper tests.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (FLINK-1870) Reintroduce Indexed reader functionality to streaming

2016-01-15 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/FLINK-1870?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15101604#comment-15101604
 ] 

ASF GitHub Bot commented on FLINK-1870:
---

Github user aljoscha commented on the pull request:

https://github.com/apache/flink/pull/1483#issuecomment-171930800
  
What do you mean by embedded usage? Also, isn't there the `StormTuple` that 
is used as output of some operations. Then, even if plain Tuples are used, they 
could have an additional field with the TaskId. I still think it could be made 
to work without putting potentially misleading/dangerous stuff into the general 
API.


> Reintroduce Indexed reader functionality to streaming
> -
>
> Key: FLINK-1870
> URL: https://issues.apache.org/jira/browse/FLINK-1870
> Project: Flink
>  Issue Type: Task
>  Components: Streaming
>Reporter: Gyula Fora
>Assignee: Matthias J. Sax
>Priority: Minor
>
> The Indexed record reader classes (IndexedReaderIterator, 
> IndexedMutableReader) were introduced to allow the streaming operators to 
> access the index of the last read channel from the input gate. This was a 
> necessary step toward future input sorting operators.
> Unfortunately this untested feature was patched away by the following commit:
> https://github.com/apache/flink/commit/5232c56b13b7e13e2cf10dbe818c221f5557426d
> At some point we need to reimplement these features with proper tests.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (FLINK-1870) Reintroduce Indexed reader functionality to streaming

2016-01-14 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/FLINK-1870?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15098905#comment-15098905
 ] 

ASF GitHub Bot commented on FLINK-1870:
---

Github user StephanEwen commented on the pull request:

https://github.com/apache/flink/pull/1483#issuecomment-171785899
  
Right, you cannot secure users from writing bad code, of course.

But a good way to guide then to writing good code is to expose only 
information they can actually use in the abstractions provided to them. 
Exposing certain information and declaring it as "don't use" is a bit like 
setting them up to make mistakes, and feels like such a strong sight that the 
abstractions are not picked correctly.

If this information is needed, I would really take the custom operator 
variant. We could also think if there is a good way of circumventing to depend 
on that channel index information in the first place.


> Reintroduce Indexed reader functionality to streaming
> -
>
> Key: FLINK-1870
> URL: https://issues.apache.org/jira/browse/FLINK-1870
> Project: Flink
>  Issue Type: Task
>  Components: Streaming
>Reporter: Gyula Fora
>Assignee: Matthias J. Sax
>Priority: Minor
>
> The Indexed record reader classes (IndexedReaderIterator, 
> IndexedMutableReader) were introduced to allow the streaming operators to 
> access the index of the last read channel from the input gate. This was a 
> necessary step toward future input sorting operators.
> Unfortunately this untested feature was patched away by the following commit:
> https://github.com/apache/flink/commit/5232c56b13b7e13e2cf10dbe818c221f5557426d
> At some point we need to reimplement these features with proper tests.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (FLINK-1870) Reintroduce Indexed reader functionality to streaming

2016-01-14 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/FLINK-1870?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15098934#comment-15098934
 ] 

ASF GitHub Bot commented on FLINK-1870:
---

Github user mjsax commented on the pull request:

https://github.com/apache/flink/pull/1483#issuecomment-171790057
  
Ok. I will try to rework this to the custom operator approach...

About your last statement: We could decide to omit this information in the 
Storm compatibility layer (and drop this PR completely) -- of course, this 
would limits the "compatibility level" we can achieve.

Btw: Storm does not give the "channel index" itself to the user, but the 
task-id of the parallel producer task attached to each input tuple (in Storm, 
task-ids are unique over all tasks over all operators of the topology). The 
compatibility will translate the "channel index" into this task-id (using the 
"channel index" is the only way I see to get this done in the compatibility 
layer).


> Reintroduce Indexed reader functionality to streaming
> -
>
> Key: FLINK-1870
> URL: https://issues.apache.org/jira/browse/FLINK-1870
> Project: Flink
>  Issue Type: Task
>  Components: Streaming
>Reporter: Gyula Fora
>Assignee: Matthias J. Sax
>Priority: Minor
>
> The Indexed record reader classes (IndexedReaderIterator, 
> IndexedMutableReader) were introduced to allow the streaming operators to 
> access the index of the last read channel from the input gate. This was a 
> necessary step toward future input sorting operators.
> Unfortunately this untested feature was patched away by the following commit:
> https://github.com/apache/flink/commit/5232c56b13b7e13e2cf10dbe818c221f5557426d
> At some point we need to reimplement these features with proper tests.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (FLINK-1870) Reintroduce Indexed reader functionality to streaming

2016-01-08 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/FLINK-1870?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15089649#comment-15089649
 ] 

ASF GitHub Bot commented on FLINK-1870:
---

Github user StephanEwen commented on the pull request:

https://github.com/apache/flink/pull/1483#issuecomment-170083592
  
I am big time skeptic when it comes to putting this into the 
`RuntimeContext`.

If we really need to expose that beyond the `StreamInputProcessor`, it 
would probably be best to pass this into the operator, as part of the 
"processRecord()" method.

Since StormBolts are implemented as custom operators, they should be able 
to pick it up from there.


> Reintroduce Indexed reader functionality to streaming
> -
>
> Key: FLINK-1870
> URL: https://issues.apache.org/jira/browse/FLINK-1870
> Project: Flink
>  Issue Type: Task
>  Components: Streaming
>Reporter: Gyula Fora
>Assignee: Matthias J. Sax
>Priority: Minor
>
> The Indexed record reader classes (IndexedReaderIterator, 
> IndexedMutableReader) were introduced to allow the streaming operators to 
> access the index of the last read channel from the input gate. This was a 
> necessary step toward future input sorting operators.
> Unfortunately this untested feature was patched away by the following commit:
> https://github.com/apache/flink/commit/5232c56b13b7e13e2cf10dbe818c221f5557426d
> At some point we need to reimplement these features with proper tests.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (FLINK-1870) Reintroduce Indexed reader functionality to streaming

2016-01-08 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/FLINK-1870?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15089600#comment-15089600
 ] 

ASF GitHub Bot commented on FLINK-1870:
---

Github user mjsax commented on the pull request:

https://github.com/apache/flink/pull/1483#issuecomment-170073172
  
If you implement a Bolt you can retrieve this information from Storm. Thus, 
the compatibility layer should provide this information, too.


> Reintroduce Indexed reader functionality to streaming
> -
>
> Key: FLINK-1870
> URL: https://issues.apache.org/jira/browse/FLINK-1870
> Project: Flink
>  Issue Type: Task
>  Components: Streaming
>Reporter: Gyula Fora
>Assignee: Matthias J. Sax
>Priority: Minor
>
> The Indexed record reader classes (IndexedReaderIterator, 
> IndexedMutableReader) were introduced to allow the streaming operators to 
> access the index of the last read channel from the input gate. This was a 
> necessary step toward future input sorting operators.
> Unfortunately this untested feature was patched away by the following commit:
> https://github.com/apache/flink/commit/5232c56b13b7e13e2cf10dbe818c221f5557426d
> At some point we need to reimplement these features with proper tests.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (FLINK-1870) Reintroduce Indexed reader functionality to streaming

2016-01-08 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/FLINK-1870?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15089432#comment-15089432
 ] 

ASF GitHub Bot commented on FLINK-1870:
---

Github user StephanEwen commented on the pull request:

https://github.com/apache/flink/pull/1483#issuecomment-170044112
  
The user code should not deal with what channel an element came from. It 
wires assumptions about the parallelism of the predecessor into the user code. 
Adjustments of the parallelism become impossible that way.

Why does the storm compatibility need this?


> Reintroduce Indexed reader functionality to streaming
> -
>
> Key: FLINK-1870
> URL: https://issues.apache.org/jira/browse/FLINK-1870
> Project: Flink
>  Issue Type: Task
>  Components: Streaming
>Reporter: Gyula Fora
>Assignee: Matthias J. Sax
>Priority: Minor
>
> The Indexed record reader classes (IndexedReaderIterator, 
> IndexedMutableReader) were introduced to allow the streaming operators to 
> access the index of the last read channel from the input gate. This was a 
> necessary step toward future input sorting operators.
> Unfortunately this untested feature was patched away by the following commit:
> https://github.com/apache/flink/commit/5232c56b13b7e13e2cf10dbe818c221f5557426d
> At some point we need to reimplement these features with proper tests.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (FLINK-1870) Reintroduce Indexed reader functionality to streaming

2016-01-04 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/FLINK-1870?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15080875#comment-15080875
 ] 

ASF GitHub Bot commented on FLINK-1870:
---

Github user StephanEwen commented on the pull request:

https://github.com/apache/flink/pull/1483#issuecomment-168620418
  
Can you give us a bit of context why this is needed?

Also: The `RuntimeContext` object is what is exposed to user-defined 
functions. Exposing the number of the channel there breaks the abstraction 
layer.


> Reintroduce Indexed reader functionality to streaming
> -
>
> Key: FLINK-1870
> URL: https://issues.apache.org/jira/browse/FLINK-1870
> Project: Flink
>  Issue Type: Task
>  Components: Streaming
>Reporter: Gyula Fora
>Assignee: Matthias J. Sax
>Priority: Minor
>
> The Indexed record reader classes (IndexedReaderIterator, 
> IndexedMutableReader) were introduced to allow the streaming operators to 
> access the index of the last read channel from the input gate. This was a 
> necessary step toward future input sorting operators.
> Unfortunately this untested feature was patched away by the following commit:
> https://github.com/apache/flink/commit/5232c56b13b7e13e2cf10dbe818c221f5557426d
> At some point we need to reimplement these features with proper tests.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (FLINK-1870) Reintroduce Indexed reader functionality to streaming

2016-01-04 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/FLINK-1870?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15081442#comment-15081442
 ] 

ASF GitHub Bot commented on FLINK-1870:
---

Github user mjsax commented on the pull request:

https://github.com/apache/flink/pull/1483#issuecomment-168748526
  
Needed for Storm compatibility. This information is made available in Storm 
to the user. Not sure why adding it to `RuntimeContext` breaks abstraction 
layer (it's only meta information)? Can you explain what you mean by that?


> Reintroduce Indexed reader functionality to streaming
> -
>
> Key: FLINK-1870
> URL: https://issues.apache.org/jira/browse/FLINK-1870
> Project: Flink
>  Issue Type: Task
>  Components: Streaming
>Reporter: Gyula Fora
>Assignee: Matthias J. Sax
>Priority: Minor
>
> The Indexed record reader classes (IndexedReaderIterator, 
> IndexedMutableReader) were introduced to allow the streaming operators to 
> access the index of the last read channel from the input gate. This was a 
> necessary step toward future input sorting operators.
> Unfortunately this untested feature was patched away by the following commit:
> https://github.com/apache/flink/commit/5232c56b13b7e13e2cf10dbe818c221f5557426d
> At some point we need to reimplement these features with proper tests.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (FLINK-1870) Reintroduce Indexed reader functionality to streaming

2016-01-02 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/FLINK-1870?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15076672#comment-15076672
 ] 

ASF GitHub Bot commented on FLINK-1870:
---

GitHub user mjsax opened a pull request:

https://github.com/apache/flink/pull/1483

[FLINK-1870] Reintroduce indexed reader functionality to streaming



You can merge this pull request into a Git repository by running:

$ git pull https://github.com/mjsax/flink flink-1870-inputChannelIndex

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/flink/pull/1483.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #1483


commit 5a8ef6cc8424dbf718e245a1e57845cc9b820059
Author: mjsax 
Date:   2015-12-30T11:40:06Z

[FLINK-1870] Reintroduce indexed reader functionality to streaming




> Reintroduce Indexed reader functionality to streaming
> -
>
> Key: FLINK-1870
> URL: https://issues.apache.org/jira/browse/FLINK-1870
> Project: Flink
>  Issue Type: Task
>  Components: Streaming
>Reporter: Gyula Fora
>Assignee: Matthias J. Sax
>Priority: Minor
>
> The Indexed record reader classes (IndexedReaderIterator, 
> IndexedMutableReader) were introduced to allow the streaming operators to 
> access the index of the last read channel from the input gate. This was a 
> necessary step toward future input sorting operators.
> Unfortunately this untested feature was patched away by the following commit:
> https://github.com/apache/flink/commit/5232c56b13b7e13e2cf10dbe818c221f5557426d
> At some point we need to reimplement these features with proper tests.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (FLINK-1870) Reintroduce Indexed reader functionality to streaming

2015-06-15 Thread Matthias J. Sax (JIRA)

[ 
https://issues.apache.org/jira/browse/FLINK-1870?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14587101#comment-14587101
 ] 

Matthias J. Sax commented on FLINK-1870:


I had a look into the code and think this feature might not be necessary any 
more. Currently, processed `BufferOrEvent` object do have method 
`.getChannelIndex()` and thus, the necessary information is available already. 
See `StreamingAbstractRecordReader.getNextRecord(T target)`. I exploit this 
information for in-memory record ts-ordering in my experimental branch already: 
https://github.com/mjsax/flink/tree/orderedStreams



 Reintroduce Indexed reader functionality to streaming
 -

 Key: FLINK-1870
 URL: https://issues.apache.org/jira/browse/FLINK-1870
 Project: Flink
  Issue Type: Task
  Components: Streaming
Reporter: Gyula Fora
Assignee: Matthias J. Sax
Priority: Minor

 The Indexed record reader classes (IndexedReaderIterator, 
 IndexedMutableReader) were introduced to allow the streaming operators to 
 access the index of the last read channel from the input gate. This was a 
 necessary step toward future input sorting operators.
 Unfortunately this untested feature was patched away by the following commit:
 https://github.com/apache/flink/commit/5232c56b13b7e13e2cf10dbe818c221f5557426d
 At some point we need to reimplement these features with proper tests.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)