[GitHub] flink pull request: [FLINK-1944][gelly] Added GSA-PageRank example

2015-04-26 Thread andralungu
GitHub user andralungu opened a pull request:

https://github.com/apache/flink/pull/626

[FLINK-1944][gelly] Added GSA-PageRank example

This PR also adds the getSuperStepNumber() method to the Gather, Sum and 
Apply Functions.

You can merge this pull request into a Git repository by running:

$ git pull https://github.com/andralungu/flink gsaPageRank

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/flink/pull/626.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #626


commit da73b7c74cabedafbc31fdf2403a4f01d77d46d0
Author: andralungu 
Date:   2015-04-26T22:26:41Z

[FLINK-1944][gelly] Added GSA-PageRank example




---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[jira] [Commented] (FLINK-1944) Add a Gelly-GSA PageRank example

2015-04-26 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/FLINK-1944?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14513308#comment-14513308
 ] 

ASF GitHub Bot commented on FLINK-1944:
---

GitHub user andralungu opened a pull request:

https://github.com/apache/flink/pull/626

[FLINK-1944][gelly] Added GSA-PageRank example

This PR also adds the getSuperStepNumber() method to the Gather, Sum and 
Apply Functions.

You can merge this pull request into a Git repository by running:

$ git pull https://github.com/andralungu/flink gsaPageRank

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/flink/pull/626.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #626


commit da73b7c74cabedafbc31fdf2403a4f01d77d46d0
Author: andralungu 
Date:   2015-04-26T22:26:41Z

[FLINK-1944][gelly] Added GSA-PageRank example




> Add a Gelly-GSA PageRank example
> 
>
> Key: FLINK-1944
> URL: https://issues.apache.org/jira/browse/FLINK-1944
> Project: Flink
>  Issue Type: Task
>Affects Versions: 0.9
>Reporter: Vasia Kalavri
>Assignee: Andra Lungu
>Priority: Minor
>




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[GitHub] flink pull request: [FLINK-1615] [java api] SimpleTweetInputFormat

2015-04-26 Thread Elbehery
Github user Elbehery commented on the pull request:

https://github.com/apache/flink/pull/621#issuecomment-96418152
  
@aljoscha  I have checked the Travic log. The problem is in Flink 
Streaming, so why my branch failed ?!!


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[jira] [Commented] (FLINK-1615) Introduces a new InputFormat for Tweets

2015-04-26 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/FLINK-1615?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14513171#comment-14513171
 ] 

ASF GitHub Bot commented on FLINK-1615:
---

Github user Elbehery commented on the pull request:

https://github.com/apache/flink/pull/621#issuecomment-96418152
  
@aljoscha  I have checked the Travic log. The problem is in Flink 
Streaming, so why my branch failed ?!!


> Introduces a new InputFormat for Tweets
> ---
>
> Key: FLINK-1615
> URL: https://issues.apache.org/jira/browse/FLINK-1615
> Project: Flink
>  Issue Type: New Feature
>  Components: flink-contrib
>Affects Versions: 0.8.1
>Reporter: mustafa elbehery
>Priority: Minor
>
> An event-driven parser for Tweets into Java Pojos. 
> It parses all the important part of the tweet into Java objects. 
> Tested on cluster and the performance in pretty well. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[GitHub] flink pull request: [gelly][Refactoring] Removed example string

2015-04-26 Thread andralungu
GitHub user andralungu opened a pull request:

https://github.com/apache/flink/pull/625

[gelly][Refactoring] Removed example string

@vasia,

Just FYI :), in example/utils we have a class EdgeWithLongIdNullValueParser 
that is never used.
 
The examples that correspond to the library methods, e.g. PageRankExample 
cannot be renamed because that will cause a collision with the method name, in 
this case, PageRank. I guess that's why we added "Example" to the class name in 
the first place. 

To me the examples look good as they are in this commit. If you have 
another idea for the library method examples, let me know, but in my view 
Example is better than Usage, or whatever different word we may come up with. 

You can merge this pull request into a Git repository by running:

$ git pull https://github.com/andralungu/flink examples

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/flink/pull/625.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #625


commit d6414eae54a2bafdba4457ffb2aa0bd1d34ad57d
Author: andralungu 
Date:   2015-04-26T18:09:04Z

[gelly][Refactoring] Removed example string




---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[jira] [Commented] (FLINK-1886) how can handles left join ,right join , FULL OUTER JOIN in flink

2015-04-26 Thread hagersaleh (JIRA)

[ 
https://issues.apache.org/jira/browse/FLINK-1886?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14513089#comment-14513089
 ] 

hagersaleh commented on FLINK-1886:
---

how can handle left outer join for any two dataset this dataset inlcude any 
filed number
example

data set one
 ExecutionEnvironment env = ExecutionEnvironment.getExecutionEnvironment();
DataSet> 
customer=env.readCsvFile("/home/hadoop/Desktop/Dataset/customer.csv")
.fieldDelimiter('|')
.includeFields("").ignoreFirstLine() 
.types(Integer.class,String.class,String.class,String.class);

dataset two

 ExecutionEnvironment orders = ExecutionEnvironment.getExecutionEnvironment();
DataSet 
customer=env.readCsvFile("/home/hadoop/Desktop/Dataset/order.csv")
.fieldDelimiter('|')
.includeFields("").ignoreFirstLine() 
.types(Integer.class,String.class,String.classs); 

> how can handles left join ,right join , FULL OUTER JOIN in flink
> 
>
> Key: FLINK-1886
> URL: https://issues.apache.org/jira/browse/FLINK-1886
> Project: Flink
>  Issue Type: Bug
>Reporter: hagersaleh
>
> how can handles left join ,right join , FULL OUTER JOIN in flink
> how can write code for handle this
> example in inner join
> DataSet result = 
>   
> customerWithOrders.join(lineitems).where(0).equalTo(0)
>   .with(
>   
> new JoinFunction ShippingPriorityItem>() {
>   
> @Override
>   
> public ShippingPriorityItem join(ShippingPriorityItem i, 
> Lineitem l) {
>   
> i.setRevenue(l.getExtendedprice() * (1 - 
> l.getDiscount()));
>   
> return i;
>   
> }
>   
> })



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Assigned] (FLINK-1944) Add a Gelly-GSA PageRank example

2015-04-26 Thread Andra Lungu (JIRA)

 [ 
https://issues.apache.org/jira/browse/FLINK-1944?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andra Lungu reassigned FLINK-1944:
--

Assignee: Andra Lungu

> Add a Gelly-GSA PageRank example
> 
>
> Key: FLINK-1944
> URL: https://issues.apache.org/jira/browse/FLINK-1944
> Project: Flink
>  Issue Type: Task
>Affects Versions: 0.9
>Reporter: Vasia Kalavri
>Assignee: Andra Lungu
>Priority: Minor
>




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (FLINK-1670) Collect method for streaming

2015-04-26 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/FLINK-1670?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14513019#comment-14513019
 ] 

ASF GitHub Bot commented on FLINK-1670:
---

Github user ggevay commented on the pull request:

https://github.com/apache/flink/pull/581#issuecomment-96373660
  
I did the small change suggested by Gyula.


> Collect method for streaming
> 
>
> Key: FLINK-1670
> URL: https://issues.apache.org/jira/browse/FLINK-1670
> Project: Flink
>  Issue Type: New Feature
>  Components: Streaming
>Affects Versions: 0.9
>Reporter: Márton Balassi
>Assignee: Gabor Gevay
>Priority: Minor
>
> A convenience method for streaming back the results of a job to the client.
> As the client itself is a bottleneck anyway an easy solution would be to 
> provide a socket sink with degree of parallelism 1, from which a client 
> utility can read.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (FLINK-1944) Add a Gelly-GSA PageRank example

2015-04-26 Thread Vasia Kalavri (JIRA)
Vasia Kalavri created FLINK-1944:


 Summary: Add a Gelly-GSA PageRank example
 Key: FLINK-1944
 URL: https://issues.apache.org/jira/browse/FLINK-1944
 Project: Flink
  Issue Type: Task
Affects Versions: 0.9
Reporter: Vasia Kalavri
Priority: Minor






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[GitHub] flink pull request: [FLINK-1670] Made DataStream iterable

2015-04-26 Thread ggevay
Github user ggevay commented on the pull request:

https://github.com/apache/flink/pull/581#issuecomment-96373660
  
I did the small change suggested by Gyula.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[jira] [Resolved] (FLINK-1758) Extend Gelly's neighborhood methods

2015-04-26 Thread Vasia Kalavri (JIRA)

 [ 
https://issues.apache.org/jira/browse/FLINK-1758?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vasia Kalavri resolved FLINK-1758.
--
   Resolution: Implemented
Fix Version/s: 0.9

> Extend Gelly's neighborhood methods
> ---
>
> Key: FLINK-1758
> URL: https://issues.apache.org/jira/browse/FLINK-1758
> Project: Flink
>  Issue Type: Improvement
>  Components: Gelly
>Affects Versions: 0.9
>Reporter: Vasia Kalavri
>Assignee: Andra Lungu
> Fix For: 0.9
>
>
> Currently, the neighborhood methods only allow returning a single value per 
> vertex. In many cases, it is desirable to return several or no value per 
> vertex. This is the case in clustering coefficient computation, 
> vertex-centric jaccard, algorithms where a vertex computes a value per edge 
> or when a vertex computes a value only for some of its neighbors.
> This issue proposes to 
> - change the current reduceOnEdges/reduceOnNeighbors methods to use 
> combinable reduce operations where possible
> - provide groupReduce-versions, which will use a Collector and allow 
> returning none or more values per vertex.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[GitHub] flink pull request: [FLINK-1758][gelly] Neighborhood Methods Exten...

2015-04-26 Thread asfgit
Github user asfgit closed the pull request at:

https://github.com/apache/flink/pull/576


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[jira] [Commented] (FLINK-1758) Extend Gelly's neighborhood methods

2015-04-26 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/FLINK-1758?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14513018#comment-14513018
 ] 

ASF GitHub Bot commented on FLINK-1758:
---

Github user asfgit closed the pull request at:

https://github.com/apache/flink/pull/576


> Extend Gelly's neighborhood methods
> ---
>
> Key: FLINK-1758
> URL: https://issues.apache.org/jira/browse/FLINK-1758
> Project: Flink
>  Issue Type: Improvement
>  Components: Gelly
>Affects Versions: 0.9
>Reporter: Vasia Kalavri
>Assignee: Andra Lungu
>
> Currently, the neighborhood methods only allow returning a single value per 
> vertex. In many cases, it is desirable to return several or no value per 
> vertex. This is the case in clustering coefficient computation, 
> vertex-centric jaccard, algorithms where a vertex computes a value per edge 
> or when a vertex computes a value only for some of its neighbors.
> This issue proposes to 
> - change the current reduceOnEdges/reduceOnNeighbors methods to use 
> combinable reduce operations where possible
> - provide groupReduce-versions, which will use a Collector and allow 
> returning none or more values per vertex.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Assigned] (FLINK-1942) Add configuration options to Gelly-GSA

2015-04-26 Thread Andra Lungu (JIRA)

 [ 
https://issues.apache.org/jira/browse/FLINK-1942?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andra Lungu reassigned FLINK-1942:
--

Assignee: Andra Lungu

> Add configuration options to Gelly-GSA
> --
>
> Key: FLINK-1942
> URL: https://issues.apache.org/jira/browse/FLINK-1942
> Project: Flink
>  Issue Type: Improvement
>  Components: Gelly
>Affects Versions: 0.9
>Reporter: Vasia Kalavri
>Assignee: Andra Lungu
>
> Currently, it is not possible to configure a GSA iteration. Similarly to 
> vertex-centric, we should allow setting the iteration name and degree of 
> parallelism, aggregators, broadcast variables and whether the solution set is 
> kept in unmanaged memory.
> The docs should be updated accordingly.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (FLINK-1943) Add Gelly-GSA compiler and translation tests

2015-04-26 Thread Vasia Kalavri (JIRA)
Vasia Kalavri created FLINK-1943:


 Summary: Add Gelly-GSA compiler and translation tests
 Key: FLINK-1943
 URL: https://issues.apache.org/jira/browse/FLINK-1943
 Project: Flink
  Issue Type: Test
  Components: Gelly
Affects Versions: 0.9
Reporter: Vasia Kalavri


These should be similar to the corresponding Spargel tests.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (FLINK-1942) Add configuration options to Gelly-GSA

2015-04-26 Thread Vasia Kalavri (JIRA)
Vasia Kalavri created FLINK-1942:


 Summary: Add configuration options to Gelly-GSA
 Key: FLINK-1942
 URL: https://issues.apache.org/jira/browse/FLINK-1942
 Project: Flink
  Issue Type: Improvement
  Components: Gelly
Affects Versions: 0.9
Reporter: Vasia Kalavri


Currently, it is not possible to configure a GSA iteration. Similarly to 
vertex-centric, we should allow setting the iteration name and degree of 
parallelism, aggregators, broadcast variables and whether the solution set is 
kept in unmanaged memory.
The docs should be updated accordingly.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (FLINK-1941) Add documentation for Gelly-GSA

2015-04-26 Thread Vasia Kalavri (JIRA)
Vasia Kalavri created FLINK-1941:


 Summary: Add documentation for Gelly-GSA
 Key: FLINK-1941
 URL: https://issues.apache.org/jira/browse/FLINK-1941
 Project: Flink
  Issue Type: Task
  Components: Gelly
Affects Versions: 0.9
Reporter: Vasia Kalavri


Add a section in the Gelly guide to describe the newly introduced 
Gather-Sum-Apply iteration method. Show how GSA uses delta iterations 
internally and explain the differences of this model as compared to 
vertex-centric.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Resolved] (FLINK-1514) [Gelly] Add a Gather-Sum-Apply iteration method

2015-04-26 Thread Vasia Kalavri (JIRA)

 [ 
https://issues.apache.org/jira/browse/FLINK-1514?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vasia Kalavri resolved FLINK-1514.
--
   Resolution: Implemented
Fix Version/s: 0.9

> [Gelly] Add a Gather-Sum-Apply iteration method
> ---
>
> Key: FLINK-1514
> URL: https://issues.apache.org/jira/browse/FLINK-1514
> Project: Flink
>  Issue Type: New Feature
>  Components: Gelly
>Affects Versions: 0.9
>Reporter: Vasia Kalavri
>Assignee: Daniel Bali
> Fix For: 0.9
>
>
> This will be a method that implements the GAS computation model, but without 
> the "scatter" step. The phases can be mapped into the following steps inside 
> a delta iteration:
> gather: a map on each < srcVertex, edge, trgVertex > that produces a partial 
> value
> sum: a reduce that combines the partial values
> apply: join with vertex set to update the vertex values using the results of 
> sum and the previous state.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (FLINK-1514) [Gelly] Add a Gather-Sum-Apply iteration method

2015-04-26 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/FLINK-1514?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14513014#comment-14513014
 ] 

ASF GitHub Bot commented on FLINK-1514:
---

Github user asfgit closed the pull request at:

https://github.com/apache/flink/pull/408


> [Gelly] Add a Gather-Sum-Apply iteration method
> ---
>
> Key: FLINK-1514
> URL: https://issues.apache.org/jira/browse/FLINK-1514
> Project: Flink
>  Issue Type: New Feature
>  Components: Gelly
>Affects Versions: 0.9
>Reporter: Vasia Kalavri
>Assignee: Daniel Bali
>
> This will be a method that implements the GAS computation model, but without 
> the "scatter" step. The phases can be mapped into the following steps inside 
> a delta iteration:
> gather: a map on each < srcVertex, edge, trgVertex > that produces a partial 
> value
> sum: a reduce that combines the partial values
> apply: join with vertex set to update the vertex values using the results of 
> sum and the previous state.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[GitHub] flink pull request: [FLINK-1514][Gelly] Add a Gather-Sum-Apply ite...

2015-04-26 Thread asfgit
Github user asfgit closed the pull request at:

https://github.com/apache/flink/pull/408


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[jira] [Created] (FLINK-1940) StockPrice example cannot be visualized

2015-04-26 Thread Gyula Fora (JIRA)
Gyula Fora created FLINK-1940:
-

 Summary: StockPrice example cannot be visualized
 Key: FLINK-1940
 URL: https://issues.apache.org/jira/browse/FLINK-1940
 Project: Flink
  Issue Type: Bug
  Components: Streaming
Reporter: Gyula Fora


The planvisualizer fails on the JSON generated by the StockPrice example



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (FLINK-1939) Add Parquet Documentation to Wiki

2015-04-26 Thread Felix Neutatz (JIRA)
Felix Neutatz created FLINK-1939:


 Summary: Add Parquet Documentation to Wiki
 Key: FLINK-1939
 URL: https://issues.apache.org/jira/browse/FLINK-1939
 Project: Flink
  Issue Type: Task
  Components: Documentation
Reporter: Felix Neutatz
Assignee: Felix Neutatz
Priority: Trivial


Add documentation who to read and write Parquet :)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)