Apache's commons-math implementation offers various strategies for handling
this scenarios:
http://commons.apache.org/proper/commons-math/jacoco/org.apache.commons.math3.stat.clustering/KMeansPlusPlusClusterer.java.html
(take a look at the EmptyClusterStrategy enum options)
2015-02-24 23:28
Hey Flinksters and IntelliJers, ;-)
the tests resources directory of each Maven module contains a
log4j-test.properties files, which gets picked via the classpath by JUnit
tests, but not Scalatest. Instead Scalatest picks up log4j.properties, but
JUnit doesn't.
It works when I specify the
Hi Amit,
Extensions of Flink should go in the flink-contrib project. Try
implementing a simple algorithm for your first task. If you extend it
and it becomes useful for other people besides you, make a pull
request via GitHub, so other people can check it out.
Your general understanding if the
Hi Matthias!
Can you try mvn clean package? It package is generally preferable to
compile, in my opinion.
There may be a dependency through a test jar that is not properly handled
when you do not execute the package goal.
Stephan
Am 25.02.2015 16:54 schrieb Max Michels m...@apache.org:
Hi
Hi,
Sorry for the delay to reply on this issue.
the jobmanager.rpc.address is set to “localhost” already in conf.yaml.
This can’t be an issue because the job manager web interface works fine which
also runs on localhost
bin/flink run jar doesn’t seem to work either. Let me send you my command
Henry Saputra created FLINK-1611:
Summary: Rename classes and packages with Nephele
Key: FLINK-1611
URL: https://issues.apache.org/jira/browse/FLINK-1611
Project: Flink
Issue Type:
Thanks for the replies guys!
@Aljoscha: I get your point, but I would actually expect either an error
message or the lonely centroid to move.
k-means is supposed to cluster data in k clusters. If you end up with k,
something must have gone wrong.. :s
@Alex: very helpful resource, thanks. I will
Addition: To check whether a port is reachable, I think the easiest thing
is to try and connect with a telnet client and see if the connection is
refused.
On Wed, Feb 25, 2015 at 8:15 PM, Stephan Ewen se...@apache.org wrote:
Okay, the problem seems to be that even though both the client and the
Okay, the problem seems to be that even though both the client and the
jobmanager use localhost as the host name, they resolve this to different
IP addresses: In one case 127.0.0.1 in the other case 10.216.177.146
Also, the 127.0.0.1 address cannot communicate to 10.216.177.146
apparently.
Can
Henry Saputra created FLINK-1612:
Summary: Add guidelines to avoid duplicate class names and more
JavaDoc for new addition
Key: FLINK-1612
URL: https://issues.apache.org/jira/browse/FLINK-1612
Hey,
since you've already read the documentation, I can recommend checking out
some slides about Flink on Slideshare as well.
Here is our How to Contribute guide:
http://flink.apache.org/how-to-contribute.html
Best,
Robert
On Wed, Feb 25, 2015 at 11:09 AM, amit pal amit5...@gmail.com wrote:
Hi Amit,
Nice to hear you're interested in Flink. The first thing you could do
is run the provided examples. Next, you could try to implement a
simple Flink job yourself. Besides the How to contribute guide, take
a look at the material page [1], the Flink blog [2], and the Data
Artisans blog [3].
Hej,
Very interesting discussion.
I hadn't heard of the SSP model before, looks like something I want to look
into.
I wonder if any of the algorithms that would work in that model would not
work in an asynchronous model. Since asynchronous is basically a SSP model
with infinite slack. Iterative
Hi Guys,
I have been reading the Flink documentation and the mail list discussions
since last two days, but haven't completely got hold of what all is there
in Flink.
My next step is to start with the examples present on github.
Can someone give me a gist of what Flink is all about and some
Hi Max,
thanks for the detailed answer-
That was exactly what I have been looking for.
I switched the serialization from kryo to use the Value interface instead,
keeping everything constant basically halved the execution time - nice.
One note - iterating over the array in serial fashion works,
Hey all,
I will start with some starter JIRA tasks, along with building some
application of Flink, particularly in the domain of NLP/text processing.
Any good NLT idea to implement over Flink? I am thinking on picking this
JIRA https://issues.apache.org/jira/browse/FLINK-1450 one to start on the
16 matches
Mail list logo