=org.apache.hadoop.yarn.util.resource.DominantResourceCalculator
And on the Samza side, I'm setting:
cluster-manager.container.cpu.cores=2
However, YARN is still telling me that the running task has 1 vcore
assigned. Do you have any other suggestions for options to tweak?
Cheers,
Malcolm
On Mon, Apr 1, 2019 at 5:28 PM Malcolm McFarland
wrote:
>
supports container requests with max-mem:
> > 14336, max-cpu: 1"
> >
> > On Tue, Apr 2, 2019 at 12:09 AM Malcolm McFarland
> > wrote:
> >
> >> Hey Prateek,
> >>
> >> The upgrade to Hadoop 2.7.6 went fine; everything seems to be working, a
on shown on-demand, as
opposed to preemptive?
Cheers,
Malcolm
Cheers,
Malcolm
On Tue, Apr 2, 2019 at 12:54 PM Malcolm McFarland
wrote:
> Hi Prateek,
>
> I'm not getting an error now, just an unyielding vcore allotment of 1.
> I just verified that we're settin
from the v2.6.1 docs (which I was initially using because of
its inclusion in the hello-samza project) to mean that this was a
per-container setting.
Thanks again for the help, and for the tip on upgrading to Yarn 2.7.6!
Cheers,
Malcolm
On Tue, Apr 2, 2019 at 1:47 PM Malcolm McFarland
wrote
One more thing -- fwiw, I actually also came across the possibility that I
would need to use the DominantResourceCalculator, but as you point out,
this doesn't seem to be available in Hadoop 2.6.
On Mon, Apr 1, 2019 at 5:27 PM Malcolm McFarland
wrote:
> That's quite helpful! I actua
d DominantResourceCalculator to account for vcore allocations in
> scheduling.
>
> - Prateek
>
> On Mon, Apr 1, 2019 at 3:00 PM Malcolm McFarland
> wrote:
>
> > Hi Prateek,
> >
> > This still seems to be manifesting with the same problem. Since this seems
> > to be s
about how the container is requesting more
resources than it can allocate. With 1 core, everything is fine. Is
there another Samza option I need to set?
Cheers,
Malcolm
--
Malcolm McFarland
Cavulus
cient. We haven't seen this
> issue before. What Samza/YARN versions are you using? Can you also include
> the logs from where you get the error and your yarn configuration?
>
> - Prateek
>
> On Mon, Apr 1, 2019 at 2:33 AM Malcolm McFarland
> wrote:
>
> > Hey Folks,
&
s? Looking at the Samza source on Github,
it appears to be information that's passed back to the AM when it
starts up.
Cheers,
Malcolm
On Mon, Apr 1, 2019 at 10:44 AM Malcolm McFarland
wrote:
>
> Hi Prateek,
>
> Sorry, meant to include these versions with my email; I'm running
> Samza 0
to default to 1? Can you
> try setting maximum-allocation-vcores lower?
>
> - Prateek
>
> On Mon, Apr 1, 2019 at 11:59 AM Malcolm McFarland
> wrote:
>
> > One other detail: I'm running YARN on ECS in AWS. Has anybody seen
> > issues with core allocation in this enviro
h this,or maybe
just isn't white-listing a port correctly, and if I could identify where
the application is stalling, it'd probably help to narrow down the
possibilities.
Cheers,
Malcolm McFarland
Cavulus
This correspondence is from HealthPlanCRM, LLC, d/b/a Cavulus. Any
unauthorized or improper
guessing it uses gethostname() on
Ubuntu, could be wrong).
Anybody ideas?
Cheers,
Malcolm McFarland
Cavulus
This correspondence is from HealthPlanCRM, LLC, d/b/a Cavulus. Any
unauthorized or improper disclosure, copying, distribution, or use of the
contents of this message is prohibited
of storage (this can be extended a small amount
via an ephemeral mounted volume but seemingly not enough to satisfy YARN's
VM requirements).
Hth, and thanks for everybody's patience,
Malcolm McFarland
Cavulus
This correspondence is from HealthPlanCRM, LLC, d/b/a Cavulus. Any
unauthorized o
No problem -- I'm happy that we finally figured this out and could share
our results. ECS could actually be a good choice for Node Managers; it's
easy in ECS to scale node counts up and down and to cycle out unhealthy
servers.
Malcolm McFarland
Cavulus
This correspondence is from HealthPlanCRM
"noreferrer"
target="_blank">https://confluence.atlassian.com/doc/generating-a-thread-dump-externally-182158040.html#GeneratingaThreadDumpExternally-GeneratingthreaddumpsonLinux
java.net.ConnectException
Since this is a ConnectException, can you rule out network issues? Can the
AM host an
were looking for?
>
> Also, by "don't fully start up" do you mean that
> applications are missing some containers (but the ApplicationMaster is
> running)?
> Or the application is missing entirely.
>
> --
> thanks
> rayman
> [image: Samza Job Launch Sequence.png]
&
Hey folks,
Are there any guidelines for how to provision an Application Master in
relation to the number of StreamTask instances it will be managing? Ie, are
there different memory, CPU, and thread-count figures for 100S StreamTasks
vs 1000, vs 1?
Cheers,
Malcolm McFarland
Cavulus
,
yarn.am.container.memory.mb=1536, yarn.am.opts=-Xmx1024m); does that sound
reasonable?
Cheers,
Malcolm McFarland
Cavulus
This correspondence is from HealthPlanCRM, LLC, d/b/a Cavulus. Any
unauthorized or improper disclosure, copying, distribution, or use of the
contents of this message is prohibited
on a YARN cluster, from Accepted status, to
localization, to the application master startup, to the actual
application's startup?
Cheers,
Malcolm
--
Malcolm McFarland
Cavulus
This correspondence is from HealthPlanCRM, LLC, d/b/a Cavulus. Any
unauthorized or improper disclosure, copying
>
> Can you roll-back to a known-good version to better isolate the issue?
>
> Best,
> Jagadish
>
> On Tue, May 7, 2019 at 3:54 PM Malcolm McFarland
> wrote:
>
> > As a followup to this, here's what I see when the Samza app tries to
> start;
> > it act
and
deduce what's happening?
Cheers,
Malcolm
--
Malcolm McFarland
Cavulus
This correspondence is from HealthPlanCRM, LLC, d/b/a Cavulus. Any
unauthorized or improper disclosure, copying, distribution, or use of
the contents of this message is prohibited. The information contained
in this message
Malcolm McFarland
wrote:
> Hey folks,
>
> We're having some trouble running Samza under YARN. The YARN
> containers are launching fully into the RUNNING state, and I can see
> in the node manager logs that the containers are running, but my logs
> are showing that Samza is not actual
:
> >
> >
> https://stackoverflow.com/questions/55737123/samza-1-1-0-run-app-sh-does-not-work-during-deployment-of-hello-samza
> >
> > Can someone on your team please help?
> >
> > Many thanks,
> >
> > Majd
> >
> >
>
--
Malcolm
or not?
Cheers,
Malcolm McFarland
Cavulus
This correspondence is from HealthPlanCRM, LLC, d/b/a Cavulus. Any
unauthorized or improper disclosure, copying, distribution, or use of the
contents of this message is prohibited. The information contained in this
message is intended only for the personal
s* the task?
Cheers,
Malcolm McFarland
Cavulus
This correspondence is from HealthPlanCRM, LLC, d/b/a Cavulus. Any
unauthorized or improper disclosure, copying, distribution, or use of the
contents of this message is prohibited. The information contained in this
message is intended only for the personal
Hey folks,
We have cleanup.policy=compact set on our checkpoint topics. Even with
this, we have almost 3 billion messages in some of these topics, and this
is causing huge startup times. Are there any other settings we should set
to optimize our startup times?
Cheers,
Malcolm McFarland
Cavulus
=compact?
Cheers,
Malcolm McFarland
Cavulus
This correspondence is from HealthPlanCRM, LLC, d/b/a Cavulus. Any
unauthorized or improper disclosure, copying, distribution, or use of the
contents of this message is prohibited. The information contained in this
message is intended only for the personal
Also, is there a way to produce this error, ie if we added extra messages
to the __checkpoint topics?
Malcolm McFarland
Cavulus
This correspondence is from HealthPlanCRM, LLC, d/b/a Cavulus. Any
unauthorized or improper disclosure, copying, distribution, or use of the
contents of this message
. Are there any best practices or gotchas
surrounding restarting Samza applications on YARN that could help here?
Cheers,
Malcolm McFarland
Cavulus
This correspondence is from HealthPlanCRM, LLC, d/b/a Cavulus. Any
unauthorized or improper disclosure, copying, distribution, or use of the
contents
Actually, do you have an example of some appropriate settings for Kafka to
ensure that compaction is behaving correctly for the Samza checkpoint
topics?
Cheers,
Malcolm McFarland
Cavulus
This correspondence is from HealthPlanCRM, LLC, d/b/a Cavulus. Any
unauthorized or improper disclosure
and
yarn.nodemanager.process-kill-wait.ms YARN values. Would this give Samza
more time to shutdown, perhaps allowing unpersisted checkpoints to be
written out?
Cheers,
Malcolm McFarland
Cavulus
This correspondence is from HealthPlanCRM, LLC, d/b/a Cavulus. Any
unauthorized or improper disclosure, copying
* topics to be removed.
I have two questions:
1) Where within these queues is the grouper configuration stored?
2) Would a Kafka topic cleanup.policy of "compact" cause trouble here?
Cheers,
Malcolm McFarland
Cavulus
This correspondence is from HealthPlanCRM, LLC, d/b/a Ca
he
message for processing. Are there any ideas out there about how to do this?
Cheers,
Malcolm McFarland
Cavulus
This correspondence is from HealthPlanCRM, LLC, d/b/a Cavulus. Any
unauthorized or improper disclosure, copying, distribution, or use of the
contents of this message is prohibited. The i
deploy (we've had no other code-related issues transitioning to YARN).
I'll take a look through the metrics and see if any of those could fill
this role. Right now we're looking at per-partition consumption, and maybe
"process-calls" will help with that.
Cheers,
Malcolm McFarla
rule-of-thumb values for each of these parameters.
Cheers,
Malcolm McFarland
Cavulus
This correspondence is from HealthPlanCRM, LLC, d/b/a Cavulus. Any
unauthorized or improper disclosure, copying, distribution, or use of the
contents of this message is prohibited. The information contained
the effect be on a container that was only allowed one CPU core?
Would it be ok to trade that off for more containers?
Cheers,
Malcolm McFarland
Cavulus
This correspondence is from HealthPlanCRM, LLC, d/b/a Cavulus. Any
unauthorized or improper disclosure, copying, distribution, or use of the
contents
recommend scaling up in box YARN node processing capability,
or out in YARN node count?
Thanks,
Malcolm McFarland
Cavulus
This correspondence is from HealthPlanCRM, LLC, d/b/a Cavulus. Any
unauthorized or improper disclosure, copying, distribution, or use of the
contents of this message is prohibited
mza 0.14.1 and using AWS MSK which is running version Kafka
2.2.1.
Thanks so much,
Malcolm McFarland
Cavulus
. Fwiw, we're using Kafka 0.11.0.2 with Samza 0.14.1; my
understanding is that there should be version compatibility between Kafka
0.11.0.x-2.x.
If you have any other ideas, I'd be interested in hearing them.
Cheers,
Malcolm McFarland
Cavulus
On Fri, Apr 10, 2020 at 8:36 PM Yi Pan wrote:
>
using coordination service) one must use
org.apache.samza.container.grouper.task.GroupByContainerIdsFactory". I
added that in and everything is starting up smoothly.
Cheers,
Malcolm McFarland
Cavulus
This correspondence is from HealthPlanCRM, LLC, d/b/a Cavulus. Any
unauthorized or improper
for session: 0x18fae2900a0
2020-12-17 17:04:32.309 [Samza Debounce
Thread-81ac6a4e-3d5e-479c-9a6c-2f2d9b4372d3] StreamProcessor [INFO]
Shutting down the executor service of the stream processor:
81ac6a4e-3d5e-479c-9a6c-2f2d9b4372d3.
Does this help?
Cheers,
Malcolm McFarland
Cavulus
-40c9-8063-1efd588cec56. Stopping this processor. New
JobModel: JobModel [..]
At this point the ThreadJob shuts down cleanly. Afaict, the legacy
configuration is set up correctly, and mirrors our functional build under
0.14.1. Any thoughts?
Cheers,
Malcolm McFarland
Cavulus
This corresponden
:
HADOOP_YARN_HOME
YARN_HOME
HADOOP_COMMON_HOME
HADOOP_HOME
HADOOP_PREFIX
HADOOP_CONF_DIR
My yarn-site.xml is at $YARN_HOME/etc/hadoop/yarn-site.xml, and contains
the following configuration:
yarn.resourcemanager.address
${RM_IP_ADDRESS}
This worked fine in 0.14.1.
Cheers,
Malcolm McFarland
the local yarn-site.xml
accurately. When it starts on yarn, though, it seems to be resolving the
resource manager to localhost. The final exception information is at the
end of this email. Any ideas?
Cheers,
Malcolm McFarland
Cavulus
Failed to connect to server: localhost/127.0.0.1:8030: retries get failed
Hey all,
Has Samza 1.7.0 been officially released? I've been following the
discussion here, and it seems like it was cleared in March, but I haven't
seen any announcements or updates to the docs. Not trying to be pushy here,
just curious about the status of release 1.7.0.
Cheers,
Malcolm
if we can integrate it into a local Java 11-based build. Thanks
Jamie!
Cheers,
Malcolm McFarland
Cavulus
On Wed, Aug 24, 2022 at 1:47 PM James DeMichele
wrote:
> Hi, I'm not sure if my previous email went through so thought that I would
> try again.
>
> I know that Samza is not
-coordinationData/jobModelGeneration/jobModelVersion
At which point the application silently exits.
Thanks in advance for any advice, ideas, things to check, etc.
Cheers,
Malcolm McFarland
Cavulus
://hadoop.apache.org/docs/r3.3.0/index.html). Are there any unit tests
in Samza that verify compatibility against a YARN cluster? If so, that
could be a place to validate YARN v2.10/v3.3 cross-compatibility.
Just throwing my 2 cents out there,
Malcolm McFarland
Cavulus
On Fri, Sep 2, 2022 at 6:27 PM
,
Malcolm McFarland
Cavulus
[0]
https://github.com/apache/samza/blob/1.7.0/samza-kafka/src/main/scala/org/apache/samza/system/kafka/KafkaSystemProducer.scala#L97-L101
[1]
https://github.com/apache/kafka/blob/2.4/clients/src/main/java/org/apache/kafka/clients/producer/KafkaProducer.java#L895
[2]
https
key-to-partition
hashing algorithm in Samza, or b) if there is a reliable general-purpose
algorithm that can create the same results as Samza's algorithm?
Cheers,
Malcolm McFarland
Cavulus
[0]
https://github.com/apache/samza/blob/1.7.0/samza-kafka/src/main/java/org/apache/samza/util
Hey all,
I just noticed that, in the blog post for Apache 1.8.0, there is mention of
following instructions for the 1.7.0 upgrade. However, there is no blog
post about 1.7.0, nor is there a version of the documentation for 1.7.0.
Did that get accidentally dropped?
Cheers,
Malcolm McFarland
Just found this under the "Releases" tab. A little confusing, but that
works!
Cheers,
Malcolm McFarland
Cavulus
On Thu, Apr 27, 2023 at 4:51 PM Malcolm McFarland
wrote:
> Hey all,
>
> I just noticed that, in the blog post for Apache 1.8.0, there is mention
> of
utoscaling module
(removed in version 1.4.0), but no actual documentation or examples.
Cheers,
Malcolm McFarland
Cavulus
way to run a
single instance of a streamtask locally? I'm using Samza 1.6.0, Kafka
2.2.2, and ZooKeeper 3.4.14.
Thanks in advance for the help!
Cheers,
Malcolm McFarland
Cavulus
Here's a sample of the last few log messages, log level set to TRACE:
[INFO] Metadata Cluster ID: _6MEL_AQSBW7DpOjoOD
54 matches
Mail list logo