Slack digest for #general - 2020-03-05

Apache Pulsar Slack Thu, 05 Mar 2020 01:11:24 -0800

2020-03-04 09:57:05 UTC - Vincent LE MAITRE: Hi, I have added this parameter on 
my bookkeeper instances. But I am still not able to use function state from my 
broker instances. I am using a Pulsar 2.5.0 deployed on a Kubernetes cluster 
using the provided helm chart. Please could you tell more about your setup ?
----
2020-03-04 10:02:07 UTC - Vincent LE MAITRE: Did you configure 
"stateStorageServiceUrl" into the functions_worker.yml conf file ? If yes, 
which value ? Thanks
----
2020-03-04 10:18:48 UTC - Viji: Hi, are there any restrictions on topic names 
in pulsar like the maximum length, allows characters etc ?
----
2020-03-04 10:22:40 UTC - Viji: When we create a partitioned topic with name 
which is a substring of an already existing partitioned topic why do we see 
following error, is it a defect in pulsar ?
----
2020-03-04 10:23:06 UTC - Viji: [3:50 PM] Sakthi Prasad K
./pulsar-admin topics create-partitioned-topic 
<persistent://public/default/create_substring_topic> -p 1

[3:50 PM] Sakthi Prasad K
./pulsar-admin topics create-partitioned-topic
<persistent://public/default/substring_topic> -p 1
10:14:38.692 [AsyncHttpClient-5-1] WARN
org.apache.pulsar.client.admin.internal.BaseResource -
[<https://itomdipulsar-proxy:8443/admin/v2/persistent/public/default/substring_topic/partitions>]
Failed to perform http put request: javax.ws.rs.ClientErrorException: HTTP 412
Precondition Failed Already have topic
<persistent://public/default/create_substring_topic-partition-0> which contains
partition suffix '-partition-' and end with numeric value, Creation of
partitioned topic substring_topic could cause conflict.
Reason: Already have topic
<persistent://public/default/create_substring_topic-partition-0> which contains
partition suffix '-partition-' and end with numeric value, Creation of
partitioned topic substring_topic could cause conflict.
----
2020-03-04 11:12:51 UTC - Viji: We raised an issue on git (
<https://github.com/apache/pulsar/issues/6468>)
----
2020-03-04 12:04:29 UTC - eilonk: Did anyone try to use tls authentication and
recieved this error?
`sun.misc.Launcher$AppClassLoader@18769467] error Uncaught exception in thread
main: Failed to load an authentication provider.`
I set authenticationProviders as:
`[conf/proxy.conf] Applying config authenticationProviders =
org.apache.pulsar.broker.authentication.AuthenticationProviderTls,org.apache.pulsar.broker.authentication.AuthenticationProviderToken`
(this is just an example from proxy, this is the same value I used everywhere)
did anyone encounter this?
----
2020-03-04 13:09:25 UTC - Yosi Attias: @Yosi Attias has joined the channel
----
2020-03-04 13:32:06 UTC - Ryan Slominski: I'm using standalone mode of Pulsar
(entire cluster on a single machine). I am not familiar with getting state
working in containers, but I vaguely remember reading somewhere you need to set
that URL in a YAML file. I think the parameter for standalone mode is
something like:

```--state_storage_serviceurl <bk://localhost:4181>```

----
2020-03-04 13:45:15 UTC - Ryan Slominski: You might also need something like:

`functionsWorkerEnabled=true`

in your broker.conf or whatever is the equivalent in the containers.
----
2020-03-04 13:49:57 UTC - Vincent LE MAITRE: OK. Thanks. Why are you using the
port 4181 for bookkeeper ? The default port should be 3181 isn't it ?
----
2020-03-04 13:51:14 UTC - Vincent LE MAITRE: functionsWorker is enabled. My
functions without state are working fine. But functions with state dores not
work
----
2020-03-04 15:35:24 UTC - Luis Muniz: @Luis Muniz has joined the channel
----
2020-03-04 16:08:57 UTC - Chris: @Chris has joined the channel
----
2020-03-04 16:27:07 UTC - Vincent LE MAITRE: Hi, bookkeeper is configured by
default on port 3181. In order to configure function state, I found this
example into the conf : stateStorageServiceUrl: <bk://localhost:4181>. What is
this 4181 port ? Another port of bookkeeper ? Thanks
----
2020-03-04 16:27:43 UTC - Pavel Tishkevich: @Sijie Guo @Devin G. Bost Could you
provide more details about tuning zookeeper to handle increased load?
----
2020-03-04 16:28:29 UTC - Devin G. Bost: It’s all in here:
<https://zookeeper.apache.org/doc/r3.4.13/zookeeperAdmin.pdf>
----
2020-03-04 16:45:31 UTC - Ryan Slominski: I might have port wrong for
bookkeeper. I can't find it now, but some sources on the Internet reference
that alternate port number. The standalone.conf that ships with version 2.5.0
of Pulsar doesn't specify and the bookkeeper.conf says 3181 like you say.
----
2020-03-04 16:50:47 UTC - Chris: How would I start diagnosing slow consumption
from large backlogs? When trying to replay old data my consumers run at
1mb/s/partition instead of the 200 or so I see when consuming with no/very
small backlog. It's equally slow when attaching `pulsar-perf` to that
subscription name. I've checked my configs for rate limits on backlogged
consumers, and they're all set to 0 which I assume is unlimited, so I'm not
quite sure what's going on.
----
2020-03-04 17:21:52 UTC - Sijie Guo: Ah I see. that seems to be a regression in
2.5.0. /cc @Penghui Li
----
2020-03-04 17:25:54 UTC - Sijie Guo: If you have a huge backlog, try to check
if it is because `dbStorage_rocksDB_blockCacheSize` is too small.
----
2020-03-04 17:30:55 UTC - Mathieu Druart: @Vincent LE MAITRE we tried the same
thing and ran into the same issue. Never succeeded to activate State into
Pulsar Functions with the Helm deployment.
----
2020-03-04 18:32:32 UTC - Chris: Is there a way to check the size of the index
db? It looks like I missed a zero or two on that config though as it was set
really small.
----
2020-03-04 19:14:51 UTC - Chris: Changing that value didn't seem to help at
all. Any other ideas?
----
2020-03-04 19:21:08 UTC - Kevin Xu: @Kevin Xu has joined the channel
----
2020-03-04 20:37:14 UTC - Sergii Zhevzhyk: Hi, I cannot find the release 2.5.1
among the milestones. When do you plan to release it?
----
2020-03-04 20:43:34 UTC - Nick Poore: @Nick Poore has joined the channel
----
2020-03-04 21:48:22 UTC - Sijie Guo: @Sergii Zhevzhyk we are not using
milestone for minor releases any more. we are using label `release/2.5.1`
instead. because a change can go with a major release (like 2.6.0) and a minor
release like (2.5.1). Github doesn’t allow us tagging multiple milestones.

We are in the process of cleaning up last batch of 2.5.1 fixes. expect to start
releasing 2.5.1 early next week.

You can subscribe to dev@ mailing list for these updates.
----
2020-03-04 22:14:05 UTC - Sergii Zhevzhyk: @Sijie Guo thank you for the info!
----
2020-03-04 23:32:09 UTC - Eric Simon: If I wasnt to override the
```instanceLivenessCheckFreqMs```
For a pulsar function. Is there a way to do this without updating the
function_worker.yml? Can I pass it in as runtime flag?
----
2020-03-05 01:42:26 UTC - Penghui Li: Ok, I will take a look.
----
2020-03-05 02:19:29 UTC - Eugen: Thanks merlimat, `blockIfQueueFull=true` did
the trick
----
2020-03-05 03:31:52 UTC - Ken Huang: Hi, how do I understand processing
guarantees of pulsar function?
"At-most-once" means regardless of whether the consumer received the message
"At-least-once" means promises consumer received the message
"Effectively-once" means promises consumer received the "correct" message
The above is my understanding, is it correct?
----
2020-03-05 03:43:56 UTC - Prasad Reddy: I Managed to set up pulsar 2.5.0-x on
kubernetes and doing some benchmark tests for producer and while doing tests,
noticed that "Journal Queue Length(The total number of requests pending in the
journal queue)" size is increasing gradually which I'm suspecting abnormal. Can
anyone explain that why the Journal Queue Size is increasing and what might be
the cause for this behavior?
----
2020-03-05 05:02:11 UTC - Sijie Guo: Currently the setting is inherited from
worker. Updating the function_worker.yml is the only way. Can you create a
github issue for requesting a feature? We can improve the behavior.
----
2020-03-05 05:03:06 UTC - Sijie Guo: I think these two metrics are gauge in
bookie side. but they are probably not handled properly.
----
2020-03-05 05:04:30 UTC - Sijie Guo: processing guarantees for functions
reflect whether a function writes the result to the output topic.
----
2020-03-05 05:05:06 UTC - Sijie Guo: ‘at-most-once’ means the function process
the message “at-most-once” but there is no guarantee the result is written to
the output topic.
----
2020-03-05 05:05:57 UTC - Sijie Guo: ‘at-least-once’ guanrantee the the result
is written to the output topic. but the function might be processed multiple
times and multiple results might be written to the output topic.
----
2020-03-05 05:06:27 UTC - Sijie Guo: ‘Effectively-once’ means the result is
written exactly once to the output topic. but the function might still run
multiple times.
----
2020-03-05 05:17:45 UTC - Prasad Reddy: @Sijie Guo Thank you for the info.
Ok. What might be the reason for not handling properly? How do resolve this
issue? Is there any specific parameters at bookie side to play around to handle
more data?
----
2020-03-05 05:21:34 UTC - Ken Huang: thank you very much for reply
----
2020-03-05 05:22:33 UTC - Sijie Guo: I think we need to fix the bookie code.
Can you create a giithub issue?
----
2020-03-05 05:23:58 UTC - Prasad Reddy: @Sijie Guo Sure.. thank you !
----
2020-03-05 06:01:47 UTC - Prasad Reddy: @Sijie Guo We raised an issue on git
(<https://github.com/apache/pulsar/issues/6479>)
*Describe the bug*
Bookie Journal Queue Length is increasing gradually during performance
executions
*To Reproduce*
Steps to reproduce the behavior:
1. Install Pulsar 2.5.0-08 on kubernetes cluster environment ( 3 Bookies, 3
brokers,3 proxy).
2. Create a topic with three partitions.
3. Run longevity test (48 hours) by passing 1 KB message size as input to send
data to created topic in step 2
4. After completion of the few hours execution, Check size of "Journal Queue
Length" bookie metric.
*Expected behavior*
Should not increase queue length of journal and bookies should be able to
handle properly.
*Screenshots*
Attached screenshot of "Journal Queue Size Length" metric.
*Desktop (please complete the following information):*
• OS: CentOS 7.7
• Master Node: 1
• Worker Nodes : 3
*H/w Details*:
• Master node: 12x16x200 (Not running any Pulsar pods on master node)
• Worker nodes : 16x32x200 ( each worker node has additional 4 virtual disks
for Pulsar pods)

----
2020-03-05 06:39:57 UTC - Eric Simon: Sure thing. I will create that ticket in
the morning.
----
2020-03-05 08:29:44 UTC - Vincent LE MAITRE: Hi @Mathieu Druart, did you open
an issue on Pulsar related to this ? Thanks for your answer. Do you find
another solution to implement kind of state into your services / functions ?
----

Slack digest for #general - 2020-03-05

Reply via email to