Re: log.retention.size

András Serény Tue, 10 Jun 2014 08:23:45 -0700

Hi Kafka devs,

are there currently any plans to implement the global threshold feature?Is there a JIRA about it?

We are considering to implement a solution for this issue (either insideor outside of Kafka).


Thanks a lot,
András

On 5/30/2014 11:45 AM, András Serény wrote:

Sorry for the delay on this.
Yes, that's right -- it'd be just another term in the chain of 'or'conditions. Currently it's <time limit> OR <size limit>. With theglobal condition, it would be
<time limit> OR <size limit> OR <global size limit>
In my view, that's fairly simple and intuitive, hence a fine piece oflogic.
Regards,
András

On 5/27/2014 4:34 PM, Jun Rao wrote:
For log.retention.bytes.per.topic and log.retention.hours.per.topic, the
current interpretation is that those are tight bounds. In otherwords, only
when those thresholds are violated, a segment is deleted. To further
satisfy log.retention.bytes.global, the per topic thresholds may nolonger
be tight, i.e., we may need to delete a segment even when the per topic
threshold is not violated.

Thanks,

Jun
On Tue, May 27, 2014 at 12:22 AM, András Serény<sereny.and...@gravityrd.com
wrote:
No, I think more specific settings should get a chance first. I'm
suggesting that provided that there is a segment rolled for a topic,*any
*of log.retention.bytes.per.topic, log.retention.hours.per.topic, and a
future log.retention.bytes.global violation would cause segments to be
deleted.

As far as I understand, the current logic says

(1)
for each topic, if there is a segment already rolled {
     mark segments eligible for deletion due to
log.retention.hours.for.this.topic
     if log.retention.bytes.for.this.topic is still violated, mark
segments eligible for deletion due tolog.retention.bytes.for.this.topic
}
After this cleanup cycle, there could be another one, taking intoaccount
the global threshold. For instance, something along the lines of

(2)
if after (1) log.retention.bytes.global is still violated, for eachtopic,
if there is a segment already rolled {
calculate the required size for this topic (e.g. the proportionalsize,
or simply (full size - threshold)/#topics ?)
   mark segments exceeding the required size for deletion
}

Regards,
András



On 5/23/2014 4:46 PM, Jun Rao wrote:
Yes, that's possible. There is a default log.retention.bytes for every
topic. By introducing a global threshold, we may have to deletedata fromlogs whose size is smaller than log.retention.bytes. So, are yousaying
that the global threshold has precedence?

Thanks,

Jun


On Fri, May 23, 2014 at 2:26 AM, András Serény
<sereny.and...@gravityrd.com>wrote:

  Hi Kafka users,
this feature would also be very useful for us. With lots of topics of
different volume (and as they grow in number) it could becometedious to
maintain topic level settings.
As a start, I think uniform reduction is a good idea. Logswouldn't be
retained as long as you want, but that's already the case when a
log.retention.bytes setting is specified. As for early rolling, Idon'tthink it's necessary: currently, if there is no log segmenteligible fordeletion, log.retention.bytes and log.retention.hours settingswon't kick
in, so it's possible to exceed these limits, which is completely fine
(please correct me if I'm mistaken here).

All in all, introducing a global threshold doesn't seem to induce a
considerable change in current retention logic.

Regards,
András


On 5/8/2014 2:00 AM, vinh wrote:

  Agreed…a global knob is a bit tricky for exactly the reason you've
identified.  Perhaps the problem could be simplified though by
considering
the context and purpose of Kafka.  I would use a persistent message
queue
because I want to guarantee that data/messages don't get lost.  But,
since
Kafka is not meant to be a long term storage solution (otherproducts
can
be used for that), I would clarify that guarantee to apply onlyto the
most
recent messages up until a certain configured threshold (i.e. max 24
hrs,
max 500GB, etc). Once those thresholds are reached, old messagesare
deleted first.
To ensure no message loss (up to a limit), I must ensure Kafka ishighlyavailable. There's a small a chance that the message deletionrate is
the
same rate that receive rate. For example, when the incomingvolume is
so
high that the size threshold is reached before the time threshold.
  But, I
may be ok with that because if Kafka goes down, it can causeupstream
applications to fail.  This can result in higher losses overall, and
particularly of the most *recent* messages.

In other words, in a persistent but ephemeral message queue, I would
give
higher precedence to recent messages over older ones. On the flip
side, by
allowing Kafka to go down when a disk is full, applications areforced
to
deal with the issue.  This adds complexity to apps, but perhaps it's
not a
bad thing. After all, in scalability, all apps should bedesigned to
handle failure.
Having said that, next is to decide which messages to deletefirst. I
believe that's a separate issue and has its own complexities, too.
The main idea though is that a global knob would provideflexibility,even if not used. From an operation perspective, if we can'tensure HA
for
all applications/components, it would be good if we can for at least
some
of the core ones, like Kafka.  This is much easier said that done
though.


On May 5, 2014, at 9:16 AM, Jun Rao <jun...@gmail.com> wrote:

   Yes, your understanding is correct. A global knob that controls
aggregate
log size may make sense. What would be the expected behaviorwhen thatlimit is reached? Would you reduce the retention uniformlyacross alltopics? Then, it just means that some of the logs may not beretained
as
long as you want. Also, we need to think through what happens when
every
log has only 1 segment left and yet the total size still exceedsthe
limit.
Do we roll log segments early?

Thanks,

Jun


On Sun, May 4, 2014 at 4:31 AM, vinh <v...@loggly.com> wrote:
Thanks Jun. So if I understand this correctly, there reallyis no
master
property to control the total aggregate size of all Kafka datafiles
on
a
broker.
log.retention.size and log.file.size are great for managingdata at
the
application level. In our case, application needs changefrequently,
and
performance itself is an ever evolving feature. This means various
configs
are constantly changing, like topics, # of partitions, etc.
What rarely changes though is provisioned hardware resources.So a
setting to control the total aggregate size of Kafka logs (or
persisted
data, for better clarity) would definitely simplify things at an
operational level, regardless what happens at the applicationlevel.
On May 2, 2014, at 7:49 AM, Jun Rao <jun...@gmail.com> wrote:

   log.retention.size controls the total size in a log dir (per
partition). log.file.size
controls the size of each log segment in the log dir.

Thanks,

Jun


On Thu, May 1, 2014 at 9:31 PM, vinh <v...@loggly.com> wrote:

   In the 0.7 docs, the description for log.retention.size and
log.file.size
sound very much the same. In particular, that they apply to asingle
log
file (or log segment file).
http://kafka.apache.org/07/configuration.html

I'm beginning to think there is no setting to control the max
aggregate
size of all logs. If this is correct, what would be a goodapproach
to
enforce this requirement? In my particular scenario, I havea lot
of

  data
being written to Kafka at a very high rate. So a 1TB disk caneasily
be
filled up in 24hrs or so. One option is to add more Kafkabrokers
to

  add
more disk space to the pool, but I'd like to avoid that andsee if I
can
simply configure Kafka to not write more than 1TB aggregate.Else,
  Kafka
will OOM and kill itself, and possibly the crash the node itself
because
the disk is full.


On May 1, 2014, at 9:21 PM, vinh <v...@loggly.com> wrote:

   Using Kafka 0.7.2, I have the following in server.properties:
log.retention.hours=48
log.retention.size=107374182400
log.file.size=536870912

My interpretation of this is:
a) a single log segment file over 48hrs old will be deleted
b) the total combined size of *all* logs is 100GB
c) a single log segment file is limited to 500MB in sizebefore a
new

  segment file is spawned spawning a new segment file
  d) a "log file" can be composed of many "log segment files"
But, even after setting the above, I find that the totalcombined
size

  of all Kafka logs on disk is 200GB right now.  Isn't
log.retention.size
supposed to limit it to 100GB? Am I missing something? Thedocs
are

  not
really clear, especially when it comes to distinguishingbetween a
"log
file" and a "log segment file".
I have disk monitoring. But like anything else insoftware, even
  monitoring can fail. Via configuration, I'd like to make sure
that

  Kafka
does not write more than the available disk space. Orsomething like
log4j, where I can set a max number of log files and the maxsize
per

  file,
which essentially allows me to set a max aggregate size limitacross
all
logs.

  Thanks,
-Vinh

Re: log.retention.size

Reply via email to