Hi everyone,
[Keeping this on the -users list for now. Let me know if I should
cross-post to -devel.]
I've been asked to help out on a Dumpling cluster (a system
bequeathed by one admin to the next, currently on 0.67.10, was
originally installed with 0.67.5 and subsequently updated a few
times),
Hi Greg,
just picked up this one from the archive while researching a different
issue and thought I'd follow up.
On Tue, Aug 19, 2014 at 6:24 PM, Gregory Farnum g...@inktank.com wrote:
The sst files are files used by leveldb to store its data; you cannot
remove them. Are you running on a very
On Tue, Sep 16, 2014 at 6:15 PM, Joao Eduardo Luis
joao.l...@inktank.com wrote:
Forcing the monitor to compact on start and restarting the mon is the
current workaround for overgrown ssts. This happens on a regular basis with
some clusters and I've not been able to track down the source. It
...@inktank.com wrote:
Not sure, but have you checked the clocks on their nodes? Extreme
clock drift often results in strange cephx errors.
-Greg
Software Engineer #42 @ http://inktank.com | http://ceph.com
On Sun, Sep 14, 2014 at 11:03 PM, Florian Haas flor...@hastexo.com wrote:
Hi everyone,
[Keeping
On Wed, Sep 17, 2014 at 1:58 PM, James Eckersall
james.eckers...@gmail.com wrote:
Hi,
I have a ceph cluster running 0.80.1 on Ubuntu 14.04. I have 3 monitors and
4 OSD nodes currently.
Everything has been running great up until today where I've got an issue
with the monitors.
I moved
Hi Craig,
just dug this up in the list archives.
On Fri, Mar 28, 2014 at 2:04 AM, Craig Lewis cle...@centraldesktop.com wrote:
In the interest of removing variables, I removed all snapshots on all pools,
then restarted all ceph daemons at the same time. This brought up osd.8 as
well.
So
On Wed, Sep 17, 2014 at 5:24 PM, Dan Van Der Ster
daniel.vanders...@cern.ch wrote:
Hi Florian,
On 17 Sep 2014, at 17:09, Florian Haas flor...@hastexo.com wrote:
Hi Craig,
just dug this up in the list archives.
On Fri, Mar 28, 2014 at 2:04 AM, Craig Lewis cle...@centraldesktop.com
wrote
On Wed, Sep 17, 2014 at 5:42 PM, Dan Van Der Ster
daniel.vanders...@cern.ch wrote:
From: Florian Haas flor...@hastexo.com
Sent: Sep 17, 2014 5:33 PM
To: Dan Van Der Ster
Cc: Craig Lewis cle...@centraldesktop.com;ceph-users@lists.ceph.com
Subject: Re: [ceph-users] RGW hung, 2 OSDs using 100
On Wed, Sep 17, 2014 at 5:21 PM, James Eckersall
james.eckers...@gmail.com wrote:
Hi,
Thanks for the advice.
I feel pretty dumb as it does indeed look like a simple networking issue.
You know how you check things 5 times and miss the most obvious one...
J
No worries at all .:)
Cheers,
Hello everyone,
Just thought I'd circle back on some discussions I've had with people
earlier in the year:
Shortly before firefly, snapshot support for CephFS clients was
effectively disabled by default at the MDS level, and can only be
enabled after accepting a scary warning that your
Hi Craig,
On Fri, Sep 19, 2014 at 2:49 AM, Craig Lewis cle...@centraldesktop.com wrote:
No, removing the snapshots didn't solve my problem. I eventually traced
this problem to XFS deadlocks caused by
[osd]
osd mkfs options xfs: -l size=1024m -n size=64k -i size=2048 -s
size=4096
On Mon, Sep 22, 2014 at 10:21 AM, Christian Balzer ch...@gol.com wrote:
The linux scheduler usually is quite decent in keeping processes where the
action is, thus you see for example a clear preference of DRBD or KVM vnet
processes to be near or on the CPU(s) where the IRQs are.
Since you're
On Fri, Sep 19, 2014 at 5:25 PM, Sage Weil sw...@redhat.com wrote:
On Fri, 19 Sep 2014, Florian Haas wrote:
Hello everyone,
Just thought I'd circle back on some discussions I've had with people
earlier in the year:
Shortly before firefly, snapshot support for CephFS clients was
effectively
On Fri, Sep 19, 2014 at 5:25 PM, Sage Weil sw...@redhat.com wrote:
On Fri, 19 Sep 2014, Florian Haas wrote:
Hello everyone,
Just thought I'd circle back on some discussions I've had with people
earlier in the year:
Shortly before firefly, snapshot support for CephFS clients was
effectively
Hi Sage,
On Tue, Oct 7, 2014 at 9:20 PM, Sage Weil s...@inktank.com wrote:
This is a release candidate for Giant, which will hopefully be out in
another week or two (s v0.86). We did a feature freeze about a month ago
and since then have been doing only stabilization and bug fixing (and a
Hi Sage,
sorry to be late to this thread; I just caught this one as I was
reviewing the Giant release notes. A few questions below:
On Mon, Oct 13, 2014 at 8:16 PM, Sage Weil s...@newdream.net wrote:
[...]
* ACLs: implemented, tested for kernel client. not implemented for
ceph-fuse.
[...]
Hi everyone,
been trying to get to the bottom of this for a few days; thought I'd
take this to the list to see if someone had insight to share.
Situation: Ceph 0.87 (Giant) cluster with approx. 250 OSDs. One set of
OSD nodes with just spinners put into one CRUSH ruleset assigned to a
spinner
Hi everyone,
I'd like to come back to a discussion from 2012 (thread at
http://marc.info/?l=ceph-develm=134808745719233) to estimate the
expected MDS memory consumption from file metadata caching. I am certain
the following is full of untested assumptions, some of which are
probably inaccurate,
On Fri, Nov 28, 2014 at 3:14 PM, Wido den Hollander w...@42on.com wrote:
On 11/28/2014 01:04 PM, Florian Haas wrote:
Hi everyone,
I'd like to come back to a discussion from 2012 (thread at
http://marc.info/?l=ceph-develm=134808745719233) to estimate the
expected MDS memory consumption from
On Fri, Nov 28, 2014 at 3:29 PM, Wido den Hollander w...@42on.com wrote:
On 11/28/2014 03:22 PM, Florian Haas wrote:
On Fri, Nov 28, 2014 at 3:14 PM, Wido den Hollander w...@42on.com wrote:
On 11/28/2014 01:04 PM, Florian Haas wrote:
Hi everyone,
I'd like to come back to a discussion from
On Tue, Feb 17, 2015 at 11:19 PM, Gregory Farnum g...@gregs42.com wrote:
On Tue, Feb 17, 2015 at 12:09 PM, Florian Haas flor...@hastexo.com wrote:
Hello everyone,
I'm seeing some OSD behavior that I consider unexpected; perhaps
someone can shed some insight.
Ceph giant (0.87.0), osd max
On Wed, Feb 18, 2015 at 9:09 PM, Brian Rak b...@gameservers.com wrote:
What does your crushmap look like (ceph osd getcrushmap -o
/tmp/crushmap; crushtool -d /tmp/crushmap)? Does your placement logic
prevent Ceph from selecting an OSD for the third replica?
Cheers,
Florian
I have 5 hosts,
On Wed, Feb 18, 2015 at 9:32 PM, Mark Nelson mnel...@redhat.com wrote:
On 02/18/2015 02:19 PM, Florian Haas wrote:
Hey everyone,
I must confess I'm still not fully understanding this problem and
don't exactly know where to start digging deeper, but perhaps other
users have seen
On Wed, Feb 18, 2015 at 10:28 PM, Oliver Schulz osch...@mpp.mpg.de wrote:
Dear Ceph Experts,
is it possible to define a Ceph user/key with privileges
that allow for read-only CephFS access but do not allow
write or other modifications to the Ceph cluster?
Warning, read this to the end, don't
On Wed, Feb 18, 2015 at 7:53 PM, Brian Rak b...@gameservers.com wrote:
We're running ceph version 0.87 (c51c8f9d80fa4e0168aa52685b8de40e42758578),
and seeing this:
HEALTH_WARN 1 pgs degraded; 1 pgs stuck degraded; 1 pgs stuck unclean; 1 pgs
stuck undersized; 1 pgs undersized
pg 4.2af is
Hello everyone,
I'm seeing some OSD behavior that I consider unexpected; perhaps
someone can shed some insight.
Ceph giant (0.87.0), osd max backfills and osd recovery max active
both set to 1.
Please take a moment to look at the following ceph health detail screen dump:
HEALTH_WARN 14 pgs
On Thu, Feb 19, 2015 at 12:50 AM, Gregory Farnum g...@gregs42.com wrote:
On Wed, Feb 18, 2015 at 3:30 PM, Florian Haas flor...@hastexo.com wrote:
On Wed, Feb 18, 2015 at 11:41 PM, Gregory Farnum g...@gregs42.com wrote:
On Wed, Feb 18, 2015 at 1:58 PM, Florian Haas flor...@hastexo.com wrote
On Wed, Feb 18, 2015 at 6:56 AM, Gregory Farnum g...@gregs42.com wrote:
On Tue, Feb 17, 2015 at 9:48 PM, Florian Haas flor...@hastexo.com wrote:
On Tue, Feb 17, 2015 at 11:19 PM, Gregory Farnum g...@gregs42.com wrote:
On Tue, Feb 17, 2015 at 12:09 PM, Florian Haas flor...@hastexo.com wrote
On Wed, Feb 18, 2015 at 10:27 PM, Florian Haas flor...@hastexo.com wrote:
On Wed, Feb 18, 2015 at 9:32 PM, Mark Nelson mnel...@redhat.com wrote:
On 02/18/2015 02:19 PM, Florian Haas wrote:
Hey everyone,
I must confess I'm still not fully understanding this problem and
don't exactly know
On Wed, Feb 18, 2015 at 11:41 PM, Gregory Farnum g...@gregs42.com wrote:
On Wed, Feb 18, 2015 at 1:58 PM, Florian Haas flor...@hastexo.com wrote:
On Wed, Feb 18, 2015 at 10:28 PM, Oliver Schulz osch...@mpp.mpg.de wrote:
Dear Ceph Experts,
is it possible to define a Ceph user/key
Hi everyone,
I always have a bit of trouble wrapping my head around how libvirt seems
to ignore ceph.conf option while qemu/kvm does not, so I thought I'd
ask. Maybe Josh, Wido or someone else can clarify the following.
http://ceph.com/docs/master/rbd/qemu-rbd/ says:
Important: If you set
On 02/27/2015 11:37 AM, Blair Bethwaite wrote:
Sorry if this is actually documented somewhere,
It is. :)
but is it possible to
create and use multiple filesystems on the data data and metadata
pools? I'm guessing yes, but requires multiple MDSs?
Nope. Every fs needs one data and one
On 02/27/2015 01:56 PM, Alexandre DERUMIER wrote:
Hi,
from qemu rbd.c
if (flags BDRV_O_NOCACHE) {
rados_conf_set(s-cluster, rbd_cache, false);
} else {
rados_conf_set(s-cluster, rbd_cache, true);
}
and
block.c
int bdrv_parse_cache_flags(const char
so directsync or none should be safe if guest does not send flush.
- Mail original -
De: Florian Haas flor...@hastexo.com mailto:flor...@hastexo.com
À: ceph-users ceph-users@lists.ceph.com
mailto:ceph-users@lists.ceph.com
Envoyé: Vendredi 27 Février
On Wed, Feb 18, 2015 at 9:19 PM, Florian Haas flor...@hastexo.com wrote:
Hey everyone,
I must confess I'm still not fully understanding this problem and
don't exactly know where to start digging deeper, but perhaps other
users have seen this and/or it rings a bell.
System info: Ceph giant
On 04/22/2015 03:38 PM, Wido den Hollander wrote:
On 04/22/2015 03:20 PM, Florian Haas wrote:
I'm not entirely sure, though, why virStorageBackendRBDCreateImage()
enables striping unconditionally; could you explain the reasoning
behind that?
When working on this with Josh some time ago we
On Wed, Apr 22, 2015 at 10:55 AM, Daniel Swarbrick
daniel.swarbr...@profitbricks.com wrote:
Hi all,
With Debian jessie scheduled to be released in a few days on April 25,
many of us will be thinking of upgrading wheezy based systems to jessie.
The Ceph packages in upstream Debian repos are
Hi everyone,
I don't think this has been posted to this list before, so just
writing it up so it ends up in the archives.
tl;dr: Using RBD storage pools with libvirt is currently broken on
Ubuntu trusty (LTS), and any other platform using libvirt 1.2.2.
In libvirt 1.2.2, the rbd_create3
Hi Ben & everyone,
just following up on this one from July, as I don't think there's been
a reply here then.
On Wed, Jul 8, 2015 at 7:37 AM, Ben Hines wrote:
> Anyone have any data on optimal # of shards for a radosgw bucket index?
>
> We've had issues with bucket index
Hey Wido,
On Dec 17, 2015 09:52, "Wido den Hollander" <w...@42on.com> wrote:
>
> On 12/17/2015 06:29 AM, Ben Hines wrote:
> >
> >
> > On Wed, Dec 16, 2015 at 11:05 AM, Florian Haas <flor...@hastexo.com
> > <mailto:flor...@hastexo.com>>
Hey everyone,
I recently got my hands on a cluster that has been underperforming in
terms of radosgw throughput, averaging about 60 PUTs/s with 70K
objects where a freshly-installed cluster with near-identical
configuration would do about 250 PUTs/s. (Neither of these values are
what I'd consider
On Mon, Dec 21, 2015 at 4:15 PM, Haomai Wang <haomaiw...@gmail.com> wrote:
>
>
> On Mon, Dec 21, 2015 at 10:55 PM, Florian Haas <flor...@hastexo.com> wrote:
>>
>> On Mon, Dec 21, 2015 at 3:35 PM, Haomai Wang <hao...@xsky.com> wrote:
>> >
>>
On Mon, Dec 21, 2015 at 9:15 PM, Ben Hines wrote:
> I'd be curious to compare benchmarks. What size objects are you putting?
As stated earlier, I ran rest-bench with 70KB objects which is a good
approximation of the average object size in the underperforming
system.
> 10gig
On Thu, Dec 17, 2015 at 6:16 PM, Florian Haas <flor...@hastexo.com> wrote:
> Hey everyone,
>
> I recently got my hands on a cluster that has been underperforming in
> terms of radosgw throughput, averaging about 60 PUTs/s with 70K
> objects where a freshly-installed cluste
On Mon, Dec 21, 2015 at 10:15 AM, Florent B wrote:
> Hi all,
>
> It seems I had an MDS crash being in standby-replay.
>
> Version is Infernalis, running on Debian Jessie (packaged version).
>
> Log is here (2.5MB) : http://paste.ubuntu.com/14126366/
>
> Has someone
On Tue, Nov 24, 2015 at 8:48 PM, Somnath Roy wrote:
> Hi Yehuda/RGW experts,
>
> I have one cluster with RGW up and running in the customer site.
>
> I did some heavy performance testing on that with CosBench and as a result
> written significant amount of data to
On Mon, Dec 21, 2015 at 10:20 AM, Wido den Hollander wrote:
>>> > Oh, and to answer this part. I didn't do that much experimentation
>>> > unfortunately. I actually am using about 24 index shards per bucket
>>> > currently and we delete each bucket once it hits about a million
On Tue, Dec 22, 2015 at 3:10 AM, Haomai Wang wrote:
>> >> >> Hey everyone,
>> >> >>
>> >> >> I recently got my hands on a cluster that has been underperforming
>> >> >> in
>> >> >> terms of radosgw throughput, averaging about 60 PUTs/s with 70K
>> >> >> objects where a
On Mon, Dec 21, 2015 at 12:36 PM, Wido den Hollander <w...@42on.com> wrote:
>
>
> On 21-12-15 10:34, Florian Haas wrote:
>> On Mon, Dec 21, 2015 at 10:20 AM, Wido den Hollander <w...@42on.com> wrote:
>>>>>> Oh, and to answer this part. I didn't do t
On Mon, Dec 21, 2015 at 3:35 PM, Haomai Wang <hao...@xsky.com> wrote:
>
>
> On Fri, Dec 18, 2015 at 1:16 AM, Florian Haas <flor...@hastexo.com> wrote:
>>
>> Hey everyone,
>>
>> I recently got my hands on a cluster that has been underperforming in
>&
Hi Yoann,
On Tue, Jun 21, 2016 at 3:11 PM, Yoann Moulin wrote:
> Hello,
>
> I found a performance drop between kernel 3.13.0-88 (default kernel on Ubuntu
> Trusty 14.04) and kernel 4.4.0.24.14 (default kernel on Ubuntu Xenial 16.04)
>
> ceph version is Jewel (10.2.2).
> All
On Wed, Jun 22, 2016 at 10:56 AM, Yoann Moulin wrote:
> Hello Florian,
>
>> On Tue, Jun 21, 2016 at 3:11 PM, Yoann Moulin wrote:
>>> Hello,
>>>
>>> I found a performance drop between kernel 3.13.0-88 (default kernel on
>>> Ubuntu
>>> Trusty 14.04) and
On Thu, Jun 23, 2016 at 9:01 AM, Yoann Moulin <yoann.mou...@epfl.ch> wrote:
> Le 23/06/2016 08:25, Sarni Sofiane a écrit :
>> Hi Florian,
>>
>
>
>> On 23.06.16 06:25, "ceph-users on behalf of Florian Haas"
>> <ceph-users-boun...@li
Hi everyone,
we recently worked a bit on running a full static website just on
radosgw (akin to
http://docs.aws.amazon.com/AmazonS3/latest/dev/WebsiteHosting.html),
and didn't find a good how-to writeup out there. So we did a bit of
fiddling with radosgw and HAproxy, and wrote one:
On Tue, Jan 26, 2016 at 11:46 PM, Yehuda Sadeh-Weinraub
<yeh...@redhat.com> wrote:
> On Tue, Jan 26, 2016 at 2:37 PM, Florian Haas <flor...@hastexo.com> wrote:
>> On Tue, Jan 26, 2016 at 8:56 PM, Wido den Hollander <w...@42on.com> wrote:
>>> On 01/26/2016 0
On Wed, Jan 27, 2016 at 12:00 AM, Robin H. Johnson <robb...@gentoo.org> wrote:
> On Tue, Jan 26, 2016 at 11:51:51PM +0100, Florian Haas wrote:
>> Hey, slick. Thanks! Out of curiosity, does the wip branch correctly
>> handle Accept-Encoding: gzip?
> No, Accept-Encoding is N
On Thu, Apr 7, 2016 at 10:09 PM, German Anders wrote:
> also jewel does not supposed to get more 'performance', since it used
> bluestore in order to store metadata. Or do I need to specify during install
> to use bluestore?
Do the words "enable experimental unrecoverable
Hi everyone,
I wonder what others think about the following suggestion: running an
even number of mons almost never makes sense, and specifically two
mons never does at all. Wouldn't it make sense to just flag a
HEALTH_WARN state if the monmap contained an even number of mons, or
maybe only if
On Tue, Apr 12, 2016 at 11:53 AM, Simon Ferber
wrote:
> Thank you! That's it. I have installed the Kernel from the Jessie
> backport. Now the crashes are gone.
> How often do these things happen? It would be a worst case scenario, if
> a system update breaks a
On Fri, Apr 1, 2016 at 2:48 PM, Wido den Hollander wrote:
> Somehow the PG got corrupted on one of the OSDs and it kept crashing on a
> single
> object.
Vaguely reminds me of the E2BIG from that one issue way-back-when in
Dumpling
On Sun, May 8, 2016 at 11:57 PM, Robert Sander
wrote:
> Am 29.04.2016 um 17:11 schrieb Robert Sander:
>
>> As the backfilling with the full weight of the new OSDs would have run
>> for more than 28h and no VM was usable we re-weighted the new OSDs to
>> 0.1. The
On Wed, Mar 15, 2017 at 2:41 AM, Alex Gorbachev <a...@iss-integration.com>
wrote:
> On Mon, Mar 13, 2017 at 6:09 AM, Florian Haas <flor...@hastexo.com> wrote:
>> On Mon, Mar 13, 2017 at 11:00 AM, Dan van der Ster <d...@vanderster.com>
>> wrote:
>>>>
On Sat, Mar 11, 2017 at 12:21 PM, wrote:
> The upgrade of our biggest cluster, nr 4, did not go without
> problems. Since we where expecting a lot of "failed to encode map
> e with expected crc" messages, we disabled clog to monitors
> with 'ceph tell osd.* injectargs
On Sat, Mar 11, 2017 at 4:24 PM, Laszlo Budai wrote:
>>> Can someone explain the meaning of osd_disk_thread_ioprio_priority. I'm
>>> [...]
>>>
>>> Now I am confused :(
>>>
>>> Can somebody bring some light here?
>>
>>
>> Only to confuse you some more. If you are
On Mon, Mar 13, 2017 at 11:00 AM, Dan van der Ster wrote:
>> I'm sorry, I may have worded that in a manner that's easy to
>> misunderstand. I generally *never* suggest that people use CFQ on
>> reasonably decent I/O hardware, and thus have never come across any
>> need to set
On Sun, Mar 12, 2017 at 9:07 PM, Laszlo Budai wrote:
> Hi Florian,
>
> thank you for your answer.
>
> We have already set the IO scheduler to cfq in order to be able to lower the
> priority of the scrub operations.
> My problem is that I've found different values set for
Hi everyone,
so this will be a long email — it's a summary of several off-list
conversations I've had over the last couple of weeks, but the TL;DR
version is this question:
How can a Ceph cluster maintain near-constant performance
characteristics while supporting a steady intake of a large
Hello everyone,
I'm trying to get a handle on the current state of the async messenger's
RDMA transport in Luminous, and I've noticed that the information
available is a little bit sparse (I've found
https://community.mellanox.com/docs/DOC-2693 and
https://community.mellanox.com/docs/DOC-2721,
On Thu, Sep 14, 2017 at 2:47 AM, Josh Durgin <jdur...@redhat.com> wrote:
> On 09/13/2017 03:40 AM, Florian Haas wrote:
>>
>> So we have a client that is talking to OSD 30. OSD 30 was never down;
>> OSD 17 was. OSD 30 is also the preferred primary for this PG (via
>&g
On Thu, Sep 14, 2017 at 3:15 AM, Brad Hubbard <bhubb...@redhat.com> wrote:
> On Wed, Sep 13, 2017 at 8:40 PM, Florian Haas <flor...@hastexo.com> wrote:
>> Hi everyone,
>>
>>
>> disclaimer upfront: this was seen in the wild on Hammer, and on 0.94.7
On Fri, Sep 15, 2017 at 8:58 AM, Josh Durgin wrote:
>> OK, maybe the "also" can be removed to reduce potential confusion?
>
>
> Sure
That'd be great. :)
>> - We have a bunch of objects that need to be recovered onto the
>> just-returned OSD(s).
>> - Clients access some of
On Mon, Sep 18, 2017 at 8:48 AM, Christian Theune wrote:
> Hi Josh,
>
>> On Sep 16, 2017, at 3:13 AM, Josh Durgin wrote:
>>
>> (Sorry for top posting, this email client isn't great at editing)
>
> Thanks for taking the time to respond. :)
>
>> The
On Mon, Sep 18, 2017 at 2:02 PM, Christian Theune wrote:
> Hi,
>
> and here’s another update which others might find quite interesting.
>
> Florian and I spend some time discussing the issue further, face to face. I
> had one switch that I brought up again
On Fri, Sep 15, 2017 at 10:37 PM, Josh Durgin wrote:
>> So this affects just writes. Then I'm really not following the
>> reasoning behind the current behavior. Why would you want to wait for
>> the recovery of an object that you're about to clobber anyway? Naïvely
>> thinking
>> > Okay, in that case I've no idea. What was the timeline for the recovery
>> > versus the rados bench and cleanup versus the degraded object counts,
>> > then?
>>
>> 1. Jewel deployment with filestore.
>> 2. Upgrade to Luminous (including mgr deployment and "ceph osd
>> require-osd-release
On Thu, Oct 12, 2017 at 7:56 PM, Gregory Farnum <gfar...@redhat.com> wrote:
>
>
> On Thu, Oct 12, 2017 at 10:52 AM Florian Haas <flor...@hastexo.com> wrote:
>>
>> On Thu, Oct 12, 2017 at 7:22 PM, Gregory Farnum <gfar...@redhat.com>
>> wrote:
>>
On Mon, Sep 11, 2017 at 8:13 PM, Andreas Herrmann wrote:
> Hi,
>
> how could this happen:
>
> pgs: 197528/1524 objects degraded (12961.155%)
>
> I did some heavy failover tests, but a value higher than 100% looks strange
> (ceph version 12.2.0). Recovery is quite slow.
>
> In our use case, we are severly hampered by the size of removed_snaps
> (50k+) in the OSDMap to the point were ~80% of ALL cpu time is spent in
> PGPool::update and its interval calculation code. We have a cluster of
> around 100k RBDs with each RBD having upto 25 snapshots and only a small
>
Hi everyone,
with the Luminous release out the door and the Labor Day weekend over,
I hope I can kick off a discussion on another issue that has irked me
a bit for quite a while. There doesn't seem to be a good documented
answer to this: what are Ceph's real limits when it comes to RBD
snapshots?
Hi everyone,
disclaimer upfront: this was seen in the wild on Hammer, and on 0.94.7
no less. Reproducing this on 0.94.10 is a pending process, and we'll
update here with findings, but my goal with this post is really to
establish whether the behavior as seen is expected, and if so, what
the
Hi Greg,
thanks for your insight! I do have a few follow-up questions.
On 09/05/2017 11:39 PM, Gregory Farnum wrote:
>> It seems to me that there still isn't a good recommendation along the
>> lines of "try not to have more than X snapshots per RBD image" or "try
>> not to have more than Y
On Mon, Sep 11, 2017 at 8:27 PM, Mclean, Patrick
wrote:
>
> On 2017-09-08 06:06 PM, Gregory Farnum wrote:
> > On Fri, Sep 8, 2017 at 5:47 PM, Mclean, Patrick
> > wrote:
> >
> >> On a related note, we are very curious why the snapshot id is
> >>
On Mon, Aug 28, 2017 at 4:21 PM, Haomai Wang <hao...@xsky.com> wrote:
> On Wed, Aug 23, 2017 at 1:26 AM, Florian Haas <flor...@hastexo.com> wrote:
>> Hello everyone,
>>
>> I'm trying to get a handle on the current state of the async messenger's
>> RDMA
Sorry, I worded my questions poorly in the last email, so I'm asking
for clarification here:
On Mon, Aug 28, 2017 at 6:04 PM, Haomai Wang <hao...@xsky.com> wrote:
> On Mon, Aug 28, 2017 at 7:54 AM, Florian Haas <flor...@hastexo.com> wrote:
>> On Mon, Aug 28, 2017 at 4:21
On Thu, Oct 12, 2017 at 7:22 PM, Gregory Farnum <gfar...@redhat.com> wrote:
>
>
> On Thu, Oct 12, 2017 at 3:50 AM Florian Haas <flor...@hastexo.com> wrote:
>>
>> On Mon, Sep 11, 2017 at 8:13 PM, Andreas Herrmann <andr...@mx20.org>
>> wr
On 13/12/2018 15:10, Mark Nelson wrote:
> Hi Florian,
>
> On 12/13/18 7:52 AM, Florian Haas wrote:
>> On 02/12/2018 19:48, Florian Haas wrote:
>>> Hi Mark,
>>>
>>> just taking the liberty to follow up on this one, as I'd really like to
>>> get
On 02/12/2018 19:48, Florian Haas wrote:
> Hi Mark,
>
> just taking the liberty to follow up on this one, as I'd really like to
> get to the bottom of this.
>
> On 28/11/2018 16:53, Florian Haas wrote:
>> On 28/11/2018 15:52, Mark Nelson wrote:
>>> Option
Hi everyone,
We have a Luminous cluster (12.2.10) on Ubuntu Xenial, though we have
also observed the same behavior on 12.2.7 on Bionic (download.ceph.com
doesn't build Luminous packages for Bionic, and 12.2.7 is the latest
distro build).
The primary use case for this cluster is radosgw. 6 OSD
On 05/12/2018 17:35, Maxime Guyot wrote:
> Hi Florian,
>
> Thanks for the help. I did further testing and narrowed it down to
> objects that have been uploaded when the bucket has versioning enabled.
> Objects created before that are not affected: all metadata operations
> are still possible.
>
Hi Mark,
On 04/12/2018 04:41, Mark Kirkwood wrote:
> Hi,
>
> I've set up a Luminous RGW with Keystone integration, and subsequently set
>
> rgw keystone implicit tenants = true
>
> So now all newly created users/tenants (or old ones that never accessed
> RGW) get their own namespaces. However
Hi Mark,
just taking the liberty to follow up on this one, as I'd really like to
get to the bottom of this.
On 28/11/2018 16:53, Florian Haas wrote:
> On 28/11/2018 15:52, Mark Nelson wrote:
>> Option("bluestore_default_buffered_read", Option::TYPE_BOOL,
>&g
On 28/11/2018 19:06, Maxime Guyot wrote:
> Hi Florian,
>
> You assumed correctly, the "test" container (private) was created with
> the "openstack container create test", then I am using the S3 API to
> enable/disable object versioning on it.
> I use the following Python snippet to enable/disable
On 05/12/2018 23:08, Mark Kirkwood wrote:
> Hi, another question relating to multi tenanted RGW.
>
> Let's do the working case 1st. For a user that still uses the global
> namespace, if I set a bucket as world readable (header
> "X-Container-Read: .r:*") then I can fetch objects from the bucket
On 19/11/2018 16:23, Florian Haas wrote:
> Hi everyone,
>
> I've recently started a documentation patch to better explain Swift
> compatibility and OpenStack integration for radosgw; a WIP PR is at
> https://github.com/ceph/ceph/pull/25056/. I have, however, run into an
>
On 27/11/2018 20:28, Maxime Guyot wrote:
> Hi,
>
> I'm running into an issue with the RadosGW Swift API when the S3 bucket
> versioning is enabled. It looks like it silently drops any metadata sent
> with the "X-Object-Meta-foo" header (see example below).
> This is observed on a Luminous 12.2.8
On 28/11/2018 15:52, Mark Nelson wrote:
>> Shifting over a discussion from IRC and taking the liberty to resurrect
>> an old thread, as I just ran into the same (?) issue. I see
>> *significantly* reduced performance on RBD reads, compared to writes
>> with the same parameters. "rbd bench
On 14/08/2018 15:57, Emmanuel Lacour wrote:
> Le 13/08/2018 à 16:58, Jason Dillaman a écrit :
>>
>> See [1] for ways to tweak the bluestore cache sizes. I believe that by
>> default, bluestore will not cache any data but instead will only
>> attempt to cache its key/value store and metadata.
>
>
On 18/11/2018 22:08, Dilip Renkila wrote:
> Hi all,
>
> We are provisioning openstack swift api though ceph rgw (mimic). We have
> problems when trying to create two containers in two projects of same
> name. After scraping web, i came to know that i have to enable
>
> *
Hi everyone,
I've recently started a documentation patch to better explain Swift
compatibility and OpenStack integration for radosgw; a WIP PR is at
https://github.com/ceph/ceph/pull/25056/. I have, however, run into an
issue that I would really *like* to document, except I don't know
whether
Hi Mohamad!
On 31/12/2018 19:30, Mohamad Gebai wrote:
> On 12/31/18 4:51 AM, Marcus Murwall wrote:
>> What you say does make sense though as I also get the feeling that the
>> osds are just waiting for something. Something that never happens and
>> the request finally timeout...
>
> So the OSDs
1 - 100 of 109 matches
Mail list logo