RE: Notes from a discussion a design to allow EC overwrites

2015-11-13 Thread Allen Samuels
I suspect could be punted to a subsequent release (i.e., I'm out of large blocks, but there's plenty of fragmented available space -- This can happen, but's a pretty pathological case which becomes rare-er and rare-er as you scale-out) Allen Samuels Software Architect, Emerging

RE: Question about how rebuild works.

2015-11-06 Thread Allen Samuels
rebuild. The average case would be 1.5x and this is inverse with the MTTDL, i.e., this behavior cuts the MTTDL in half. Allen Samuels Software Architect, Fellow, Systems and Software Solutions 2880 Junction Avenue, San Jose, CA 95134 T: +1 408 801 7030| M: +1 408 780 6416 allen.samu

Question about how rebuild works.

2015-11-05 Thread Allen Samuels
I have a question about rebuild in the following situation: I have a pool with 3x replication. For one particular PG we'll designate the active OSD set as [1,2,3] with 1 as the primary. Assume 2 and 3 crash with a TOTAL loss of local data. 2 restarts, fiddles about and then start the backfill pro

RE: newstore direction

2015-10-22 Thread Allen Samuels
How would this kind of split affect small transactions? Will each split be separately transactionally consistent or is there some kind of meta-transaction that synchronizes each of the splits? Allen Samuels Software Architect, Fellow, Systems and Software Solutions 2880 Junction Avenue, San

RE: newstore direction

2015-10-21 Thread Allen Samuels
trend toward thread-per-core software development will also tend to support the "do it in user-space" trend. That's because most of the kernel and file-system interface is architected around the blocking "thread-per-IOP" model and is unlikely to change in the future.

RE: newstore direction

2015-10-21 Thread Allen Samuels
I am pushing internally to open-source ZetaScale. Recent events may or may not affect that trajectory -- stay tuned. Allen Samuels Software Architect, Fellow, Systems and Software Solutions 2880 Junction Avenue, San Jose, CA 95134 T: +1 408 801 7030| M: +1 408 780 6416 allen.samu

RE: newstore direction

2015-10-21 Thread Allen Samuels
o the equation the "on top of an FS" path doesn't look like such a clear winner. Allen Samuels Software Architect, Fellow, Systems and Software Solutions 2880 Junction Avenue, San Jose, CA 95134 T: +1 408 801 7030| M: +1 408 780 6416 allen.samu...@sandisk.com -Original Me

RE: newstore direction

2015-10-21 Thread Allen Samuels
IOPS OR huge amounts of DRAM. Regardless of the choice, you'll see a significant degradation of performance while the scrub is ongoing -- which is one of the biggest problems with clustered systems (expensive and extensive maintenance operations). Allen Samuels Software Architect

RE: newstore direction

2015-10-21 Thread Allen Samuels
plete roadblock. Just my experience. YMMV. Allen Samuels Software Architect, Fellow, Systems and Software Solutions 2880 Junction Avenue, San Jose, CA 95134 T: +1 408 801 7030| M: +1 408 780 6416 allen.samu...@sandisk.com -Original Message- From: Ric Wheeler [mailto:rwhee...@re

RE: newstore direction

2015-10-21 Thread Allen Samuels
y decreases. You can't avoid (2) as long as you're using a file system. Yes an LSM tree performs better on HDD than does a B-tree, which is a good argument for keeping the KV module pluggable. Allen Samuels Software Architect, Fellow, Systems and Software Solutions 2880 Junction A

RE: newstore direction

2015-10-21 Thread Allen Samuels
retty much required for deep scrubbing. Allen Samuels Software Architect, Fellow, Systems and Software Solutions 2880 Junction Avenue, San Jose, CA 95134 T: +1 408 801 7030| M: +1 408 780 6416 allen.samu...@sandisk.com -Original Message- From: ceph-devel-ow...@vger.kernel.org [mailto:cep

RE: loadable objectstore

2015-09-14 Thread Allen Samuels
Yes, I'm referring to the C++ vtable. Allen Samuels Software Architect, Emerging Storage Solutions 2880 Junction Avenue, Milpitas, CA 95134 T: +1 408 801 7030| M: +1 408 780 6416 allen.samu...@sandisk.com -Original Message- From: James (Fei) Liu-SSI [mailto:james@ssi.samsun

RE: loadable objectstore

2015-09-12 Thread Allen Samuels
h the vtbl which is loaded from a known constant offset in the object). Allen Samuels Chief Software Architect, Emerging Storage Solutions 951 SanDisk Drive, Milpitas, CA 95035 T: +1 408 801 7030| M: +1 408 780 6416 allen.samu...@sandisk.com -Original Message- From: cep

RE: Inline dedup/compression

2015-08-20 Thread Allen Samuels
I was referring strictly to compression. Dedupe is a whole 'nother issue. I agree that dedupe on a per-OSD basis isn't interesting. It needs to be done at the pool level (or higher). Allen Samuels Software Architect, Systems and Software Solutions 2880 Junction Avenue, San Jose,

RE: Inline dedup/compression

2015-08-20 Thread Allen Samuels
and chunksize. That would also provide backward compatibility and allow per-object compression diversity. Then you'd want to add verbiage to the individual access schemes to allow/disallow compression. For file systems you'd want that on a per-directory basis or perhaps even better a set o

RE: Ceph Hackathon: More Memory Allocator Testing

2015-08-19 Thread Allen Samuels
/jemalloc -- oops it uses more memory discussion will go away. Allen Samuels Software Architect, Systems and Software Solutions 2880 Junction Avenue, San Jose, CA 95134 T: +1 408 801 7030| M: +1 408 780 6416 allen.samu...@sandisk.com -Original Message- From: ceph-devel-ow...@vger.ker

RE: The design of the eviction improvement

2015-07-22 Thread Allen Samuels
u get high counts of small objects. I agree that paying $ for RAM that translates into actual performance isn't really a problem. It really boils down to your workload and access pattern. Allen Samuels Software Architect, Systems and Software Solutions 2880 Junction Avenue, San Jose, CA 9

RE: The design of the eviction improvement

2015-07-22 Thread Allen Samuels
Don't we need to double-index the data structure? We need it indexed by atime for the purposes of eviction, but we need it indexed by object name for the purposes of updating the list upon a usage. Allen Samuels Software Architect, Systems and Software Solutions 2880 Junction Avenue

RE: The design of the eviction improvement

2015-07-22 Thread Allen Samuels
ely to be used is flushed to storage with some mechanism that allows batched updates. Allen Samuels Software Architect, Systems and Software Solutions 2880 Junction Avenue, San Jose, CA 95134 T: +1 408 801 7030| M: +1 408 780 6416 allen.samu...@sandisk.com -Original Message- From: ceph-de

RE: The design of the eviction improvement

2015-07-20 Thread Allen Samuels
This seems much better than the current mechanism. Do you have an estimate of the memory consumption of the two lists? (In terms of bytes/object?) Allen Samuels Software Architect, Systems and Software Solutions 2880 Junction Avenue, San Jose, CA 95134 T: +1 408 801 7030| M: +1 408 780 6416

RE: Inline dedup/compression

2015-07-02 Thread Allen Samuels
ince small objects are replicated rather than ECed). This will have a massive impact on backend storage I/O as the basic data/metadata ratio is complete skewed (both for static storage and dynamic I/O count). Allen Samuels Software Architect, Emerging Storage Solutions 2880 Junction Avenue, Mi

RE: Inline dedup/compression

2015-06-30 Thread Allen Samuels
This covers the read and write, what about the delete? One of the major issues with Dedupe, whether global or local is to address the inherent ref-counting associated with sharing of pieces of storage. Allen Samuels Software Architect, Emerging Storage Solutions 2880 Junction Avenue, Milpitas

RE: Ceph tier’ing enhancements blue print for jewel

2015-06-16 Thread Allen Samuels
> -Original Message- > From: Gregory Farnum [mailto:g...@gregs42.com] > Sent: Wednesday, June 10, 2015 12:17 PM > To: Shishir Gowda > Cc: ceph-devel@vger.kernel.org > Subject: Re: Ceph tier’ing enhancements blue print for jewel > > On Tue, Jun 9, 2015 at 7:52 PM, Shishir Gowda > wrote: >

RE: Regarding key/value interface

2014-10-03 Thread Allen Samuels
good thing. Allen Samuels Chief Software Architect, Emerging Storage Solutions 951 SanDisk Drive, Milpitas, CA 95035 T: +1 408 801 7030| M: +1 408 780 6416 allen.samu...@sandisk.com -Original Message- From: ceph-devel-ow...@vger.kernel.org [mailto:ceph-devel-ow...@vger.kernel.org] On

RE: Regarding key/value interface

2014-09-11 Thread Allen Samuels
Another thing we're looking into is compression. The intersection of compression and object striping (fracturing) is interesting. Is the striping variable on a per-object basis? Allen Samuels Chief Software Architect, Emerging Storage Solutions 951 SanDisk Drive, Milpitas, CA 95035 T: +

RE: [Feature]Proposal for adding a new flag named shared to support performance and statistic purpose

2014-06-05 Thread Allen Samuels
You talk about restting the object map on a restart after a crash -- I assume you mean rebuilding, how long will this take? --- The true mystery of the world is the visible, not the invisible. Oscar Wilde (1854 - 1900) Allen Samuels

RE: RBD thoughts

2014-05-07 Thread Allen Samuels
move. --- Now I know what a statesman is; he's a dead politician. We need more statesmen. Bob Edwards Allen Samuels Chief Software Architect, Emerging Storage Solutions 951 SanDisk Drive, Milpitas, CA 95035 T: +1 408 801 7030| M: +1 408 780 6416 allen.samu...@sandisk.com ---

RE: RBD thoughts

2014-05-07 Thread Allen Samuels
a simpler implementation task. --- Never put off until tomorrow what you can do the day after tomorrow. Mark Twain Allen Samuels Chief Software Architect, Emerging Storage Solutions 951 SanDisk Drive, Milpitas, CA 95035 T: +1 408 801 7030| M: +1 40