Sorry about that; used -1/+1 to show preference, not binding action

> On Oct 28, 2021, at 5:50 AM, bened...@apache.org wrote:
> 
>> I am -1 here, for the reasons listed above; the problem (in my eye) is not 
>> reader/writer but higher level at the actual SSTable.  If we plug out 
>> read/write but still allow direct file access, then these abstractions fail 
>> to provide the goals of the CEP.
> 
> Be careful dropping -1s, as your -1s here are binding. I realise this isn’t a 
> vote thread, but the effect is the same. IMO we should try to express our 
> preferences and defer to the collective opinion where possible. True -1s 
> should very rarely appear.
> 
> 
> From: David Capwell <dcapw...@apple.com.INVALID>
> Date: Wednesday, 27 October 2021 at 15:33
> To: dev@cassandra.apache.org <dev@cassandra.apache.org>
> Subject: Re: [DISCUSS] CEP-17: SSTable format API (CASSANDRA-17056)
> Reading the CEP I don’t see any mention to the systems which access SSTables; 
> such as streaming (small callout to zero-copy-streaming with 
> ZeroCopyBigTableWriter) and repair.  If you are abstracting out 
> BigTableReader then you are not dealing with the implementation assumptions 
> that users of SSTables have (such as direct mutation to auxiliary files 
> outside of -Data.db).
> 
>> Audience
>>       • Cassandra developers who wish to see SSTableReader and SSTableWriter 
>> more modular than they are today,
> 
> This statement relates to the above comment, many parts of the code do not 
> use Reader/Writer but instead use direct format knowledge to apply changes to 
> the file format (normally outside of -Data.db); to me the interfaces has to 
> be at the SSTable level, which then expose readers/writers, but also has to 
> expose the other things we do outside of those paths.
> 
>>       • move the metrics related to sstable format out from TableMetrics 
>> class and make them tied to certain sstable implementation
> 
> I am curious about this comment, are you removing exposing this information?
> 
>>       • have a single factory for creating both readers and writers for 
>> particular implementation of sstable and use it consistently - no direct 
>> creation of any reader / writer
> 
> I am -1 here, for the reasons listed above; the problem (in my eye) is not 
> reader/writer but higher level at the actual SSTable.  If we plug out 
> read/write but still allow direct file access, then these abstractions fail 
> to provide the goals of the CEP.
> 
> I am +1 to the intent of the CEP.
> 
> And last comment, which I have also done in the other modularity thread… 
> backwards compatibility and maintenance. It is not clear right now what java 
> interfaces may not break and how we can maintain and extend such interfaces 
> in the future.  If the goal is to allow 3rd parties to plugin and offer new 
> SSTable formats, are we as a project ok with having a minor release do a 
> binary or source non-compatible change?  If not how do we detect this?  Until 
> this problem is solved, I do not think we should add any such interfaces.
> 
>> On Oct 22, 2021, at 7:23 AM, Jeremiah Jordan <jeremiah.jor...@gmail.com> 
>> wrote:
>> 
>> Hi Stefan,
>> That idea is not related to this CEP which is about the file formats of the
>> sstables, not file system access.  But you should take a look at the work
>> recently committed in https://issues.apache.org/jira/browse/CASSANDRA-16926
>> to switch to using java.nio.file.Path for file access.  This should allow
>> the use of a file system provider to access files which could be the basis
>> for work to load the files from S3.
>> 
>> -Jeremiah
>> 
>> On Fri, Oct 22, 2021 at 4:07 AM Stefan Miklosovic <
>> stefan.mikloso...@instaclustr.com> wrote:
>> 
>>> One point I would like to add to this; I was already looking into how
>>> to extend this but what I saw in SSTableReader was that it is very
>>> much "file system oriented". There was not any possibility to actually
>>> hook something like that there. I think what importing does is that it
>>> will use SSTableReader / Writer stuff so I think that the modification
>>> of these classes to accommodate this idea would be necessary.
>>> 
>>> On Fri, 22 Oct 2021 at 11:02, Stefan Miklosovic
>>> <stefan.mikloso...@instaclustr.com> wrote:
>>>> 
>>>> Hi Jacek,
>>>> 
>>>> Thanks for taking the lead on this.
>>>> 
>>>> There was importing of SSTables introduced in 4.0 via
>>>> StorageService#importNewSSTables. The "problem" with this is that
>>>> SSTables need to be physically located at disk so Cassandra can read
>>>> them. If a backup is taken and SSTables are uploaded to, for example,
>>>> S3 bucket, then upon restore, all these SSTables need to be downloaded
>>>> first and then imported. What about downloading them / importing them
>>>> directly from S3? Or any custom source for that matter? Importing of
>>>> SSTables is a very nice feature in 4.0, we do not need to copy / hard
>>>> link / refresh, it is all handled internally.
>>>> 
>>>> I am not sure if your work is related to this idea but I would
>>>> appreciate it if this is pluggable as well for the sake of simplicity
>>>> and effectiveness as we would not have to download all sstables before
>>>> importing them.
>>>> 
>>>> If it is not related, feel free to skip that completely and I guess I
>>>> would have to try to push that forward myself.
>>>> 
>>>> Regards
>>>> 
>>>> 
>>>> On Fri, 22 Oct 2021 at 10:24, Jacek Lewandowski
>>>> <lewandowski.ja...@gmail.com> wrote:
>>>>> 
>>>>> I'd like to start a discussion about SSTable format API proposal
>>> (CEP-17)
>>>>> 
>>>>> Jira: https://issues.apache.org/jira/browse/CASSANDRA-17056
>>>>> CEP:
>>> https://cwiki.apache.org/confluence/display/CASSANDRA/CEP-17%3A+SSTable+format+API
>>>>> 
>>>>> Thanks,
>>>>> Jacek
>>>>> 
>>>>> ---------------------------------------------------------------------
>>>>> To unsubscribe, e-mail: dev-unsubscr...@cassandra.apache.org
>>>>> For additional commands, e-mail: dev-h...@cassandra.apache.org
>>>>> 
>>> 
>>> ---------------------------------------------------------------------
>>> To unsubscribe, e-mail: dev-unsubscr...@cassandra.apache.org
>>> For additional commands, e-mail: dev-h...@cassandra.apache.org
>>> 
>>> 
> 
> 
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: dev-unsubscr...@cassandra.apache.org
> For additional commands, e-mail: dev-h...@cassandra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@cassandra.apache.org
For additional commands, e-mail: dev-h...@cassandra.apache.org

Reply via email to