Re: isCached
Thanks for the info On Fri, Sep 1, 2017 at 12:06 PM, Nick Pentreathwrote: > No unfortunately not - as i recall storageLevel accesses some private > methods to get the result. > > On Fri, 1 Sep 2017 at 17:55, Nathan Kronenfeld > > wrote: > >> Ah, in 2.1.0. >> >> I'm in 2.0.1 at the moment... is there any way that works that far back? >> >> On Fri, Sep 1, 2017 at 11:46 AM, Nick Pentreath > > wrote: >> >>> Dataset does have storageLevel. So you can use isCached = (storageLevel >>> != StorageLevel.NONE) as a test. >>> >>> Arguably isCached could be added to dataset too, shouldn't be a >>> controversial change. >>> >>> On Fri, 1 Sep 2017 at 17:31, Nathan Kronenfeld >>> >>> wrote: >>> I'm currently porting some of our code from RDDs to Datasets. With RDDs it's pretty easy to figure out if they are cached or not. I notice that the catalog has a function for determining this on Datasets too, but it's private[sql]. Is there any reason for it not to be public? Is there any way at the moment to determine if a dataset is cached or not? Thanks in advance -Nathan Kronenfeld >>> >>
Re: isCached
No unfortunately not - as i recall storageLevel accesses some private methods to get the result. On Fri, 1 Sep 2017 at 17:55, Nathan Kronenfeldwrote: > Ah, in 2.1.0. > > I'm in 2.0.1 at the moment... is there any way that works that far back? > > On Fri, Sep 1, 2017 at 11:46 AM, Nick Pentreath > wrote: > >> Dataset does have storageLevel. So you can use isCached = (storageLevel >> != StorageLevel.NONE) as a test. >> >> Arguably isCached could be added to dataset too, shouldn't be a >> controversial change. >> >> On Fri, 1 Sep 2017 at 17:31, Nathan Kronenfeld >> wrote: >> >>> I'm currently porting some of our code from RDDs to Datasets. >>> >>> With RDDs it's pretty easy to figure out if they are cached or not. >>> >>> I notice that the catalog has a function for determining this on >>> Datasets too, but it's private[sql]. Is there any reason for it not to be >>> public? Is there any way at the moment to determine if a dataset is cached >>> or not? >>> >>> Thanks in advance >>>-Nathan Kronenfeld >>> >> >
Re: isCached
Ah, in 2.1.0. I'm in 2.0.1 at the moment... is there any way that works that far back? On Fri, Sep 1, 2017 at 11:46 AM, Nick Pentreathwrote: > Dataset does have storageLevel. So you can use isCached = (storageLevel != > StorageLevel.NONE) as a test. > > Arguably isCached could be added to dataset too, shouldn't be a > controversial change. > > On Fri, 1 Sep 2017 at 17:31, Nathan Kronenfeld > > wrote: > >> I'm currently porting some of our code from RDDs to Datasets. >> >> With RDDs it's pretty easy to figure out if they are cached or not. >> >> I notice that the catalog has a function for determining this on Datasets >> too, but it's private[sql]. Is there any reason for it not to be public? >> Is there any way at the moment to determine if a dataset is cached or not? >> >> Thanks in advance >>-Nathan Kronenfeld >> >
Re: isCached
Dataset does have storageLevel. So you can use isCached = (storageLevel != StorageLevel.NONE) as a test. Arguably isCached could be added to dataset too, shouldn't be a controversial change. On Fri, 1 Sep 2017 at 17:31, Nathan Kronenfeldwrote: > I'm currently porting some of our code from RDDs to Datasets. > > With RDDs it's pretty easy to figure out if they are cached or not. > > I notice that the catalog has a function for determining this on Datasets > too, but it's private[sql]. Is there any reason for it not to be public? > Is there any way at the moment to determine if a dataset is cached or not? > > Thanks in advance >-Nathan Kronenfeld >