I am reading a parquet file using Spark dataframe as df

I want to write some part of this data to ignite cache

Assume I want to write df2 to the cache

I used df2.write.format('ignite')
Is there a better way to do this or this is the only way??

Regards
Arunima

On Mon, 3 Jul, 2023, 1:19 pm Stephen Darlington, <
stephen.darling...@gridgain.com> wrote:

> Commercial options are available, but otherwise help would generally be
> limited to email lists and Stack Overflow.
>
> On 1 Jul 2023, at 06:59, Arunima Barik <arunimabari...@gmail.com> wrote:
>
> Are there any provisions wherein I can discuss about my project
> implementation with someone from the Ignite team to clarify some doubts?
>
> Preferably through a small online meet?
>
> Regards
> Arunima
>
> On Sat, 1 Jul, 2023, 12:03 am Jeremy McMillan, <
> jeremy.mcmil...@gridgain.com> wrote:
>
>> Python doesn't at this time go anywhere near Ignite CacheStore. You would
>> need to implement the CacheStore in Java or some other language which
>> compiles to JVM runtime/jar. There's a talk from the most recent summit on
>> using Groovy, if you want a higher level language than Java, but
>> theoretically you could use Jython (if you are willing to experiment and
>> can find a compatible JVM that runs both Ignite and Jython).
>>
>> Ignite can operate like a federated query proxy if different caches are
>> implemented with different external persistence for each cache. CacheStore
>> is the interface Ignite would use to send a cache miss to a backend
>> database. In your original question you intended to use Parquet files as a
>> backend database, but Ignite does not (yet) provide one for Parquet. If
>> someone were to donate a supportable Java implementation, I suspect the
>> community would adopt and support it. Since Parquet is columnar, I also
>> suspect it would need to target Ignite 3 to adopt conventions around
>> columnar data, and then might be backported to Ignite 2.
>>
>>
>> On Fri, Jun 30, 2023 at 12:13 PM Arunima Barik <arunimabari...@gmail.com>
>> wrote:
>>
>>> Which do you think would be a better option?
>>>
>>> Federated queries or CacheStore
>>>
>>> And is CacheStore supported in Python?
>>>
>>> On Fri, 30 Jun, 2023, 1:50 pm Stephen Darlington, <
>>> stephen.darling...@gridgain.com> wrote:
>>>
>>>> You’d need to implement your own Cache Store.
>>>> https://ignite.apache.org/docs/latest/persistence/custom-cache-store
>>>>
>>>> On 30 Jun 2023, at 06:46, Arunima Barik <arunimabari...@gmail.com>
>>>> wrote:
>>>>
>>>>
>>>> ---------- Forwarded message ---------
>>>> From: Arunima Barik <arunimabari...@gmail.com>
>>>> Date: Fri, 30 Jun, 2023, 10:52 am
>>>> Subject: Ignite for Parquet files
>>>> To: <user@ignite.apache.org>
>>>>
>>>>
>>>> Hello Team
>>>>
>>>> I have my data stored as parquet files. I want a caching layer on top
>>>> of this existing file system. I am going to use Ignite for that but I do
>>>> not need native persistence for that.
>>>>
>>>> I want that any changes to database should be reflected in both cache
>>>> and file.
>>>> And same for read queries. It should automatically read from disk if
>>>> data is not present in cache.
>>>>
>>>> I want to do all this is python. Please let me know how the same can be
>>>> done.
>>>> Resources if any as well.
>>>>
>>>> Thank you and looking forward to hearing from you.
>>>>
>>>> Regard,
>>>> Arunima Barik
>>>>
>>>>
>>>>
>

Reply via email to