chineking created ARROW-2879:
--------------------------------

             Summary: Arrow plasma can only use a small part of specified 
shared memory
                 Key: ARROW-2879
                 URL: https://issues.apache.org/jira/browse/ARROW-2879
             Project: Apache Arrow
          Issue Type: Bug
          Components: Python
            Reporter: chineking


Hi, thanks for the great job of arrow, it helps us a lot.

However, we encounter a problem when we were using plasma.

The sample code:
{code:python}
import numpy as np
import pyarrow as pa
import pyarrow.plasma as plasma

client = plasma.connect("/tmp/plasma", "", 0)

puts = []
nbytes = 0
while True:
    a = np.ones((1000, 1000))
    try:
        oid = client.put(a)
        puts.append(client.get(oid))
        nbytes += a.nbytes
    except pa.lib.PlasmaStoreFull:
        print('use nbytes', nbytes)
        break
{code}

We start a plasma store with 1G memory, but the nbytes output above is only 
496000000, which cannot even reach half of the memory we specified.

I cannot figure out why plasma can only use such a small part of shared memory. 
Could anybody help me? Thanks a lot.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to