Re: [Numpy-discussion] fromstring, tostring slow?

Christopher Barker Tue, 13 Feb 2007 13:37:57 -0800

Mark Janikas wrote:

I am finding that directly packing numpy arrays into binary using thetostring and fromstring methods

For starters, use fromfile and tofile, to save the overhead of creatingan entire extra string.


fromfile is a function (as it is an alternate constructor for arrays):

numpy.fromfile()

ndarray.tofile() is an array method.

Enclosed is your test, including a test for tofile(), I needed to makethe arrays much larger, and use time.time() rather than time.clock() toget enough time resolution to see anything, though if you really want tobe accurate, you need to use the timeit module.


My results:
Using lists 0.457561016083
Using tostring 0.00922703742981
Using tofile 0.00431108474731

Another note: where is the data coming from -- there may be ways tooptimize this whole process if we saw that.


-Chris



--
Christopher Barker, Ph.D.
Oceanographer

Emergency Response Division
NOAA/NOS/OR&R            (206) 526-6959   voice
7600 Sand Point Way NE   (206) 526-6329   fax
Seattle, WA  98115       (206) 526-6317   main reception

[EMAIL PROTECTED]

#!/usr/bin/env python

from numpy import *
import numpy.random as RAND
import time as TIME

x = RAND.random(100000)
xl = x.tolist()

t1 = TIME.time()
xStringOut = [ str(i) for i in xl ]
xStringOut = " ".join(xStringOut)
f = file('blah.dat','w'); f.write(xStringOut)
t2 = TIME.time()
total = t2 - t1
print "Using lists", total

t1 = TIME.time()
f = file('blah.bwt','wb')
xBinaryOut = x.tostring()
f.write(xBinaryOut)
t2 = TIME.time()
total1 = t2 - t1
print "Using tostring", total1

t1 = TIME.time()
f = file('blah.bwt','wb')
x.tofile(f)
t2 = TIME.time()
total1 = t2 - t1
print "Using tofile", total1

_______________________________________________
Numpy-discussion mailing list
Numpy-discussion@scipy.org
http://projects.scipy.org/mailman/listinfo/numpy-discussion

Re: [Numpy-discussion] fromstring, tostring slow?

Reply via email to