Thanks Antonio and Anthony. I will give this a try.

-Pushkar


On Wed, Jul 17, 2013 at 2:59 PM, <
pytables-users-requ...@lists.sourceforge.net> wrote:

> Date: Wed, 17 Jul 2013 16:59:16 -0500
> From: Anthony Scopatz <scop...@gmail.com>
> Subject: Re: [Pytables-users] Pytables bulk loading data
> To: Discussion list for PyTables
>         <pytables-users@lists.sourceforge.net>
> Message-ID:
>         <
> capk-6t4ht9+ncdd_1oojrbn4u_6+ouekobklmokeufjojjk...@mail.gmail.com>
> Content-Type: text/plain; charset="iso-8859-1"
>
> Hi Pushkar,
>
> I agree with Antonio.  You should load your data with NumPy functions and
> then write back out to PyTables.  This is the fastest way to do things.
>
> Be Well
> Anthony
>
>
> On Wed, Jul 17, 2013 at 2:12 PM, Antonio Valentino <
> antonio.valent...@tiscali.it> wrote:
>
> > Hi Pushkar,
> >
> > Il 17/07/2013 19:28, Pushkar Raj Pande ha scritto:
> > > Hi all,
> > >
> > > I am trying to figure out the best way to bulk load data into pytables.
> > > This question may have been already answered but I couldn't find what I
> > was
> > > looking for.
> > >
> > > The source data is in form of csv which may require parsing, type
> > checking
> > > and setting default values if it doesn't conform to the type of the
> > column.
> > > There are over 100 columns in a record. Doing this in a loop in python
> > for
> > > each row of the record is very slow compared to just fetching the rows
> > from
> > > one pytable file and writing it to another. Difference is almost a
> factor
> > > of ~50.
> > >
> > > I believe if I load the data using a C procedure that does the parsing
> > and
> > > builds the records to write in pytables I can get close to the speed of
> > > just copying and writing the rows from 1 pytable to another. But may be
> > > there is something simple and better that already exists. Can someone
> > > please advise? But if it is a C procedure that I should write can
> someone
> > > point me to some examples or snippets that I can refer to put this
> > together.
> > >
> > > Thanks,
> > > Pushkar
> > >
> >
> > numpy has some tools for loading data from csv files like loadtxt [1],
> > genfromtxt [2] and other variants.
> >
> > Non of them is OK for you?
> >
> > [1]
> >
> >
> http://docs.scipy.org/doc/numpy/reference/generated/numpy.loadtxt.html#numpy.loadtxt
> > [2]
> >
> >
> http://docs.scipy.org/doc/numpy/reference/generated/numpy.genfromtxt.html#numpy.genfromtxt
> >
> >
> > cheers
> >
> > --
> > Antonio Valentino
> >
> >
> >
> ------------------------------------------------------------------------------
> > See everything from the browser to the database with AppDynamics
> > Get end-to-end visibility with application monitoring from AppDynamics
> > Isolate bottlenecks and diagnose root cause in seconds.
> > Start your free trial of AppDynamics Pro today!
> >
> http://pubads.g.doubleclick.net/gampad/clk?id=48808831&iu=/4140/ostg.clktrk
> > _______________________________________________
> > Pytables-users mailing list
> > Pytables-users@lists.sourceforge.net
> > https://lists.sourceforge.net/lists/listinfo/pytables-users
> >
> -------------- next part --------------
> An HTML attachment was scrubbed...
>
> ------------------------------
>
>
> ------------------------------------------------------------------------------
> See everything from the browser to the database with AppDynamics
> Get end-to-end visibility with application monitoring from AppDynamics
> Isolate bottlenecks and diagnose root cause in seconds.
> Start your free trial of AppDynamics Pro today!
> http://pubads.g.doubleclick.net/gampad/clk?id=48808831&iu=/4140/ostg.clktrk
>
> ------------------------------
>
> _______________________________________________
> Pytables-users mailing list
> Pytables-users@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/pytables-users
>
>
> End of Pytables-users Digest, Vol 86, Issue 8
> *********************************************
>
------------------------------------------------------------------------------
See everything from the browser to the database with AppDynamics
Get end-to-end visibility with application monitoring from AppDynamics
Isolate bottlenecks and diagnose root cause in seconds.
Start your free trial of AppDynamics Pro today!
http://pubads.g.doubleclick.net/gampad/clk?id=48808831&iu=/4140/ostg.clktrk
_______________________________________________
Pytables-users mailing list
Pytables-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/pytables-users

Reply via email to