Re: [Pytables-users] [pytables-dev] SciPy 2012 Tutorial

2012-07-17 Thread Alvaro Tejero Cantero
It is a very nice presentation. Makes me wonder if using the terminology 'in memory' for 'in-core' and 'in disk' for 'out of core' would not be more straightforward! -á. On 17 July 2012 06:46, Anthony Scopatz wrote: > Hello PyTables, > > I'd like to present the tutorial I gave at SciPy 2012 t

Re: [Pytables-users] [pytables-dev] SciPy 2012 Tutorial

2012-07-17 Thread Francesc Alted
Hey Anthony, I was not there, but judging by the slices, this should have been a very nice tutorial. Some remarks: - In slide 19, you state that, if data comes from datasets with the 'numpy' flavor, they can be accessed in a numpy-like fashion. In fact, you should be able to access data thi

[Pytables-users] Faster Performance: A set of nodes vs A new column that ranges within a set?

2012-07-17 Thread Jacob Bennett
Hello PyTables Users & Contributors, Just a quick question, let's say that I have certain identifiers that link to a set of data. Would it generally be faster for lookup to have each set a data as a separate table with an id as the tables name or to add this id as another column to a universal tab

Re: [Pytables-users] Faster Performance: A set of nodes vs A new column that ranges within a set?

2012-07-17 Thread Anthony Scopatz
On Tue, Jul 17, 2012 at 3:30 PM, Jacob Bennett wrote: > Hello PyTables Users & Contributors, > > Just a quick question, let's say that I have certain identifiers that link > to a set of data. Would it generally be faster for lookup to have each set > a data as a separate table with an id as the ta

Re: [Pytables-users] [pytables-dev] SciPy 2012 Tutorial

2012-07-17 Thread Anthony Scopatz
On Tue, Jul 17, 2012 at 2:43 AM, Alvaro Tejero Cantero wrote: > It is a very nice presentation. > > Makes me wonder if using the terminology > > 'in memory' for 'in-core' and 'in disk' for 'out of core' would not be > more straightforward! > Thanks Alvaro! I agree the existing terminology here i

Re: [Pytables-users] Faster Performance: A set of nodes vs A new column that ranges within a set?

2012-07-17 Thread Jacob Bennett
Thanks for the input Anthony! -Jake On Tue, Jul 17, 2012 at 4:20 PM, Anthony Scopatz wrote: > On Tue, Jul 17, 2012 at 3:30 PM, Jacob Bennett > wrote: > >> Hello PyTables Users & Contributors, >> >> Just a quick question, let's say that I have certain identifiers that >> link to a set of data.

Re: [Pytables-users] Pytables file structure

2012-07-17 Thread Juan Manuel Vázquez Tovar
Thank you very much Anthony. Do I have to sign up to store a ticket? 2012/7/15 Anthony Scopatz > Ahh I see, tricky. > > So I think what is killing you is that you are pulling each row of the > table individually over the network. Ideally you should be able to do > something like the following:

Re: [Pytables-users] Pytables file structure

2012-07-17 Thread Anthony Scopatz
Hello Juan, Just make an account at github [1] and then go to the PyTables issues page. Be Well Anthony 1. https://github.com/ 2. https://github.com/PyTables/PyTables/issues On Tue, Jul 17, 2012 at 6:27 PM, Juan Manuel Vázquez Tovar < jmv.to...@gmail.com> wrote: > Thank you very much Anthony.

Re: [Pytables-users] Faster Performance: A set of nodes vs A new column that ranges within a set?

2012-07-17 Thread Ümit Seren
Just to add what Anthony said: In the end it also depends how unrelated your data is and how you want to access it. If the access scenaria is that you usually only search or select within a specific dataset then splitting up the datasets and putting them into separate tables is the way to go. In RB