Re: [Pytables-users] [Pytables-announce] ANN: PyTables 2.4.0 beta1

2012-07-11 Thread Francesc Alted
Hey Antonio, this looks great.

BTW, which is the status of the 3.x support?  I vaguely remember you 
asking me for some help on this, but I don't remember well. Not that I 
have a lot of time to spend on it, but perhaps I can use some hours in 
the next days.

Thanks,

Francesc

On 7/7/12 8:47 PM, Antonio Valentino wrote:
 -BEGIN PGP SIGNED MESSAGE-
 Hash: SHA1

 ===
 Announcing PyTables 2.4.0b1
 ===

 We are happy to announce PyTables 2.4.0b1.

 This is an incremental release which includes many changes to prepare
 for future Python 3 support.


 What's new
 ==

 This release includes support for the float16 data type and read-only
 support for variable length string attributes.

 The handling of HDF5 errors has been improved.  The user will no
 longer see HDF5 error stacks dumped to the console.  All HDF5 error
 messages are trapped and attached to a proper Python exception.

 Now PyTables only supports HDF5 v1.8.4+. All the code has been updated
 to the new HDF5 API.  Supporting only HDF5 1.8 series is beneficial
 for future development.

 As always, a large amount of bugs have been addressed and squashed as
 well.

 In case you want to know more in detail what has changed in this
 version, please refer to:
 http://pytables.github.com/release_notes.html

 You can download a source package with generated PDF and HTML docs, as
 well as binaries for Windows, from:
 http://sourceforge.net/projects/pytables/files/pytables/2.4.0b1

 For an online version of the manual, visit:
 http://pytables.github.com/usersguide/index.html


 What it is?
 ===

 PyTables is a library for managing hierarchical datasets and
 designed to efficiently cope with extremely large amounts of data with
 support for full 64-bit file addressing.  PyTables runs on top of
 the HDF5 library and NumPy package for achieving maximum throughput and
 convenient use.  PyTables includes OPSI, a new indexing technology,
 allowing to perform data lookups in tables exceeding 10 gigarows
 (10**10 rows) in less than a tenth of a second.


 Resources
 =

 About PyTables: http://www.pytables.org

 About the HDF5 library: http://hdfgroup.org/HDF5/

 About NumPy: http://numpy.scipy.org/


 Acknowledgments
 ===

 Thanks to many users who provided feature improvements, patches, bug
 reports, support and suggestions.  See the ``THANKS`` file in the
 distribution package for a (incomplete) list of contributors.  Most
 specially, a lot of kudos go to the HDF5 and NumPy (and numarray!)
 makers.  Without them, PyTables simply would not exist.


 Share your experience
 =

 Let us know of any bugs, suggestions, gripes, kudos, etc. you may have.


 - 

**Enjoy data!**


 - --
 The PyTables Team
 -BEGIN PGP SIGNATURE-
 Version: GnuPG v1.4.11 (GNU/Linux)
 Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org/

 iEYEARECAAYFAk/4hDwACgkQ1JUs2CS3bP7TUwCfcobS3KI7L/6k3Bbbt2VBOz5B
 TqAAn0DhrSdtd7XTPOj0RR/mpr2FtseE
 =T5iQ
 -END PGP SIGNATURE-

 --
 Live Security Virtual Conference
 Exclusive live event will cover all the ways today's security and
 threat landscape has changed and how IT managers can respond. Discussions
 will include endpoint security, mobile security and the latest in malware
 threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
 ___
 Pytables-announce mailing list
 pytables-annou...@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/pytables-announce


-- 
Francesc Alted


--
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
___
Pytables-users mailing list
Pytables-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/pytables-users


Re: [Pytables-users] [Pytables-announce] ANN: PyTables 2.4.0 beta1

2012-07-11 Thread Antonio Valentino
-BEGIN PGP SIGNED MESSAGE-
Hash: SHA1

Hi Francesc,

Il 11/07/2012 11:33, Francesc Alted ha scritto:
 Hey Antonio, this looks great.
 

Thanks :)

 BTW, which is the status of the 3.x support?  I vaguely remember
 you asking me for some help on this, but I don't remember well. Not
 that I have a lot of time to spend on it, but perhaps I can use
 some hours in the next days.

well, in PyTables 2.4 we made some job in preparation of the porting
to python3, but the porting itself is still not started.

One of the main issues is that numexpr still not supports python3 so
we have a missing dependency.

I started a porting of numexpr to python3 (see [1]) but it is still
incomplete.
I hope it is good enough to let us start working on the porting of
PyTables.

Of course if you would like to give a look to numexpr for python3 it
would be of great help.

After the PyTables 2.4 final I plan to publish a wiki page with my
roadmap proposal. IMHO main points are:

* open a new branch in the repo
* remove al deprecated code (Numeric, numarray, netcdf3, etc). This
breaks the API and, IMHO, we will also need to bump the format version
* ensure that all the required SW work (enough) on python3
* handle str/unicode issues
* full support to unicode HDF5 object names
* start working an a good setup for 2to3 (needs some investigation)
* ...

Please let me know if you think there are other point that are
important for python3 support


[1] https://groups.google.com/forum/?fromgroups#!topic/numexpr/M2MVjXsBR0c


cheers

 
 Thanks,
 
 Francesc
 
 On 7/7/12 8:47 PM, Antonio Valentino wrote: 
 === Announcing PyTables 2.4.0b1 
 ===
 
 We are happy to announce PyTables 2.4.0b1.
 

[CUT]

- -- 
Antonio Valentino


-BEGIN PGP SIGNATURE-
Version: GnuPG v1.4.11 (GNU/Linux)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org/

iEYEARECAAYFAk/9soUACgkQ1JUs2CS3bP4u1ACeJKnMQRFF1hATXFMG3lPH2xyU
9DwAoJNPp6L8gHf+s5hA2Jhj4JLyl3jr
=AYqd
-END PGP SIGNATURE-

--
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
___
Pytables-users mailing list
Pytables-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/pytables-users


Re: [Pytables-users] advice on using PyTables

2012-07-11 Thread Anthony Scopatz
Hello Benjamin,

Not knowing to much about the ASTERIX format, other than what you said and
what is in the links, I would say that this is a good fit for HDF5 and
PyTables.  PyTables will certainly help you read in the data and manipulate
it.

However, before you abandon hachoir completely, I will say it is a lot
easier to write hdf5 files in PyTables than to use the HDF5 C API.   If
hachoir is too slow, have you tried profiling the code to see what is
taking up the most time?  Maybe you could just rewrite these parts in C?
 Have you looked into Cythonizing it?  Also, you don't seem to be using
numpy to read in the data... (there are some tricks given ASTERIX here, but
not insurmountable).

I ask the above, just so you don't have to completely rewrite everything.
 You are correct though that pure python is probably not sufficient.  Feel
free to ask more questions here.

Be Well
Anthony

On Wed, Jul 11, 2012 at 6:52 AM, benjamin.bertr...@lfv.se wrote:

 Hi,

 I'm working with Air Traffic Management and would like to perform checks /
 compute statistics on ASTERIX data.
 ASTERIX is an ATM Surveillance Data Binary Messaging Format (
 http://www.eurocontrol.int/asterix/public/standard_page/overview.html)

 The data consist of a concatenation of consecutive data blocks.
 Each data block consists of data category + length + records.
 Each record is of variable length and consists of several data items (that
 are well defined for each category).
 Some data items might be present or not depending on a field specification
 (bitfield).

 I started to write a parser using hachoir (
 https://bitbucket.org/haypo/hachoir/overview) a pure python library.
 But the parsing was really too slow and taking a lot of memory.
 That's not really useable.

 From what I read, PyTables could really help to manipulate and analyze
 the data.
 So I've been thinking about writing a tool (probably in C) to convert my
 ASTERIX format to HDF5.

 Before I start, I'd like confirmation that this seems like a suitable
 application for PyTables.
 Is there another approach than writing a conversion tool to HDF5?

 Thanks in advance

 Benjamin


 --
 Live Security Virtual Conference
 Exclusive live event will cover all the ways today's security and
 threat landscape has changed and how IT managers can respond. Discussions
 will include endpoint security, mobile security and the latest in malware
 threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
 ___
 Pytables-users mailing list
 Pytables-users@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/pytables-users

--
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/___
Pytables-users mailing list
Pytables-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/pytables-users