On Sun, 2011-07-31 at 19:57 +0530, Anand Balachandran Pillai wrote:
xml parsing in the case when all that you need from the string is a
simple
numeric value(not a string), then good luck; unlike esr i will not
use
adjectives; but i would not use your code either.
To be fair here, I
On Mon, Aug 1, 2011 at 12:43 AM, Noufal Ibrahim nou...@gmail.com wrote:
Dhananjay Nene dhananjay.n...@gmail.com writes:
[...]
re.search(distance\s*(\d+)\s*/distance,data).group(1)
would appear to be the most succinct and quite fast. Adjust for
whitespace
as and if necessary.
On 31-Jul-2011, at 11:33 PM, Venkatraman S wrote:
A regex is the simplest IMHO, because you need not know the syntax of the
minidom parser.
But, again i have seen this quiet often that lack of knowledge of regexp has
led people to other solutions (the grapes are sour!)
In the eternal words
On Mon, Aug 1, 2011 at 1:25 PM, Kiran Jonnalagadda j...@pobox.com wrote:
On 31-Jul-2011, at 11:33 PM, Venkatraman S wrote:
A regex is the simplest IMHO, because you need not know the syntax of the
minidom parser.
But, again i have seen this quiet often that lack of knowledge of regexp
by using lxml...for example-:
from lxml import etree
content = etree.iterparse( *name of the xml file*, events=('start', 'end'))
for event, elem in content:
if elem.tag == 'distance':
print elem.text
Hope it will work..
On Mon, Aug 1, 2011 at 1:43 PM,
On Mon, Aug 1, 2011 at 12:46 AM, Noufal Ibrahim nou...@gmail.com wrote:
Venkatraman S venka...@gmail.com writes:
Hang around in #django or #python. The most elegant code that you
*should* write would invariably be pretty fast (am not ref to asm).
I agree with you here. Pythonicity is best
Anand Balachandran Pillai abpil...@gmail.com writes:
On Mon, Aug 1, 2011 at 6:08 AM, Anand Chitipothu anandol...@gmail.comwrote:
[...]
It is more subtler than that.
List comprehensions are faster than map functions when
the latter needs to invoke a user-defined function call or a lambda.
On Mon, Aug 1, 2011 at 7:51 PM, Noufal Ibrahim nou...@gmail.com wrote:
Anand Balachandran Pillai abpil...@gmail.com writes:
On Mon, Aug 1, 2011 at 6:08 AM, Anand Chitipothu anandol...@gmail.comwrote:
[...]
It is more subtler than that.
List comprehensions are faster than map functions
On Fri, Jul 29, 2011 at 4:41 PM, Venkatraman S venka...@gmail.com wrote:
Noufal,
I have nothing more to say than this(as i see some tangential replies which
i am not interested in substantiating - for eg, i never suggested to use a
regexp based parser - a regexp based xml parser is different
Anand Balachandran Pillai abpil...@gmail.com writes:
On Fri, Jul 29, 2011 at 4:41 PM, Venkatraman S venka...@gmail.com wrote:
[...]
To be fair here, I think what he is saying is that Kenneth's problem
(getting at the particular value) can be solved by using an aptly
written regular
On Thu, Jul 28, 2011 at 3:18 PM, Kenneth Gonsalves law...@gmail.com wrote:
hi,
here is a simplified version of an xml file:
?xml version=1.0 encoding=UTF-8?
gpx
metadata
author
nameCloudMade/name
email id=support
On Sun, Jul 31, 2011 at 10:58 PM, Dhananjay Nene
dhananjay.n...@gmail.comwrote:
a. the minidom performance is acceptable - no further optimisation required
b. minidom performance is not acceptable - try the regex one
c. python library performance is not acceptable - switch to 'c'
I can
Dhananjay Nene dhananjay.n...@gmail.com writes:
[...]
re.search(distance\s*(\d+)\s*/distance,data).group(1)
would appear to be the most succinct and quite fast. Adjust for whitespace
as and if necessary.
Whitespace (including newlines), mixed cases etc.
[...]
As far as optimisation
Venkatraman S venka...@gmail.com writes:
[...]
A regex is the simplest IMHO, because you need not know the syntax of the
minidom parser.
Oh come on. This sounds like doing it the wrong way because you're not
going to spend time reading the docs and then using performance as a
cover for the
On Mon, Aug 1, 2011 at 6:08 AM, Anand Chitipothu anandol...@gmail.comwrote:
Hang around in #django or #python. The most elegant code that you
*should*
write would invariably be pretty fast (am not ref to asm).
That doesn't mean that any code that is faster is elegant.
IIRC, in python,
Venkatraman S venka...@gmail.com writes:
[...]
Well, i have clearly mentioned my assumptions - i.e, when you treat
the XML as a 'string' and do not want to retrieve anything else in a
'structured manner'.
If the data is structured, it makes sense to exploit that structure and
use a proper
On Fri, Jul 29, 2011 at 11:15 AM, Anand Chitipothu anandol...@gmail.comwrote:
2011/7/29 Venkatraman S venka...@gmail.com:
On Fri, Jul 29, 2011 at 10:47 AM, Anand Chitipothu anandol...@gmail.com
wrote:
2011/7/28 Venkatraman S venka...@gmail.com:
parsing using minidom is one of the
On Fri, Jul 29, 2011 at 11:31 AM, Noufal Ibrahim nou...@gmail.com wrote:
Well, i have clearly mentioned my assumptions - i.e, when you treat
the XML as a 'string' and do not want to retrieve anything else in a
'structured manner'.
If the data is structured, it makes sense to exploit
Venkatraman S venka...@gmail.com writes:
[...]
Sigh! Again, guys, i am referring to regexp when all you need is some
number within a tag! If the content of that tag was text, i would
have never suggested this solution.
[...]
And I'm telling you that even a slight change to the tag - an
On Fri, Jul 29, 2011 at 11:44 AM, Noufal Ibrahim nou...@gmail.com wrote:
And I'm telling you that even a slight change to the tag - an extra
space, a newline, a new attribute, a change in case or any such thing
which doesn't modify it's meaning as far as the XML snippet is concerned
will
On Fri, Jul 29, 2011 at 11:31 AM, Noufal Ibrahim nou...@gmail.com wrote:
I am a speed-maniac and crave for speed; so if the assumption is
valid, i can vouch for the fact that regexp would be faster and neater
solution. I have done some speed experiments in past on this (results
of which i
Venkatraman S venka...@gmail.com writes:
On Fri, Jul 29, 2011 at 11:31 AM, Noufal Ibrahim nou...@gmail.com wrote:
I am a speed-maniac and crave for speed; so if the assumption is
valid, i can vouch for the fact that regexp would be faster and neater
solution. I have done some speed
+1.
On Fri, Jul 29, 2011 at 12:20 PM, Noufal Ibrahim nou...@gmail.com wrote:
Venkatraman S venka...@gmail.com writes:
On Fri, Jul 29, 2011 at 11:31 AM, Noufal Ibrahim nou...@gmail.com wrote:
I am a speed-maniac and crave for speed; so if the assumption is
valid, i can vouch for the fact
+1
On Fri, Jul 29, 2011 at 12:50 PM, Sidu Ponnappa lorddae...@gmail.comwrote:
+1.
On Fri, Jul 29, 2011 at 12:20 PM, Noufal Ibrahim nou...@gmail.com wrote:
Venkatraman S venka...@gmail.com writes:
On Fri, Jul 29, 2011 at 11:31 AM, Noufal Ibrahim nou...@gmail.com
wrote:
I am a
n Fri, Jul 29, 2011 at 12:20 PM, Noufal Ibrahim nou...@gmail.com wrote:
I agree and I try my best to do the same thing. However, I differentiate
between micro optimsations like rewriting parts in C and XML and top
level optimisations like good design and the right data structures.
Using
Venkatraman S venka...@gmail.com writes:
n Fri, Jul 29, 2011 at 12:20 PM, Noufal Ibrahim nou...@gmail.com wrote:
I agree and I try my best to do the same thing. However, I differentiate
between micro optimsations like rewriting parts in C and XML and top
level optimisations like good design
On Fri, Jul 29, 2011 at 1:09 PM, Venkatraman S venka...@gmail.com wrote:
IMHO, regexps are much more powerful and fault tolerant than XML parsing.
XMLs are brittle.
Did you mean parsing XML using Regular Expressions is more powerful
and fault tolerant than using a XML parser?
Regards,
BG
--
On Fri, Jul 29, 2011 at 10:55 AM, Baishampayan Ghose b.gh...@gmail.com wrote:
minidom is the fastest solution if you consider the programmer time
instead of developer time. Minidom is available in standard library,
you don't have to add another dependency and worry about PyPI
downtimes and
Noufal,
I have nothing more to say than this(as i see some tangential replies which
i am not interested in substantiating - for eg, i never suggested to use a
regexp based parser - a regexp based xml parser is different from using 'a'
regexp on a string!) :
Read my replies properly. Read my
Venkatraman S venka...@gmail.com writes:
[...]
Read my replies properly. Read my assumptions properly w.r.t the xml
structure and the requested value in the xml. Read the link that you
have pasted again. If possible, read the comments in the link
shared(from esr) again. Once done, think
grep or regexp?
-V
___
BangPypers mailing list
BangPypers@python.org
http://mail.python.org/mailman/listinfo/bangpypers
Using xpath such as:
/gpx/extensions/distance(:text)
?
On Thu, Jul 28, 2011 at 3:20 PM, Venkatraman S venka...@gmail.com wrote:
grep or regexp?
-V
___
BangPypers mailing list
BangPypers@python.org
On Thu, Jul 28, 2011 at 3:20 PM, Venkatraman S venka...@gmail.com wrote:
grep or regexp?
-V
___
BangPypers mailing list
BangPypers@python.org
http://mail.python.org/mailman/listinfo/bangpypers
can write an Xml parsing query
--
Ramdas S
+91
2011/7/28 Kenneth Gonsalves law...@gmail.com:
hi,
here is a simplified version of an xml file:
?xml version=1.0 encoding=UTF-8?
gpx
metadata
author
nameCloudMade/name
email id=support domain=cloudmade.com /
link
here is a simplified version of an xml file:
?xml version=1.0 encoding=UTF-8?
gpx
metadata
author
nameCloudMade/name
email id=support domain=cloudmade.com /
link href=http://maps.cloudmade.com;/link
On Thu, 2011-07-28 at 15:33 +0530, Anand Chitipothu wrote:
I want to get the value of the distance element - 1489. What is the
simplest way of doing this?
from xml.dom import minidom
dom = minidom.parseString(x)
dom.getElementsByTagName(distance)[0].childNodes[0].nodeValue
u'1489'
You can try beautifulsoup, recommended for python/XML Parsing.
___
BangPypers mailing list
BangPypers@python.org
http://mail.python.org/mailman/listinfo/bangpypers
If you're doing this repeatedly, you may want to just delegate to a
native XPath implementation. I haven't done much Python, so I can't
comment on your choices, but in Ruby I'd simply hand off to libXML
using Nokogiri. This approach should be a whole lot faster, but I'd
advise benchmarking first
parsing using minidom is one of the slowest. if you just want to extract the
distance and assuming that it(the tag) will always be consistent, then i
would always suggest regexp. xml parsing is a pain.
___
BangPypers mailing list
BangPypers@python.org
On Thu, Jul 28, 2011 at 10:37 PM, Venkatraman S venka...@gmail.com wrote:
parsing using minidom is one of the slowest. if you just want to extract the
distance and assuming that it(the tag) will always be consistent, then i
would always suggest regexp. xml parsing is a pain.
[...]
Strongly
On Fri, Jul 29, 2011 at 1:23 AM, Gora Mohanty g...@mimirtech.com wrote:
On Thu, Jul 28, 2011 at 10:37 PM, Venkatraman S venka...@gmail.com
wrote:
parsing using minidom is one of the slowest. if you just want to extract
the
distance and assuming that it(the tag) will always be consistent,
Hi,
check and try pyparsing module... U could do it so simple:)
regards,
joseph
On 7/29/11, Ramdas S ram...@gmail.com wrote:
On Fri, Jul 29, 2011 at 1:23 AM, Gora Mohanty g...@mimirtech.com wrote:
On Thu, Jul 28, 2011 at 10:37 PM, Venkatraman S venka...@gmail.com
wrote:
parsing using
2011/7/28 Venkatraman S venka...@gmail.com:
parsing using minidom is one of the slowest. if you just want to extract the
distance and assuming that it(the tag) will always be consistent, then i
would always suggest regexp. xml parsing is a pain.
regexp is a bad solution to parse xml.
minidom
minidom is the fastest solution if you consider the programmer time
instead of developer time. Minidom is available in standard library,
you don't have to add another dependency and worry about PyPI
downtimes and lxml compilations failures.
FWIW, ElementTree is a part of the standard library
On Fri, Jul 29, 2011 at 10:47 AM, Anand Chitipothu anandol...@gmail.comwrote:
2011/7/28 Venkatraman S venka...@gmail.com:
parsing using minidom is one of the slowest. if you just want to extract
the
distance and assuming that it(the tag) will always be consistent, then i
would always
2011/7/29 Venkatraman S venka...@gmail.com:
On Fri, Jul 29, 2011 at 10:47 AM, Anand Chitipothu
anandol...@gmail.comwrote:
2011/7/28 Venkatraman S venka...@gmail.com:
parsing using minidom is one of the slowest. if you just want to extract
the
distance and assuming that it(the tag) will
2011/7/29 Baishampayan Ghose b.gh...@gmail.com:
minidom is the fastest solution if you consider the programmer time
instead of developer time. Minidom is available in standard library,
you don't have to add another dependency and worry about PyPI
downtimes and lxml compilations failures.
47 matches
Mail list logo