from:"Yingjie Lan"

Re: string interpolation for python

2012-04-02 Thread Yingjie Lan

Hi Adrian, see my comments below.

 From: Adrian Hunt cybor...@hotmail.com
...
It could break old code... okay you may say you should’nt allow 
certain characters but if they're printable and used in a controlled
environment those characters can dramatically increase the security
of a username and password.


What you said makes lots of sense to me. 
if strings are interpolated *automatically*.

But it won't and shouldn't.

They are called Dynamic strings.
Dynamic strings can achieve formatting, 
but the mechanism under the hood differ
from common strings dramatically.

Many here didn't realize that this is not
another formatting proposal, it is a new
kind of *expression*. 

To have it in Python, we will need 
a new kind of syntax to distinguish it
from other strings, such as raw strings 
and the like. A raw string looks like:

 r'my\\ raw str'
'my raw str'

A dynamic string may look like this:

 name = Peter #common string
 dThank you, $name$! #dynamic string!
'Thank you, Peter!'


The following example would make it feel 
a lot more safe (suppose a = raw_input()):

 a = 'dAre you $name$?'
 print(a)
'dAre you $name$?'

 eval('dAre you $name$?')

'Are you Peter?'
 dIt contains $len(_.split())$ words!
'It contains 3 words!'

An interesting question might be:
what if a dynamic string is referring
to another dynamic string, which
in turn refers back to the former?

The answer is: no variable can hold
a dynamic string itself, only its result,
which can only be a common string.

However, a infinite recursion may 
occur if the eval function is used inside:

 a = 'd$eval(a)$'
 eval(a)

This is just to show a dynamic string
is really an expression in disguise.
Like evaluating any expression containing
function calls, there is risk of getting into
infinite recursion.

Cheers, 

Yingjie
-- 
http://mail.python.org/mailman/listinfo/python-list

Re: string interpolation for python

2012-04-02 Thread Yingjie Lan

 You can already do essentially that without adding a special-case string 

 formatting method to the general methods we already have.
 
  balls = 5
  people = 3
  'The {people} people have {balls} 
 balls.'.format(**locals())
 'The 3 people have 5 balls.'


Clearly dynamic strings are much more powerful,
allowing arbitrary expressions inside. It is also
more terse and readable, since we need no dictionary.

I would probably rather liken dynamic expressions
as a little brother of computable documents in 
Mathematica. It is a new kind of expression,
rather than formatting -- though it has formatting
connections. 

Dynamic strings are mainly useful at time of
writing readable code before compilation. 
The compiler can choose to convert it into
a string formatting expression, of course.
To efficiently format strings at runtime, 
the best choice (especially
for safty reasons) is string formatting, 
not evaluating a dynamic string.

On the implementation, I would suppose new 
syntax is needed (though very small).

Yingjie
-- 
http://mail.python.org/mailman/listinfo/python-list

Re: string interpolation for python

2012-04-02 Thread Yingjie Lan

 Python already has *3* different built-in string

 formatting/interpolation systems:
...
 I would surmise that your key implicitly grab variable values from
 the enclosing scope feature has previously been rejected for being
 too magical.

It grabs because it is an expression in disguise (not a literal). 
If we could make that clear in the way we write it, seems the 
magic level should reduce.to a tolerable (or even likable) level.

Cheers,
Yingjie
-- 
http://mail.python.org/mailman/listinfo/python-list

Re: string interpolation for python

2012-04-02 Thread Yingjie Lan

- Original Message -
 From: Steven D'Aprano steve+comp.lang.pyt...@pearwood.info
 To: python-list@python.org
 Cc: 
 Sent: Monday, April 2, 2012 4:26 PM
 Subject: Re: string interpolation for python

 On Mon, 02 Apr 2012 00:39:42 -0700, Yingjie Lan wrote:

  You can already do essentially that without adding a special-case
  string

  formatting method to the general methods we already have.

   balls = 5
   people = 3
   'The {people} people have {balls}
  balls.'.format(**locals())
  'The 3 people have 5 balls.'

  Clearly dynamic strings are much more powerful, allowing arbitrary
  expressions inside.

 And so it may be a security risk, if user-input somehow ends up treated 
 as a dynamic string.

 We already have three ways to evaluate arbitrary expressions:

 * Python code
 * eval
 * exec

 Why do we need yet another one?

  It is also more terse and readable, since we need no
  dictionary.

 I think you mean terse and unreadable, since we need no dictionary. That 
 means that variables will be evaluated by magic from... where? Globals? 
 Local scope? Non-local scope? All of the above?

 We already have one way of evaluating implicit variables using implicit 
 rules, namely regular Python code. Why do we need a second one?

  I would probably rather liken dynamic expressions as a little brother of
  computable documents in Mathematica. It is a new kind of expression,
  rather than formatting -- though it has formatting connections.

 Why do we need a new kind of expression?

  Dynamic strings are mainly useful at time of writing readable code
  before compilation.

 What does that mean?

  The compiler can choose to convert it into a string
  formatting expression, of course. To efficiently format strings at
  runtime, the best choice (especially
  for safty reasons) is string formatting, not evaluating a dynamic
  string.

 So you're suggesting that we should have dynamic strings, but not 
 actually use dynamic strings. The compiler should just convert them to 
 regular string formatting.

 Why not cut out the middle-man and just use regular string formatting?

I believe non of the other three alternatives are as terse and readable.
We've got template based, formatting with dict, formatting with tuple.
They all require the coder extra effort:

Both template based and dict-based formatting require writing the
identifier three times:

 name = 'Peter'
 Are you %(name)s%{'name':name}

If dynamic string is used:
 Are you $name$?

Template:
 Template(Are you $name?).substitute(name=name)

It is three to one in compactness, what a magic 3!

Of course, the old C style way:

 Are you %s?%name

Almost as terse, but not as readable, especially
when there are many parts to substitute --
the coder and reader need to be careful 
to make sure the sequence is correct.

Why the Python community is so
hostile to new things now? 
Python has merits,
but it is far from being perfect.

Cheers,
Yingjie
-- 
http://mail.python.org/mailman/listinfo/python-list

Re: string interpolation for python

2012-04-02 Thread Yingjie Lan

 Actually, this sounds like a job for a precompiler/preprocessor. Do

 whatever translations you want on your code, then turn it into a .py
 file for execution. But hardly necessary, as there are already two -
 err, three, I stand corrected - perfectly good ways to do it.


Agree and disagree. 

The other ways are not perfectly good.
They stinks in Python. 
This new way is the most natural way.
Effortless, natural. That's my Python.

Cheers,
Yingjie

-- 
http://mail.python.org/mailman/listinfo/python-list

Re: string interpolation for python

2012-04-02 Thread Yingjie Lan

 

  Are you +name+?
 
 That allows arbitrary expressions and everything.
 

To make that work for any type, you need:

 Are you + str(name) + ?

Another concern is performance.

You are absolutely right, they are 
equivalent in that both are expressions.
As long as people start to realize that
dynamic strings are expressions,
there is no magic in it any more.

And allowing expressions in those
dynamic strings would make sense 
since they are of the same sort.

 dsin($x$) = $ sin(x):0.3f $

is equivalent to the expression of

 sin(%s%x + )= %0.3f%sin(x)

Comparing th e two, I would say the latter
is more computer friendly while 
the former, more human friendly.

If the computed result is only to be
used in formatting the string, it would
be nice to save an assignment stmt.


 
  Almost as terse, but not as readable, especially
  when there are many parts to substitute --
  the coder and reader need to be careful
  to make sure the sequence is correct.
 
 I quite like this notation, personally. It's convenient, and is
 supported (with variants) in quite a few C-derived languages (and, in
 spite of the massive syntactic differences, Python does have C
 heritage).

Sure, once you get used to it, it would be harder to stop it
 the harder it is :). That's part of human nature, anyway.


  Why the Python community is so
  hostile to new things now?
  Python has merits,
  but it is far from being perfect.
 
 Hey now, no need to get defensive :) Thing is, it's up to you to
 demonstrate that your proposal justifies itself. You're proposing to
 create a massive backward-compatibility issue, so you need to prove
 that your new way of formatting strings is sufficiently awesome to be
 able to say Well, you need Python 3.4+ to use this.
 


OK. I have put it out as is. I trust people knows good things.

I would simply say: this new way is much more simple 
and much more powerful. And there is no security issues
as long as you don't use the evil eval to evaluate expressions,
which is always a security issue.

It is new, and has no compatibility issues with old ways at all.
In syntax, all you need is to allow d..., which clearly won't
affect any old ways of business.

Cheers,

Yingjie
-- 
http://mail.python.org/mailman/listinfo/python-list

Re: string interpolation for python

2012-04-02 Thread Yingjie Lan

 Are you %(name)s % locals() # or vars()



This partly solves the problem, however, you 
can't work with expressions inside, like:

 dsin($x$) = $sin(x)$


Also, what if locals() or vars() does not contain
the variable x? (x could be nonlocal or global).


 It's more conservative than hostile. Here's some insight:
 http://www.boredomandlaziness.org/2011/02/status-quo-wins-stalemate.html


You are probably right...Harmless enhancement is still probable?


 
 Personally, in isolation, the only part of your proposal I find
 /truly/ objectionable is the support for arbitrary expressions, since
 it would tend towards encouraging suboptimal factoring. But we also

I don't quite see that as a problem. The compiler (or translator, as you 
mentioned earlier) could easily make dsin($x$) = $sin(x)$ into
something like: ''.join([sin(, str(x), ) = , str(sin(x))]
which would far more efficient than calling the format() method.

Cheers,
Yingjie
-- 
http://mail.python.org/mailman/listinfo/python-list

Re: string interpolation for python

2012-04-02 Thread Yingjie Lan

...
 That, by the way, is perhaps the biggest problem with this idea of 

 dynamic strings: not that it is too powerful, but that it is TOO WEAK. 
...
 and similar for both format() and Template.


Seems you miss understood my notion of dynamic string.
Dynamic strings are expressions in disguise: the things
in between $...$ are plain old expressions (with optional 
formatting specifications). They are evaluated
as if they were outside the dynamic string. We put them
in there to to kill two birds with one stone: 
   1) ease of reading;
   2) place holding.

Cheers,
Yingjie
-- 
http://mail.python.org/mailman/listinfo/python-list

Re: string interpolation for python

2012-04-02 Thread Yingjie Lan

 like this one ?

 
 b = dict(name=Sue, job=SAS sharp-shooter)
 print $b['name']$ works as b['job']
 
 Is it really easier to read that the following ?
 {0} works as {1}.format(b['name'],b['job'])
 
 In the case in which b is an object having job and name 
 attribute, the dynamic string will write
 
 $b.name$ works as $b.job$
 instead of
 {0}.name works as {0}.job.format(b)
 


When you already have a dict, the dict-based
formatting would be nice.

 %(name)s works as %(job)s%b

If it you need to create a dict just for string

formatting, dynamic string would be nice.
Say your object has methods/properties
that fetch things from your database.

 class staff:
...@property
...def name(): return 'Peter'
...
 t = staff()
 vars(t)
{}
 t.name
'Peter'
 dStaff name: $t.name$ #note the d... format
'Staff name: Peter'

Because of the d... format, it won't 
affect old ways of doing things one bit.
Allowing dynamic string wouldn't hurt 
a bit to anything that is already there.
-- 
http://mail.python.org/mailman/listinfo/python-list

Re: string interpolation for python

2012-04-02 Thread Yingjie Lan

 Right, meaning that both have the same issues 
 of performance, need for

 str(), etc. There's absolutely no difference.


OK, performance. Here is a new solution:
 
Suppose we have a new string method
    str.format_join([...])
taking a list of strings and objects,
with even-indexed ones being strings,
odd-indexed ones being objects.
Each even-indexed string *ends* with a formatting
specification for the next object in the list.
Then we can have:
 dsin($x$) = $ sin(x):0.3f $
get translated to:
 ''.format_join([sin(%s,x,) = %0.3f, sin(x)])
This seems to be at least as good in performance.


 And no benefit. You lose out on syntax highlighting 
 in your editor and gain nothing.

Gain: readability, terseness, hassle-free, and possibly
better performance if done right.

Syntax highlighting: can be done more creatively.
For dynamic strings, string parts are like normal 
strings, but the embedded expressions are like
normal expressions :)

 
 sprintf(UPDATE tablename SET modified=now()%{,%s=:%[0]s%} WHERE
 key=%d,array_of_field_names,primary_key_value)
 -- UPDATE tablename SET modified=now(),foo=:foo,bar=:bar,quux=:quux
 WHERE key=1234
 
 You're still paying for no complexity you aren't actually using. 
 It's clear and readable.

You are really good at that. Maybe not everybody is as
experience as you, and I suppose the learning curve is 
kind of hard to climb.

 It's powerful only if you use eval to allow full expression syntax.
 Otherwise, what does it have above str.format()?


Those expressions are embedded, you don't need eval()
to have the result though. Are we on the same page?

 You may well be able to get past the compatibility issues. I'm not yet
 convinced that the new syntax is worth it, but it may be possible.
 
 Here's a recommendation: Write a parser for your notation that turns
 it into executable Python code (that is, executable in Python 3.3
 without any d... support). 

You mean a translator?

The syntax is essential for compatibility.
We must distinguish dynamic strings from common strings.
They will live peacefully together. 
(escaping the '$' in normal strings breaks compatibility, 
and the consequence of forgetting to escape could be
disastrous, so definitely not an option). 

May be d is too tiny, $... is easier to pick out.



Cheers,
Yingjie

-- 
http://mail.python.org/mailman/listinfo/python-list

Re: string interpolation for python

2012-04-02 Thread Yingjie Lan

   In that case you should re-think the delimiters, so that you have

   something
  that can be nested.  An example (example only, I'm not in love with it 
 as
  a final form):

Haven't really thought about that. Nesting is a big 
issue if in the embedded expression, there is another 
dynamic string expression. For example (the first
'%' immediately preceding the first quote denotes
the start of a dynamic string expression)::

 first_name, family_name = Peter, Johns
 %Hi $ first_name+%, $family_name$ $!

'Hi Peter, Johns!'

However, nesting is really cluttering things up and
we should forbid it without loosing power:
The above example can be done without nesting:
  %Hi $ first_name$, $family_name$!
'Hi Peter, Johns!'

Can you offer an example where nesting is 
better or necessary?

Cheers,
Yingjie
-- 
http://mail.python.org/mailman/listinfo/python-list

Re: string interpolation for python

2012-04-02 Thread Yingjie Lan

  I can't find a way to use an argument more than once,

  without switching to dictionary mode and using keys for 
 everything.


Even in dictionary mode, the key is spelled more than

once. The tuple mode below seems to save some typing. 

However, when there are more and more items to format, 
the readability deteriorates quickly in the tuple mode.

Use an argument more than once is, after all, probably a
not-so-often use case.


The dictionary mode greatly enhances readability, but 
you have to provide a dict object and make sure the name in 
your formatting string matches the keys in the dictionary.
And this matching requirement is because of information
redundancy in the arrangement. And information redundancy
is often the root of evil for various kinds of trouble.

Dynamic string seems to have the good without redundancy.
And there is no need to build a dict for it. 
However, when there is already the dict for use, 
clearly the dict format is the winner.
  Ack.
 
  In this case, you can use format:
 
  Hello {0}, what's up with {1}? Hey, {0} I'm 
 speaking to you!.format
  ('George', 'Melissa')
  Hello George, what's up with Melissa? Hey, George I'm 
 speaking to you!
 
 Or don't bother with the initial string, and simply pass everything as
 arguments:
 
 def yingjie_format(*args):
     it=iter(args)
     return ''.join(s%next(it,None) for s in it)
 yingjie_format(sin(%s,x,) = %0.3f, sin(x))
 

That's very nice, thanks!

 So if they're exactly like normal expressions, why not simply use

 normal expressions?


I think use dynamic strings can have these benefits:
1) you less keys (punch less keys).
2) better readability (less clutters)
3) you don't have to explicit convert/format expressions into strings
4) better performance too (adding strings together is generally slow).
 

  sprintf(UPDATE tablename SET modified=now()%{,%s=:%[0]s%} WHERE
  key=%d,array_of_field_names,primary_key_value)
  -- UPDATE tablename SET 
 modified=now(),foo=:foo,bar=:bar,quux=:quux
  WHERE key=1234
 
  You're still paying for no complexity you aren't actually 
 using.
  It's clear and readable.
 
  You are really good at that. Maybe not everybody is as
  experience as you, and I suppose the learning curve is
  kind of hard to climb.
 
 Yes, it takes some learning to use it. But that's true of everything,
 no less of your magical string interpolation. My point is that simple
 examples should be (and are, with printf formatting) simple, such that
 you only get those more complicated format strings when you're
 actually doing a complicated job (in that case, taking each element of
 an array and using it twice - actually, it was taking the indices of a
 mapping that would end up being passed to the DB query function, thus
 providing values to the :foo :bar variables).
 

Sure. But if one thing does well on both simple and complex 
situations, why not that thing?

  Those expressions are embedded, you don't need eval()
  to have the result though. Are we on the same page?
 
 I can see three implementation paths:
 
 1) Language feature. It really *is* just an expression. There's no way
 that a user can provide them, so there's actually no similarity to
 eval. But this requires that Python itself handle things.
 
 2) Precompiler. It *becomes* an expression. Again, perfectly safe,
 although I don't know how useful this really is.
 
 3) Functoin. As several have suggested, you could do some magic and
 use d(this is a $dollar$ $interpolated$ string) to implement. For
 this, you *will* need eval (or something like it).
 

Sure. for 1), things can be done most conveniently and efficiently.
for 2), yeah, not sure how useful it is.
for 3), maybe can let str class have a property like: dy.
which can do all the dirty processing. Then we may do:
 x = 0
 sin($x$) = $sin(x)$.dy
'sin(0) = 0.0'

Not much burden to use except for the CPU, I suppose.

  You mean a translator?
 
 Yes. It translates your dollar-strings into something that's legal
 Python 3.3 syntax - either calls to a function like I provided above,
 or actual embedded expressions.
 
  The syntax is essential for compatibility.
  We must distinguish dynamic strings from common strings.
  They will live peacefully together.
  (escaping the '$' in normal strings breaks compatibility,
  and the consequence of forgetting to escape could be
  disastrous, so definitely not an option).
 
  May be d is too tiny, $... is easier to pick out.
 
 I don't like the use of symbols like that; can someone, glancing at
 your code, tell whether $ is an operator, a name, or something else?
 The original d is probably better for that.
 


Yeah, the d is probably better. 

Cheers,
Yingjie

-- 
http://mail.python.org/mailman/listinfo/python-list

string interpolation for python

2012-03-31 Thread Yingjie Lan

Hi all, 

I'd really like to share this idea of string interpolation for formatting.
Let's start with some code:

 name = Shrek
 print( Hi, $name$!)
Hi, Shrek!
 balls = 30
 print( We have $balls$ balls.)
We have 30 balls

 persons = 5

 print (And $persons$ persons.)
And 5 persons.
 print( Each may get exactly $balls//persons$ balls.)
Each may get exactly 6 balls.
 fraction = 0.12345
 print( The fraction is $fraction : 0.2f$!)
The fraction is 0.12!
 print(It costs about $$3, I think.)
It costs about $3, I think.

I think the rule is quite self explanatory.
An expression is always enclosed between two '$'s,
with an optional ':' followed by additional formatting specification.
Double '$$' is like double '%%'.

Of course, '$' can be replaced by anything else.
For compatibility reasons, we might just use '%' as well.



Implementation: the compiler might need to do something,
say, to replace a string interpolation with an equivalent 
expression.

Cheers,

Yingjie-- 
http://mail.python.org/mailman/listinfo/python-list

the deceptive continuous assignments

2011-12-06 Thread Yingjie Lan

Hi, I just figured out this with Python3.2 IDLE:

 class k: pass
 x=k()
 x.thing = 1
 x.thing
1
 x = x.thing = 1
Traceback (most recent call last):
  File pyshell#7, line 1, in module
    x = x.thing = 1
AttributeError: 'int' object has no attribute 'thing'
 x
1



when I do x=x.thing=1, I thought it would
be like in C, 1 is first assigned to x.thing,
then it is further assigned to x.

But what seems to be going on here is
that 1 is first assigned to x, then
to x.thing (which causes an error).

Any reason why would Python deviate
from C in this regard?

Thanks!

Yingjie
-- 
http://mail.python.org/mailman/listinfo/python-list

Re: revive a generator

2011-10-21 Thread Yingjie Lan

 Here's an example of an explicit request to revive the generator:

 
  g = (x*x for x in range(3))
  for x in g: print x
 0
 1
 4
  g = (x*x for x in range(3)) # revive the generator
  for x in g: print x #now this will work
 0
 1
 4
 
 ChrisA


What if the generator is passed in as an argument 
when you are writing a function? That is, the expression
is not available? 

Secondly, it would be nice to automatically revive it.
For example, when another for-statement or other
equivalent is applied to it.

Yingjie
-- 
http://mail.python.org/mailman/listinfo/python-list

Re: revive a generator

2011-10-21 Thread Yingjie Lan

- Original Message -

 From: Paul Rudin paul.nos...@rudin.co.uk
 The language has no explicit notion of a request to revive a
 generator. You could use the same syntax to make a new generator that
 yeilds the same values as the one you started with if that's what you
 want. 
 
 As we've already discussed if you want to iterate several times over the
 same values then it probably makes sense to compute them and store them
 in e.g. a list (although there are always trade-offs between storage use
 and the cost of computing things again).
 

Oops, my former reply has the code indentation messed up 
by the mail system. Here is a reformatted one:


What if the generator involves a variable from another scope,
and before re-generating, the variable changed its value.
Also, the generator could be passed in as an argument,
so that we don't know its exact expression.

 vo = 34
 g = (vo*x for x in range(3))
 def myfun(g):
            for i in g: print(i)
            vo  += 3
            revive(g) #best if revived automatically
            for i in g: print(i)
 myfun(g)


Yingjie
-- 
http://mail.python.org/mailman/listinfo/python-list

Re: revive a generator

2011-10-21 Thread Yingjie Lan

- Original Message -

 From: Paul Rudin paul.nos...@rudin.co.uk
 To: python-list@python.org
 Cc: 
 Sent: Friday, October 21, 2011 3:27 PM
 Subject: Re: revive a generator

 The language has no explicit notion of a request to revive a
 generator. You could use the same syntax to make a new generator that
 yeilds the same values as the one you started with if that's what you
 want. 

 As we've already discussed if you want to iterate several times over the
 same values then it probably makes sense to compute them and store them
 in e.g. a list (although there are always trade-offs between storage use
 and the cost of computing things again).

What if the generator involves a variable from another scope,
and before re-generating, the variable changed its value.
Also, the generator could be passed in as an argument,
so that we don't know its exact expression.

 vo = 34
 g = (vo*x for x in range(3))
 def myfun(g):
for i in g: print(i)
vo  += 3
revive(g) #best if revived automatically
for i in g: print(i)
myfun(g)

Yingjie
-- 
http://mail.python.org/mailman/listinfo/python-list

Re: revive a generator

2011-10-21 Thread Yingjie Lan

- Original Message -

 From: Paul Rudin paul.nos...@rudin.co.uk
 
 I'm not really sure whether you intend g to yield the original values
 after your revive or new values based on the new value of vo.  But
 still you can make a class that supports the iterator protocol and does
 whatever you want (but you can't use the generator expression syntax).
 
 If you want something along these lines you should probably read up on
 the .send() part of the generator protocol.
 
 As an aside you shouldn't really write code that uses a global in that
 way.. it'll end up biting you eventually.
 
 Anyway... we can speculate endlessly about non-existent language
 constructs, but I think we've probably done this one to death.
 -- 


Maybe no new language construct is needed:
just define that x.send() revives a generator.

Yingjie
-- 
http://mail.python.org/mailman/listinfo/python-list

Re: revive a generator

2011-10-21 Thread Yingjie Lan

- Original Message -
 From: Chris Angelico ros...@gmail.com
 To: python-list@python.org
 Cc: 
 Sent: Friday, October 21, 2011 4:27 PM
 Subject: Re: revive a generator

 On Fri, Oct 21, 2011 at 7:02 PM, Yingjie Lan lany...@yahoo.com wrote:
  What if the generator involves a variable from another scope,
  and before re-generating, the variable changed its value.
  Also, the generator could be passed in as an argument,
  so that we don't know its exact expression.

 There's actually no way to know that the generator's even
 deterministic. Try this, for instance:

  g=(input(Enter value %d or blank to stop: %n) for n in 
 range(1,11))
  for s in g:
     if not s: break
     print(Processing input: +s)

 It may not be particularly useful, but it's certainly legal. And this
 generator cannot viably be restarted. 

Depends on what you want. If you want ten more inputs from user,
reviving this generator is certainly a good thing to do.

 The only way is to cast it to
 list first, but that doesn't work when you have to stop reading
 expressions from the generator part way.

 What you could perhaps do is wrap the generator in something that
 saves its values:

  class restartable(object):
     def __init__(self,gen):
         self.gen=gen
         self.yielded=[]
         self.iter=iter(self.yielded)
     def restart(self):
         self.iter=iter(self.yielded)
     def __iter__(self):
         return self
     def __next__(self): # Remove the underscores for Python 2
         try:
             return self.iter.__next__()
         except StopIteration:
             pass
         ret=self.gen.__next__()
         self.yielded.append(ret)
         return ret

  h=restartable(g)
  for i in h:
     if not i: break
     print(Using: ,i)
  h.restart()
  for i in h:
     if not i: break
     print(Using: ,i)

 Complicated, but what this does is returns a value from its saved list
 if there is one, otherwise returns a value from the original
 generator. It can be restarted as many times as necessary, and any
 time you read past the end of where you've read so far, the 
 original
 generator will be called upon.

 Actually, this model might be useful for a repeatable random-number
 generator. But those are more efficiently restarted by means of
 reseeding the PRNG.

Sure. Or you would like to have the next few random numbers with 
the same PRNG. 

These two cases seem to be strong use cases for reviving a generator.

Yingjie
-- 
http://mail.python.org/mailman/listinfo/python-list

revive a generator

2011-10-20 Thread Yingjie Lan

Hi,

it seems a generator expression can be used only once:

 g = (x*x for x in range(3))
 for x in g: print x
0
1
4
 for x in g: print x #nothing printed


Is there any way to revive g here?

Yingjie
-- 
http://mail.python.org/mailman/listinfo/python-list

compare range objects

2011-10-20 Thread Yingjie Lan

Hi, 

Is it possible to test if two range objects contain the same sequence of 
integers by the following algorithm in Python 3.2?

1. standardize the ending bound by letting it be the first excluded integer for 
the given step size.
2. compare the standardized starting bound, ending bound and step size: two 
ranges equal if and only if this triplet is the same.

If that's correct, it would be good to have equality comparison on two ranges. 

Further, it might also be good to have sub-sequence test on ranges without 
enumerating it.

Cheers,

Yingjie
-- 
http://mail.python.org/mailman/listinfo/python-list

Re: revive a generator

2011-10-20 Thread Yingjie Lan

- Original Message -
 From: Paul Rudin paul.nos...@rudin.co.uk
 To: python-list@python.org
 Cc: 
 Sent: Thursday, October 20, 2011 10:28 PM
 Subject: Re: revive a generator

 Yingjie Lan lany...@yahoo.com writes:

  Hi,

  it seems a generator expression can be used only once:

  g = (x*x for x in range(3))
  for x in g: print x
  0
  1
  4
  for x in g: print x #nothing printed

  Is there any way to revive g here?

 Generators are like that - you consume them until they run out of
 values. You could have done [x*x for x in range(3)] and then iterated
 over that list as many times as you wanted.

 A generator doesn't have to remember all the values it generates so it
 can be more memory efficient that a list. Also it can, for example,
 generate an infinite sequence.

Thanks a lot to all who answered my question. 
I am still not sure why should we enforce that 
a generator can not be reused after an explicit 
request to revive it?
-- 
http://mail.python.org/mailman/listinfo/python-list

Re: compare range objects

2011-10-20 Thread Yingjie Lan





- Original Message -
 From: Westley Martínez aniko...@gmail.com
 To: python-list@python.org
 Cc: 
 Sent: Friday, October 21, 2011 12:22 AM
 Subject: Re: compare range objects
 
 There's already a discussion about this on python-ideas.  But somebody
 please tell me, why would you ever need to compare ranges?


In simulation, one can use range objects to denote a discrete domain,
and domain comparison could be very useful. Not just equality, but also
things like if one domain is contained in another.
-- 
http://mail.python.org/mailman/listinfo/python-list

strange comparison result with 'is'

2011-10-17 Thread Yingjie Lan

Hi all, 

This is quite strange when I used the Python shell with IDLE:

 x = []
 id(getattr(x, 'pop')) == id(x.pop)

True
 getattr(x, 'pop') is x.pop
False
 

I suppose since the two things have the same id, the 'is'-test 
should give a True value, but I get a False value. 

Any particular reason for breaking this test?
I am quite confused as I show this before a large 
audience only to find the result denies my prediction.

The python version is 3.2.2; I am not sure about other versions.

Regards,

Yingjie-- 
http://mail.python.org/mailman/listinfo/python-list

Re: How to test if object is an integer?

2011-10-17 Thread Yingjie Lan





- Original Message -
 From: Noah Hall enali...@gmail.com
 To: MrPink tdsimp...@gmail.com
 Cc: python-list@python.org
 Sent: Tuesday, October 18, 2011 4:44 AM
 Subject: Re: How to test if object is an integer?

 There's the isdigit method, for example -
 
  str = 1324325
  str.isdigit()
 True
  str = 1232.34
  str.isdigit()
 False
  str = I am a string, not an int!
  str.isdigit()
 False


There are some corner cases to be considered with this approach:
1. negative integers: '-3'
2. strings starting with '0': '03'
3. strings starting with one '+': '+3'
-- 
http://mail.python.org/mailman/listinfo/python-list

Re: [Python-ideas] allow line break at operators

2011-09-04 Thread Yingjie Lan



Thanks, I think that's the rule described in its full glory. Currently I am not 
 quite sure of the use case for continuation nested in continuation -- it seems 
to be still a single continuation, but it allows for some additional freedom in 
formatting the continued line. Do you have other use cases for that?


For the case of mis-indentation, as demonstrated in your scenario, I think it 
is better that the rule is not applied to a comment continued onto the next 
line. The effect of a '#' only carries to the end of a line, if one would like 
the next line to be a comment, just use another '#'. It remains to consider a 
mis-indentation that only involves code lines. However, that is not a new 
problem, so we should not worry (it is like a sunken cost). 

Sorry for top posting. It seems yahoo web email is not designed with such a use 
case in mind.



From: MRAB pyt...@mrabarnett.plus.com
To: python-list@python.org
Sent: Sunday, September 4, 2011 10:04 AM
Subject: Re: [Python-ideas] allow line break at operators

On 04/09/2011 00:22, Yingjie Lan wrote:
  Every language with blocks needs some mechanism to indicate the
 beginning and ending of blocks and of statements within blocks. If
 visible fences ('begin/end' or '{}') and statement terminators (';') are
 used, then '\n' can be treated as merely a space, as it is in C, for
 instance.
  and it uses unescaped '\n' (with two escapement options) to terminate
 statements. This is fundamental to Python's design and goes along with
 significant indents.

 Agreed. Currently indentation in Python starts a new block, but if you
 view it from the perspective of line breaking, it also functions as if
 the line is continued. The line of code below

 if condition: do_a(); do_b()

 can be written as:

 if condition: #line breaks
 do_a(); # ';' is optional here
 do_b() # continue

 That indentation can be also employed for line breaking is quite evident
 to me. During the open email correspondence with Stephen, it seems to be
 a tenable point.

   There would then be three ways to escape newline, with one doing
 double duty. And for what? Merely to avoid using either of the two
 methods already available.

 I believe the other two ways are not as good as this new way. As the
 proposal is fully backward compatible, people may choose whatever way
 they prefer.

I think that the rules would be:

If a line ends with a colon and the next line is indented, then it's
the start of a block, and the following lines which belong to that
block have the same indent.

If a line doesn't end with a colon but the next line is indented, then
it's the start of a continuation, and the following lines which belong
to that continuation have the same indent.

In both cases there could be blocks nested in blocks and possibly
continuations nested in continuations, as well as blocks nested in
continuations and continuations nested in blocks.

I'm not sure what the effect would be if you had mis-indented lines.
For example, if a line was accidentally indented after a comment, then
it would be treated as part of the comment. It's in cases like those
that syntax colouring would be helpful. It would be a good idea to use
an editor which could indicate in some way when a line is a
continuation.

     
     *From:* Terry Reedy tjre...@udel.edu
     *To:* python-list@python.org
     *Cc:* python-id...@python.org
     *Sent:* Sunday, September 4, 2011 3:01 AM
     *Subject:* Re: [Python-ideas] allow line break at operators

     On 9/3/2011 3:51 AM, Yingjie Lan wrote:
       I agree that long lines of code are not very common in many projects,
       though it might be the case with some heavily involved in math.
     For some
       reason, when the feature of free line breaking came about in computer
       languages, it is welcomed and generally well accepted.

     Every language with blocks needs some mechanism to indicate the
     beginning and ending of blocks and of statements within blocks. If
     visible fences ('begin/end' or '{}') and statement terminators (';')
     are used, then '\n' can be treated as merely a space, as it is in C,
     for instance.

       Python uses indentation for blocks,

     and it uses unescaped '\n' (with two escapement options) to
     terminate statements. This is fundamental to Python's design and
     goes along with significant indents.

       and by the same mechanism, line breaking can be
       accommodated without requiring parenthesis or ending backslashes.

     You need proof for your claim that indentation can be used for both
     jobs in the form of a grammar that works with Python's parser. I am
     dubious that you can do that with an indents *after* the newline.

     Even if you could, it would be confusing for human readers. There
     would then be three ways to escape newline, with one doing double
     duty. And for what? Merely

Re: [Python-ideas] allow line break at operators

2011-09-03 Thread Yingjie Lan

I agree that long lines of code are not very common in many projects, though it 
might be the case with some heavily involved in math. For some reason, when the 
feature of free line breaking came about in computer languages, it is welcomed 
and generally well accepted. Python uses indentation for blocks, and by the 
same mechanism, line breaking can be accommodated without requiring parenthesis 
or ending backslashes.

For the tuning, yes, people would disagree on how to split expressions/code. 
The continue-by-indentation would allow people to break a line in whatever way 
that pleases their aesthetic taste, as long as there is an indentation. Most 
people seems to like an indentation on the continuing lines, probably for a 
visual indication of a continuation line. Some general guidelines may be 
provided, but there is no need for other hard rules on breaking lines, except 
that an identifier should never be split apart.

For the implementation, I don't have much clue. At least on the parser, it 
needs to look beyond the linefeed to determine if a line is completed. If the 
indentation is defined as a single symbol, then it would only require a 
one-step look-ahead, and that should not be hard.

Again, my apology for top posting.




From: Stephen J. Turnbull step...@xemacs.org
To: Yingjie Lan lany...@yahoo.com
Cc: Gabriel AHTUNE gaht...@gmail.com; Matt Joiner anacro...@gmail.com; 
python-list@python.org python-list@python.org; python-ideas 
python-id...@python.org
Sent: Saturday, September 3, 2011 2:10 PM
Subject: Re: [Python-ideas] allow line break at operators

Yingjie Lan writes:

 Have you considered line continuation by indentation? It seems to
 meet the design principle. I think it is the most natural way to
 allow free line breaking in Python.

Briefly, yes, and I think it would need a lot of tuning and probably
complex rules.  Unlike statements, where everybody (except the judges
of the Obfuscated C Contest) agrees on a simple rule: In a control
structure, the controlled suite should be uniformly indented one
level, line breaking and indentation of long expressions is an art,
and people have different opinions on readability and beauty.
Achieving a compromise that is workable even for a few major styles is
likely to be annoying and bug-prone.

Pretty much every program I write seems to have a continued list of
data or a multi-line dictionary display as data.  It's not unusual for
me to comment the formal arguments in a function definition, or the
parent classes of a class definition.  The exception for parenthesized
objects is something I depend on for what I consider good style.  Of
course I could use explicit continuation, but in a long table that's
ugly and error-prone.

Long expressions that need to be broken across lines, on the other
hand, often indication that I haven't thought carefully enough about
that component of the program, and an extra pair of parentheses or a
terminal backslash just isn't that heavy or ugly in the context of
such long expressions.  For me, they're also pretty rare; many
programs I write have no explicit continuations in them at all.

YMMV, of course, but I find the compromise that Python arrived at to
be very useful, and I must suppose that it was substantially easier to
implement than fully free line breaking (whatever that means to you).


-- 
http://mail.python.org/mailman/listinfo/python-list

Re: [Python-ideas] allow line break at operators

2011-09-03 Thread Yingjie Lan

Ambiguity: yes, when the last line of a suite is a continued line, it would 
require double dedentations to end the line and the suite. I noticed a similar 
case in current Python language as well:

==
#BEGIN CODE 1
if condition:
for i in range(5):
triangulate(i)
else: #double dedentations
for body in space:
triangulate(body)
#double dedentations again
log('triangulation done')
#END CODE 1
==



If lines can be continued by indentation, similar situation would rise:

==
#BEGIN CODE 2
if condition:
result = [sin(i) for i in range(5)]
+ [cos(i) for i in range(5)]
else:

result = [cos(i) for i in range(5)]
+ [sin(i) for i in range(5)]

log('triangulation done')
#END CODE 2
==


Generating text example: right, this is a case that can't be handled by 
standard indentation, unless we only consider full dedentation (dedentation to 
the exact level of the initial indentation) as the signal of ending the line. 
Whether to accommodate for such a case might be an issue of debate, but at 
least we can have such 'freedom' :)





From: Stephen J. Turnbull step...@xemacs.org
To: Yingjie Lan lany...@yahoo.com
Cc: Gabriel AHTUNE gaht...@gmail.com; Matt Joiner anacro...@gmail.com; 
python-ideas python-id...@python.org
Sent: Saturday, September 3, 2011 5:29 PM
Subject: Re: [Python-ideas] allow line break at operators

Yingjie Lan writes:

 Python uses indentation for blocks, and by the same mechanism, line
 breaking can be accommodated without requiring parenthesis or
 ending backslashes.

Possibly, but now you have a problem that a dedent has ambiguous
meaning.  It might mean that you're ending a suite, or it might mean
you're ending a continued expression.  This probably can be
disambiguated, but I don't know how easy that will be to do perfectly,
including in reporting ill-formed programs.

 Most people seems to like an indentation on the continuing lines,

Most of the time, yes, but sometimes not.  For example, in generating
text, it's often useful to dedent substantially so you can have a
nearly normal length line in the literal strings being concatenated.
Or you might have a pattern like this:

    x = long_named_variable_a
            - long_named_variable_a_base
        + long_named_variable_b
            - long_named_variable_b_base

which your parser would raise an error on, I presume.  That's not
freedom!wink /



-- 
http://mail.python.org/mailman/listinfo/python-list

Re: [Python-ideas] allow line break at operators

2011-09-03 Thread Yingjie Lan

Oops, the generating text part of my reply is referring to your last code 
example. For literal texts, it is is not governed by this proposal, nor are 
expressions within brackets and backslash continued lines. In a word, this 
proposal is fully backward compatible.

From: Yingjie Lan lany...@yahoo.com
To: Stephen J. Turnbull step...@xemacs.org
Cc: python list python-list@python.org; Gabriel AHTUNE gaht...@gmail.com; 
python-ideas python-id...@python.org; Matt Joiner anacro...@gmail.com
Sent: Saturday, September 3, 2011 6:33 PM
Subject: Re: [Python-ideas] allow line break at operators

Ambiguity: yes, when the last line of a suite is a continued line, it would 
require double dedentations to end the line and the suite. I noticed a similar 
case in current Python language as well:

==
#BEGIN CODE 1
if condition:
for i in range(5):
triangulate(i)
else: #double dedentations
for body in space:
triangulate(body)
#double dedentations again
log('triangulation done')
#END CODE 1
==

If lines can be continued by indentation, similar situation would rise:

==
#BEGIN CODE 2
if condition:
result = [sin(i) for i in range(5)]
+ [cos(i) for i in range(5)]
else:

result = [cos(i) for i in range(5)]
+ [sin(i) for i in range(5)]

log('triangulation done')
#END CODE 2
==

Generating text example: right, this is a case that can't be handled by 
standard indentation, unless we only consider full dedentation (dedentation to 
the exact level of the initial indentation) as the signal of ending the line. 
Whether to accommodate for such a case might be an issue of debate, but at 
least we can have such 'freedom' :)

From: Stephen J. Turnbull step...@xemacs.org
To: Yingjie Lan lany...@yahoo.com
Cc: Gabriel AHTUNE gaht...@gmail.com; Matt Joiner anacro...@gmail.com; 
python-ideas python-id...@python.org
Sent: Saturday, September 3, 2011 5:29 PM
Subject: Re: [Python-ideas] allow line break at operators

Yingjie Lan writes:

 Python uses indentation for blocks, and by the same mechanism, line
 breaking can be accommodated without requiring parenthesis or
 ending backslashes.

Possibly, but now you have a problem that a dedent has ambiguous
meaning.  It might mean that you're ending a suite, or it might mean
you're ending a continued expression.  This probably can be
disambiguated, but I don't know how easy that will be to do
 perfectly,
including in reporting ill-formed programs.

 Most people seems to like an indentation on the continuing lines,

Most of the time, yes, but sometimes not.  For example, in generating
text, it's often useful to dedent substantially so you can have a
nearly normal length line in the literal strings being concatenated.
Or you might have a pattern like this:

    x = long_named_variable_a
            - long_named_variable_a_base
        + long_named_variable_b
            - long_named_variable_b_base

which your parser would raise an error on, I presume.  That's not
freedom!wink /

-- 
http://mail.python.org/mailman/listinfo/python-list

-- 
http://mail.python.org/mailman/listinfo/python-list

Re: [Python-ideas] allow line break at operators

2011-09-03 Thread Yingjie Lan

 Every language with blocks needs some mechanism to indicate the beginning and 
ending of blocks and of statements within blocks. If visible fences 
('begin/end' or '{}') and statement terminators (';') are used, then '\n' can 
be treated as merely a space, as it is in C, for instance. 

 and it uses unescaped '\n' (with two escapement options) to terminate 
statements. This is fundamental to Python's design and goes along with 
significant indents.

Agreed. Currently indentation in Python starts a new block, but if you view it 
from the perspective of line breaking, it also functions as if the line is 
continued. The line of code below

if condition: do_a(); do_b()

can be  written as:

if condition: #line breaks
do_a(); # ';' is optional here 
do_b() # continue

That indentation can be also employed for line breaking is quite evident to me. 
During the open email correspondence with Stephen, it seems to be a tenable 
point. 

 There would then be three ways to escape newline, with one doing double duty. 
And for what? Merely to avoid using either of the two methods already 
available.

I believe the other two ways are not as good as this new way. As the proposal 
is fully backward compatible, people may choose whatever way they prefer. 



From: Terry Reedy tjre...@udel.edu
To: python-list@python.org
Cc: python-id...@python.org
Sent: Sunday, September 4, 2011 3:01 AM
Subject: Re: [Python-ideas] allow line break at operators

On 9/3/2011 3:51 AM, Yingjie Lan wrote:
 I agree that long lines of code are not very common in many projects,
 though it might be the case with some heavily involved in math. For some
 reason, when the feature of free line breaking came about in computer
 languages, it is welcomed and generally well accepted.

Every language with blocks needs some mechanism to indicate the beginning and 
ending of blocks and of statements within blocks. If visible fences 
('begin/end' or '{}') and statement terminators (';') are used, then '\n' can 
be treated as merely a space, as it is in C, for instance.

 Python uses indentation for blocks,

and it uses unescaped '\n' (with two escapement options) to terminate 
statements. This is fundamental to Python's design and goes along with 
significant indents.

 and by the same mechanism, line breaking can be
 accommodated without requiring parenthesis or ending backslashes.

You need proof for your claim that indentation can be used for both jobs in 
the form of a grammar that works with Python's parser. I am dubious that you 
can do that with an indents *after* the newline.

Even if you could, it would be confusing for human readers. There would then 
be three ways to escape newline, with one doing double duty. And for what? 
Merely to avoid using either of the two methods already available.

-- Terry Jan Reedy

-- http://mail.python.org/mailman/listinfo/python-list


-- 
http://mail.python.org/mailman/listinfo/python-list

Re: [Python-ideas] allow line break at operators

2011-09-02 Thread Yingjie Lan

Hi Gabriel,

==
From: Gabriel AHTUNE gaht...@gmail.com
Subject: Re: [Python-ideas] allow line break at operators

So can be done with this syntax:

 x = firstpart * secondpart  +  #line breaks here
 anotherpart + #continue 
 stillanother #continue on.

after a + operator the line is clearly not finished yet.

Gabriel AHTUNE

==

That's good to me too, which I proposed early in this thread.
Then somebody would like to have the operator in the
beginning of the next line so that it would stand out.Then still another one 
said that indentation is good here.
So I finally proposed line continuation with indentation.
Since this is Python, we will live with indentation.
Currently indentation in Python starts a new block,
but if you view it from the perspective of line breaking,
it also function as if the line is continued. The line

if condition: do_a(); do_b()

can be  written as:

if condition: #line breaks
do_a(); # ';' is optional here 
do_b() # continue

Sounds a pretty natural way to allow free line breaking.

Yingjie-- 
http://mail.python.org/mailman/listinfo/python-list

Re: [Python-ideas] allow line break at operators

2011-09-02 Thread Yingjie Lan

Yeah, that might be a challenge for the Python interpreter, for it has to check 
if the next line is indented or not. But it might be worthwhile to take this 
trouble, so that the coder has more freedom, and the code is hopefully better 
to read.



From: Matt Joiner anacro...@gmail.com
To: Yingjie Lan lany...@yahoo.com
Cc: python-list@python.org python-list@python.org; python-ideas 
python-id...@python.org
Sent: Friday, September 2, 2011 1:33 PM
Subject: Re: [Python-ideas] allow line break at operators

I guess the issue here is that you can't tell if an expression is
complete without checking the indent of the following line. This is
likely not desirable.

On Thu, Sep 1, 2011 at 11:43 PM, Yingjie Lan lany...@yahoo.com wrote:
 Hi Matt,
 ===
 From: Matt Joiner anacro...@gmail.com

 The trailing \ workaround is nonobvious. Wrapping in () is noisy and
 already heavily used by other syntactical structures.
 ===
 How about only require indentation
 to freely break lines? Here is an example:
 x = firstpart * secondpart #line breaks here
 + anotherpart #continue by indentation
 + stillanother #continue on.
 #until here, another line starts by dedentation
 y = some_expression - another_one
 All this would be completely compatible with former code, while
 having almost free line breaking! Plus, indentation makes it pretty.
 Really hope Python can have freedom in breaking lines.
 Yingjie-- 
http://mail.python.org/mailman/listinfo/python-list

Re: [Python-ideas] allow line break at operators

2011-09-02 Thread Yingjie Lan

Have you considered line continuation by indentation? It seems to meet the 
design principle. I think it is the most natural way to allow free line 
breaking in Python.

(Sorry, the yahoo web email interface is so weird that I don't know how to 
format comments between the quoted message below.)





From: Stephen J. Turnbull step...@xemacs.org
To: Gabriel AHTUNE gaht...@gmail.com
Cc: Matt Joiner anacro...@gmail.com; python-list@python.org 
python-list@python.org; python-ideas python-id...@python.org; Yingjie Lan 
lany...@yahoo.com
Sent: Friday, September 2, 2011 3:28 PM
Subject: Re: [Python-ideas] allow line break at operators

Gabriel AHTUNE writes:
 So can be done with this syntax:
 
  x = firstpart * secondpart  +  #line breaks here
  anotherpart + #continue
  stillanother #continue on.
 
 after a + operator the line is clearly not finished yet.

Sure, but IIRC one design principle of Python is that the keyword that
denotes the syntax should be the first thing on the line, making it
easy to scan down the left side of the code to see the syntactic
structure.  The required indentation of the controlled suite also
helps emphasize that keyword.

Analogously, if operators are going to denote continuation, they
should come first on the line.



I just don't think this idea is going anywhere.  Explicit continuation
with backslash or implicit continuation of parenthesized expressions
is just not that heavy a price to pay.  Perhaps historically some of
these ideas could have been implemented, but now they're just going to
confuse a host of editors and code analysis tools.

-- 
http://mail.python.org/mailman/listinfo/python-list

Re: [Python-ideas] allow line break at operators

2011-09-01 Thread Yingjie Lan

Hi Matt,

===
From: Matt Joiner anacro...@gmail.com


The trailing \ workaround is nonobvious. Wrapping in () is noisy and
already heavily used by other syntactical structures. 

===

How about only require indentation
to freely break lines? Here is an example:

x = firstpart * secondpart #line breaks here
+ anotherpart #continue by indentation
+ stillanother #continue on.
#until here, another line starts by dedentation 
y = some_expression - another_one

All this would be completely compatible with former code, while
having almost free line breaking! Plus, indentation makes it pretty.

Really hope Python can have freedom in breaking lines.

Yingjie-- 
http://mail.python.org/mailman/listinfo/python-list

Re: allow line break at operators

2011-08-12 Thread Yingjie Lan

 :The trouble of dealing with long lines can be avoided by a smart

 :editor. It's called line wrap.
 
 Yeah, usually they just wrap it pretty arbitrarily,
 and you don't have any control, isn't it?

:umm... besides notepad pretty much any other serious programmer editor 
:program try to do its best to deal with line wrap: the minimal I found is 
:the wrapped line is indented at the same level of the flow, but I found 
:editors where you can specify what to do (generally something like indent 
:the wrapped part 2 levels or something like that)

Thanks for sharing that, which I am not quite aware of . BTW, do you think
things like eclipse, emacs and vim also has such kind of functionality? 
Best of all, would certainly like to have IDLE have it, as I am teaching 
Python and would like to get them to start with a simple environment.

Yingjie
-- 
http://mail.python.org/mailman/listinfo/python-list

Re: allow line break at operators

2011-08-12 Thread Yingjie Lan






From: Vito 'ZeD' De Tullio zak.mc.kra...@libero.it

:umm... besides notepad pretty much any other serious programmer editor 
:program try to do its best to deal with line wrap: the minimal I found is 
:the wrapped line is indented at the same level of the flow, but I found 
:editors where you can specify what to do (generally something like indent 
:the wrapped part 2 levels or something like that)

Well, even if one editor can do perfect line wrapping, breaking 
the line at places perfectly pleasing to the eye (put aside the
fact that sometimes the line breaking could be based on the
meaning of the code), it is unlikely to be cross-editor consistent. 

Yingjie
-- 
http://mail.python.org/mailman/listinfo/python-list

Re: allow line break at operators

2011-08-11 Thread Yingjie Lan

From: Michael Trausch fd0...@gmail.com
To: Yingjie Lan lany...@yahoo.com
Cc: Chris Angelico ros...@gmail.com; python-list@python.org 
python-list@python.org
Sent: Thursday, August 11, 2011 12:51 PM
Subject: Re: allow line break at operators

 Perhaps it could be made an optional thing to enable; for example, some 
 languages by default do dynamic typing, but with an option contained as the 
 first statement of the file can enforce static typing.

That is a brilliant idea! Python code can specify encoding in the beginning, we 
might use another similar line to opt in for that kind of language features.

Once in that ';'-required mode, the trouble of typing a ';' at end of almost 
every line can be easily avoided by a smart editor: 

1) when a single 'Return' key is hit, and the line is not ending in ':' or ';' 
(white spaces and comments discarded), automatically append a ';' to the end of 
the line.
2) to continue the line to the next line, hit Shift+Enter, then no ';' will 
be appended to the line.

Cheers,

Yingjie
-- 
http://mail.python.org/mailman/listinfo/python-list

Re: allow line break at operators

2011-08-11 Thread Yingjie Lan

From: Steven D'Aprano steve+comp.lang.pyt...@pearwood.info
To: python-list@python.org
Sent: Thursday, August 11, 2011 12:18 PM
Subject: Re: allow line break at operators

On Thu, 11 Aug 2011 12:52 pm Yingjie Lan wrote:

 :And if we require {} then truly free indentation should be OK too! But

 :it wouldn't be Python any more.

 Of course, but not the case with ';'. Currently ';' is optional in Python,
 But '{' is used for dicts. Clearly, ';' and '{' are different in
 magnitude.

 So the decision is: shall we change ';' from optional to mandatory
 to allow free line splitting?

:Why on earth would you want to break backwards compatibility of millions of
:Python scripts and programs, and require extra, unnecessary line-noise on
:every single line of Python code, just so that you can occasionally avoid a
:writing a pair of parentheses?

I think allowing free line splitting (without parentheses -- that's artifitial 
and 
requires the coder to serve the machine rather than the other way around)
with proper indentation will produce truely ergonomic code layout (well,
assuming you also like properly indented code). 

And this can be done almost hassle-free for the coder. 

The compatibility problem can be solved by something like a preprocessor 
indicator to opt in this new language feature, or, if we are determined to favor
this new feature, all old code can be easily converted. The worst case is
that the coder has to opt in.

The trouble of adding a ';' to most of the lines can also be 
avoided by a smart editor (see my other reply).

Python serves the coder, not the other way around. 

Cheers,

Yingjie
-- 
http://mail.python.org/mailman/listinfo/python-list

Re: allow line break at operators

2011-08-11 Thread Yingjie Lan

From: Chris Rebert c...@rebertia.com
To: Yingjie Lan lany...@yahoo.com
Cc: python-list@python.org python-list@python.org
Sent: Thursday, August 11, 2011 3:50 PM
Subject: Re: allow line break at operators

On Thu, Aug 11, 2011 at 12:24 AM, Yingjie Lan lany...@yahoo.com wrote:
 From: Steven D'Aprano steve+comp.lang.pyt...@pearwood.info
 On Thu, 11 Aug 2011 12:52 pm Yingjie Lan wrote:

 :And if we require {} then truly free indentation should be OK too! But

 :it wouldn't be Python any more.

 Of course, but not the case with ';'. Currently ';' is optional in Python,
 But '{' is used for dicts. Clearly, ';' and '{' are different in
 magnitude.

 So the decision is: shall we change ';' from optional to mandatory
 to allow free line splitting?

 :Why on earth would you want to break backwards compatibility of millions of
 :Python scripts and programs, and require extra, unnecessary line-noise on
 :every single line of Python code, just so that you can occasionally avoid a
 :writing a pair of parentheses?

 I think allowing free line splitting (without parentheses -- that's
 artifitial and
 requires the coder to serve the machine rather than the other way around)
 with proper indentation will produce truely ergonomic code layout (well,
 assuming you also like properly indented code).

 And this can be done almost hassle-free for the coder.
snip
 The trouble of adding a ';' to most of the lines can also be
 avoided by a smart editor (see my other reply).

:The trouble of dealing with long lines can be avoided by a smart
:editor. It's called line wrap.

Yeah, usually they just wrap it pretty arbitrarily, 
and you don't have any control, isn't it?

Cheers,

Yingjie-- 
http://mail.python.org/mailman/listinfo/python-list

Re: allow line break at operators

2011-08-10 Thread Yingjie Lan

On Tue, Aug 9, 2011 at 9:42 PM, Yingjie Lan lany...@yahoo.com wrote:

 Hi all,

 When writing a long expresion, one usually would like to break it into 
 multiple lines. Currently, you may use a '\' to do so, but it looks a little 
 awkward (more like machine-oriented thing). Therefore I start wondering why 
 not allow line breaking at an operator, which is the standard way of breaking 
 a long expression in publication? Here is an example:

 #the old way

 x = 1+2+3+4+\
   1+2+3+4

 #the new way
 x = 1+2+3+4+ #line continues as it is clearly unfinished

   1+2+3+4

:# the currently allowed way
:x = (1+2+3+4+
:    1+2+3+4)
:# note the parentheses
:
:I think this is sufficient.

That works, but not in the most natural way--the way people are customed 
to...why require a pair of parenthis when we can do without them? Also, the new 
way does not affect the old ways of doing things at all, it is fully backward 
compatible. So this just offers a new choice.

 Of course, the dot operator is also included, which may facilitate method 
 chaining:

 x = svg.append( 'circle' ).
   r(2).cx(1).xy(1).
   foreground('black').bkground('white')

:Also, I dislike this for the dot operator especially, as it can
:obscure whether a method call or a function call is taking place.

Again, this only offers a new choice, and does not force anybody to do it this 
way.

cheers,

Yingjie
-- 
http://mail.python.org/mailman/listinfo/python-list

Re: allow line break at operators

2011-08-10 Thread Yingjie Lan

 On Wed, Aug 10, 2011 at 10:56 AM, Dan Sommers d...@tombstonezero.net

 wrote: 
 In terms of easier to read, I find code easier to read when the
 operators are at the beginnings of the lines (PEP 8 notwithstanding):

    x = (someobject.somemethod(object3, thing)
         + longfunctionname(object2)
         + otherfunction(value1, value2, value3))

 
 Without the parentheses, this is legal but (probably) useless; it
 applies the unary + operator to the return value of those functions.

:No, in some other languages it might be legal, but this is Python.
:without the parentheses it is a syntax error.

This discussion leads me to this question:

Is it possible for python to allow free splitting of single-line statements
without the backslashes, if we impose that expressions can only be split
when it is not yet a finished expression? Note: splitting before closing 
parenthis, brace, or bracket can be viewed as special case of this 
more general rule.


Yingjie

-- 
http://mail.python.org/mailman/listinfo/python-list

Re: allow line break at operators

2011-08-10 Thread Yingjie Lan

 In terms of easier to read, I find code easier to read when the

 operators are at the beginnings of the lines (PEP 8 notwithstanding):

    x = (someobject.somemethod(object3, thing)
         + longfunctionname(object2)
         + otherfunction(value1, value2, value3))

 
 Without the parentheses, this is legal but (probably) useless; it
 applies the unary + operator to the return value of those functions.

If ';' are employed (required), truely free line-splitting should be OK, 
the operators may appear at the beginnings of the lines as you wish.

Yingjie

-- 
http://mail.python.org/mailman/listinfo/python-list

Re: allow line break at operators

2011-08-10 Thread Yingjie Lan

:And if we require {} then truly free indentation should be OK too! But

:it wouldn't be Python any more.

Of course, but not the case with ';'. Currently ';' is optional in Python,
But '{' is used for dicts. Clearly, ';' and '{' are different in magnitude.

So the decision is: shall we change ';' from optional to mandatory
to allow free line splitting?

Yingjie

-- 
http://mail.python.org/mailman/listinfo/python-list

Re: allow line break at operators

2011-08-10 Thread Yingjie Lan

On Wed, Aug 10, 2011 at 1:58 PM, Yingjie Lan lany...@yahoo.com wrote:

 Is it possible for python to allow free splitting of single-line statements
 without the backslashes, if we impose that expressions can only be split
 when it is not yet a finished expression?

:The trouble is that in a lot of cases, the next statement after an
:unfinished expression could conceivably be a continuation of it. If
:this were permitted, it would have to also require that the
:continuation lines be indented, to avoid the problem described above.
:It'd still have the potential to mis-diagnose errors, though.

Indentation is a good idea to reduce the likelihood of such troubles.
and also produce code that is easier to read.

Yingjie

-- 
http://mail.python.org/mailman/listinfo/python-list

allow line break at operators

2011-08-09 Thread Yingjie Lan

Hi all,

When writing a long expresion, one usually would like to break it into multiple 
lines. Currently, you may use a '\' to do so, but it looks a little awkward 
(more like machine-oriented thing). Therefore I start wondering why not allow 
line breaking at an operator, which is the standard way of breaking a long 
expression in publication? Here is an example:

#the old way

x = 1+2+3+4+\
  1+2+3+4

#the new way
x = 1+2+3+4+ #line continues as it is clearly unfinished

  1+2+3+4

Of course, the dot operator is also included, which may facilitate method 
chaining:

x = svg.append( 'circle' ).
  r(2).cx(1).xy(1).
  foreground('black').bkground('white')

Thoughts?

Yingjie

-- 
http://mail.python.org/mailman/listinfo/python-list

anonymous function with multiple statements

2011-07-10 Thread Yingjie Lan

Hi all, 

I wonder if Python provides a way to define anonymous functions containing 
multiple statements? With lambda form, we can only define a function of a 
single 
expression. In Javascript, it is possible to define 
a full-fledged anonymous functions, which suggests it is useful to have it. In 
Python, it might be like this:


#==

def test( fn, K ): 
return sum(fn(i) for i in range(K))/K

test( def (i): #indent from start of this line, not from 'def'
from math import sin
#the last statement ends with a ',' or ')'.
return sin(i)+i*i;, 100) #';' to mark end of statement 

#another way:
test( def (i): #indent from start of this line, not from 'def'
from math import sin
return sin(i)+i*i
  , 100) #freely place ',' anywhere

#===

Any thoughts?

Yingjie
-- 
http://mail.python.org/mailman/listinfo/python-list

having both dynamic and static variables

2011-03-02 Thread Yingjie Lan

Hi everyone,

Variables in Python are resolved dynamically at runtime, which comes at a 
performance cost. However, a lot of times we don't need that feature. Variables 
can be determined at compile time, which should boost up speed. Therefore, I 
wonder if it is a good idea to have static variables as well. So at compile 
time, a variable is determined to be either static  or dynamic (the reference 
of 
a static varialbe is determined at compile time -- the namespace implementation 
will consist of two parts, a tuple for static variables and a dict for dynamic 
ones). The resolution can be done at the second pass of compilation. By 
default, 
variables are considered static. A variables is determined dynamic when: 1. it 
is declared dynamic; 2. it is not defined locally and the nearest namespace has 
it declared dynamic. A static variable can't be deleted, so a deleted variable 
must be a dynamic one: we can either enforce that the variable must be 
explicitly declared or allow a del statement to implicitly declare a dynamic 
variable.

Any thoughts?

Yingjie



  
-- 
http://mail.python.org/mailman/listinfo/python-list

Re: having both dynamic and static variables

2011-03-02 Thread Yingjie Lan

- Original Message 

From: Steven D'Aprano steve+comp.lang.pyt...@pearwood.info
To: python-list@python.org
Sent: Thu, March 3, 2011 1:27:01 PM
Subject: Re: having both dynamic and static variables

On Wed, 02 Mar 2011 19:45:16 -0800, Yingjie Lan wrote:

 Hi everyone,

 Variables in Python are resolved dynamically at runtime, which comes at
 a performance cost. However, a lot of times we don't need that feature.
 Variables can be determined at compile time, which should boost up
 speed.

:This is a very promising approach taken by a number of projects.

Thanks, that's good to know -- so people are doing this already!

:Finally, Python 3 introduced type annotations, which are currently a 
:feature looking for a reason.

I have a use for that feature -- I have a project that help build the 
scaffold for people to extend CPython. See http://expy.sf.net/
It is only good for Python 2 at this moment.

Yingjie

-- 
http://mail.python.org/mailman/listinfo/python-list

the C header file when extending CPython

2011-01-11 Thread Yingjie Lan

Hi, 

I am wondering when extending Python (CPython), what should be put into the C 
header file? Any guidelines?

Thanks,

Yingjie



  
-- 
http://mail.python.org/mailman/listinfo/python-list

group 0 in the re module

2010-12-07 Thread Yingjie Lan

Hi, 

According to the doc, group(0) is the entire match.

 m = re.match(r(\w+) (\w+), Isaac Newton, physicist) 
 m.group(0) # The entire match 'Isaac Newton' 

But if you do this:
 import re
 re.sub(r'(\d{3})(\d{3})', r'\0 to \1-\2', '757234')
'\x00 to 757-234'

where I expected
'757234 to 757-234'

Then I found that in python re '\0' is considered an octal number.
So, is there anyway to refer to the entire match by an escaped
notation?

Thanks,

Yingjie


  
-- 
http://mail.python.org/mailman/listinfo/python-list

Re: group 0 in the re module

2010-12-07 Thread Yingjie Lan

: Use \g0.


Thanks! 

Though I wish all \1, \2, ..., should also be forbidden.
Such a mixture of things looks like a patch work.

No offense meant.

Yingjie



  
-- 
http://mail.python.org/mailman/listinfo/python-list

Re: regular expression help

2010-11-29 Thread Yingjie Lan

--- On Tue, 11/30/10, goldtech goldt...@worldpost.com wrote:

 From: goldtech goldt...@worldpost.com
 Subject: regular expression help
 To: python-list@python.org
 Date: Tuesday, November 30, 2010, 9:17 AM
 The regex is eating up too much. What I want is every
 non-overlapping
 occurrence I think.

 so rtt would be:

 '||flfllff||ooo'

Hi, I'll just let Python do most of the talk here.

 import re
 m=cccvlvlvlvnnnflfllffccclfnnnooo
 p=re.compile(r'ccc.*?nnn')
 p.sub(||, m)
'||flfllff||ooo'

Cheers,

Yingjie

-- 
http://mail.python.org/mailman/listinfo/python-list

Re: Python 3 encoding question: Read a filename from stdin, subsequently open that filename

2010-11-29 Thread Yingjie Lan

--- On Tue, 11/30/10, Dan Stromberg drsali...@gmail.com wrote:
 In Python 3, I'm finding that I have encoding issues with
 characters
 with their high bit set.  Things are fine with strictly
 ASCII
 filenames.  With high-bit-set characters, even if I
 change stdin's
 encoding with:

Co-ask. I have also had problems with file names in
Chinese characters with Python 3. I unzipped the 
turtle demo files into the desktop folder (of
course, the word 'desktop' is in Chinese, it is
a windows XP system, localization is Chinese), then
all in a sudden some of the demos won't work
anymore. But if I move it to a folder whose 
path contains only english characters, everything
comes back to normal.

Another related issue on the same platform is this: 
if you have your source file in the chinese 'desktop'
folder, and when you run the code and got some 
exceptions, the path information printed out
by the tracer wrongly encoded the chinese name 
for 'desktop'.

Yingjie



  
-- 
http://mail.python.org/mailman/listinfo/python-list

what a cheap rule

2010-11-25 Thread Yingjie Lan

Sometimes the golden rule in Python of
explicit is better than implicit is
so cheap that it can be thrown away
for the trouble of typing an empty tuple.

Today when I am explaining that in Python 3,
there are two ways to raise exceptions:

raise Exception

raise Exception()

and that the first one is the same
as the second one, as Python will add the
missing pair of parenthesis. 

I felt their pain as they gasped.
Before that, I have already explained 
to them this piece of code:

try: raise SomeException()
except SomeException:
   print('Got an exception here')

by saying that the except-clause 
will match anything 
that belong to the SomeException class.

Without knowing this secrete
piece of information (that a
pair of parenthesis is automatically
provided), the following code
would be hard to understand:

try: raise SomeException
except SomeException:
   print('Got an exception here')

because the class object SomeException
is not an instance of itself, so 
a not-so-crooked coder will not
consider a match here.

So, the explicit is better than
implicit rule is thrown out of
the window so cheaply, 
that it literally worth less 
than an empty tuple!

Regards,

Yingjie


  
-- 
http://mail.python.org/mailman/listinfo/python-list

Re: a regexp riddle: re.search(r'

2010-11-25 Thread Yingjie Lan

--- On Thu, 11/25/10, Phlip phlip2...@gmail.com wrote:
 From: Phlip phlip2...@gmail.com
 Subject: a regexp riddle: re.search(r'
 To: python-list@python.org
 Date: Thursday, November 25, 2010, 8:46 AM
 HypoNt:

 I need to turn a human-readable list into a list():

    print re.search(r'(?:(\w+), |and
 (\w+))+', 'whatever a, bbb, and
 c').groups()

 That currently returns ('c',). I'm trying to match any
 word \w+
 followed by a comma, or a final word preceded by and.

 The match returns 'a, bbb, and c', but the groups return
 ('bbb', 'c').
 What do I type for .groups() to also get the 'a'?

First of all, the 'bbb' coresponds to the first capturing
group and 'c' the second. But 'a' is forgotten be cause
it was the first match of the first group, but there
is a second match 'bbb'.

Generally, a capturing group only remembers the last match.

It also seems that your re may match this: 'and c',
which does not seem to be your intention.
So it may be more intuitively written as:

r'(?:(\w+), )+and (\w+)'

I'm not sure how to get it done in one step,
but it would be easy to first get the whole 
match, then process it with:

re.findall(r'(\w+)(?:,|$)', the_whole_match)

cheers,

Yingjie

-- 
http://mail.python.org/mailman/listinfo/python-list

tilted text in the turtle module

2010-11-25 Thread Yingjie Lan

First of all, I'd like to express my deep gratidute to the author of this 
module, it is such a fun module to work with and to teach python as a first 
programming language.

Secondly, I would like to request a feature if it is not too hard to achieve. 
Currently, you can only write texts horizontally, no matter what is the current 
orientation of the turtle pen. I wonder if it is possible to write text in any 
direction when we control the heading of the turtle? For example, the following 
code would write a vertically oriented text:

setheading(90) #turtle facing up
write(vertical text!) 

Thanks a lot!

Yingjie


  
-- 
http://mail.python.org/mailman/listinfo/python-list

the buggy regex in Python

2010-11-25 Thread Yingjie Lan

I know many experts will say I don't have understanding...but let me pay this 
up front as my tuition.

Here are some puzzling results I have got (I am using Python 3, I suppose 
similar results for python 2).

When I do the following, I got an exception:
 re.findall('(d*)*', 'adb')
 re.findall('((d)*)*', 'adb')

When I do this, I am fine but the result is wrong:
 re.findall('((.d.)*)*', 'adb')
[('', 'adb'), ('', '')]

Why is it wrong?

The first mactch of groups: 
('', 'adb') 
indicates the outer group ((.d.)*) captured
the empty string, while the inner group (.d.) 
captured 'adb', so the outer group must have
captured the empty string at the end of the 
provided string 'adb'.

Once we have matched the final empty string '',
there should be no more matches, but we got
another match ('', '')!!!

So, findall matched the empty string in
the end of the string twice!!!

Isn't this a bug?

Yingjie


  
-- 
http://mail.python.org/mailman/listinfo/python-list

Re: what a cheap rule

2010-11-25 Thread Yingjie Lan

--- On Thu, 11/25/10, Steve Holden st...@holdenweb.com wrote:
  Sometimes the golden rule in Python of
  explicit is better than implicit is
  so cheap that it can be thrown away
  for the trouble of typing an empty tuple.
 
 I'm not sure that there *are* any golden rules. The Zen of
 Python is
 intended to be guidelines, not rigid rules intended to
 constrain your
 behavior but advice to help you write better code.


 Surely an exaggeration. In fact current best practice
 (which you should
 inform yourself of as best you can to help you in your
 teaching work -
 so you are to be congratulated for bringing this question
 to the list)
 is to always use explicit calls, with arguments specifying
 a tailored
 message.

 regards
  Steve

A very cogent message -- the end echos the start. :)
I must say that I learned from you a new angle to
think about this issue. On the other hand, I still
feel that when allowing both ways colliding into
the simpleness and bueaty of the language, we
should consider to make a decision.

Sure, this introduced quite a lot of complexity
when the doc has to give a very long explanation of
what is happening in order to justify it.

As I am thinking about it, it seems two
conflicting intuition of code comprehension
are at work here:

Intuition #1: as if you raise an exception
type, and then match that type.
It seems that no instances
are involved here (Intuitively).
See an example code here:

try: raise KeyError
except KeyError: pass


Intuition #2: you raise an exception
instance, and then match an instance by
its type. See an example code here:

try: raise KeyError()
except KeyError as ke: pass

Those two comprehensions are not compatible,
and thus the one that promotes correct
understanding should be encouraged,
while the other should be discouraged,
and maybe even be made iliegal.

Regards,

Yingjie


  
-- 
http://mail.python.org/mailman/listinfo/python-list

Re: the buggy regex in Python

2010-11-25 Thread Yingjie Lan

--- On Thu, 11/25/10, MRAB pyt...@mrabarnett.plus.com wrote:
 re.findall performs multiple searches, each starting where
 the previous
 one finished. The first match started at the start of the
 string and
 finished at its end. The second match started at that point
 (the end of
 the string) and found another match, ending at the end of
 the string.
 It tried to match a third time, but that failed because it
 would have
 matched an empty string again (it's not allowed to return 2
 contiguous
 empty strings).
 
  Isn't this a bug?
 
 No, but it can be confusing at times! :-)
 -- 

But the last empty string is matched twice -- so it is 
an overlapping. But findall is supposed not to return
overlapping matches. So I think this does not live up
to the documentation -- thus I still consider it a bug.

Yingjie


  
-- 
http://mail.python.org/mailman/listinfo/python-list

Re: tilted text in the turtle module

2010-11-25 Thread Yingjie Lan

--- On Thu, 11/25/10, Steve Holden st...@holdenweb.com wrote:
 From: Steve Holden st...@holdenweb.com
 Subject: Re: tilted text in the turtle module
 To: python-list@python.org
 Date: Thursday, November 25, 2010, 7:00 PM
 On 11/25/2010 5:06 AM, Yingjie Lan
 wrote:
 This sounds like a good idea. To request a feature you
 should create an
 account (if you do not already have one) on bugs.python.org
 and create a
 new issue (assuming a search reveals that there is not
 already such an
 issue).

Thanks I just did that.

 You may find if you look at the module's code that you can
 imagine how
 to  make the change. If not, the request will wait
 until some maintainer
 sees it and has time.

I don't know much about tkinter, 
not sure if I can contribute.
And even if I made a patch, 
then how to publish it?

Regards,

Yingjie

-- 
http://mail.python.org/mailman/listinfo/python-list

Re: tilted text in the turtle module

2010-11-25 Thread Yingjie Lan

--- On Thu, 11/25/10, Steve Holden st...@holdenweb.com wrote:
  And even if I made a patch, 
  then how to publish it?
  
 Once you have a patch, attach it to the issue as a file and
 try and get
 it reviewed by a developer for incorporation into a future
 release.
 
 Note that no Python 2.8 release is planned, so you would
 best
 concentrate your effort on the 3.x series.
 

I see. I suppose one could post a message somewhere
to get the attention? 

Thanks for the pointer.

Regards,

Yingjie


  
-- 
http://mail.python.org/mailman/listinfo/python-list

Re: the buggy regex in Python

2010-11-25 Thread Yingjie Lan

--- On Thu, 11/25/10, MRAB pyt...@mrabarnett.plus.com wrote:
 
 Look at the spans:
 
   for m in re.finditer('((.d.)*)*', 'adb'):
     print(m.span())
 
     
 (0, 3)
 (3, 3)
 
 There's an non-empty match followed by an empty match.

If you read my first post, it should be apparent that
that the empty string in the end of the string is 
used  twice -- thus an overlap.

Yingjie


  
-- 
http://mail.python.org/mailman/listinfo/python-list

Re: tilted text in the turtle module

2010-11-25 Thread Yingjie Lan

--- On Fri, 11/26/10, Steve Holden st...@holdenweb.com wrote:

 From: Steve Holden st...@holdenweb.com
 Subject: Re: tilted text in the turtle module
 To: python-list@python.org
 Date: Friday, November 26, 2010, 4:16 AM
 On 11/25/2010 5:58 PM, Yingjie Lan
 wrote:
  --- On Thu, 11/25/10, Steve Holden st...@holdenweb.com
 wrote:
  And even if I made a patch, 
  then how to publish it?

  Once you have a patch, attach it to the issue as a
 file and
  try and get
  it reviewed by a developer for incorporation into
 a future
  release.

  Note that no Python 2.8 release is planned, so you
 would
  best
  concentrate your effort on the 3.x series.

  I see. I suppose one could post a message somewhere
  to get the attention? 

  Thanks for the pointer.

 One would normally make a post on the python-dev list to
 get the
 attention of developers. If you want to *guarantee* that
 your issue gets
 attention then you can review five existing issues and
 advance them a
 little to help the development team.

Sound advices. Many thanks!

Yingjie

-- 
http://mail.python.org/mailman/listinfo/python-list

Re: what a cheap rule

2010-11-25 Thread Yingjie Lan

--- On Fri, 11/26/10, Steven D'Aprano steve+comp.lang.pyt...@pearwood.info 
wrote:

 From: Steven D'Aprano steve+comp.lang.pyt...@pearwood.info
 Subject: Re: what a cheap rule
 To: python-list@python.org
 Date: Friday, November 26, 2010, 5:10 AM
 On Thu, 25 Nov 2010 08:15:21 -0800,
 Yingjie Lan wrote:

 You seem to have misunderstood both forms of the raise
 statement. Should 
 we make exceptions illegal because you can't correctly
 guess what they do?

Though what you said about Python is right,
I think somehow you missed my point a bit.

Ideally, the language could be so 'natural' 
that it means what meets the eyes.

Yingjie

-- 
http://mail.python.org/mailman/listinfo/python-list

Re: Must be a bug in the re module [was: Why this result with the re module]

2010-11-04 Thread Yingjie Lan

--- On Wed, 11/3/10, MRAB pyt...@mrabarnett.plus.com wrote:
 [snip]
 The outer group is repeated, so it can match again, but the
 inner group
 can't match again because it captured all it could the
 previous time.
 
 Therefore the outer group matches and captures an empty
 string and the
 inner group remembers its last capture.

Thanks, I got it. Basically, '(.a.)*' matched an empty string
in the last outer group match, but not '(.a.)'.

Now what remains hard for me to figure out is the number
of matches: why is it 6 times with '((.a.)*)*' when 
matched to 'Mary has a lamb'?
I think this is probably cuased by the limit of the
matchobject: this object does not say anything
about if an empty string is appended to the
matched pattern or not. Hence some of the empty 
strings are repeated/overlapped by re.findall().

Regards,

Yingjie


  
-- 
http://mail.python.org/mailman/listinfo/python-list

Re: Must be a bug in the re module [was: Why this result with the re module]

2010-11-03 Thread Yingjie Lan

--- On Wed, 11/3/10, John Bond li...@asd-group.com wrote:

3) then said there must be =0 occurrences of what's inside it,
which of course there is, so that has no effect.

((.a.)*)*

Hi, 

I think there should be a difference: unlike before,
now what's inside the outer group can match an empty 
string. And so by reason of the greediness of the 
quantifier * of the outer group (that is, the last *), 
it should take up the empty string after each 
non-empty match.

So, the first match in 'Mary has a lamb' must be:

'' + 'Mar' + '' (the empty string before the 'y')

(note the first '' is before the 'M')
Then, after skipping the 'y' (remember, the empty 
string before 'y' is already taken), comes a second:

'' (the one between 'y' and ' ')

Then after skipping the space ' ', comes a third:

'has' + ' a ' + 'lam' + '' (the empty string before the 'b')

And finally, it matches the empty string after 'b':

''

So there should be total of four matches -- isn't it?

Yingjie


  
-- 
http://mail.python.org/mailman/listinfo/python-list

Re: Must be a bug in the re module [was: Why this result with the re module]

2010-11-03 Thread Yingjie Lan



--- On Wed, 11/3/10, John Bond li...@asd-group.com wrote:

 I just explained that (I think!)! The outer capturing group
 uses 
 repetition, so it returns the last thing that was matched
 by the inner 
 group, which was an empty string. 
 

According to yourself, the last match of the 
inner group is also empty!

Generally speaking, 
as a match for the outer group always 
contains some matches for the inner group,
it must be the case that the last match
for the inner group must be contained
inside the last match of the outer group.
So if the last match of the 
outer group is empty, then the last
match for the inner group must
also be empty.

Regards,

Yingjie


--- On Wed, 11/3/10, John Bond li...@asd-group.com wrote:

 From: John Bond li...@asd-group.com
 Subject: Re: Must be a bug in the re module [was: Why this result with the re 
 module]
 To: python-list@python.org
 Date: Wednesday, November 3, 2010, 8:43 AM
 On 3/11/2010 4:23 AM, Yingjie Lan
 wrote:
  --- On Wed, 11/3/10, MRABpyt...@mrabarnett.plus.com 
 wrote:
 
  From: MRABpyt...@mrabarnett.plus.com
  Subject: Re: Must be a bug in the re module [was:
 Why this result with the re module]
  To: python-list@python.org
  Date: Wednesday, November 3, 2010, 8:02 AM
  On 03/11/2010 03:42, Yingjie Lan
  wrote:
  Therefore the outer (first) group is always an
 empty string
  and the
  inner (second) group is the same as the previous
 example
  (the last
  capture or '' if no capture).
  Now I am confused also:
 
  If the outer group \1 is empty, how could the inner
  group \2 actually have something?
 
  Yingjie
 
 
 
 I just explained that (I think!)! The outer capturing group
 uses 
 repetition, so it returns the last thing that was matched
 by the inner 
 group, which was an empty string. I
 
 If you took away the outer groups repetition:
 
 re.findall('((.a.)*)', 'Mary has a lamb')
 
 then, for each of the six matches, it returns the full
 thing that was 
 matched:
 
 ('Mar', 'Mar'), ('', ''), ('', ''), ('has a lam', 'lam'),
 ('', ''), ('', 
 '')]
 
 Cheers, JB
 -- 
 http://mail.python.org/mailman/listinfo/python-list
 


  
-- 
http://mail.python.org/mailman/listinfo/python-list

Re: Why this result with the re module

2010-11-02 Thread Yingjie Lan

 From: John Bond li...@asd-group.com
  re.findall('(.a.)+', 'Mary has a lamb')
  ['Mar', 'lam']

 It's because you're using capturing groups, and because of
 how they work - specifically they only return the LAST match
 if used with repetition (and multiple matches occur).

It seems capturing groups is assumed by default,
but this is somehow against my intuition...
Ituitively, it should be what matches the 
whole regex '(.a.)+', shouldn't it?

Regards,

Yingjie


  
-- 
http://mail.python.org/mailman/listinfo/python-list

Re: Why this result with the re module

2010-11-02 Thread Yingjie Lan

 From: John Bond li...@asd-group.com
 Subject: Re: Why this result with the re module
  re.findall('(.a.)*', 'Mary has a lamb')
  ['Mar', '', '', 'lam', '', '']

 So - see if you can explain the first problematic result
 now.

Thanks a lot for explaining to me the second problematic result!
But the first one is even more puzzling...mainly because the
pattern matches any empty string. Here are more examples:

 re.findall('(.a.)*','')
['']
 re.findall('(.a.)*',' ') #one space
['', '']
 re.findall('(.a.)*','  ') #two spaces
['', '', '']
 len(re.findall('(.a.)*',' '*4)) #four
5
 len(re.findall('(.a.)*',' '*8)) #eight
9

I must need more details of the matching algorithm to explain this?

Regards,

Yingjie

-- 
http://mail.python.org/mailman/listinfo/python-list

Re: Why this result with the re module

2010-11-02 Thread Yingjie Lan

 From: John Bond li...@asd-group.com

 You might wonder why something that can match no input
 text, doesn't return an infinite number of those matches at
 every possible position, but they would be overlapping, and
 findall explicitly says matches have to be non-overlapping.

That scrabbed my itches, though the notion of overlapping
empty strings is quite interesting in itself. Obviously 
we have to assume there is one and only one empty string
between two consecutive characters.

Now I slightly modified my regex, and it suddenly looks
self-explanatory:
 
 re.findall('((.a.)+)', 'Mary has a lamb')
[('Mar', 'Mar'), ('has a lam', 'lam')]
 re.findall('((.a.)*)', 'Mary has a lamb')
[('Mar', 'Mar'), ('', ''), ('', ''), ('has a lam', 'lam'), ('', ''), ('', '')]

BUT, but.

1. I expected findall to find matches of the whole
regex '(.a.)+', not just the subgroup (.a.) from 
 re.findall('(.a.)+', 'Mary has a lamb')
Thus it is probably a misunderstanding/bug??

2. Here is an statement from the documentation on 
   non-capturing groups:
   see http://docs.python.org/dev/howto/regex.html

Except for the fact that you can’t retrieve the 
contents of what the group matched, a non-capturing 
group behaves exactly the same as a capturing group; 

   Thus, I'm again confused, despite of your 
   previous explanation. This might be a better
   explanation: when a subgroup is repeated, it
   only captures the last repetition.

3. It would be convenient to have '(*...)' for 
   non-capturing groups -- but of course, that's
   only a remote suggestion.

4. By reason of greediness of '*', and the concept 
of non-overlapping, it should go like this for
   re.findall('((.a.)*)', 'Mary has a lamb')

step 1: Match 'Mar' + '' (gready!)
step 2: skip 'y'
step 3: Match ''
step 4: skip ' '
step 5: Match ''+'has'+' a '+'lam'+'' (greedy!)
step 7: skip 'b'
step 8: Match ''

So there should be 4 matches in total:

'Mar', '', 'has a lam', ''

Also, if a repeated subgroup only captures 
the last repetition, the repeated 
subgroup (.a.)* should always be ''.

Yet the execution in Python results in 6 matches.

Here is the documentation of re.findall:


findall(pattern, string, flags=0)
Return a list of all non-overlapping matches in the string.

If one or more groups are present in the pattern, return a
list of groups; this will be a list of tuples if the pattern
has more than one group.

Empty matches are included in the result.


Thus from
 re.findall('(.a.)*', 'Mary has a lamb')
I should get this result 
[('',), ('',), ('',), ('',)]


Finally, The name findall implies all matches 
should be returned, whether there are subgroups in 
the pattern or not. It might be best to return all
the match objects (like a re.match call) instead 
of the matched strings. Then there is no need
to return tuples of subgroups. Even if tuples 
of subgroups were to be returned, group(0) must
also be included in the returned tuple.

Regards,

Yingjie 


  
-- 
http://mail.python.org/mailman/listinfo/python-list

Re: Why this result with the re module

2010-11-02 Thread Yingjie Lan

 From: John Bond li...@asd-group.com
 Subject: Re: Why this result with the re module

Firstly, thanks a lot for your patient explanation.
this time I have understood all your points perfectly.

Secondly, I'd like to clarify some of my points, which 
did not get through because of my poor presentation.

I suggested findall return a tuple of re.MatchObject(s),
with each MatchObject instance representing a match.
This is consistent with the re.match() function anyway.
And it will eliminate the need of returning tuples,
and it is much more precise and information rich.

If that's not possible, and a tuple must be returned,
then the whole match (not just subgroups) should
always be included as the first element in the tuple,
as that's group(0) or '\0'. Less surprise would arise.

Finally, it seems to me the algo for findall is WRONG.

To re.findall('(.a.)*', 'Mary has a lamb'), 
by reason of greediness of '*', and the requirement 
of non-overlapping, it should go like this
(suppose an '' is at the beginning and at the end,
and between two consecutive characters there is
one and only one empty string ''. To show the
match of empty strings clearly,
I am concatenating each repeated match below):

Steps for re.findall('(.a.)*', 'Mary has a lamb'):

step 1: Match '' + 'Mar' + '' (gready!)
step 2: skip 'y'
step 3: Match ''
step 4: skip ' '
step 5: Match ''+'has'+' a '+'lam'+'' (greedy!)
step 6: skip 'b'
step 7: Match ''

So there should be exactly 4 matches in total:

'Mar', '', 'has a lam', ''

Also, the matches above shows
that if a repeated subgroup only captures 
the last match, the subgroup (.a.)* 
should always capture '' here (see steps 
1, 3, 5, 7) above.

Yet the execution in Python results in 6 matches!
And, the capturing subgroup with repetition 
sometimes got the wrong guy.

So I believe the algorithm for findall must be WRONG.

Regards,

Yingjie

-- 
http://mail.python.org/mailman/listinfo/python-list

Re: Why this result with the re module

2010-11-02 Thread Yingjie Lan

 From: Vlastimil Brom vlastimil.b...@gmail.com
 Subject: Re: Why this result with the re module

 in that case you may use re.finditer(...)

Thanks for pointing this out.

Still I'd love to see re.findall never
discards the whole match, even if 
a tuple is returned.

Yingjie 

-- 
http://mail.python.org/mailman/listinfo/python-list

Must be a bug in the re module [was: Why this result with the re module]

2010-11-02 Thread Yingjie Lan

 From: John Bond li...@asd-group.com
 Subject: Re: Why this result with the re module
 To: Yingjie Lan lany...@yahoo.com
 Cc: python-list@python.org
 Date: Tuesday, November 2, 2010, 8:09 PM
 On 2/11/2010 12:19 PM, Yingjie Lan
 wrote:
  From: John Bondli...@asd-group.com
  Subject: Re: Why this result with the re module
  Firstly, thanks a lot for your patient explanation.
  this time I have understood all your points
 perfectly.

  Secondly, I'd like to clarify some of my points,
 which
  did not get through because of my poor presentation.

  I suggested findall return a tuple of
 re.MatchObject(s),
  with each MatchObject instance representing a match.
  This is consistent with the re.match() function
 anyway.
  And it will eliminate the need of returning tuples,
  and it is much more precise and information rich.

  If that's not possible, and a tuple must be returned,
  then the whole match (not just subgroups) should
  always be included as the first element in the tuple,
  as that's group(0) or '\0'. Less surprise would
 arise.

  Finally, it seems to me the algo for findall is
 WRONG.

  To re.findall('(.a.)*', 'Mary has a lamb'),
  by reason of greediness of '*', and the requirement
  of non-overlapping, it should go like this
  (suppose an '' is at the beginning and at the end,
  and between two consecutive characters there is
  one and only one empty string ''. To show the
  match of empty strings clearly,
  I am concatenating each repeated match below):

  Steps for re.findall('(.a.)*', 'Mary has a lamb'):

  step 1: Match '' + 'Mar' + '' (gready!)
  step 2: skip 'y'
  step 3: Match ''
  step 4: skip ' '
  step 5: Match ''+'has'+' a '+'lam'+'' (greedy!)
  step 6: skip 'b'
  step 7: Match ''

  So there should be exactly 4 matches in total:

  'Mar', '', 'has a lam', ''

  Also, the matches above shows
  that if a repeated subgroup only captures
  the last match, the subgroup (.a.)*
  should always capture '' here (see steps
  1, 3, 5, 7) above.

  Yet the execution in Python results in 6 matches!
  And, the capturing subgroup with repetition
  sometimes got the wrong guy.

  So I believe the algorithm for findall must be WRONG.

  Regards,

  Yingjie
 At a guess, I'd say what is happening is something like
 this:

 Steps for re.findall('(.a.)*', 'Mary has a lamb'):

 step 1: Match 'Mar' at string index 0
 step 2: Match '' at string index 3 (before 'y')
 step 3: skip 'y'
 step 4: Match '' at string index 4 (before ' ')
 step 5: skip ' '
 step 6: Match 'has a lam' at string index 5
 step 7: Match '' at string index 14 (before 'b')
 step 8: skip 'b'
 step 9: Match '' at string index 15 (before EOS)
 EOS reached

 matches: ('Mar', '', '', 'has a lam', '', '')
 returns: ['Mar', '', '', 'lam', '', ''] (*)

 (*) has a  lost due to not being last repetition at that
 match point

 Which seems about right to me! Greediness has nothing to do
 with it, except that it causes 'has a lam' to be matched in
 one match, instead of as three separate matches (of 'has', '
 a ' and 'lam').

Disagree in this case, where the whole regex 
matches an empty string. Greadiness will match
as much as possible. So it will also match
the empty strings between consecutive 
characters as much as possible, once
we have properly defined all the unique
empty strings. Because of greadiness, 
fewer matches should be found. In this
case, it should find only 4 matches 
(shown in my steps) instead of 6 matches
(shown in your steps).

Yingjie

-- 
http://mail.python.org/mailman/listinfo/python-list

Re: Must be a bug in the re module [was: Why this result with the re module]

2010-11-02 Thread Yingjie Lan

 Your regex says Zero or more consecutive occurrences of
 something, always returning the most possible.  That's
 what it does, at every position - only matching emptyness
 where it couldn't match anything (findall then skips a
 character to avoid overlapping/infinite empty
 matches),  and at all other times matching the most
 possible (eg. has a lam not has,  a , lam).

You are about to convince me now. 
You are correct for the regex '(.a.)*'.

What I thought was for this regex: '((.a.)*)*', 
I confused myself when I added an enclosing ().

Could you please reconsider how would you
work with this new one and see if my steps 
are correct? If you agree with my 7-step
execution for the new regex, then:

We finally found a real bug for re.findall:

 re.findall('((.a.)*)*', 'Mary has a lamb')
[('', 'Mar'), ('', ''), ('', ''), ('', 'lam'), ('', ''), ('', '')]


Cheers,

Yingjie


  
-- 
http://mail.python.org/mailman/listinfo/python-list

Re: Must be a bug in the re module [was: Why this result with the re module]

2010-11-02 Thread Yingjie Lan

 Matches an empty string, returns ''
 
 The result is therefore ['Mar', '', '', 'lam', '', '']

Thanks, now I see it through with clarity.
Both you and JB are right about this case.
However, what if the regex is ((.a.)*)* ?


  
-- 
http://mail.python.org/mailman/listinfo/python-list

Re: Must be a bug in the re module [was: Why this result with the re module]

2010-11-02 Thread Yingjie Lan

--- On Wed, 11/3/10, John Bond li...@asd-group.com wrote:


 Just to clarify - findall is returning:
 
 [ (only match in outer group, 1st match in inner group)
 , (only match in outer group, 2nd match in inner group)
 , (only match in outer group, 3rd match in inner group)
 , (only match in outer group, 4th match in inner group)
 , (only match in outer group, 5th match in inner group)
 , (only match in outer group, 6th match in inner group)
 ]
 
 Where only match in outer group = 6th match in inner
 group owing to the way that capturing groups with
 repetition only return the last thing they matched.
 

---On Wed, 11/3/10, MRAB wrote---

 Therefore the outer (first) group is always an empty string and the
 inner (second) group is the same as the previous example (the last
 capture or '' if no capture).

OK, I've got that, and I have no problem with the capturing part.
My real problem is with the number of total matches.
I think it should be 4 matches in total but findall 
gives 6 matches, for the new regex '((.a.)*)*'.
I'd love to know what you think about this.

Many thanks!
Yingjie



  
-- 
http://mail.python.org/mailman/listinfo/python-list

Re: Must be a bug in the re module [was: Why this result with the re module]

2010-11-02 Thread Yingjie Lan

--- On Wed, 11/3/10, MRAB pyt...@mrabarnett.plus.com wrote:

 From: MRAB pyt...@mrabarnett.plus.com
 Subject: Re: Must be a bug in the re module [was: Why this result with the re 
 module]
 To: python-list@python.org
 Date: Wednesday, November 3, 2010, 8:02 AM
 On 03/11/2010 03:42, Yingjie Lan
 wrote:

 Therefore the outer (first) group is always an empty string
 and the
 inner (second) group is the same as the previous example
 (the last
 capture or '' if no capture).

Now I am confused also:

If the outer group \1 is empty, how could the inner
group \2 actually have something?

Yingjie

-- 
http://mail.python.org/mailman/listinfo/python-list

Allow multiline conditions and the like

2010-11-01 Thread Yingjie Lan

Hi,

This is a mini-proposal I piggy-tailed in the other topic:

Allow the conditions in the if-, elif-, while-, for-, and
with-clauses to span multiple lines without using a backlalsh
at the end of a line, 
just like when you specify literal lists, tuples, dicts, etc.
across multiple lines (similar to comprehensions too).

My reasons:

   because they all must end with a required colon ':',
so nobody will mistake it. 

   also, if we don't allow it, people just have to use
parenthesis around the expressions to make that happen.


Just a half-baked idea, appreciate all comments.

Yingjie


  
-- 
http://mail.python.org/mailman/listinfo/python-list

Re: with block for multiple files

2010-11-01 Thread Yingjie Lan

 Guido's time machine strikes again! It's already in Python
 3; your
 example would be spelled:
 
 with open('scores.csv') as f, open('grades.csv', wt) as g:
     g.write(f.read())
 

Indeed! Thanks, Chris and James.

Yingjie


  
-- 
http://mail.python.org/mailman/listinfo/python-list

Allowing comments after the line continuation backslash

2010-11-01 Thread Yingjie Lan

Hi,

Sorry if I am baking too many ideas today.
I am just having trouble with the backslashes

I would like to have comments after the line continuation backslash.

 if a  0 \ #comments for this condition
  and b  0:
#do something here

This is currently not OK, but this might be a good thing to have.

Yingjie


  
-- 
http://mail.python.org/mailman/listinfo/python-list

Why this result with the re module

2010-11-01 Thread Yingjie Lan

Hi, I am rather confused by these results below. 
I am not a re expert at all. the module version 
of re is 2.2.1 with python 3.1.2

 import re
 re.findall('.a.', 'Mary has a lamb') #OK
['Mar', 'has', ' a ', 'lam'] 
 re.findall('(.a.)*', 'Mary has a lamb') #??
['Mar', '', '', 'lam', '', '']
 re.findall('(.a.)+', 'Mary has a lamb') #??
['Mar', 'lam']


Thanks in advance for any comments.

Yingjie


  
-- 
http://mail.python.org/mailman/listinfo/python-list

A bug for raw string literals in Py3k?

2010-10-31 Thread Yingjie Lan

Hi,

I tried this in the IDLE (version 3.1.2) shell:

 r'\'
SyntaxError: EOL while scanning string literal

But according to the py3k docs 
(http://docs.python.org/release/3.0.1/whatsnew/3.0.html):

All backslashes in raw string literals are interpreted literally.

So I suppose this is a bug?

Yingjie


  
-- 
http://mail.python.org/mailman/listinfo/python-list

Re: A bug for raw string literals in Py3k?

2010-10-31 Thread Yingjie Lan

  So I suppose this is a bug?
 
 It's not, see
 
 http://docs.python.org/py3k/reference/lexical_analysis.html#literals
 
 # Specifically, a raw string cannot end in a single backslash

Thanks! That looks weird to me ... doesn't this contradict with:

All backslashes in raw string literals are interpreted literally.
(see http://docs.python.org/release/3.0.1/whatsnew/3.0.html):

Best,

Yingjie



  
-- 
http://mail.python.org/mailman/listinfo/python-list

Re: A bug for raw string literals in Py3k?

2010-10-31 Thread Yingjie Lan

   All backslashes in raw string literals are
 interpreted literally.
   (seehttp://docs.python.org/release/3.0.1/whatsnew/3.0.html):
 
  All backslashes in syntactically-correct raw string
 literals are interpreted literally.
 
 That's a good way of putting it.
 

Syntactical correctness obviously depends on the syntax specification.
To cancle the special meaning of ALL backlashes in a raw string literal 
makes a lot of sense to me. Currently, the behavior of backslashes
in a raw string literal is rather complicated I think.
In fact, the backlashes can still escape quotes in a raw string,
and one the other hand, it also remains in the string -- I'm 
wondering what kind of use case is there to justify such a behavior?
Surely, my experience is way too limited to make a solid judgement,
I Hope others would shed light on this issue.

Yingjie


  
-- 
http://mail.python.org/mailman/listinfo/python-list

Re: A bug for raw string literals in Py3k?

2010-10-31 Thread Yingjie Lan

 According to msg56377, the behaviour is optimal for regular
 expressions. Well, I use regular expressions a lot, and I
 still think it's a nuisance!

Thanks for bringing that up.

Using an otherwise 'dead' backlash to escape quotes 
in raw strings seems like the black magic of 
necromancy to me. :)

To include quotes in a string, there are a couple of
known choices: If you need single quotes in the string, 
start the literal by a double-quote, and vice versa.
In case you need both, you can use a long string:

 rab\c'''

Note that when the last character is also a quote, we can
use the other type of quote three times to delimit the 
long string. Of course, there are still some corner cases:

1. when we need three consecutive single quotes AND three 
   consecutive double quotes in the string.

2. When the last is a single quote, and we also need 
   three consecutive double-quotes in the string, 
   or the other way around.

Then we can abandon the raw string literal, or use 
concatenation of string literals wisely to get it done.

But in total, I still would vote against the nacromancy.

Yingjie


  
-- 
http://mail.python.org/mailman/listinfo/python-list

with block for multiple files

2010-10-31 Thread Yingjie Lan

Hi,

Suppose I am working with two files simultaneously, 
it might make sense to do this:

with open('scores.csv'), open('grades.csv', wt) as f,g:
 g.write(f.read())

sure, you can do this with nested with-blocks, 
but the one above does not seem too complicated,
it is like having a multiple assignment...

Any thoughts?

Another mini-proposal: 

Allow the conditions in the if-, elif-, while-, for-, and 
with-clauses to span multiple lines without using a backlalsh, 
just like when you specify literal lists, tuples, dicts, etc.
across multiple lines (similar to comprehensions too).

My reason is this: 
   because they all must end with a required colon ':',
so nobody will mistake it.

Just some half-baked ideas, would appreciate 
thos who shed light on these issues.

Yingjie


  
-- 
http://mail.python.org/mailman/listinfo/python-list

please help explain this result

2010-10-17 Thread Yingjie Lan

Hi, 

I played with an example related to namespaces/scoping.
The result is a little confusing:

 a=1
 def f(): 
a = a + 1
return a

 f()

I suppose I will get 2 ( 'a' is redefined as a local variable, whose value is 
obtained by the value of the global variable 'a' plus 1). But this is what I 
got:

 a=1
 def f():
a = a + 1
return a

 f()
Traceback (most recent call last):
  File pyshell#39, line 1, in module
f()
  File pyshell#38, line 2, in f
a = a + 1
UnboundLocalError: local variable 'a' referenced before assignment


I'm not sure how to explain this? Thanks!

Yingjie


  
-- 
http://mail.python.org/mailman/listinfo/python-list

Re: please help explain this result

2010-10-17 Thread Yingjie Lan

 From: Nobody nob...@nowhere.com

 The determination of local or global is made when the def
 statement is
 executed, not when the function is called.

Thanks a lot for your reply, which is of great help!

So, I assume that when the 'def' is executed, any 
name occurred will be categorized as either local 
or global (maybe nonlocal?).

Yingjie


  
-- 
http://mail.python.org/mailman/listinfo/python-list

Re: please help explain this result

2010-10-17 Thread Yingjie Lan



--- On Sun, 10/17/10, Steven D'Aprano st...@remove-this-cybersource.com.au 
wrote:

 (1) If you assign to a variable *anywhere* in the function,
 it is a local 
 *everywhere* in the function.
 
 There is no way to have a variable refer to a local in some
 places of a 
 function and a global in other places of the same function.
 This is by 
 design.
 

Super crystal clear. Thanks a lot!

Yingjie


  
-- 
http://mail.python.org/mailman/listinfo/python-list

sequence multiplied by -1

2010-09-25 Thread Yingjie Lan

Hi, 

I noticed that in python3k, multiplying a sequence by a negative integer is the 
same as multiplying it by 0, and the result is an empty sequence. It seems to 
me that there is a more meaningful symantics. 

Simply put, a sequence multiplied by -1 can give a reversed sequence. 

Then for any sequence seq, and integer n0, we can have 

seq * -n producing (seq * -1) * n. 

Any thoughts?

Yingjie


  
-- 
http://mail.python.org/mailman/listinfo/python-list

Re: sequence multiplied by -1

2010-09-25 Thread Yingjie Lan

--- On Sat, 9/25/10, Thomas Jollans tho...@jollybox.de wrote:
 for every list l and integer n = 0:
     len(l*n) == len(l)*n

Well, this invariance is indeed broken under my proposal.
But it is *already broken* in current python3k.
However, the following invariance is maintained under
my proposal:

len(l*n) == len(l) * abs(n),

which is also broken under current python3k.
if you think len(..) as a mathematical norm, 
the above invariance makes perfect sense:

|| a * b || == ||a|| * |b|, b is real


 
  Simply put, a sequence multiplied by -1 can give a
 reversed sequence.
 
 For that, we have slicing. A negative step value produces a
 reverse slice of 
 the list. You can't argue that this makes sense, can you
 
  [1,2,3,4][::-1]
 [4, 3, 2, 1]
  [1,2,3,4][::-2]
 [4, 2]
  

Having more than one way of doing things sometimes is good.
Slicing is a little more complex, what if you want this:

 ([1,2,3,4]*2)[::-1]
[4, 3, 2, 1, 4, 3, 2, 1]

under my new proposal, you simply do this:

 [1,2,3,4]*-2
[4, 3, 2, 1, 4, 3, 2, 1]

Cheers,

Yingjie


  
-- 
http://mail.python.org/mailman/listinfo/python-list

Re: sequence multiplied by -1

2010-09-25 Thread Yingjie Lan

Hi,

 
 In my opinion this _isn't_ a situation where it's good. :)
 
     L[::-1]
 
 is only marginally longer than
 
     -1 * L
 
 I think this small gain doesn't justify violating this
 Python Zen rule (from `import this`):
 
     There should be one-- and preferably only one
 --obvious way to do it.
 

Thanks for the insightful remarks. For the rule above,
how about the case to reverse and multiply:

 L*-3 #L reversed and repeated three times

v.s.

 L[::-1]*3 #L reversed and repeated three times

The first one is simpler (4 chars v.s. 9 chars).
I thought it was also intuitive because if you multiply
a vector by -1, you should get a vector 
in the reversed direction. But, intuitiveness depends
on who you are, what you do, etc

Regards,

Yingjie





  
-- 
http://mail.python.org/mailman/listinfo/python-list

solve alphametic puzzles in just 9 lines of code

2010-09-25 Thread Yingjie Lan

Hi, 

I am teaching Python this semester and
as I am trying to explain the code by
Raymond Hettinger, I need to make it
simpler (this is an introductory course).
And it ends up to be just 9 lines of code.

Just for fun. See also:

http://diveintopython3.org/advanced-iterators.html


Regards,
Yingjie

Code starts here###

import itertools
def solve(puzzle):
solve alphametic puzzles in just 9 lines of code.
words = [w for w in puzzle.split() if w.isalpha()]
nonzeros = {w[0] for w in words}
others = {a for a in ''.join(words) if a not in nonzeros}
chars = [ord(c) for c in nonzeros]+[ord(c) for c in others]
assert len(chars) = 10, 'Too many letters'
for guess in itertools.permutations('0123456789', len(chars)):
if '0' not in guess[:len(nonzeros)]:
equation = puzzle.translate(dict(zip(chars, guess)))
if eval(equation): return equation


  
-- 
http://mail.python.org/mailman/listinfo/python-list

Re: sequence multiplied by -1

2010-09-25 Thread Yingjie Lan

Hi all,

Thanks for considering this proposal seriously and 
all your discussions shed light on the pro's and cons 
(well, more cons than pros, to be honest).

It occurrs to me that this proposal is not a sound 
one, for the reasons already well documented in 
this thread, which I need not repeat.

Thanks all for participation!

Yingjie




  
-- 
http://mail.python.org/mailman/listinfo/python-list

Re: solve alphametic puzzles in just 9 lines of code

2010-09-25 Thread Yingjie Lan

Sorry, didn't document my code well enough.
Here is the code with an example.

Yingjie


#Code begins###

from itertools import permutations

def solve(puzzle):
solve alphametic puzzles in just 9 lines of code.
Make sure each operator is seperated from the words by
white-spaces, e.g.:

 solve('send + more == money')


words = [w for w in puzzle.split() if w.isalpha()]
nonzeros = {w[0] for w in words}
others = {a for a in ''.join(words) if a not in nonzeros}
chars = [ord(c) for c in nonzeros]+[ord(c) for c in others]
assert len(chars) = 10, 'Too many letters'
for guess in permutations('0123456789', len(chars)):
if '0' not in guess[:len(nonzeros)]:
equation = puzzle.translate(dict(zip(chars, guess)))
if eval(equation): return puzzle, equation

if __name__ == '__main__':
print ('\n'.join(solve(send + more == money)))


  
-- 
http://mail.python.org/mailman/listinfo/python-list

30 is True

2010-09-15 Thread Yingjie Lan

Hi, 

I am not sure how to interprete this, in the interactive mode:

 30 is True
False
 (30) is True
True
 3 (0 is True)
True

Why did I get the first 'False'? I'm a little confused.

Thanks in advance for anybody who shed some light on this. 

YL


  
-- 
http://mail.python.org/mailman/listinfo/python-list

Re: 30 is True

2010-09-15 Thread Yingjie Lan

 From: Jon Siddle j...@corefiling.co.uk
 Subject: Re: 30 is True
 To: python-list@python.org
 Date: Wednesday, September 15, 2010, 5:04 PM
   As others have said, it's not
 a matter of precendence. Using the 
 compiler module
 you can see how python actually parses this:

 3  (0 is True)
 Compare(Const(3), [('', Compare(Const(0), [('is',
 Name('True'))]))])

 No great surprise there.

 3  0 is True
 Compare(Const(3), [('', Const(0)), ('is',
 Name('True'))])

 As you can see, it's not the same. Two comparisons are
 being done at 
 once, not
 one comparison on the result of another.

 Hope this helps

Thank you all for nailing down this itching issue for me!

All I can say is: Wow!

You all have a teribly nice day!

Yingjie

-- 
http://mail.python.org/mailman/listinfo/python-list

empty set and empty dict for Python 3

2010-07-15 Thread Yingjie Lan

Hi there,

Maybe somebody already suggested this:

How about {:} for the empty dict, 
so that {} can denote the empty set?


Yingjie


  
-- 
http://mail.python.org/mailman/listinfo/python-list

ANN: expy 0.6.7 released!

2010-05-03 Thread Yingjie Lan

EXPY is an express way to extend Python!
 
EXPY provides a way to extend python in an elegant way. 
For more information and a tutorial, see: 

http://expy.sourceforge.net/

I'm glad to announce a new release again today. ^_^

What's new:

Version 0.6.7
1.  Now functions can have 'value on failure' via rawtype.
2.  Now property getters/setters can have @throws
3.  Bug fix: if __init__ wrapper fails, it must return -1.


Cheers,

Yingjie


  
-- 
http://mail.python.org/mailman/listinfo/python-list

ANN: expy 0.6.6 released!

2010-05-02 Thread Yingjie Lan

EXPY is an express way to extend Python!

EXPY provides a way to extend python in an elegant way. For more information 
and a tutorial, see: http://expy.sourceforge.net/

What's new:

1. Special methods can now take @throws decorators.
2. Added convenience macros type_NEW and type_CheckExact for extension 
types.
3. Give warnings of missing Py_INCREF on all methods/functions returning an 
object. 
4. And the responsibility of Py_INCREF is left for the developer.
5. Documentation update.

Cheers,

Yingjie


  
-- 
http://mail.python.org/mailman/listinfo/python-list

1 2 >

1 - 100 of 124 matches

Mail list logo