subject:"Regex"

Re: Regex Python Help

2015-03-24 Thread Skip Montanaro

On Tue, Mar 24, 2015 at 1:13 PM, gdot...@gmail.com wrote:

 SyntaxError: Missing parentheses in call to 'print'


It appears you are attempting to use a Python 2.x print statement with
Python 3.x Try changing the last line to

print(line.rstrip())

Skip
-- 
https://mail.python.org/mailman/listinfo/python-list

[issue2636] Adding a new regex module (compatible with re)

2015-03-18 Thread Evgeny Kapun


Changes by Evgeny Kapun abacabadabac...@gmail.com:


--
nosy: +abacabadabacaba

___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue2636
___
___
Python-bugs-list mailing list
Unsubscribe: 
https://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com

Re: regex help

2015-03-13 Thread Thomas 'PointedEars' Lahn

Larry Martell wrote:

 I need to remove all trailing zeros to the right of the decimal point,
 but leave one zero if it's whole number. For example, if I have this:
 
 
14S,5.,4.5686274500,3.7272727272727271,3.3947368421052630,5.7307692307692308,5.7547169811320753,4.9423076923076925,5.7884615384615383,5.13725490196
 
 I want to end up with:
 
 
14S,5.0,4.56862745,3.7272727272727271,3.394736842105263,5.7307692307692308,5.7547169811320753,4.9423076923076925,5.7884615384615383,5.13725490196
 
 I have a regex to remove the zeros:
 
 '0+[,$]', ''
 
 But I can't figure out how to get the 5. to be 5.0.
 I've been messing with the negative lookbehind, but I haven't found
 one that works for this.

First of all, I find it unlikely that you really want to solve your problem 
with regular expressions.  Google “X-Y problem”.

Second, if you must use regular expressions, the most simple approach is to 
use backreferences.

Third, you need to show the relevant (Python) code.

http://www.catb.org/~esr/faqs/smart-questions.html

-- 
PointedEars

Twitter: @PointedEars2
Please do not cc me. / Bitte keine Kopien per E-Mail.
-- 
https://mail.python.org/mailman/listinfo/python-list

Re: regex help

2015-03-13 Thread Tim Chase

On 2015-03-13 12:05, Larry Martell wrote:
 I need to remove all trailing zeros to the right of the decimal
 point, but leave one zero if it's whole number. 
 
 But I can't figure out how to get the 5. to be 5.0.
 I've been messing with the negative lookbehind, but I haven't found
 one that works for this.

You can do it with string-ops, or you can resort to regexp.
Personally, I like the clarity of the string-ops version, but use
what suits you.

-tkc

import re
input = [
'14S',
'5.',
'4.5686274500',
'3.7272727272727271',
'3.3947368421052630',
'5.7307692307692308',
'5.7547169811320753',
'4.9423076923076925',
'5.7884615384615383',
'5.13725490196',
]

output = [
'14S',
'5.0',
'4.56862745',
'3.7272727272727271',
'3.394736842105263',
'5.7307692307692308',
'5.7547169811320753',
'4.9423076923076925',
'5.7884615384615383',
'5.13725490196',
]


def fn1(s):
if '.' in s:
s = s.rstrip('0')
if s.endswith('.'):
s += '0'
return s

def fn2(s):
return re.sub(r'(\.\d+?)0+$', r'\1', s)

for fn in (fn1, fn2):
for i, o in zip(input, output):
v = fn(i)
print %s: %s - %s [%s] % (v == o, i, v, o)
-- 
https://mail.python.org/mailman/listinfo/python-list

Re: regex help

2015-03-13 Thread MRAB


On 2015-03-13 16:05, Larry Martell wrote:

I need to remove all trailing zeros to the right of the decimal point,
but leave one zero if it's whole number. For example, if I have this:

14S,5.,4.5686274500,3.7272727272727271,3.3947368421052630,5.7307692307692308,5.7547169811320753,4.9423076923076925,5.7884615384615383,5.13725490196

I want to end up with:

14S,5.0,4.56862745,3.7272727272727271,3.394736842105263,5.7307692307692308,5.7547169811320753,4.9423076923076925,5.7884615384615383,5.13725490196

I have a regex to remove the zeros:

'0+[,$]', ''

But I can't figure out how to get the 5. to be 5.0.
I've been messing with the negative lookbehind, but I haven't found
one that works for this.


Search: (\.\d+?)0+\b
Replace: \1

which is:

re.sub(r'(\.\d+?)0+\b', r'\1', string)

--
https://mail.python.org/mailman/listinfo/python-list

Re: regex help

2015-03-13 Thread Larry Martell

On Fri, Mar 13, 2015 at 1:29 PM, MRAB pyt...@mrabarnett.plus.com wrote:
 On 2015-03-13 16:05, Larry Martell wrote:

 I need to remove all trailing zeros to the right of the decimal point,
 but leave one zero if it's whole number. For example, if I have this:


 14S,5.,4.5686274500,3.7272727272727271,3.3947368421052630,5.7307692307692308,5.7547169811320753,4.9423076923076925,5.7884615384615383,5.13725490196

 I want to end up with:


 14S,5.0,4.56862745,3.7272727272727271,3.394736842105263,5.7307692307692308,5.7547169811320753,4.9423076923076925,5.7884615384615383,5.13725490196

 I have a regex to remove the zeros:

 '0+[,$]', ''

 But I can't figure out how to get the 5. to be 5.0.
 I've been messing with the negative lookbehind, but I haven't found
 one that works for this.

 Search: (\.\d+?)0+\b
 Replace: \1

 which is:

 re.sub(r'(\.\d+?)0+\b', r'\1', string)

Thanks! That works perfectly.
-- 
https://mail.python.org/mailman/listinfo/python-list

Re: regex help

2015-03-13 Thread Cameron Simpson

On 13Mar2015 12:05, Larry Martell larry.mart...@gmail.com wrote:

I need to remove all trailing zeros to the right of the decimal point,
but leave one zero if it's whole number. For example, if I have this:

14S,5.,4.5686274500,3.7272727272727271,3.3947368421052630,5.7307692307692308,5.7547169811320753,4.9423076923076925,5.7884615384615383,5.13725490196

I want to end up with:

14S,5.0,4.56862745,3.7272727272727271,3.394736842105263,5.7307692307692308,5.7547169811320753,4.9423076923076925,5.7884615384615383,5.13725490196

I have a regex to remove the zeros:

'0+[,$]', ''

But I can't figure out how to get the 5. to be 5.0.
I've been messing with the negative lookbehind, but I haven't found
one that works for this.

Leaving aside the suggested non-greedy match, you can rephrase this: strip
trailing zeroes _after_ the first decimal digit. Then you can consider a number
to be:

digits
point
any digit
other digits to be right-zero stripped

so:

(\d+\.\d)(\d*[1-9])?0*\b

and keep .group(1) and .group(2) from the match.

Another way of considering the problem.

Or you could two step it. Strip all trailing zeroes. If the result ends in a
dot, add a single zero.

Cheers,
Cameron Simpson c...@zip.com.au

C'mon. Take the plunge. By the time you go through rehab the first time,
you'll be surrounded by the most interesting people, and if it takes years
off of your life, don't sweat it. They'll be the last ones anyway.
- Vinnie Jordan, alt.peeves
--
https://mail.python.org/mailman/listinfo/python-list

Re: regex help

2015-03-13 Thread Steven D'Aprano

Larry Martell wrote:

 I need to remove all trailing zeros to the right of the decimal point,
 but leave one zero if it's whole number. 


def strip_zero(s):
if '.' not in s:
return s
s = s.rstrip('0')
if s.endswith('.'):
s += '0'
return s


And in use:

py strip_zero('-10.2500')
'-10.25'
py strip_zero('123000')
'123000'
py strip_zero('123000.')
'123000.0'


It doesn't support exponential format:

py strip_zero('1.230e3')
'1.230e3'

because it isn't clear what you intend to do under those circumstances.


-- 
Steven

-- 
https://mail.python.org/mailman/listinfo/python-list

regex help

2015-03-13 Thread Larry Martell

I need to remove all trailing zeros to the right of the decimal point,
but leave one zero if it's whole number. For example, if I have this:

14S,5.,4.5686274500,3.7272727272727271,3.3947368421052630,5.7307692307692308,5.7547169811320753,4.9423076923076925,5.7884615384615383,5.13725490196

I want to end up with:

14S,5.0,4.56862745,3.7272727272727271,3.394736842105263,5.7307692307692308,5.7547169811320753,4.9423076923076925,5.7884615384615383,5.13725490196

I have a regex to remove the zeros:

'0+[,$]', ''

But I can't figure out how to get the 5. to be 5.0.
I've been messing with the negative lookbehind, but I haven't found
one that works for this.
-- 
https://mail.python.org/mailman/listinfo/python-list

[issue22364] Improve some re error messages using regex for hints

2015-03-01 Thread Serhiy Storchaka


Serhiy Storchaka added the comment:

Could anyone please make a review? This patch is a prerequisite of other 
patches.

--

___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue22364
___
___
Python-bugs-list mailing list
Unsubscribe: 
https://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com

[issue23532] add example of 'first match wins' to regex | documentation?

2015-02-27 Thread Matthew Barnett


Matthew Barnett added the comment:

Not quite all. POSIX regexes will always look for the longest match, so the 
order of the alternatives doesn't matter, i.e. x|xy would give the same result 
as xy|x.

--

___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue23532
___
___
Python-bugs-list mailing list
Unsubscribe: 
https://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com

[issue23532] regex | behavior differs from documentation

2015-02-26 Thread Rick Otten


Changes by Rick Otten rottenwindf...@gmail.com:


--
components: Regular Expressions
nosy: Rick Otten, ezio.melotti, mrabarnett
priority: normal
severity: normal
status: open
title: regex | behavior differs from documentation
type: behavior
versions: Python 2.7

___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue23532
___
___
Python-bugs-list mailing list
Unsubscribe: 
https://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com

[issue23532] regex | behavior differs from documentation

2015-02-26 Thread Mark Shannon


Mark Shannon added the comment:

This looks like the expected behaviour to me.
re.sub matches the leftmost occurence and the regular expression is greedy so 
(x|xy) will always match xy if it can.

--
nosy: +Mark.Shannon

___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue23532
___
___
Python-bugs-list mailing list
Unsubscribe: 
https://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com

[issue23532] regex | behavior differs from documentation

2015-02-26 Thread Rick Otten


Rick Otten added the comment:

Can the documentation be updated to make this more clear?

I see now where the clause As the target string is scanned, ... is describing 
what you have listed here.

I and a coworker both read the description several times and missed that.  I 
thought it first tried incorporated against the whole string, then tried  
inc against the whole string, etc...  When actually it was trying each, 
incorporated and  inc and the others against the first position of the 
string.  And then again for the second position.

Since I want to force the order against the whole string before trying the next 
one for my particular use case, I'll do a series of re.subs instead of trying 
to do them all in one.  It makes sense now and is easy to fix.

Thanks for looking at it and explaining what is happening more clearly.  It was 
really not obvious.  I tried at least 100 variations and wasn't seeing the 
pattern.

--

___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue23532
___
___
Python-bugs-list mailing list
Unsubscribe: 
https://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com

[issue23532] regex | behavior differs from documentation

2015-02-26 Thread Matthew Barnett


Matthew Barnett added the comment:

@Mark is correct, it's not a bug.

In the first example:

It tries to match each alternative at position 0. Failure.
It tries to match each alternative at position 1. Failure.
It tries to match each alternative at position 2. Failure.
It tries to match each alternative at position 3. Success. ' inc' matches.

In the second example:

It tries to match each alternative at position 0. Failure.
It tries to match each alternative at position 1. Failure.
It tries to match each alternative at position 2. Failure.
It tries to match each alternative at position 3. Failure.
It tries to match each alternative at position 4. Success. 'incorporated' 
matches. ('inc' is a later alternative; it's considered only if the earlier 
alternatives have failed to match at that position.)

--

___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue23532
___
___
Python-bugs-list mailing list
Unsubscribe: 
https://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com

[issue23532] regex | behavior differs from documentation

2015-02-26 Thread Rick Otten


New submission from Rick Otten:

The documentation states that | parsing goes from left to right.  This 
doesn't seem to be true when spaces are involved.  (or \s).

Example:

In [40]: mystring
Out[40]: 'rwo incorporated'

In [41]: re.sub('incorporated| inc|llc|corporation|corp| co', '', mystring)
Out[41]: 'rwoorporated'

In this case  inc was processed before incorporated.
If I take the space out:

In [42]: re.sub('incorporated|inc|llc|corporation|corp| co', '', mystring)
Out[42]: 'rwo '

incorporated is processed first.

If I put a space with each, then  incorporated is processed first:

In [43]: re.sub(' incorporated| inc|llc|corporation|corp| co', '', mystring)
Out[43]: 'rwo'

And If use \s instead of a space, it is processed first:

In [44]: re.sub('incorporated|\sinc|llc|corporation|corp| co', '', mystring)
Out[44]: 'rwoorporated'

--

___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue23532
___
___
Python-bugs-list mailing list
Unsubscribe: 
https://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com

[issue23532] add example of 'first match wins' to regex | documentation?

2015-02-26 Thread R. David Murray


R. David Murray added the comment:

The thing is, what you describe is fundamental to how regular expressions work. 
 I'm not sure it makes sense to add a specific mention of it to the '|' docs, 
since it applies to all regexes.

--
assignee:  - docs@python
components: +Documentation -Regular Expressions
nosy: +docs@python, r.david.murray
title: regex | behavior differs from documentation - add example of 'first 
match wins' to regex |  documentation?

___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue23532
___
___
Python-bugs-list mailing list
Unsubscribe: 
https://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com

[issue22364] Improve some re error messages using regex for hints

2015-02-20 Thread Serhiy Storchaka


Serhiy Storchaka added the comment:

 Messages tend to be abbreviated, so I think that it would be better to just
 omit the article.

I agree, but this is came from standard error messages which are not 
consistent. I opened a thread on Python-Dev.

expected a bytes-like object and expected str instance are standard error 
messages raised in bytes.join and str.join, not in re. We could change them 
though.

 I don't think that the error message bad repeat interval is an improvement
 (Why is it bad? What is an interval?). I think that saying that the min
 is greater than the max is clearer.

Agree. I'll change this in re. What message is better in case of overflow: the 
repetition number is too large (in re) or repeat count too big (in regex)?

--

___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue22364
___
___
Python-bugs-list mailing list
Unsubscribe: 
https://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com

[issue22364] Improve some re error messages using regex for hints

2015-02-18 Thread Serhiy Storchaka


Serhiy Storchaka added the comment:

Here is a patch for regex which makes some error messages be the same as in re 
with re_errors_2.patch. You could apply it to regex if new error messages look 
better than old error messages. Otherwise we could change re error messages to 
match regex, or discuss better variants.

--
Added file: http://bugs.python.org/file38171/regex_errors.diff

___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue22364
___
___
Python-bugs-list mailing list
Unsubscribe: 
https://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com

[issue22364] Improve some re error messages using regex for hints

2015-02-18 Thread Matthew Barnett


Matthew Barnett added the comment:

Some error messages use the indefinite article:

expected a bytes-like object, %.200s found
cannot use a bytes pattern on a string-like object
cannot use a string pattern on a bytes-like object

but others don't:

expected string instance, %.200s found
expected str instance, %.200s found

Messages tend to be abbreviated, so I think that it would be better to just 
omit the article.

I don't think that the error message bad repeat interval is an improvement 
(Why is it bad? What is an interval?). I think that saying that the min is 
greater than the max is clearer.

--

___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue22364
___
___
Python-bugs-list mailing list
Unsubscribe: 
https://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com

[issue22364] Improve some re error messages using regex for hints

2015-02-10 Thread Serhiy Storchaka


Serhiy Storchaka added the comment:

Updated patch addresses Ezio's comments.

--
Added file: http://bugs.python.org/file38080/re_errors_2.patch

___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue22364
___
___
Python-bugs-list mailing list
Unsubscribe: 
https://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com

[issue22364] Improve some re error messages using regex for hints

2015-02-07 Thread Serhiy Storchaka


Serhiy Storchaka added the comment:

Here is a patch which unify and improves re error messages. Added tests for all 
parsing errors. Now error message always points on the start of affected 
component, i.e. on the start of bad escape, group name or unterminated 
subpattern.

--
stage: needs patch - patch review
Added file: http://bugs.python.org/file38035/re_errors.patch

___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue22364
___
___
Python-bugs-list mailing list
Unsubscribe: 
https://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com

[issue22364] Improve some re error messages using regex for hints

2015-02-07 Thread Serhiy Storchaka


Serhiy Storchaka added the comment:

re_errors_diff.txt contains differences for all tested error messages.

--
Added file: http://bugs.python.org/file38036/re_errors_diff.txt

___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue22364
___
___
Python-bugs-list mailing list
Unsubscribe: 
https://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com

[issue23191] fnmatch regex cache use is not threadsafe

2015-01-27 Thread Serhiy Storchaka


Changes by Serhiy Storchaka storch...@gmail.com:


--
resolution:  - fixed
stage: patch review - resolved
status: open - closed

___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue23191
___
___
Python-bugs-list mailing list
Unsubscribe: 
https://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com

[issue23191] fnmatch regex cache use is not threadsafe

2015-01-27 Thread Roundup Robot


Roundup Robot added the comment:

New changeset fe12c34c39eb by Serhiy Storchaka in branch '2.7':
Issue #23191: fnmatch functions that use caching are now threadsafe.
https://hg.python.org/cpython/rev/fe12c34c39eb

--
nosy: +python-dev

___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue23191
___
___
Python-bugs-list mailing list
Unsubscribe: 
https://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com

[issue23191] fnmatch regex cache use is not threadsafe

2015-01-27 Thread Serhiy Storchaka


Changes by Serhiy Storchaka storch...@gmail.com:


--
assignee:  - serhiy.storchaka

___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue23191
___
___
Python-bugs-list mailing list
Unsubscribe: 
https://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com

[issue23318] (compiled RegEx).split gives unexpected results if () in pattern

2015-01-25 Thread Dave Notman


New submission from Dave Notman:

# Python 3.3.1 (default, Sep 25 2013, 19:30:50)
# Linux 3.8.0-35-generic #50-Ubuntu SMP Tue Dec 3 01:25:33 UTC 2013 i686 i686 
i686 GNU/Linux

import re

splitter = re.compile( r'(\s*[+/;,]\s*)|(\s+and\s+)' )
ll = splitter.split( 'Dave  Sam, Jane and Zoe' )
print(repr(ll))

print( 'Try again with revised RegEx' )
splitter = re.compile( r'(?:(?:\s*[+/;,]\s*)|(?:\s+and\s+))' )
ll = splitter.split( 'Dave  Sam, Jane and Zoe' )
print(repr(ll))

Results:
['Dave', '  ', None, 'Sam', ', ', None, 'Jane', None, ' and ', 'Zoe']
Try again with revised RegEx
['Dave', 'Sam', 'Jane', 'Zoe']

--
components: Regular Expressions
messages: 234677
nosy: dnotmanj, ezio.melotti, mrabarnett
priority: normal
severity: normal
status: open
title: (compiled RegEx).split gives unexpected results if () in pattern
type: behavior
versions: Python 3.3

___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue23318
___
___
Python-bugs-list mailing list
Unsubscribe: 
https://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com

[issue23318] (compiled RegEx).split gives unexpected results if () in pattern

2015-01-25 Thread SilentGhost


SilentGhost added the comment:

Looks like it works exactly as the docs[1] describe:

 re.split(r'\s*[+/;,]\s*|\s+and\s+', string)
['Dave', 'Sam', 'Jane', 'Zoe']

You're using capturing groups (parentheses) in your original regex which 
returns separators as part of a match.

[1] https://docs.python.org/3/library/re.html#re.split

--
nosy: +SilentGhost
resolution:  - not a bug
status: open - closed

___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue23318
___
___
Python-bugs-list mailing list
Unsubscribe: 
https://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com

< 2 3 4 5 6 7 8 9 10 11 >

601 - 700 of 2838 matches

Mail list logo