me wrote:
<code>
I'm new to regexs and trying to get a list of all my C++ methods with balanced parenthesis as follows.


#find all c++ method prototypes with a '::' in the middle #upto and including the 1st closing parenthesis pattern_upto_1st_closed_parenth = re.compile('\w+::\w+\([^)]*\)') match_upto_1st_closed_parenth = re.findall(pattern_upto_1st_closed_parenth,txt)
num_of_protos = len(match_upto_1st_closed_parenth)

for i in range (0,num_of_protos-1):
This should actually be range(0, num_of_protos).

   num_of_open_parenths = match_upto_1st_closed_parenth[i].count('(')
#expand the pattern to get all of the prototype #ie upto the last closed parenthesis
   #saying something like
   pattern = re.compile(\
'match_upto_1st_closed_parenth[i]+\ (([^)]*\)){num_of_open_parenths-1}'\
                       )
   #====================================================================
   #HELP!!!!!! I'm not sure how to incorporate:
   #1 'match_upto_1st_closed_parenth[i]' into the above extended pattern???
   #2 the count 'num_of_open_parenths' instead of a literal ???
   #====================================================================




#=======================================
#if I could do it this sort of this would appear to offer the neatest solution
pattern_upto_last_balanced_parenthesis  = re.compile('
                                                                        
(\w+::\w+\([^)]*\))\                    
                                                                        
([^)]*\)){\1.count('(')-1}              
                                                                        ')      
        
#=======================================

Should I be using regexs to do this?

I've only put \ line extensions to separate the pattern components to assist readability

Thx
</code>

Not necessarily the best way, but:

methods = []
# The pattern for the start of the method's header.
start_pattern = re.compile(r'\w+::\w+\([^)]*\)')
# Start at the beginning of the text.
pos = 0
while True:
    # Search for the start of the next method's header.
    start_match = start_pattern.search(txt, pos)
    if not start_match:
        break
    # Search for the end of the method's header.
end_pattern = re.compile(r'(?:[^)]*\)){%d}' % start_match.group().count('('))
    end_match = end_pattern.search(txt, pos)
    methods.append(txt[start_match.start() : end_match.end()])
    # Continue the next search from where we left off.
    pos = end_match.end()
--
http://mail.python.org/mailman/listinfo/python-list

Reply via email to