On 30/09/2013 21:29, Leena Gupta wrote:
Hello,

I have a TSV file that has the city,state,country information in this
format:
Name               Display name      Code
San Jose          SJC                     SJC - SJ (POP), CA (US)
San Francisco  SFO                    SFO - SF, CA (US)

I need to extract the state and country for each city from this file.
I'm trying to do this in python by using the following Regex:

s=re.search(',(.*?)\(',text)
                if s:
                    state=s.group(1).strip()
c=re.search('\((.*?)\)',text)
                if c:
                    country=c.group(1).strip()


This works well for the state. But for country for San Jose, it brings
the following:
country = POP

I think it maybe better to search from the end of the string,but I am
unable to get the right syntax. Could you please share any pointers?

Thanks!


I'd be strongly inclined to use the CSV module from the standard library with an excel-tab dialect name, see http://docs.python.org/3/library/csv.html#module-csv

Please try it and if you encounter any problems feel free to get back to us, we don't bite :)
--
Cheers.

Mark Lawrence

_______________________________________________
Tutor maillist  -  Tutor@python.org
To unsubscribe or change subscription options:
https://mail.python.org/mailman/listinfo/tutor

Reply via email to