On 03/06/14 17:24, jarod...@libero.it wrote:

with open("prova.csv") as p:

If processing csv files its usually better to use the csv modulew, it can handle many obscure quoting issues etc, especially if the data is sourced from, say, Excel.

In particular I recommend using the dict-reader which returns a
list of dictionaries based on your column names. This makes
writing list comprehensions to extract data sets much easier.

['programs ', 'sample', 'gene', 'values']
['program1', 'sample1', 'TP53', '2']
['program1', 'sample1', 'TP53', '3']
['program1', 'sample2', 'PRNP', '4']
['program1', 'sample2', 'ATF3', '3']
['program2', 'sample1', 'TP53', '2']
['program2', 'sample1', 'PRNP', '5']
['program2', 'sample2', 'TRIM32', '4']
['program2', 'sample2', 'TLK1', '4']


I want to create a dictionary with set data with the names of the genes:

example:
dic = {}

dic['program1-sample1] = set(TP53)

You could do this using the dict reader of csv module like

tp53set = [sample for sample in data if datum['gene'] == 'TP53']

Obviously you can select which sample columns to keep
as you require.

You ask for a set but don't illustrate what that set
would actually contain so I'm not sure what aspects of
the data you are trying to get at...

So If I have a dictionary like that I can compare two set  I will compare the
capacity of the programs in function of the gene show.

Sorry, that was just gobbledegook to me. No idea what you mean.


--
Alan G
Author of the Learn to Program web site
http://www.alan-g.me.uk/
http://www.flickr.com/photos/alangauldphotos

_______________________________________________
Tutor maillist  -  Tutor@python.org
To unsubscribe or change subscription options:
https://mail.python.org/mailman/listinfo/tutor

Reply via email to