Re: [Rdkit-discuss] Pandas dataframe manipulation

2016-03-11 Thread Paul Czodrowski
axis=0) Paul Von: Maciek Wójcikowski [mailto:mac...@wojcikowski.pl] Gesendet: Freitag, 11. März 2016 12:29 An: Paul Czodrowski <paul.czodrow...@merckgroup.com> Cc: rdkit <rdkit-discuss@lists.sourceforge.net> Betreff: Re: [Rdkit-discuss] Pandas dataframe manipulation Hi Paul

Re: [Rdkit-discuss] Pandas dataframe manipulation

2016-03-11 Thread Maciek Wójcikowski
Hi Paul, I would suggest: - assigning dtype of dataframe/column to str/np.object - cleaning up the IC50s - casting to float/int as dataframe.astype() Or alternatively you could use "converters" argument: pd.read_csv('filename.csv', converters={'ic50_colname': lambda x: x.replace('>',