Hi All---
Sorry for posting again, but I have a problem that I tried several different ways to solve w/o success. I approached the problem from one angle and asked about it here; I got some good input using pandas, and structured array, but I am new to python and not very familiar with either to use at this moment. I decided to go about in a different direction. I am hoping for a simpler solution using Numpy. I have a csv file with 4 columns and 2000 rows. There are 10 variables in column 1 and 4 variables on each column, 2 and 3. I read the csv file and converted it to arrays. The problem I ran into and could not resolve is 2-fold: (1) change the datatype for columns 1 and 4 to float and (2) then, I want to use Numpy-or simpler method- to calculate the mean of the data points on column 4 based on each variable on column 1 and column 2. Below is my code and sample data file. Here is part of my code: import numpy as np import csv TMatrix=[] np.set_printoptions(precision=2) " Converting csv to lists " with open('c:/Users/My Documents/AAA/temp1.csv') as temp: reader = csv.reader(temp, delimiter=',', quoting=csv.QUOTE_NONE) for row in reader: TMatrix.append(row) " converting lists to arrays " TMatrix=np.array(TMatrix) TMatrix=np.array(4,TMatrix[1:,::],dtype='float,int,int,float') # this statement is not working +++++++++++++++ This is a sample of my file +++++++++++++ ['19' 'A4' 'B2' '2'] ['19' 'A5' 'B1' '12'] ['18' 'A5' 'B2' '121']] Thanks in advance EK Esawi _______________________________________________ Tutor maillist - Tutor@python.org To unsubscribe or change subscription options: https://mail.python.org/mailman/listinfo/tutor