Hello,
I have a project and need someone to help below you will see details about it , 
please if you able to help email me as soon as possible and how much will cost !

Thanks

details:
Computer programming Python


You will download data files: 2010 U.S. Mortality Data and ICD10 code file. 
Both of them are freely available from the CDC website:
2010 U.S. Mortality data
ftp://ftp.cdc.gov/pub/Health_Statistics/NCHS/Datasets/DVS/mortality/mort2010us.zip
ICD 10 code and description file:
ftp://ftp.cdc.gov/pub/Health_Statistics/NCHS/Publications/ICD10/allvalid2009(detailed
 titles headings).txt
For the second file, you are allowed to preprocess it before you analyze it. 
For instance, you may open the text file in Microsoft Excel and only keep the 
two needed columns (ICD10 code and the corresponding description) and remove 
all the other columns.
In this project, you are required to extract the following data items for each 
entry from the mortality data file: sex, age, race, education, marital status, 
manner of death, and ICD10 code for the reason of death. You are then required 
to analyze the extracted data and answer the following questions:
1) The male to female ratio (10 points)
2) The distribution of age. You may split all people into 12 groups according 
to their age: 0, 1-10, 11-20, 21-30, 31-40, 41-50, 51-60, 61-70, 71-80, 81-90, 
91-100, > 100. You may then count how many people were in each group. (10 
points)
3) The distribution of race. Similarly, you may categorize all people into 
groups according to their race [male, female, unknown] and report how many 
people were in each group. (10 points)
4) The distribution of education. Similar as above. (10 points)
5) The distribution of marital status. Similar as above. (10 points)
6) The distribution of manner of death. Similar as above. (10 points)
7) The top 10 leading cause of death. You may first figure out the top 10 
leading cause of death by counting the occurrence of the ICD10 code first, then 
determine the corresponding description about the code from the ICD10 code 
dictionary. (10 points)
8) Correlation between education and death age. To calculate correlation 
coefficient, you should convert both data columns into integers. (10 points)
9) Correlation between race and death age. Similar as above. (10 points)
10) Correlation between marital status and death age. Similar as above (10 
points)
Hint: For question 2, 3, 4, 5, and 6, you may create a function to finish the 
task since they have some common parts. (Not mandatory)


Sent from my iPhone
_______________________________________________
Tutor maillist  -  Tutor@python.org
To unsubscribe or change subscription options:
https://mail.python.org/mailman/listinfo/tutor

Reply via email to