Dear Sir/Madam, I am a PhD student researching at the University of Oxford. Recently, I came across the Google summer of code program and I am keen to participate.
After looking through the ideas of a number of mentoring partners, I found that working on developing an advanced data analysis tools for the OpenOffice.org Calc application would be most suitable as it would make use of my research experience in data analysis and programming. Please find attached to this email my detailed specification and description of the outcome for the proposed project. I look forward to hear from you shortly and hope that you would be willing to support my application to the Summer of Code program as my mentor. Yours sincerely, Jason Wong
|
Name: Jason Wong
Email: [EMAIL PROTECTED]
Project Title: Development of an advanced data analysis package for OpenOffice.org Calc
Synopsis: Create one or more tools to perform advanced statistical data and machine learning analysis tasks, such as ANOVA, FFT cross-correlation, neural networks, etc, that will complement the existing tools for integration with OpenOffice Calc.
Project specifications: The requirement of analyzing datasets is becoming increasingly common in many different disciplines of research for biology to sociology. To gain information from these datasets, researchers often have to apply a range of multivariate analysis method to analyze them. Spreadsheet programs remain one of the most common tools in which such data analyzes take place. While simple statistical analysis tools such as t-tests, correlation and linear regression is available, a further advanced set of tools would undoubtedly be of interest to many users.
It is proposed in this project, that a set of advanced data analysis tools be developed and integrated for cell access using the OpenOffice.org API.
An example of some of tools to be developed will include:
The tools will be developed such that they may be interfaced through a dialog that will enable users to select the cells for analysis as well as the input of relevant parameters.
Further, documentation with examples will be written for all implemented functions to help users understand their usage.
Description of outcome: A set of data analysis tools as mentioned in the specification will developed for OpenOffice Calc such that they may be easily used directly within the spreadsheet via the OpenOffice.org API. All tools will be throughly tested and verified against existing tools from other software vendors (such as Matlab).
Benefits to OpenOffice: An advance data analysis package will be a significant feature of Calc as it will provide users to ability to perform rapid analysis of data within the spreadsheet programs. The inclusion of a such package with OpenOffice Calc will undoubtedly increase its appeal to users as such a feature is not available event in other well known commercial packages.
Project Schedule: The duration of the Summer of Code and subsequently any updates/fixes if necessary.
Bio: I am a PhD research student at the Physical and Theoretical Chemistry research laboratory at the University of Oxford. My current research involves the processing and analysis of large spectral datasets for the purpose of disease diagnosis.
My work involves the application of a large number of statistical and machine learning methods in the analysis and deconvolution of the datasets. Further, I have solid computer programming experience in C++, Java and Visual Basic through the development a number of custom in house software tools to analyze my datasets.
I wish to contribute to the development of a statistical package for OpenOffice.org for a number of reasons:
|
--------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
