Richard,
There's the corresp() function in library(MASS) and Fionn Murtagh's code to go with his correspondence analysis book is available over the web.
This is very common practice with R books.
While playing with the data, I was struck by two prominent lines I kept seeing: table(loc_written) 0 1 2 3 4 5 6 7 2 3 3 8 1 7 3 3 ^ ^ I don't suppose it has any significance at all for your results, but I wonder why the loc_written data were so clumpy.
That 8 caught my eye, it should be 7 (a typo). I checked the other numbers and they are correct. What this is saying is that developers don't have a clue how many lines of code they have read/written (see extract of question below). In places they are not even consistent and there is a poor correlation with experience (0s indicate no answer given, which should really be NA). ----------------------- How many lines of code would you estimate you have \fBwritten\fR in different languages over your career: .RS .IP i) 50,000 .IP ii) 75,000 .IP iii) 100,000 .IP iv) 150,000 .IP v) 200,000 .IP vi) 275,000 .IP vii) 350,000+ .RE .IP b) How many lines of code would you estimate you have \fBread\fR in different languages over your career: .RS .IP i) 75,000 .IP ii) 100,000 .IP iii) 150,000 .IP iv) 200,000 .IP v) 300,000 .IP vi) 500,000 .IP vii) 800,000+
-- Derek M. Jones tel: +44 (0) 1252 520 667 Knowledge Software Ltd blog:shape-of-code.coding-guidelines.com Source code analysis http://www.knosof.co.uk -- The Open University is incorporated by Royal Charter (RC 000391), an exempt charity in England & Wales and a charity registered in Scotland (SC 038302).