[R] [R-pkgs] The *makesweave* Package for Using Make with Sweave Efficiently

2010-11-18 Thread Charlotte Maia
Hi R Community,

I've just completed the initial version of (Linux-based) R package,
called *makesweave*, for using Make to build Sweave documents
efficiently.

The idea is that R is started once (per shell) as a background
process, then Make builds each Sweave source file, using the same R
instance.

More information in the package vignette.

Feedback very welcome.


regards
-- 
Charlotte Maia
http://sites.google.com/site/maiagx

___
R-packages mailing list
r-packa...@r-project.org
https://stat.ethz.ch/mailman/listinfo/r-packages

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.


[R] [R-pkgs] Yet Another Package for Time Data

2010-06-01 Thread Charlotte Maia
Hi fellow R developers/users,

I've recently revised a package called rtv, and now consider it
reasonably stable.

Description: A package for conveniently representing, manipulating and
visualising time data. Here, time is regarded as a random variable,
and objects are used to represent realisations of that random
variable. This is particularly useful for change points, irregular
timeseries and failure events. There's a strong emphasis on continuous
representations of time, with user-specified origins and units.

http://cran.r-project.org/web/packages/rtv/index.html

The package contains classes notably similar to the POSIXt classes.
However, the classes are based on a somewhat different philosophy.
The major advantage of the package, is that crtv objects (similar to
POSIXct objects) allow user-specified origins and units.
It also allows time events to be represented as fractional months or
fractional years.

More info in the package vignette.

Bug reports and feature requests welcome.


kind regards
-- 
Charlotte Maia
http://sites.google.com/site/maiagx

___
R-packages mailing list
r-packa...@r-project.org
https://stat.ethz.ch/mailman/listinfo/r-packages

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.


[R] The KJV

2010-02-05 Thread Charlotte Maia
Hey all,

Does anyone know if there are any R packages with a copy of the KJV?
I'm guessing the answer is no...

So the next question, and the more important one is:
Does anyone think it would be useful (e.g. for text-mining purposes)?
I know almost nothing about theology,
so I'm not sure what kind of questions theologists might have (that R
could answer).

An alternative, that would achieve a similar result (I think),
would be an R interface to another open source system, such as Sword.


kind regards
-- 
Charlotte Maia
http://sites.google.com/site/maiagx

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.


[R] Emacs vs Eclipse vs Rcmdr

2010-01-03 Thread Charlotte Maia
Hi everyone,

I could have posted this on R-devel or the GUI list, however don't
feel it's that serious.
Hence, decided R-help would be the most appropriate.

I'm not so much interested in which is the best user interface for R.
Rather which is the best ***platform*** for developing ***new*** user
interfaces for R.
Noting I'm using the term user interface is a very general sense.
(i.e. Can include anything from console/pseudoterminal widgets, to
text editors with customised syntax highlighting, to elaborate menus
and dialog boxes).

Here are my initial thoughts:

Emacs Pros:
- A lot of computer experts use it.
- Plus some high profile R people are involved in the development of ESS.
- High level of customisation.

Emacs Cons:
- Need to know Lisp.
- Counter intuitive.
- It's really ugly.
- No decent widget set (which is probably why it's ugly).

Eclipse Pros:
- It's kind of fashionable and nice looking.

Eclipse Cons:
- Unnecessarily complicated.
- Need to know SWT (and maybe XML too?).
- The process for installing (and finding) add on packages, is terrible.

Rcmdr Pros and Cons:
- I haven't used it for a long time, so can't really comment.
- However, I was surprised by how many reverse dependencies it has. So
I will assume it has some potential.

Other people's thoughts welcome...


kind regards
-- 
Charlotte Maia
Open Source Developer and Statistician
http://sites.google.com/site/maiagx

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.


[R] [R-pkgs] Multivariate ECDFs

2009-12-10 Thread Charlotte Maia
Hey R people,

I have just put a package on CRAN, mecdf 0.2.1.

It computes multivariate ECDFs.
i.e. Estimates (or perhaps I should say evaluates) a multivariate
cumulative distribution function, using data, without any assumptions
per se.
Plus contrary to my own advise, the vignette contains some pretty
pictures of the bivariate normal...

The current package is relatively simple.
However, I have searched for this topic, and found very little
information, so I regard this as a new(ish) topic.

The package is still new and relatively untested.
I'm planning to implement smooth(ish) distribution functions in the
near future. e.g. Interpolating the step function (possibly with some
other adjustments).
Suggestions (including criticisms) welcome, especially while the package is new.


Merry Christmas, group hug...
-- 
Charlotte Maia
http://sites.google.com/site/maiagx/home

___
R-packages mailing list
r-packa...@r-project.org
https://stat.ethz.ch/mailman/listinfo/r-packages

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.


Re: [R] Replying to Posts Within Same Thread

2009-12-03 Thread Charlotte Maia
On Thu, Dec 3, 2009 at 1:15 AM, Ted Harding
ted.hard...@manchester.ac.uk wrote:
 There is an issue which does not seem to have been raised yet in
 this thread. In her original post, Charlotte Maia said she was
 receiving messages in Digest form.

 There are two Digest options: Get MIME or Plain Text Digests?
 You can choose which one to use, if you opt for Digest format,
 when setting your subscription options in the R-help configuration
 page at

  https://stat.ethz.ch/mailman/options/r-help

 In a plain-text Digest, all the messages are bundled into a single
 message. Therefore when you reply to one of these you have to edit
 out everything except the bit you are replying to; and also the
 Thread-relevant headers of the original messages will heve been
 suppressed -- only headers for the Dugest message will be present,
 and these will have nothing to do with the threads that the separate
 original messages belonged to.

 On the other hand, in a MIME Digest, each of the original messages
 is attached as a separate attachment, along with its original headers.
 A mail agent which is MIME-Digest-aware will allow the user to
 open up each attachment separately, as a single message, and reply
 to that. Then the mail agent *should* incorporate these headers,
 including those which identify the thread.

 So, if you use Digest format, choose MIME Digest, and use a mail
 agent which works properly!

 Ted.

Hi,

I have changed my settings from plain text to MIME (doesn't apply to
this post, which is a response to a regular email, with r-help on the
cc line).

We'll see how things go, if my future posts appear threaded properly
then we can assume this was the issue. If not, then I'll keep playing
around.

Me being the old-school kind of computer geek that I am, naturally go
for plain text...

thanks to all those who responded to this topic
-- 
Charlotte Maia
http://sites.google.com/site/maiagx/home

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.


[R] Re Off topic - Compendium of distributions

2009-12-03 Thread Charlotte Maia
Hi,

I am going to sound mean here, however I don't feel the document is
very comprehensive. Maybe concise is a better word.

I quickly looked through the document.

The biggest problem is that there is very little discussion on
multivariate distributions. Noting that multivariate distributions
play a critical role in statistical theory, plus are gaining an
increasing number of applications in various fields.

I saw no discussion of distributions with circular, cylindrical, or
spherical sample spaces (and for that matter the descriptions of
bounded and unbounded are wrong).
No discussion of copulas and no discussion of distributions that mix
discrete and continuous random variables.

Plus despite saying that your organisation supports nonparametric
approaches, there was no obvious serious examples (maybe they are
hidden in there somewhere...).

It's important to note that as this is an R mailing list, many readers
will have degrees in computer science, mathematics and statistics.
Plus many will have looked at CRANs task view for distributions. I'd
be surprised if there are many readers who are not already familiar
with basic distributions such as normal, univariate, etc.

Perhaps for people that have done one, maybe two, statistics courses
at university, and majored in something entirely different, they very
well may be quite impressed.

Whilst this post is critical, I genuinely wish you the best of luck
with your business, I hope your stakeholders and clientele are
impressed by it.


regards
-- 
Charlotte Maia
http://sites.google.com/site/maiagx/home

(This is a response to website, not the MIME digest).

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.


[R] Re shorten str() output for long list

2009-12-03 Thread Charlotte Maia
I doubt str is intended to work the way you want it to.
I certainly wouldn't use it that way.

The choice of data structure here is inappropriate, use a vector not a list.

If you absolutely must do what you are asking, then simply write your
own function. Use a heuristic for object length, say if less than 200
do such and such, otherwise do something else.

In regards to some of your other posts, I think the R documentation is
very good.
Have you ever tried writing software documentation? It's a lot of
work, which makes the R documentation in the standard packages quite
impressive.

If using a function for a subject area I'm not familiar with, I expect
the unexpected.

For mature software, the arguments section is extremely useful. Often
when I'm using a function I forget the exact arguments, so have to
look them up. For functions that I'm not familiar with, or unclear
about, then I read the details section. Sometimes I might read it
several times.


-- 
Charlotte Maia
http://sites.google.com/site/maiagx/home

(this is a response to the website, not the MIME digest)

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.


[R] Again on overlaying plots (a plot region within a plot region)

2009-12-02 Thread Charlotte Maia
Hi Ottorino-Luca,

I'm not sure how flexible the layout command is off the top of my head.
I have never seen an example of what you describe using layout, so I
will assume it can't be done.

One option is to use grid and lattice.
This is possibly the best option if you are planning to do a lot of
this sort of thing.

Another option is to plot the second plot, on top of the first plot,
using primitives such as points and lines, transforming your
coordinates so that they match the coordinates of the first plot. This
will be very difficult if you wish to put titles and axis on the
nested plot, however simple if you restrict yourself to points and
lines. Finding documentation and examples of R's graphical primitives
should be easy.

At the risk of sounding crude, I'm pretty sure both latex and many
word processors allow one position one figure on top of another. This
would seem the simplest solution for a one off job.

Further to my post on R-devel, I recommend not using png format,
except where absolutely necessary.

regards
-- 
Charlotte Maia
http://sites.google.com/site/maiagx/home

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.


Re: [R] Distance between sets of points in transformed environmental space

2009-12-01 Thread Charlotte Maia
Well, here's another naive post from me (hopefully better than the last one).

Firstly I'm not sure computing euclidean distance is that simple. I
would assume temperatures and precipitation would need to be
standardised in some way.

I think the notion of how far away something is, and how distinct
location wise something is, are quite different, so maybe two
measures?

For distance per se, I think your first idea is the best.
Plus simple is always good...

For distinctness, given one one of two sets, for each point, you could
just compute the closest point to it. If the closest point is a member
of the same set, we will call that a + point, if the closest point is
a member of the other set, we will call it a - point. In principle the
measure of distinctness would be the sum of the +'s, however there
might need to be some scaling to take into account the number of
points in each set.

There are also a lot of fancy things out there, so someone will
probably come up with a much fancier (and possibly better) idea than
this.

Well, that's just my rant, before I go to bed.


kind regards
-- 
Charlotte Maia
http://sites.google.com/site/maiagx/home

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.


Re: [R] Distance between sets of points in transformed environmental space

2009-12-01 Thread Charlotte Maia
Hi Corrado,

I was thinking about this some more.

Maybe you could use a linear discriminate, i.e. a (hyper)plane that
partitions your points into two sets, such that the misclassification
rate is minimised.

Closeness could be regarded as the number of misclassified points.
Two sets would be distant, if no points are misclassified.

I am assuming there is a standard function in R to do this, no idea
what it is though. Plus this is a reasonably well known technique.

Again the size of the sets needs to be accounted for.
As well as the question, does the distance of set A from B, need to be
the same as the distance of set B from A. Both the nearest neighbour
approach and the discriminant approach, don't necessarily satisfy this
condition.

regards
-- 
Charlotte Maia
http://sites.google.com/site/maiagx/home

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.


[R] Replying to Posts Within Same Thread

2009-12-01 Thread Charlotte Maia
Hi,

When I reply to a post, it generally appears as a separate thread,
rather than branching off the original post.

Anyone know what I'm doing wrong?

Noting I am currently receiving a couple of the lists in digest form.


kind regards
-- 
Charlotte Maia
http://sites.google.com/site/maiagx/home

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.


Re: [R] Replying to Posts Within Same Thread

2009-12-01 Thread Charlotte Maia
On 12/2/09, John Sorkin jsor...@grecc.umaryland.edu wrote:
 I don't know what you are doing wrong because I don't know exactly what you 
 are doing. I do know that I don't have your problem when I simply reply to a 
 message without touching the subject line.
 John

Could you clarify the notion of simply reply.
There is no obvious reply links in either the archive page, or the digest.

In the case, where someone sends me an email, such have you have done,
I can simply reply.

However, in general this option is not available.

-- 
Charlotte Maia
http://sites.google.com/site/maiagx/home

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.


[R] Re How to remove R banner?

2009-12-01 Thread Charlotte Maia
Hi,

Annoying
I love it (except possibly when re-directing standard out to a file).

I think it's one of the command line options (in which case, it will
be clearly documented).
No idea how it works on the GUI systems (maybe an option somewhere...).

In saying that, I have to assume that if people (you?) want control
over R, they are using the *command line in the first place. The GUI
systems represent a complete loss of control (esp the mac version,
omg).

There's always that in-between group who use Emacs/ESS, won't go there...

*I use gnome-terminal + bash, plus a mixture of text editors.

regards
-- 
Charlotte Maia
http://sites.google.com/site/maiagx/home

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.


[R] R] Re: Histogram and Density on the the same graph

2009-11-30 Thread Charlotte Maia
Hi,

The computer I'm using now doesn't have R on it, so this is a naive guess:

lines (density (y) )


regards
-- 
Charlotte Maia
http://sites.google.com/site/maiagx/home

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.


[R] Backfitting with Missing Explanatory Values

2009-11-24 Thread Charlotte Maia
Hi, I just wanted to check I'm not re-inventing the wheel here.

I'm developing a new algorithm for backfitting (i.e. additive models)
and for computing partial residuals, where partial residuals are still
computed even where there are missing values. Noting additive models
here contain both linear terms and smooth terms.

If I am re-inventing the wheel could some one please let me know. I'm
kind of on my own at the moment, and don't have quite as much academic
support as I would like.

Here's an excerpt from my incomplete package (on cran), amba.

One way to think of residuals, is as some vector of values. If we
start with the response values and
subtract the overall mean, we get values with relatively high
variance. If we then subtract the fitted values
for the first term, the variance decreases. If we repeat for each
term, the variance gradually decreases,
until we are left with values with relatively low variance. In the
ideal case, the residuals would have zero
variance. If we apply certain special conditions, then it is possible
to only subtract a fitted value, where the
corresponding explanatory value is valid (i.e. not missing). Where it
is not valid, we just skip that
subtraction operation (i.e. for that particular observation, the
variance is not reduced as much). For
this to work, each explanatory variable's partial residuals for each
fit (not just the final fit) must be
zero-centered. For smoothers this isn't a big issue, however
conventional linear terms often do not satisfy
this zero-centered condition. Noting the centering condition applies
to partial residuals in relation to an
explanatory variable (not in relation to a parameter) and each
explanatory may have multiple parameters
associated with it. For our linear terms to satisfy it, we require
extra parameters. Categorical terms
require one parameter for each level, and polynomial terms, their own
intercepts.


kind regards
Charlotte

__
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.