"David C. Howell" wrote:
> 
> My daughter just asked me a question that I should be able to answer,
> but can't. Any help would be appreciated.
> 
> It is well know that the sum of normally distributed random variables
> is itself normally distributed. But I want to know the sampling
> distribution of the sum of lognormal distributions.
> 
> My first thought would be that the central limit theorem would suggest
> that the sampling distribution would approach normal as N increases.
> On the other hand, when I generate multiple independent lognormal
> distributions and then take their sum, a P-P plot tells me that the
> sum is nicely lognormal, not normal.

        Did you do the plot correctly?

Even for the sum of 2 lognormal distributions I get this (below) for
a q-q plot against another lognormal. It is far from straight:


          -
     21.0+
         -                                                      *
 2sort   -
         -
         -
     14.0+                          *
         -                    *
         -                  2
         -
         -            **
      7.0+         2*2
         -       34*
         -     273
         -   6+7
         -  ++
      0.0+ 5*
          
+---------+---------+---------+---------+---------+------1sort   
         0.0       4.0       8.0      12.0      16.0      20.0


but also far from normal (plot against sorted normals below)


         -
     21.0+
         -                                                         *
 2sort   -
         -
         -
     14.0+                                                      *
         -                                                 *
         -                                            * *
         -
         -                                           **
      7.0+                                         22*
         -                                     *4*2
         -                                 *2522
         -                          36324343
         -           23 2 6**23452***
      0.0+   2*  ** *
          
--------+---------+---------+---------+---------+--------Nsort   
               -1.60     -0.80      0.00      0.80      1.60





Adding 10 lognormals and plotting against an eleventh I get the even
less straight


         -
         -                                                      *
       36+
         -                    *     *
 10sort  -                  2
         -           ***
         -         3**
       24+       *4
         -       5
         -     67
         -    +3
         -   +3
       12+  98
         -  +
         - 5
         -
         -
          
+---------+---------+---------+---------+---------+------1sort   
         0.0       4.0       8.0      12.0      16.0      20.0

Whereas plotting against sorted normals I get


        -
         -                                                         *
       36+
         -                                                 *    *
 10sort  -                                            * *
         -                                           2*
         -                                        *22
       24+                                      3**
         -                                    *3*
         -                                *425*
         -                            32433
         -                       2**46
       12+                ***2345
         -          *23 2 5
         -   2*  **
         -
         -
          
--------+---------+---------+---------+---------+--------Nsort   
               -1.60     -0.80      0.00      0.80      1.60

which is considerably closer.

Repeating for n=100 I get (against sorted lognormals)


        -
      210+                    *     *                           *
         -            **    2
 100sort -         2*2
         -       *4*
         -      75
      175+    39
         -    +
         -   +*
         -  39
         -  8
      140+  6
         - *5
         - 2
         - 2
         -
      105+
          
+---------+---------+---------+---------+---------+------1sort   
         0.0       4.0       8.0      12.0      16.0      20.0

and against sorted normals: 

        -
      210+                                                 *    *  *
         -                                           *2 *
 100sort -                                         22*
         -                                      3*2
         -                                  *523*
      175+                               344*
         -                           2324
         -                       ***44
         -                    245*
         -                3**2*
      140+            * 2 3
         -        * *22
         -    *  *
         -   2
         -
      105+
          
--------+---------+---------+---------+---------+--------Nsort   
               -1.60     -0.80      0.00      0.80      1.60

The twitches at the ends are because I was lazy and used sorted
pseudorandom numbers rather than theoretical normal quantiles - but we
see that as n gets larger the sum clearly *does* approach normality. I
doubt if there is any 
really nice form for the intermediate distributions, though.

        -Robert Dawson
.
.
=================================================================
Instructions for joining and leaving this list, remarks about the
problem of INAPPROPRIATE MESSAGES, and archives are available at:
.                  http://jse.stat.ncsu.edu/                    .
=================================================================

Reply via email to