Skip to main content

evaluation - Expectation of a family of random variables


I did some search here, but seems that no one asked about this.


I want to define a family of independent and identically distributed random variables, $x_1,...,x_n$, and then calculates the expected value of some expressions like $\sum_{i,j=1}^nx_ix_j$. The result would be some function depending on $n$. Is there a way to do this?



Answer



Here is a general solution for any distribution whose moments exist ...


Notation Define the power sum $s_r$:



$$s_r=\sum _{i=1}^n X_i^r$$


The Problem


Let $\left(X_1,\ldots,X_n\right)$ denote $n$ iid random variables. This is the same problem as drawing a random sample of size $n$ from a population random variable $X$. The problem is to find:


$$E\Big(\sum_{i,j=1}^n X_i X_j\Big) = E\Big [\Big (\sum_{i=1}^n X_i\Big)^2\Big ] = E\Big [s_1^2\Big]$$


This is a problem known as finding moments of moments: they can be very difficult to solve by hand, but quite easy to solve with the help of a computer algebra system, for any arbitrary symmetric power sum. In this instance, we seek the expectation of $s_1^2$ ... i.e. the 1st raw moment of $s_1^2$ ... so the solution (expressed ToRaw moments of the population) is:


enter image description here


where RawMomentToRaw is a function from the mathStatica package for Mathematica, and where $\acute{\mu }_1$ and $\acute{\mu }_2$ denote the 1st and 2nd raw moments of random variable $X$, whatever its distribution (assuming they exist). All done.


More detail


There is an extensive discussion of moments of moments in Chapter 7 of our book:




  • Rose and Smith, "Mathematical Statistics with Mathematica", Springer, NY


A free download of the chapter is available here:


http://www.mathstatica.com/book/Rose_and_Smith_2002edition_Chapter7.pdf





Example 1: The Normal Distribution


If $X \sim N(\mu, \sigma^2)$, then: $$\acute{\mu }_1 = E[X] = \mu \quad \text{ and } \quad \acute{\mu }_2 = E[X^2] = \mu^2 + \sigma^2 $$


Substituting in $\acute{\mu }_1$ and $\acute{\mu }_2$ in Out[1] yields the solution: $$E\Big(\sum_{i,j=1}^n X_i X_j\Big) = n \left(n \mu ^2 + \sigma ^2\right)$$


Simple check: The Normal case with $n = 3$



In the case of $n = 3$, the joint pdf of $(X_1, X_2, X_3)$ is say $f(x_1, x_2, x_3)$:


enter image description here


The sum of products we are interested in is:


enter image description here


and the desired expectation is:


enter image description here


which matches perfectly the general $n$-Normal solution derived above, but with $n = 3$.




Example 2: The Uniform Distribution


If $X \sim Uniform(a,b)$ (as considered in both other answers), then: $$\acute{\mu }_1 = E[X] = \frac{a+b}{2} \quad \text{ and } \quad \acute{\mu }_2 = E[X^2] = \frac{1}{3} \left(a^2+a b+b^2\right)$$



Substituting in $\acute{\mu }_1$ and $\acute{\mu }_2$ in Out[1] yields the solution: $$E\Big(\sum_{i,j=1}^n X_i X_j\Big) =\frac{1}{3} n \left(a^2+a b+b^2\right)+\frac{1}{4} (n-1) n (a+b)^2$$


Again, this is different to the other answers posted - and much more complicated. Again, it is easy to perform a quick check:


Simple check: The Uniform case with $n = 3$


In the case of $n = 3$, the joint pdf of $(X_1, X_2, X_3)$ is say $g(x_1, x_2, x_3)$:


enter image description here


and the desired expectation is:


enter image description here


which matches perfectly our general $n$-Uniform solution derived above, with $n = 3$.


Comments

Popular posts from this blog

plotting - Plot 4D data with color as 4th dimension

I have a list of 4D data (x position, y position, amplitude, wavelength). I want to plot x, y, and amplitude on a 3D plot and have the color of the points correspond to the wavelength. I have seen many examples using functions to define color but my wavelength cannot be expressed by an analytic function. Is there a simple way to do this? Answer Here a another possible way to visualize 4D data: data = Flatten[Table[{x, y, x^2 + y^2, Sin[x - y]}, {x, -Pi, Pi,Pi/10}, {y,-Pi,Pi, Pi/10}], 1]; You can use the function Point along with VertexColors . Now the points are places using the first three elements and the color is determined by the fourth. In this case I used Hue, but you can use whatever you prefer. Graphics3D[ Point[data[[All, 1 ;; 3]], VertexColors -> Hue /@ data[[All, 4]]], Axes -> True, BoxRatios -> {1, 1, 1/GoldenRatio}]

plotting - Mathematica: 3D plot based on combined 2D graphs

I have several sigmoidal fits to 3 different datasets, with mean fit predictions plus the 95% confidence limits (not symmetrical around the mean) and the actual data. I would now like to show these different 2D plots projected in 3D as in but then using proper perspective. In the link here they give some solutions to combine the plots using isometric perspective, but I would like to use proper 3 point perspective. Any thoughts? Also any way to show the mean points per time point for each series plus or minus the standard error on the mean would be cool too, either using points+vertical bars, or using spheres plus tubes. Below are some test data and the fit function I am using. Note that I am working on a logit(proportion) scale and that the final vertical scale is Log10(percentage). (* some test data *) data = Table[Null, {i, 4}]; data[[1]] = {{1, -5.8}, {2, -5.4}, {3, -0.8}, {4, -0.2}, {5, 4.6}, {1, -6.4}, {2, -5.6}, {3, -0.7}, {4, 0.04}, {5, 1.0}, {1, -6.8}, {2, -4.7}, {3, -1....

functions - Get leading series expansion term?

Given a function f[x] , I would like to have a function leadingSeries that returns just the leading term in the series around x=0 . For example: leadingSeries[(1/x + 2)/(4 + 1/x^2 + x)] x and leadingSeries[(1/x + 2 + (1 - 1/x^3)/4)/(4 + x)] -(1/(16 x^3)) Is there such a function in Mathematica? Or maybe one can implement it efficiently? EDIT I finally went with the following implementation, based on Carl Woll 's answer: lds[ex_,x_]:=( (ex/.x->(x+O[x]^2))/.SeriesData[U_,Z_,L_List,Mi_,Ma_,De_]:>SeriesData[U,Z,{L[[1]]},Mi,Mi+1,De]//Quiet//Normal) The advantage is, that this one also properly works with functions whose leading term is a constant: lds[Exp[x],x] 1 Answer Update 1 Updated to eliminate SeriesData and to not return additional terms Perhaps you could use: leadingSeries[expr_, x_] := Normal[expr /. x->(x+O[x]^2) /. a_List :> Take[a, 1]] Then for your examples: leadingSeries[(1/x + 2)/(4 + 1/x^2 + x), x] leadingSeries[Exp[x], x] leadingSeries[(1/x + 2 + (1 - 1/x...