Skip to main content

probability or statistics - Generate a random PDF


How can I generate a random probability density function $p(x)$ in the domain $[0,1]$? Is there a single Mathematica function to do so?



Answer



There are many ways to generate random distributions on the unit interval. I do not know of a function in Mathematica that will do that by itself, but it is straightforward to write a simple function that will.


The approach presented here is based on random Bernstein polynomials, which in this context become random mixtures of beta distributions. Here is a reference:



Sonia Petrone (1999) Bayesian Density Estimation Using Bernstein Polynomials, The Canadian Journal of Statistics, Vol. 27, No. 1 (Mar., 1999), pp. 105-126



Both the number of mixture components and the mixture weights are random. There is a certain amount of flexibility in choosing the underlying random distributions. In what follows, I use the geometric distribution to determine the number of mixture components and the Dirichlet distribution to determine the mixture weights.



As I have implemented this, there are two parameters to be chosen by the user, one for each of the underlying distributions. The first parameter is the mean geometric distribution (which must be greater than or equal to one).


The second parameter is the concentration parameter for the Dirichlet distribution, which determines how concentrated the distribution is around its mean. (The concentration parameter must be positive.) I have fixed the mean at $(1/k, \ldots, 1/k)$ where $k$ is the number of mixture components. As a consequence, the average random distribution is uniform. (There are a number of ways to produce non-uniform average random distributions. An important way involves applying a suitable transformation to the beta distributions.)


Programming notes:


I do not use the built-in DirichletDistribution because it only generates $k-1$ random weights, instead of $k$. In principle the missing weight could be computed via $w_k = 1 - \sum_{j=1}^{k-1} w_j$, but in practice this fails because $w_k$ ends up being negative too often to be useful. (It's a very small negative number, but negative nonetheless.) Instead I generate the gamma variates that Dirichlet variates are based on. (There is no need to normalize these gamma variates, since MixutureDistribution does that automatically.)


For efficiency, I use Chop to kill off those mixture components with trivially small mixture weights. (This is not necessary.)


randomDistribution[xi_, alpha_] :=
With[{k = RandomVariate[GeometricDistribution[1/xi]] + 1},
MixtureDistribution[
RandomVariate[GammaDistribution[alpha/k, 1], k],
Table[BetaDistribution[j, k - j + 1], {j, k}]

] // Chop
]

Here is some usage. I have set the mean of the geometric distribution to 100 and the concentration parameter to 1. (The use of Evaluate in the plot can speed things up significantly.)


dist = randomDistribution[100, 1];
Plot[PDF[dist, x] // Evaluate, {x, 0, 1}, PlotRange -> {0, All}, Filling -> 0]

Comments

Popular posts from this blog

front end - keyboard shortcut to invoke Insert new matrix

I frequently need to type in some matrices, and the menu command Insert > Table/Matrix > New... allows matrices with lines drawn between columns and rows, which is very helpful. I would like to make a keyboard shortcut for it, but cannot find the relevant frontend token command (4209405) for it. Since the FullForm[] and InputForm[] of matrices with lines drawn between rows and columns is the same as those without lines, it's hard to do this via 3rd party system-wide text expanders (e.g. autohotkey or atext on mac). How does one assign a keyboard shortcut for the menu item Insert > Table/Matrix > New... , preferably using only mathematica? Thanks! Answer In the MenuSetup.tr (for linux located in the $InstallationDirectory/SystemFiles/FrontEnd/TextResources/X/ directory), I changed the line MenuItem["&New...", "CreateGridBoxDialog"] to read MenuItem["&New...", "CreateGridBoxDialog", MenuKey["m", Modifiers-...

How to thread a list

I have data in format data = {{a1, a2}, {b1, b2}, {c1, c2}, {d1, d2}} Tableform: I want to thread it to : tdata = {{{a1, b1}, {a2, b2}}, {{a1, c1}, {a2, c2}}, {{a1, d1}, {a2, d2}}} Tableform: And I would like to do better then pseudofunction[n_] := Transpose[{data2[[1]], data2[[n]]}]; SetAttributes[pseudofunction, Listable]; Range[2, 4] // pseudofunction Here is my benchmark data, where data3 is normal sample of real data. data3 = Drop[ExcelWorkBook[[Column1 ;; Column4]], None, 1]; data2 = {a #, b #, c #, d #} & /@ Range[1, 10^5]; data = RandomReal[{0, 1}, {10^6, 4}]; Here is my benchmark code kptnw[list_] := Transpose[{Table[First@#, {Length@# - 1}], Rest@#}, {3, 1, 2}] &@list kptnw2[list_] := Transpose[{ConstantArray[First@#, Length@# - 1], Rest@#}, {3, 1, 2}] &@list OleksandrR[list_] := Flatten[Outer[List, List@First[list], Rest[list], 1], {{2}, {1, 4}}] paradox2[list_] := Partition[Riffle[list[[1]], #], 2] & /@ Drop[list, 1] RM[list_] := FoldList[Transpose[{First@li...

plotting - How to draw lines between specified dots on ListPlot?

I would like to create a plot where I have unconnected dots and some connected. So far, I have figured out how to draw the dots. My code is the following: ListPlot[{{1, 1}, {2, 2}, {3, 3}, {4, 4}, {1, 4}, {2, 5}, {3, 6}, {4, 7}, {1, 7}, {2, 8}, {3, 9}, {4, 10}, {1, 10}, {2, 11}, {3, 12}, {4,13}, {2.5, 7}}, Ticks -> {{1, 2, 3, 4}, None}, AxesStyle -> Thin, TicksStyle -> Directive[Black, Bold, 12], Mesh -> Full] I have thought using ListLinePlot command, but I don't know how to specify to the command to draw only selected lines between the dots. Do have any suggestions/hints on how to do that? Thank you. Answer One possibility would be to use Epilog with Line : ListPlot[ {{1, 1}, {2, 2}, {3, 3}, {4, 4}, {1, 4}, {2, 5}, {3, 6}, {4, 7}, {1, 7}, {2, 8}, {3, 9}, {4, 10}, {1, 10}, {2, 11}, {3, 12}, {4, 13}, {2.5, 7}}, Ticks -> {{1, 2, 3, 4}, None}, AxesStyle -> Thin, TicksStyle -> Directive[Black, Bold, 12], Mesh -> Full, Epilog -> { Line[ ...