Probability Distributions

Probability distributions describe how the values of a random variable are distributed. In R, probability distributions are used for statistical modeling, simulations, and calculating probabilities. R provides built-in functions for common distributions such as normal, binomial, Poisson, and uniform.

1. Types of Probability Distributions

a) Discrete Distributions

  • Binomial Distribution: Models the number of successes in a fixed number of independent trials.
  • Poisson Distribution: Models the number of events occurring in a fixed interval of time or space.

b) Continuous Distributions

  • Normal Distribution: Bell-shaped curve used for many natural phenomena.
  • Uniform Distribution: All outcomes are equally likely.
  • Exponential Distribution: Models time between events in a Poisson process.

2. Functions for Probability Distributions in R

R uses a consistent naming convention for distribution functions:

Function PrefixDescriptionExample
dDensity or probability mass functiondnorm(x, mean=0, sd=1)
pCumulative distribution function (CDF)pnorm(1.96, mean=0, sd=1)
qQuantile function (inverse CDF)qnorm(0.975, mean=0, sd=1)
rRandom generationrnorm(5, mean=0, sd=1)

3. Normal Distribution

# Probability density at x = 1
dnorm(1, mean = 0, sd = 1)# Cumulative probability P(X <= 1)
pnorm(1, mean = 0, sd = 1)# 95th percentile
qnorm(0.95, mean = 0, sd = 1)# Generate 10 random numbers
rnorm(10, mean = 0, sd = 1)

4. Binomial Distribution

# Probability of 3 successes in 5 trials with success probability 0.6
dbinom(3, size = 5, prob = 0.6)# Cumulative probability P(X <= 3)
pbinom(3, size = 5, prob = 0.6)# Generate 5 random numbers from binomial distribution
rbinom(5, size = 5, prob = 0.6)

5. Poisson Distribution

# Probability of 2 events when lambda = 3
dpois(2, lambda = 3)# Cumulative probability P(X <= 2)
ppois(2, lambda = 3)# Generate 5 random numbers from Poisson distribution
rpois(5, lambda = 3)

6. Uniform Distribution

# Probability density for continuous uniform between 0 and 1
dunif(0.5, min=0, max=1)# Cumulative probability
punif(0.5, min=0, max=1)# Generate 5 random numbers
runif(5, min=0, max=1)

7. Advantages of Using Probability Distributions

  • Model real-world phenomena and randomness
  • Calculate probabilities and quantiles
  • Simulate data for experiments and testing
  • Essential for statistical inference and hypothesis testing

Conclusion

Understanding probability distributions in R is crucial for data analysis, modeling, and simulation. By mastering functions like dnorm(), rbinom(), and ppois(), you can calculate probabilities, generate random samples, and perform statistical analysis efficiently. These distributions form the foundation of both descriptive and inferential statistics.

Home ยป R Programming (R Lang) > Statistics with R > Probability Distributions