Probability distributions describe how the values of a random variable are distributed. In R, probability distributions are used for statistical modeling, simulations, and calculating probabilities. R provides built-in functions for common distributions such as normal, binomial, Poisson, and uniform.
1. Types of Probability Distributions
a) Discrete Distributions
- Binomial Distribution: Models the number of successes in a fixed number of independent trials.
- Poisson Distribution: Models the number of events occurring in a fixed interval of time or space.
b) Continuous Distributions
- Normal Distribution: Bell-shaped curve used for many natural phenomena.
- Uniform Distribution: All outcomes are equally likely.
- Exponential Distribution: Models time between events in a Poisson process.
2. Functions for Probability Distributions in R
R uses a consistent naming convention for distribution functions:
| Function Prefix | Description | Example |
|---|---|---|
d | Density or probability mass function | dnorm(x, mean=0, sd=1) |
p | Cumulative distribution function (CDF) | pnorm(1.96, mean=0, sd=1) |
q | Quantile function (inverse CDF) | qnorm(0.975, mean=0, sd=1) |
r | Random generation | rnorm(5, mean=0, sd=1) |
3. Normal Distribution
# Probability density at x = 1
dnorm(1, mean = 0, sd = 1)# Cumulative probability P(X <= 1)
pnorm(1, mean = 0, sd = 1)# 95th percentile
qnorm(0.95, mean = 0, sd = 1)# Generate 10 random numbers
rnorm(10, mean = 0, sd = 1)
4. Binomial Distribution
# Probability of 3 successes in 5 trials with success probability 0.6
dbinom(3, size = 5, prob = 0.6)# Cumulative probability P(X <= 3)
pbinom(3, size = 5, prob = 0.6)# Generate 5 random numbers from binomial distribution
rbinom(5, size = 5, prob = 0.6)
5. Poisson Distribution
# Probability of 2 events when lambda = 3
dpois(2, lambda = 3)# Cumulative probability P(X <= 2)
ppois(2, lambda = 3)# Generate 5 random numbers from Poisson distribution
rpois(5, lambda = 3)
6. Uniform Distribution
# Probability density for continuous uniform between 0 and 1
dunif(0.5, min=0, max=1)# Cumulative probability
punif(0.5, min=0, max=1)# Generate 5 random numbers
runif(5, min=0, max=1)
7. Advantages of Using Probability Distributions
- Model real-world phenomena and randomness
- Calculate probabilities and quantiles
- Simulate data for experiments and testing
- Essential for statistical inference and hypothesis testing
Conclusion
Understanding probability distributions in R is crucial for data analysis, modeling, and simulation. By mastering functions like dnorm(), rbinom(), and ppois(), you can calculate probabilities, generate random samples, and perform statistical analysis efficiently. These distributions form the foundation of both descriptive and inferential statistics.