Site hosted by Angelfire.com: Build your free website today!

Chapter 6: Probability

Concept of probability forms the foundation of inferential statistics

Basic example: coin-flip

So, probability is describing likelihood of getting heads or tails on each flip

In formal terms, the probability of a given event (A) is defined as the number of events favoring event A divided by the total number of possible observations

p(A) = number of observations favoring/satisfying event A

total number of possible observations

Coin example: there are two possible observations (heads or tails). If event A is getting tails, then probability is ½ or .50

 

Also possible to determine probability of events when measuring more than one variable

Another useful way to analyze data is in terms of Conditional Probability

p (A/B) = number of observations favoring both event A and event B

number of observations favoring event B

This type of probability analysis is useful to examine the independence of events

 

Another useful way to analyze data is in terms of Joint Probability

p (A,B) = number of observations favoring both event A and event B

total number of observations

What does any of this have to do with Statistics? Remember that often we cannot measure an entire population we are interested in studying and instead have to take a random sample of people from that population to investigate. This relates to concept of sampling with or without replacement. Be sure to read this part of the chapter in your text!

We are often interested in answering a specific question or testing a hypothesis.

If our test has 20 questions, there are 20 trials on which a person can either have a success or a failure. We can calculate the mean score predicted for a person who is just guessing on the questions:

m = np

where n = number of trials and p = probability of success

So, in our example, if a person was just guessing on the questions they should expect to get 10 questions right.

This would relate to the concept of hypothesis testing.

Problem of how to determine chance vs. non-chance results under the assumption that the null hypothesis is true.