Created: Jul 25, 2013
Inferential Statistics Report
Society is very diverse in many things, it includes socioeconomically, educationally, and in their perspectives in  their need to further their education. The purpose of this survey was to get a snapshot on the education level along with the income within the population. Overall, it had an intention to show the difference in income between those with different educational levels. An interesting aspect of the survey is that it shows the motivation to continue professional development by means of furthering their degrees beyond what was initially achieved.


The population varied in age, educational level, and income. The sample was collected from various kinds of people; co-workers, friends, people walking in the park, down the street, and at a local bakery were all part of the sample . The collection process was done through face to face interview.


The Surveys consisted of the four following questions:


Do you plan to further your degree?


What is your highest education level?


What is your age (years)?


What is your annual salary ($)?


Categorical Variable:

A- Based on the question: Do you plan to further your degree?

The following pie chart shows this categorical value with the following results: 62.07% of the sample will continue their education and 37.93% of them will not.

B- The one sample proportion n=116 and yes=72. Lower limit= 0.53239108 and Upper limit of 0.70898823. Using a 95% confidence level that reflects balance therefore we are 95% confident that the interval from 0.532 and 0.709 actually does contain the true value (will continue education) of the population proportion p.

Result 2: One Sample Proportion with data (yes/no)   [Info]
95% confidence interval results:
Outcomes in : cont?
Success : yes
p : Proportion of successes
Method: Standard-Wald
Variable Count Total Sample Prop. Std. Err. L. Limit U. Limit
cont? 72 116 0.62068966 0.045051121 0.53239108 0.70898823

Looking at the Numerical Value

A.The numerical variable focuses on asking the age by this question: What is your age (years)?

Summary statistics:
Column n Mean Variance Std. Dev. Std. Err. Median Range Min Max Q1 Q3
age 116 38.594828 147.87789 12.160505 1.1290746 38 52 18 70 28.5 46.5

The results above give a summary or a snapshot of the sample collected. From the graphs displayed, we see that the box plot shows no outliers that affect the results. In addition, the histogram seems to be skewed to the right, with a mean of 38.594 for those 116 people surveyed and a median of 38 being very closed to the mean but in right skewed shaped histogram it is typical for the mean to be larger. 

The minimum age was 18 years old and oldest or maximum age was 70 with a midrange of 59. The standard deviation between the data was 12.16 and the Quartiles were Q1 was 28.5 and Q3 46.5. The range of 52 shows the wide numbers of ages present in this survey. The IQR is 18 (Q3-Q1) in this case (46.5-28.5). The Range/4 is 13 and it is approximated to the SD of 12.16.


B- Base on the results we are 95% confident that the intervals from 36.36 years and 40.83 years actually does contain the true value of the population mean of their ages. Sample mean was 38.59 and a t interval was used because it satisfied the criteria of n>30, the population is normally distributed because there are no outliers and it appears to be somewhat symmetrical but the population standard deviation was not known. According to the reading in the student text book the population standard deviation is typically unknown in real circumstances such as the one in this report and thats why I used the T- interval on statcrunch . The sample mean is a good indicator and provides an estimate of the population mean. In this case the sample mean is 38.59 years.

95% confidence interval results:
μ : Mean of variable
Variable Sample Mean Std. Err. DF L. Limit U. Limit
age 38.594828 1.1290746 115 36.358348 40.831307

