StatCrunch logo (home)

Report Properties

from Flickr
Owner: vta15
Created: Nov 12, 2017
Share: yes
Views: 347
Results in this report
Data sets in this report
Need help?
To copy selected text, right click to Copy or choose the Copy option under your browser's Edit menu. Text copied in this manner can be pasted directly into most documents with formatting maintained.
To copy selected graphs, right click on the graph to Copy. When pasting into a document, make sure to paste the graph content rather than a link to the graph. For example, to paste in MS Word choose Edit > Paste Special, and select the Device Independent Bitmap option.
You can now also Mail results and reports. The email may contain a simple link to the StatCrunch site or the complete output with data and graphics attached. In addition to being a great way to deliver output to someone else, this is also a great way to save your own hard copy. To try it out, simply click on the Mail link.
Forbes - America's Best Colleges, 2010 Project Part 2
Mail   Print   Twitter   Facebook

Forbes - America's Best Colleges, 2010 Owner: cdcummings12


Data set 1. DATA 1 Original Set   [Info]
To analyze this data, please sign in.

Result 1: Scatter Plot 1   [Info]
Right click to copy


Comment on your scatter plot: 

- The 2 quantitative variables that I used were Cost and Total Student Population.  

- My scatterplot displays a weak negative correlation. The x was the total student population and the y was the cost. The data is very scattered, however it does seem that the less students there are, the more expensive the university is. 

-There are no outliers in this data.

-An appropriate significance level for this 0.05. Thr margin of error is not very small and not very large for the data, so the significance level would be 0.05 and a confidence interval of 95%. 

Result 2: Simple Linear Regression 1   [Info]
Simple linear regression results:
Dependent Variable: Cost
Independent Variable: Total Student Population
Cost = 46225.923 - 0.029189982 Total Student Population
Sample size: 98
R (correlation coefficient) = -0.031224727
R-sq = 0.00097498359
Estimate of error standard deviation: 9809.4697

Parameter estimates:
ParameterEstimateStd. Err.AlternativeDFT-StatP-value
Intercept46225.9231222.731 ≠ 09637.805472<0.0001
Slope-0.0291899820.095364722 ≠ 096-0.306087850.7602

Analysis of variance table for regression model:

Line of Best Fit: 

y= cost

x= total student population

y= 46225.923 - 0.029189982x

or Cost = 46225.923 - 0.029189982 Total Student Population

Comment on your correlation coefficient. 

R (correlation coefficient) = -0.031224727.  This shows that my data set is very weak negative, practically no correlation. 

Are your terms significant?

Cost and Total Student Population are significant.   

Result 3: Line of Best Fit Scatter   [Info]
Right click to copy


Interpret your line of best fit 

The line of best fit shows that my data is slightly negative.  A lot of the points are at the top left side of the plot, and very few lead to the bottom right of the plot, so since there are so many at the top left, the line of best fit almost looks like a perfect horizontal line, but is slightly negative.  This shows how it is a very weak negative correlation. This means that most of the best universities on the list have a low student population and a high cost. 

Interpret r^2

R-sq = 0.00097498359. The R^2 is very small so this shows that the line of best fit is not a strong representation of the data. This is due to the fact that the majority of the schools on the Forbes Best Colleges in the USA for 2010 have a low student population and a high cost.  

Correlation and Causation 

The Cost and Total Student Population are correlated and causation. The majority of schools that were ranked best in the USA according to Forbes in 2010 were schools that have a low student population. This allows for a more hands-on learning experience and helps makes there colleges as great as they are. However, the smaller the school means they have a more personalized education so this can cause the cost to be higher. Except for 3 colleges, the others were all private colleges which in general tend to have small classroom sizes and a high cost, so therefore the data is correlated and has causation.


Extra Credit: QQ Plots


Result 4: QQ Plot Cost   [Info]
Right click to copy

 Do your expected values follow a normal distribution?

Yes, they followed normal distribution. 



Result 5: QQ Plot Student Pop   [Info]
Right click to copy

Yes, they follow normal distribution. 



HTML link:
<A href="">Forbes - America's Best Colleges, 2010 Project Part 2</A>

Want to comment? Subscribe
Already a member? Sign in.
By xg15
Nov 20, 2017

For the scatterplot, I am thinking probably the points that are far way from the rest can be considered as outliers.

When determine whether a slope is significant or not, we compare the P-value with the significance level
R^2 is the percentage of variance of response explained by linear model

Always Learning