1: Introduction
My group and I designed a survey to assess and learn about the health rating and habits of our friends, families and acquaintances. The population that we sampled were adults, 2545 and nonmilitary status. The information was collected from Facebook friends so the sample is not random. It is a convenience sample with a voluntary response.
The survey contained the following questions:
1: How many days in a week do you do at least 20 minutes of moderate physical activity (an activity that increases heart rate, increases respiration/breathing rate, increases sweating and/or causes muscle fatigue)?
2: How many hours in a day do you spend sedentary/sitting (ex: eating, commuting, working, watching tv, etc)?
3: Are you Male or Female?
4: How would you rate your overall health: Poor, Good, Very Good, Excellent
2: Looking at a Categorical Variable
The pie chart below displays the results of “How would you rate your overall health?”
Result 1: Pie Chart of Health Rating
The majority of the subjects questioned rating their health as Good, 38%, with Very Good being second with 35% of participants using that rating. 19% rated themselves as Excellent health and 7% stated their health as Poor.
To examine the difference of how males and females of this survey differed in their self health ratings, the results are displayed in a bar plot below:
Result 2: Bar Plot of Health Rating for Males and Females
The majority of females from this survey labeled their health as Good while the majority of males picked Excellent. The majority of male survey members rated themselves above Poor while the females were more variable in their choices and less likely to rate themselves above Good.
3: Looking at a Numerical Variable
The results for the question “How many hours in a day do you spend sedentary/sitting (ex: eating, commuting, working, watching tv, etc)?” are displayed in the histogram, boxplot and summary statistics below.
Results 3: Histogram of Hours per Day Spent Sedentary/Sitting
Result 4: Boxplot of Hours per Day Spent Sedentary/Sitting
Result 5: Summary Statistics of Hours per Day Spent Sedentary/Sitting
Summary statistics:

The histogram is displaying the results for how many hours in a day a person rated they were sedentary. Its shape is right skewed with a median of 7 hours, mode of 6 hours and a mean of 7.69 hours which is verified since the mean is greater than the median. Since it is skewed to the right, the best measure of center is the median and the IQR of 5 since they are less affected by the outliers. There were outliers of 20, 20, 20 and 21 hours, which I believe is an error. With only 24 hours in a day, it is very unlikely that a person is sitting for 20 hours in day. This was most likely due to a misunderstanding of the questions and possibly including sleeping in their number. These outliers should be discarded.
There is a lot of variability in the results since the range is 20 and the standard deviation is 3.67. The range/4 is 5 hours and with a standard deviation of 3.67 the range rule of thumb does not apply and isn’t a good approximation of the standard deviation which is explained by the skewness and outliers of the data.
Looking for a Relationship between Two Numerical Variables
To determine whether or not there is relationship between the questions “How many days in a week do you do at least 20 minutes of moderate physical activity (an activity that increases heart rate, increases respiration/breathing rate, increases sweating and/or causes muscle fatigue)?” and “How many hours in a day do you spend sedentary/sitting (ex: eating, commuting, working, watching tv, etc)?” I applied their values to a scatter plot.
Result 6: Scatter Plot of Active Days/week vs Hours in a Day Spent Sedentary
The scatter plot reveals a slight negative trend, as the x values increase the y values decrease as well. Which makes sense, if a person is reporting that they spend more days being active, they are more likely to not spend time being sedentary.
Result 7: Correlation of Active Days and Sedentary Hours in a Day
Correlation between N1:DAYS OF ACTIVITY and N2:SEDENTARY HOURS is: 0.30760913 
The correlation coefficient is 0.308, which shows that there is a negative correlation and a weak linear relationship between the two variables. With an n of 165 the critical value is 0.153 which is les than .196 so there is no statistical significance.
Already a member? Sign in.