StatCrunch logo (home)

Data sets shared by StatCrunch members
Showing 1 to 15 of 45 data sets matching Association
Data Set/Description Owner Last edited Size Views
Average Female Height, by Age
This data used as an example for review of logarithmic regression.
kcramerDec 3, 2019200B199
Major League Players Elected to Hall of Fame as Players
Includes 2019 BBWAA-elected inductees Mariano Rivera, Edgar Martinez, Roy Halladay, and Mike Mussina. 31 variables for each player. Team=primary team; BBWAA=Baseball Writers Association of America; Bat: R=right, L=left, B=both; WAR=Wins Against Replacement: number of wins the player added to the team above what an "average" replacement player would add. CS=caught stealing. OPS=On-base Plus Slugging; as a rule of thumb, a "good" OPS is a value that when divided by 3 results in a value that would be considered a "good" batting average. Other variables are hopefully self-explanatory.

This data set was originally uploaded to StatCrunch by the treiland user.
statcrunch_featuredNov 13, 201937KB262
Professor Salaries 2018-2019
This data came from the following website: Tenured/Tenure-Track Faculty Salaries

Included are the average salaries for tenured/tenure-track faculty from the 2018-19 Faculty in Higher Education Survey conducted by The College and University Professional Association for Human Resources (CUPA-HR). Findings detailed aggregate salary information from 847 institutions for 171,487 full-time tenure-track faculty in the US. Columns indicate the type of professor, with "All" referring to any type, "Research" being professors who primarily research, "Other Doctoral" being non-research professors with a doctoral degree, and "Master's" being non-research professors with a Master's degree.
statcrunch_featuredNov 8, 20197KB388
Body Temperature
Data taken from the Journal of Statistics Education online data archive. That archive in turn got the data from an article in the Journal of the American Medical Association. (Mackowiak, et al., "A Critical Appraisal of 98.6 Degrees F …", vol. 268, pp. 1578-80, 1992).
"Body Temp" is measured in degrees fahrenheit
"Heart rate" is the resting beats per minute
alexandria.cullyNov 25, 20193KB159
Body Temperature
Data taken from the Journal of Statistics Education online data archive. That archive in turn got the data from an article in the Journal of the American Medical Association. (Mackowiak, et al., "A Critical Appraisal of 98.6 Degrees F …", vol. 268, pp. 1578-80, 1992).
"Body Temp" is measured in degrees fahrenheit
"Heart rate" is the resting beats per minute
statcrunch_featuredJun 27, 20172KB15384
Mean Weights of Boys Ages 2 to 12
I'm using this for Modeling Linear Associations. It has a decent linear correlation coefficient. A linear regression produces the stats and scatter plot with a polynomial of order one trend line overlay which can be used to illustrate extrapolation/interpolation, error estimates, and model breakdown. For over/underestimates and error, interpolate mean weights for 3 and 5 year olds and compare with observed mean weights of 31.0 pounds and 40.5 pounds, respectively. For model breakdown, adjust the x-axis of the scatter plot to range between 0 and 20, with integer tick marks, and the y-axis to range between 0 and 200, with tick marks 0, 10, 20, ..., 200, and an extrapolation for mean weight at age 20 will suggest a weight somewhere near 135 lbs for a 20 year old male.
kcramerOct 26, 2019110B543
Major League Players Elected to Hall of Fame as Players
Includes 2019 BBWAA-elected inductees Mariano Rivera, Edgar Martinez, Roy Halladay, and Mike Mussina. 31 variables for each player. Team=primary team; BBWAA=Baseball Writers Association of America; Bat: R=right, L=left, B=both; WAR=Wins Against Replacement: number of wins the player added to the team above what an "average" replacement player would add. CS=caught stealing. OPS=On-base Plus Slugging; as a rule of thumb, a "good" OPS is a value that when divided by 3 results in a value that would be considered a "good" batting average. Other variables are hopefully self-explanatory.
treilandJan 25, 201937KB6284
Best Selling Music Artists in US (top 100)
This is the list of the top 100 best-selling music artists in the United States only as of December 2008 and based on certifications of albums by the Recording Industry Association of America (RIAA). The RIAA does not deal in sales of albums, only certifications of shipped units. Therefore this best-selling list is based on total accumulated certifications given per artist and may not reflect the true physical sales obtained by these artists. The RIAA began its certifications in 1952. This list therefore excludes large-selling artists like Bing Crosby whose sales in the 1930s-1950s were not certified. Also affected are artists such as Frank Sinatra, who although they were successful after 1952, were also best-selling prior to 1952. Therefore their positions in the chart reflect certifications from 1952 onwards.
cdcummings12Aug 24, 20095KB1716
Body Temperature
Data taken from the Journal of Statistics Education online data archive. That archive in turn got the data from an article in the Journal of the American Medical Association. (Mackowiak, et al., "A Critical Appraisal of 98.6 Degrees F …", vol. 268, pp. 1578-80, 1992).
"Body Temp" is measured in degrees fahrenheit
"Heart rate" is the resting beats per minute
statcrunchhelpMar 8, 20162KB4707
Cigarette Consumption vs CHD Mortality
Now that cigarette smoking has been clearly tied to lung cancer, researchers are focusing on possible links to other diseases. The data below show annual rates of cigarette consumption and deaths from coronary heart disease for several nations. Some public health officials are urging that the US adopt a national goal of cutting cigarette consumption in half over the next decade. Examine these data and write a report. In your report you should: 1. Include appropriate graphs (e.g. scatterplot, residual plot) and statistics (e.g. mean and SD); 2. Describe the association between cigarette smoking and coronary heart disease; 3. Create a linear model; 4. Evaluate the strength and appropriateness of your model; 5. Interpret the slope and y-intercept of the line; 6. Use your model to estimate the potential benefits of reaching the national goal proposed for the US. That is, based on your linear model, if the US were to cut its cigarette consumption in half (from 3900 to 1950), what does the linear model predict would happen to the CHD rate. 7. You should use Statcrunch to generate nice looking graphs and output as needed. Be sure to size them appropriately. No need for a 8x10 scatterplot; Make your graphs about 3x3. You should scale them in Statcrunch first, then copy and paste into Word.
smcdaniel04Sep 29, 2011267B5788
Professor Median Salaries by Discipline, Level, and Institution (2012-2013)
This data comes from the source listed above. The webpage gives the following description "These are the results of the 2012-13 Faculty in Higher Education Salary Survey by Discipline, Rank and Tenure Status in Four-Year Colleges and Universities conducted by The College and University Professional Association for Human Resources (CUPA-HR). Findings reflect the salaries of 184,924 tenured/tenure-track faculty members at 794 institutions nationwide. Salaries were reported by 794 institutions, including 478 private institutions and 316 public institutions, for 31 academic disciplines"
The dataset originally was created by Keisha Brown from Georgia Perimeter College. Each salary is the median salary based on the responses to the survey.
statcrunchhelpMar 14, 201612KB958
Body Temperature, Alexia Denson and Josiah Hendon
Data taken from the Journal of Statistics Education online data archive. That archive in turn got the data from an article in the Journal of the American Medical Association. (Mackowiak, et al., "A Critical Appraisal of 98.6 Degrees F …", vol. 268, pp. 1578-80, 1992).
"Body Temp" is measured in degrees fahrenheit
"Heart rate" is the resting beats per minute
justkiyaOct 22, 20182KB373
NIST Longley Regresson Test Data Set
Longley, J. W. (1967). An Appraisal of Least Squares Programs for the Electronic Computer from the Viewpoint of the User. Journal of the American Statistical Association, 62, pp. 819-841.
craig_slinkmanMar 18, 2010665B93
ICU.xlsx
The data consist of 200 subjects selected at random from a larger study done in 1988 on the survival of patients following admission to an adult intensive care unit (ICU). Data were collected at Baystate Medical Center in Springfield, MA. The purpose of the study was to develop a model to predict the probability of survival until hospital discharge and to study the risk factors associated with ICU mortality. Results have been published in a number of articles, one of which is by S. Lemeshow, D. Teres, J.S. Avrunin, and H. Pastides, “Predicting the Outcome of Intensive Care Unit Patients”, Journal of American Statistical Association, 83, 348-356 (1988). You must include the following citation in your report as the source of the data set or you will be in violation of copyright laws: Hosmer, D.W., Lemeshow, S. and Sturdivant, R.X. (2013), Applied Logistic Regression: Third Edition, Section 1.6.1, p. 23. These data are copyrighted by John Wiley & Sons Inc., and must be acknowledged and used accordingly.
anderson_instructorJan 9, 201510KB536
CAP 3CSN Movie Data
Title: Movie title Year: Year movie was released Studio: Studios categorized as Big6 or Other Studio Name: Name of studio producing the movie Genre: Action/Adventure or Other Budget: Movie budget in millions of dollars US Box Office: total box office sales in millions of dollars Opening Week: box office sales for opening week in millions of dollars Movie Length: length of movie in minutes Trailer Length: length of advertising trailer in seconds Director: Name of the movie’s director Director Gender: male or female Director Race: W (white) or POC (person of color) Star: Name of the movie’s lead star Star Gender: male or female Star Race: W (white) or POC (person of color) Costar: Name of the movie’s main costar Costar Gender: male or female Costar Race: W (white) or POC (person of color) IMDb_Rating: Average IMDb user rating on a scale of 1-10 Metascore: Score out of 100, based on major critic reviews as provided by Metacritic.com Metacriticcom_rating: Number of critic reviews used to calculate the Metascore Rotten_Tomatoes: Score out of 100, based on authors from writing guilds or film critic associations Number of Oscars: number of Oscars won by the movie Oscar Nominations: number of Oscar nominations for the movie Oscar Winner: whether the movie won an Oscar (yes or no)
alysonbutcherSep 5, 201312KB576

1 2 3   >

Always Learning