StatCrunch logo (home)

Data sets shared by StatCrunch members
Showing 1 to 15 of 818 data sets matching one
Data Set/Description Owner Last edited Size Views
Tattoo and HepC 136
HepC gives whether individual tests positive/negative for Hepatitis C. Comm is individual has tattoos from commercial tattoo parlor, Else is tattoo from somewhere other than commercial parlor, None implied has no tattoos
kfoongDec 9, 201963B26
PRRS Vaccine Final
A researcher was testing the effectiveness of vaccines on the swine disease called PRRS. The researcher randomly split a group of 650 swine into 13 groups of 50 swine. Each group was randomly assigned to one of 4 treatment groups. Each treatment group was given the specific treatment and was then injected with the PRRS virus. The results show the number of swine that test positive for PRRS 30 days after infection.
mariebuseDec 2, 2019127B54
Criminal Recidivism in Iowa: 2010-2014
Recidivism is defined as the "tendency of a convicted criminal to reoffend". This dataset tracks former criminals from Iowa over a 3 year period after their release from prison to see whether or not they were convicted of a new crime during that time. The recidivism reporting year is the fiscal year (year ending June 30) marking the end of the three year tracking period. Included are the following variables: Fiscal Year Released (the year the individual was released from Prison), the Race, Ethnicity, Sex, and Age of individual when released. Also included are details about the original crime committed along with whether that individual committed a new crime (Recidivism - Return to Prison) within the 3 year window.
statcrunch_featuredMar 21, 20183MB4436
USA Car Accidents in 2011
This data set contains information for drivers/passengers involved in fatal car accidents in the United States during 2011. The variables include the age in years of the person (Age), the gender of the person (Gender), the month in which the accident occurred (Month), and the day of the week of the accident (DayOfWeek).
statcrunch_featuredSep 12, 2017919KB12756
Movie Budgets and Box Office Earnings (Updated Spring 2018)
This data all comes from the following website the tracks the financial performance of movies:
http://www.the-numbers.com/movie/budgets/all

The “Budget”, “Domestic Gross”, and “Worldwide Gross” columns each are in millions of dollars.

statcrunch_featuredOct 4, 2018270KB18172
Thanksgiving 2015 Poll Data
This data was collected using a SurveyMonkey poll conducted on November 17th, 2015. Originally there were 1,058 respondents. The following where the original questions summarized in this data set:
Do you celebrate Thanksgiving?
What is typically the main dish at your Thanksgiving dinner?
How is the main dish typically cooked?
What kind of stuffing/dressing do you typically have?
What type of cranberry sauce do you typically have?
Do you typically have gravy?
Which of these side dishes are typically served at your Thanksgiving dinner? Please select all that apply.
Corn
Green beans/green bean casserole
Mashed potatoes
Rolls/biscuits
Yams/sweet potato casserole
Which type of pie is typically served at your Thanksgiving dinner? Please select all that apply.
Apple
Pecan
Pumpkin
Which of these desserts do you typically have at Thanksgiving dinner? Please select all that apply.
Do you typically pray before or after the Thanksgiving meal?
How far will you travel for Thanksgiving?
Will you watch any of the following programs on Thanksgiving?
Macy's Parade What's the age cutoff at your "kids' table" at Thanksgiving?
Have you ever tried to meet up with hometown friends on Thanksgiving night?
Have you ever attended a "Friendsgiving?"
Will you shop any Black Friday sales on Thanksgiving Day?
Do you work in retail?
Will you employer make you work on Black Friday?
How would you describe where you live?
Age
What is your gender?
How much total combined money did all members of your HOUSEHOLD earn last year?
What US Region do you live in?
statcrunch_featuredNov 14, 2018204KB5807
2014 MLB Top 100 Batters
This data came from ESPN.com and has the top 100 batters by WAR (wins above replacement). AB: At bats R: Runs H: Hits 2B: Doubles 3B: Triples RBI: Runs batted in SB: Stolen Bases BB: Walks SO: Strikeouts AVG: Batting average OBP: On Base Percentage SLG: Slugging Percentage OPS: OBP + SLG WAR: Wins Above Replacement
statcrunch_featuredApr 3, 20179KB4717
Top 100 Retailers 2015
This dataset comes from the National Retail Federation and tracks the top retail chains in the US for 2015 based on their 2014 sales. The original data can be found at the webpage listed as the source. Note that these retailers include all sorts of avenues including internet sales.
statcrunch_featuredNov 10, 20178KB5282
US States Population Change 2010
This data set comes from the 2010 US Census. The states are ranked by their total population on 2010. Percent change is calculated by taking the change in population (2010-2000) divided by the 2000 population.
This data set was pulled into StatCrunch using StatCrunchThis from http://en.wikipedia.org/wiki/2010_United_States_Census.
statcrunch_featuredJan 2, 20182KB3800
All MLB Salaries (1985-2015)
This data has all MLB player salaries between 1985-2015 including the team played for, the city, and a unique ID for each player. Total this includes 25,575 salaries for 4,963 different baseball players.
The player ID is the first 5 letters from the last name, followed by the first two letters from the first name, followed by a number in case of duplicate names. For example, bondsba01 stands for Barry Bonds with "01" because he's the first with the "bondsba" name ID.
statcrunch_featuredJun 27, 20171MB5815
Roller Coasters Data
This dataset looks at some of the roller coasters across the US and various other countries.
ColumnDescription
NameName of roller coaster
ParkAmusement park for roller coaster
CityCity for amusement park
StateState abbreviation
CountryCountry of the roller coaster. US: United States, MX: Mexico, CR: Costa Rica, GT: Guatemala, CO: Columbia, VE: Venezuela, BR: Brazil, AR: Argentina, CL: Chile, EQ: Ecuador, PE: Peru, F: France, D: Germany
TypeS: Steel, W: Wood
ConstructorType of build for the roller coaster
HeightHeight in meters
SpeedSpeed in kilometers per hour (km/h)
LengthLength in meters
InversionsYes if there are inversions, no if not
DurationDuration of ride in seconds
GForceMax g-force
OpenedYear it opened
RegionGeographic region for the roller coaster
statcrunch_featuredApr 3, 201748KB8597
Cereal Brands
Data on several variable of different brands of cereal. Number of cases: 77 Variable Names: Name: Name of cereal mfr: Manufacturer of cereal where A = American Home Food Products; G = General Mills; K = Kelloggs; N = Nabisco; P = Post; Q = Quaker Oats; R = Ralston Purina type: cold or hot calories: calories per serving protein: grams of protein fat: grams of fat sodium: milligrams of sodium fiber: grams of dietary fiber carbo: grams of complex carbohydrates sugars: grams of sugars potass: milligrams of potassium vitamins: vitamins and minerals - 0, 25, or 100, indicating the typical percentage of FDA recommended shelf: display shelf (1, 2, or 3, counting from the floor) weight: weight in ounces of one serving cups: number of cups in one serving rating: a rating of the cereals
statcrunch_featuredApr 3, 20174KB9167
Body Temperature
Data taken from the Journal of Statistics Education online data archive. That archive in turn got the data from an article in the Journal of the American Medical Association. (Mackowiak, et al., "A Critical Appraisal of 98.6 Degrees F …", vol. 268, pp. 1578-80, 1992).
"Body Temp" is measured in degrees fahrenheit
"Heart rate" is the resting beats per minute
statcrunch_featuredJun 27, 20172KB15383
Mean Weights of Boys Ages 2 to 12
I'm using this for Modeling Linear Associations. It has a decent linear correlation coefficient. A linear regression produces the stats and scatter plot with a polynomial of order one trend line overlay which can be used to illustrate extrapolation/interpolation, error estimates, and model breakdown. For over/underestimates and error, interpolate mean weights for 3 and 5 year olds and compare with observed mean weights of 31.0 pounds and 40.5 pounds, respectively. For model breakdown, adjust the x-axis of the scatter plot to range between 0 and 20, with integer tick marks, and the y-axis to range between 0 and 200, with tick marks 0, 10, 20, ..., 200, and an extrapolation for mean weight at age 20 will suggest a weight somewhere near 135 lbs for a 20 year old male.
kcramerOct 26, 2019110B543
Death rolls
"Death roll of the Alligator, Mechanics of Twist and Feeding in Water" by Fish, et al. One variable measured was the degree of the angle between the body and head of the alligator while performing the roll. Data is in degrees
m.smith96Sep 26, 2019142B346

1 2 3 4 5 6 7 8 9 10   >

Always Learning