StatCrunch logo (home)

Data sets shared by StatCrunch members
Showing 1 to 15 of 1309 data sets matching SAMPLE
Data Set/Description Owner Last edited Size Views
Criminal Recidivism in Iowa: 2010-2014
Recidivism is defined as the "tendency of a convicted criminal to reoffend". This dataset tracks former criminals from Iowa over a 3 year period after their release from prison to see whether or not they were convicted of a new crime during that time. The recidivism reporting year is the fiscal year (year ending June 30) marking the end of the three year tracking period. Included are the following variables: Fiscal Year Released (the year the individual was released from Prison), the Race, Ethnicity, Sex, and Age of individual when released. Also included are details about the original crime committed along with whether that individual committed a new crime (Recidivism - Return to Prison) within the 3 year window.
statcrunch_featuredMar 21, 20183MB4149
USA Car Accidents in 2011
This data set contains information for drivers involved in car accidents in the United States during 2011. The variables include the age in years of the person (Age), the gender of the person (Gender), the month in which the accident occurred (Month), and the day of the week of the accident (DayOfWeek).
statcrunch_featuredSep 12, 2017919KB11998
National Longitudinal Youth Survey: Weight Perception
The Youth survey consists of a nationally representative sample of youths who were 14 to 20 years old as of December 31, 1999.
This dataset tracks the Age, Height (in inches), Weight (in pounds), Gender, and the self reported "How would you describe your weight?" multiple choice answers for the individuals.
statcrunch_featuredNov 10, 2017330KB9138
2014 MLB Top 100 Batters
This data came from and has the top 100 batters by WAR (wins above replacement). AB: At bats R: Runs H: Hits 2B: Doubles 3B: Triples RBI: Runs batted in SB: Stolen Bases BB: Walks SO: Strikeouts AVG: Batting average OBP: On Base Percentage SLG: Slugging Percentage OPS: OBP + SLG WAR: Wins Above Replacement
statcrunch_featuredApr 3, 20179KB4442
Top 100 Retailers 2015
This dataset comes from the National Retail Federation and tracks the top retail chains in the US for 2015 based on their 2014 sales. The original data can be found at the webpage listed as the source. Note that these retailers include all sorts of avenues including internet sales.
statcrunch_featuredNov 10, 20178KB5042
US States Population Change 2010
This data set comes from the 2010 US Census. The states are ranked by their total population on 2010. Percent change is calculated by taking the change in population (2010-2000) divided by the 2000 population.
This data set was pulled into StatCrunch using StatCrunchThis from
statcrunch_featuredJan 2, 20182KB3603
All MLB Salaries (1985-2015)
This data has all MLB player salaries between 1985-2015 including the team played for, the city, and a unique ID for each player. Total this includes 25,575 salaries for 4,963 different baseball players.
The player ID is the first 5 letters from the last name, followed by the first two letters from the first name, followed by a number in case of duplicate names. For example, bondsba01 stands for Barry Bonds with "01" because he's the first with the "bondsba" name ID.
statcrunch_featuredJun 27, 20171MB5565
Roller Coasters Data
This dataset looks at some of the roller coasters across the US and various other countries.
NameName of roller coaster
ParkAmusement park for roller coaster
CityCity for amusement park
StateState abbreviation
CountryCountry of the roller coaster. US: United States, MX: Mexico, CR: Costa Rica, GT: Guatemala, CO: Columbia, VE: Venezuela, BR: Brazil, AR: Argentina, CL: Chile, EQ: Ecuador, PE: Peru, F: France, D: Germany
TypeS: Steel, W: Wood
ConstructorType of build for the roller coaster
HeightHeight in meters
SpeedSpeed in kilometers per hour (km/h)
LengthLength in meters
InversionsYes if there are inversions, no if not
DurationDuration of ride in seconds
GForceMax g-force
OpenedYear it opened
RegionGeographic region for the roller coaster
statcrunch_featuredApr 3, 201748KB8303
Cereal Brands
Data on several variable of different brands of cereal. Number of cases: 77 Variable Names: Name: Name of cereal mfr: Manufacturer of cereal where A = American Home Food Products; G = General Mills; K = Kelloggs; N = Nabisco; P = Post; Q = Quaker Oats; R = Ralston Purina type: cold or hot calories: calories per serving protein: grams of protein fat: grams of fat sodium: milligrams of sodium fiber: grams of dietary fiber carbo: grams of complex carbohydrates sugars: grams of sugars potass: milligrams of potassium vitamins: vitamins and minerals - 0, 25, or 100, indicating the typical percentage of FDA recommended shelf: display shelf (1, 2, or 3, counting from the floor) weight: weight in ounces of one serving cups: number of cups in one serving rating: a rating of the cereals
statcrunch_featuredApr 3, 20174KB8646
Body Temperature
Data taken from the Journal of Statistics Education online data archive. That archive in turn got the data from an article in the Journal of the American Medical Association. (Mackowiak, et al., "A Critical Appraisal of 98.6 Degrees F …", vol. 268, pp. 1578-80, 1992).
"Body Temp" is measured in degrees fahrenheit
"Heart rate" is the resting beats per minute
statcrunch_featuredJun 27, 20172KB14570
Sample College Data
The following data is for the year 2011 for colleges and universities in the Delaware, District of Columbia, Maryland, Pennsylvania, Virginia and West Virginia. Variables included in the data set include the college, type of college, location of college, admissions rate, SAT scores (Reading and Math, 75th percentile), tuition and fees, average amount of financial aid, enrollment (total and undergraduate), freshman retention rate, student/teacher ratio and graduation rate (5 year).
ppoconnoSep 16, 201919KB440
National Longitudinal Youth Survey
The Youth survey consists of a nationally representative sample of youths who were 14 to 20 years old as of December 31, 1999.
This dataset tracks the Age, Height (in inches), Weight (in pounds), Gender, and the self reported "How would you describe your weight?" multiple choice answers for the individuals.
statcrunchhelpMar 8, 2016330KB2109
Year of 800 pennies from a local bank, sampled in 2011 (which is why frequency for 2011 is low).
anderson_instructorOct 29, 20184KB1479
North Carolina birth data
A Random Sample of 1000 births from the state of North Carolina. Plurarility refers to the number of children associated with the birth. Gender 1=Male, 2=Female. fage is age of father (years), mage is age of mother (years), visits is number of pre-natal medical visits, marital is 1=married, 2=unmarried, racemom is Race of Mother (0=Other Non-white, 1=White, 2=Black 3=American Indian, 4=Chinese, 5=Japanese, 6=Hawaiian, 7=Filipino, 8=Other Asian or Pacific Islander), hispmom is whether mother is of Hispanic origin (C=Cuban, M=Mexican, N=Non-Hispanic, O=Other and Unknown Hispanic, P=Puerto Rican, S=Central/South American, U=Not Classifiable), gained is weight gain during pregnancy (pounds), lowbw is if birth weight is 2500 grams or lower, tpounds is birthweight in pounds, smoke is 0=no, 1=yes for mother admitted to smoking, mature is 0=no, 1-yes for mother is 35 or older, premie is 0=no, 1=yes to being born 36 weeks or sooner.
jph422Sep 8, 200837KB5619
Cost of living by country
Apartment (3 bedrooms) in City Centre. Based on 0-50 contributions for Afghanistan, Aland Islands, Andorra and 81 more countries and 50-100 contributions for Albania, Algeria, Armenia and 19 more countries and over 100 contributions for Argentina, Australia, Austria and 82 more countries. The surveys were conducted by from May, 2011 to February, 2014. See this sample survey for the United States, respondents were asked "Apartment (3 bedrooms) in City Centre". Prices in current USD.
ldox618Jan 6, 20155KB1350

1 2 3 4 5 6 7 8 9 10   >

Always Learning