
Data sets shared by StatCrunch members
Showing 1 to 15 of 381 data sets matching VARIABLE
Data Set/Description 
Owner 
Last edited 
Size 
Views 
Criminal Recidivism in Iowa: 20102014
Recidivism is defined as the "tendency of a convicted criminal to reoffend". This dataset tracks former criminals from Iowa over a 3 year period after their release from prison to see whether or not they were convicted of a new crime during that time. The recidivism reporting year is the fiscal year (year ending June 30) marking the end of the three year tracking period. Included are the following variables: Fiscal Year Released (the year the individual was released from Prison), the Race, Ethnicity, Sex, and Age of individual when released. Also included are details about the original crime committed along with whether that individual committed a new crime (Recidivism  Return to Prison) within the 3 year window.  statcrunch_featured  Mar 21, 2018  3MB  3644 
USA Car Accidents in 2011
This data set contains information for drivers involved in car accidents in the United States during 2011. The variables include the age in years of the person (Age), the gender of the person (Gender), the month in which the accident occurred (Month), and the day of the week of the accident (DayOfWeek).  statcrunch_featured  Sep 12, 2017  919KB  9797 
Cereal Brands
Data on several variable of different brands of cereal. Number of cases: 77 Variable Names: Name: Name of cereal mfr: Manufacturer of cereal where A = American Home Food Products; G = General Mills; K = Kelloggs; N = Nabisco; P = Post; Q = Quaker Oats; R = Ralston Purina type: cold or hot calories: calories per serving protein: grams of protein fat: grams of fat sodium: milligrams of sodium fiber: grams of dietary fiber carbo: grams of complex carbohydrates sugars: grams of sugars potass: milligrams of potassium vitamins: vitamins and minerals  0, 25, or 100, indicating the typical percentage of FDA recommended shelf: display shelf (1, 2, or 3, counting from the floor) weight: weight in ounces of one serving cups: number of cups in one serving rating: a rating of the cereals  statcrunch_featured  Apr 3, 2017  4KB  7594 
cell_phones.xls
Q1. Based on a recent study, roughly 80% of college students in the U.S. own a cell phone. Do the data provide evidence that the proportion of students who own cell phones in this university is lower than the national figure?
Answer. Most likely not. Ownership of cellphones and ratios do not depend on anything.
Relevant Variables  The cell is the relevant variable and it is categorical.
Analyze Data  The formal analysis of Q1 will pinpoint on searching the population proportion. The correct statistical test is the one sample ztest for the proportion.
Null Hypothesis  Ho: p = 8
Alternative Hypothesis  Ha: p < .8
Outcomes: Cell
Success: yes
Test stat z = .71, pvalue is .239 > .05, so Ho cannot be rejected.
Roughly 78% of the students sampled own a cellphone. Even though 78% percent is less than 80%, there is not enough support to conclude that the exact data holds right for the whole college or that it would be different from the national proportion.  faithnwanne  May 3, 2019  8KB  380 
Major League Players Elected to Hall of Fame as Players
Includes 2019 BBWAAelected inductees Mariano Rivera, Edgar Martinez, Roy Halladay, and Mike Mussina. 31 variables for each player. Team=primary team; BBWAA=Baseball Writers Association of America; Bat: R=right, L=left, B=both;
WAR=Wins Against Replacement: number of wins the player added to the team above what an "average" replacement player would add.
CS=caught stealing.
OPS=Onbase Plus Slugging; as a rule of thumb, a "good" OPS is a value that when divided by 3 results in a value that would be considered a "good" batting average.
Other variables are hopefully selfexplanatory.  treiland  Jan 25, 2019  37KB  5750 
Attendance Vs. Grade
Compares percent of classes attended with final grade in the class.
If you use % missed as the independent variable, you end up with a regression model that allows for interpretation of the intercept and has a negative slope.  lbgreen  Jan 28, 2019  744B  1175 
US Counties and Presidential Voting Dataset
Sampling Unit county
3141 observations and 19 variables, maximum # NAs:2956
Name
county  County
state  State
msa  Metropolitan Statistical Area
pmsa  Primary Metropolitan Statistical Area
pop.density  1992 pop per 1990 miles^2
pop  1990 population
pop.change  Percent population change 19801992
age6574  Percent age 6574, 1990
age75  Percent age >= 75, 1990
crime  serious crimes per 100,000 1991
college  Percent with bachelor's degree or higher of those
age>=25
income  median family income, 1989 dollars
farm  farm population, % of total, 1990
democrat  Percent votes cast for democratic president
republican  Percent votes cast for republican president
Perot  Percent votes cast for Ross Perot
white  Percent white, 1990
black  Percent black, 1990
turnout  1992 votes for president / 1990 pop x 100  craig_slinkman  Apr 12, 2011  755KB  2244 
Class Seating vs Grade
From Body Image Data Set:
"A student survey was conducted at a major university. Data were collected from a random sample of 239 undergraduate students".
Variables:
Gender  Male or Female,
GPA  Student's cumulative college GPA.
GPA is then converted to Grades
(where,
4.33 = A+,
4.00 = A,
3.67 = A,
3.33 = B+,
3.00 = B,
2.67 = B,
2.33 = C+,
2.00 = C,
1.67 = C).
Seat  Typical classroom seat location (Front & Back)  mallirhea86  Oct 26, 2018  2KB  3605 
Nfl draft combine results 19992013
The NFL Combine occurs once per year and is used to measure the physical characteristics of potential NFL draft picks.
The data covers 19992013.
Variables include college, position, height, weight, 40 yard dash time, etc.
 daniel.inghram  Feb 14, 2014  324KB  2344 
The Unofficial 2014 NFL Player Census
This data set contains a number of variables on every NFL player participating in the 2014 season. Most of the variables should be self explanatory. Salary represents the average annual salary for the player under their existing contract. Exp represents years of experience. Pro Bowler represents the number of years the player was selected for the pro bowl. Champ provides the number of championship teams on which the player has played. Heisman represents whether or not the player won the Heisman trophy in college.  websterwest  May 5, 2015  321KB  1982 
Skyscrapers in the U.S.
Data for buildings in the United States that are 100 meters tall or higher. The variables include the rank in terms of height (Rank), the building name (Building), the height in meters (Height), the number of floors (Floors), the year of completion (Year), materials used in construction (Materials), and the use of the building (Use). The last two variables contain multiple outcomes delimited by /. When considering these columns, consider an outcomes table (Stat > Tables > Outcomes) with / as a delimiter.  websterwest  Jan 14, 2015  148KB  2556 
Titanic.xlsx
Report on the Loss of the â€˜Titanicâ€™ (S.S.) (1990), British Board of Trade Inquiry Report (reprint), Gloucester, UK: Allan Sutton Publishing. Taken from the Journal on Statistical Education Archive, submitted by rdawson@husky1.stmarys.ca. Dr. Craig Slinkman has recoded the data as selfexplanatory nominal variables. yes craig_slinkman Mar 23, 2010 68KB 5
 craig_slinkman  Mar 23, 2010  61KB  2283 
Field Goal Data for the 2013 NFL Season
This data set contains information for every field goal attempted in the 2013 NFL season. The variables include the name of the team, the name of the kicker, the number of yards for the attempt, the outcome of the kick (Made, Missed or Blocked), the quarter of the game in which the attempt was made, the minutes/seconds left in the quarter and the number of points the kicking team was ahead before the kick (if negative the team was behind).  websterwest  Oct 10, 2014  43KB  1514 
% voting for Obama and other state statistics
This data set has over 100 statistics (current for 201011) for U.S. states obtained from Measure of America. Each state's percentage voting for President Obama in 2012 has been added. Which of the original variables is most highly correlated with this voting percentage? How does this data match the ideas provided by political pundits? See the source for a complete description of all variables.  websterwest  Feb 16, 2013  36KB  6435 
Advanced NBA Statistics for 20132014 Season
N = 342; only players with at least 40 games played are included.
These are advanced metrics which attempt to evaluate, relatively speaking, how good an NBA basketball player was during the 20132014 (in which Kevin Durant won the MVP Award).
Variables..........Position  what position did they play?..... Age  How old was the player as of February 1, 2014?..... Team  Obvious..... PER  Player Efficiency Rating; a measure of perminute production standardized such that the league average is 15.....
TS  True Shooting Percentage; a measure of shooting effeciency that takes into account 2point field goals, 3point field goals, and free throws.....
ORB  Offensive Rebound Percentage; an estimate of the percentage of available offensive rebounds a player grabbed while he was on the floor.....
DRB  Defensive Rebound Percentage; an estimate of the percentage of available defensive rebounds a player grabbed while he was on the floor.....
TRB  Total Rebound Percentage; an estimate of the percentage of available rebounds a player grabbed while he was on the floor.....
AST  Assist Percentage; an estimate of the percentage of teammate field goals a player assisted while he was on the floor.....
STL  Steal Percentage; an estimate of the percentage of opponent possessions that end with a steal by the player while he was on the floor.....
BLK  Block Percentage; an estimate of the percentage of opponent twopoint field goal attempts blocked by the player while he was on the floor.....
TOV  Turnover Percentage; an estimate of turnovers per 100 plays.....
USG  Usage Percentage; an estimate of the percentage of team plays used by a player while he was on the floor.....
ORtg  Offensive Rating: An estimate of points produced (players) or scored (teams) per 100 possessions.....
DRtg  Defensive Rating: An estimate of points allowed per 100 possessions.....
OWS  Offensive Win Shares; an estimate of the number of wins contributed by a player due to his offense.....
DWS  Defensive Win Shares; an estimate of the number of wins contributed by a player due to his defense.....
WS  Win Shares; an estimate of the number of wins contributed by a player.
 daniel.inghram  May 22, 2014  33KB  3953 

