Data Set/Description 
Owner 
Last edited 
Size 
Views 
Criminal Recidivism in Iowa: 20102014
Recidivism is defined as the "tendency of a convicted criminal to reoffend". This dataset tracks former criminals from Iowa over a 3 year period after their release from prison to see whether or not they were convicted of a new crime during that time. The recidivism reporting year is the fiscal year (year ending June 30) marking the end of the three year tracking period. Included are the following variables: Fiscal Year Released (the year the individual was released from Prison), the Race, Ethnicity, Sex, and Age of individual when released. Also included are details about the original crime committed along with whether that individual committed a new crime (Recidivism  Return to Prison) within the 3 year window.  statcrunch_featured  Mar 21, 2018  3MB  3817 
USA Car Accidents in 2011
This data set contains information for drivers involved in car accidents in the United States during 2011. The variables include the age in years of the person (Age), the gender of the person (Gender), the month in which the accident occurred (Month), and the day of the week of the accident (DayOfWeek).  statcrunch_featured  Sep 12, 2017  919KB  10354 
Cereal Brands
Data on several variable of different brands of cereal. Number of cases: 77 Variable Names: Name: Name of cereal mfr: Manufacturer of cereal where A = American Home Food Products; G = General Mills; K = Kelloggs; N = Nabisco; P = Post; Q = Quaker Oats; R = Ralston Purina type: cold or hot calories: calories per serving protein: grams of protein fat: grams of fat sodium: milligrams of sodium fiber: grams of dietary fiber carbo: grams of complex carbohydrates sugars: grams of sugars potass: milligrams of potassium vitamins: vitamins and minerals  0, 25, or 100, indicating the typical percentage of FDA recommended shelf: display shelf (1, 2, or 3, counting from the floor) weight: weight in ounces of one serving cups: number of cups in one serving rating: a rating of the cereals  statcrunch_featured  Apr 3, 2017  4KB  8002 
D1.6
Dataset: airline_costs.dat
Source: J.W. Proctor and J.S. Duncan (1954). "A Regression Analysis
of Airline Costs," Journal of Air Law and Commerce, Vol.21, #3, pp.282292.
Description: Regression relating Operating Costs per revenue tonmile to 7 factors: length of flight, speed of plane, daily flight time per aircraft, population served, tonmile load factor, available tons per aircraft mile, and firms net assets. Regression based on natural logarithms of all factors, except load factor. Load factor and available tons (capacity) for Northeast Airlines was imputed from summary calculations.
Variables/columns
Airline 120
Length of flight (miles) 2228
L_Group (inserted) Long (>175), Med (>60), Short (<69)
Speed of Plane (miles per hour) 3036
Daily Flight Time per plane (hours) 3844
Population served (1000s) 4652
Total Operating Cost (cents per revenue tonmile) 5460
Revenue Tons per Aircraft mile 6268
TonMile load factor (proportion) 7076
Available Capacity (Tons per mile) 7884
Total Assets ($100,000s) 8692
Investments and Special Funds ($100,000s) 94100
Adjusted Assets ($100,000s) 102108  housew1  Jul 3, 2019  2KB  67 
Major League Players Elected to Hall of Fame as Players
Includes 2019 BBWAAelected inductees Mariano Rivera, Edgar Martinez, Roy Halladay, and Mike Mussina. 31 variables for each player. Team=primary team; BBWAA=Baseball Writers Association of America; Bat: R=right, L=left, B=both;
WAR=Wins Against Replacement: number of wins the player added to the team above what an "average" replacement player would add.
CS=caught stealing.
OPS=Onbase Plus Slugging; as a rule of thumb, a "good" OPS is a value that when divided by 3 results in a value that would be considered a "good" batting average.
Other variables are hopefully selfexplanatory.  treiland  Jan 25, 2019  37KB  5792 
US Counties and Presidential Voting Dataset
Sampling Unit county
3141 observations and 19 variables, maximum # NAs:2956
Name
county  County
state  State
msa  Metropolitan Statistical Area
pmsa  Primary Metropolitan Statistical Area
pop.density  1992 pop per 1990 miles^2
pop  1990 population
pop.change  Percent population change 19801992
age6574  Percent age 6574, 1990
age75  Percent age >= 75, 1990
crime  serious crimes per 100,000 1991
college  Percent with bachelor's degree or higher of those
age>=25
income  median family income, 1989 dollars
farm  farm population, % of total, 1990
democrat  Percent votes cast for democratic president
republican  Percent votes cast for republican president
Perot  Percent votes cast for Ross Perot
white  Percent white, 1990
black  Percent black, 1990
turnout  1992 votes for president / 1990 pop x 100  craig_slinkman  Apr 12, 2011  755KB  2283 
Nfl draft combine results 19992013
The NFL Combine occurs once per year and is used to measure the physical characteristics of potential NFL draft picks.
The data covers 19992013.
Variables include college, position, height, weight, 40 yard dash time, etc.
 daniel.inghram  Feb 14, 2014  324KB  2404 
The Unofficial 2014 NFL Player Census
This data set contains a number of variables on every NFL player participating in the 2014 season. Most of the variables should be self explanatory. Salary represents the average annual salary for the player under their existing contract. Exp represents years of experience. Pro Bowler represents the number of years the player was selected for the pro bowl. Champ provides the number of championship teams on which the player has played. Heisman represents whether or not the player won the Heisman trophy in college.  websterwest  May 5, 2015  321KB  2006 
Skyscrapers in the U.S.
Data for buildings in the United States that are 100 meters tall or higher. The variables include the rank in terms of height (Rank), the building name (Building), the height in meters (Height), the number of floors (Floors), the year of completion (Year), materials used in construction (Materials), and the use of the building (Use). The last two variables contain multiple outcomes delimited by /. When considering these columns, consider an outcomes table (Stat > Tables > Outcomes) with / as a delimiter.  websterwest  Jan 14, 2015  148KB  2638 
Titanic.xlsx
Report on the Loss of the â€˜Titanicâ€™ (S.S.) (1990), British Board of Trade Inquiry Report (reprint), Gloucester, UK: Allan Sutton Publishing. Taken from the Journal on Statistical Education Archive, submitted by rdawson@husky1.stmarys.ca. Dr. Craig Slinkman has recoded the data as selfexplanatory nominal variables. yes craig_slinkman Mar 23, 2010 68KB 5
 craig_slinkman  Mar 23, 2010  61KB  2345 
Field Goal Data for the 2013 NFL Season
This data set contains information for every field goal attempted in the 2013 NFL season. The variables include the name of the team, the name of the kicker, the number of yards for the attempt, the outcome of the kick (Made, Missed or Blocked), the quarter of the game in which the attempt was made, the minutes/seconds left in the quarter and the number of points the kicking team was ahead before the kick (if negative the team was behind).  websterwest  Oct 10, 2014  43KB  1534 
% voting for Obama and other state statistics
This data set has over 100 statistics (current for 201011) for U.S. states obtained from Measure of America. Each state's percentage voting for President Obama in 2012 has been added. Which of the original variables is most highly correlated with this voting percentage? How does this data match the ideas provided by political pundits? See the source for a complete description of all variables.  websterwest  Feb 16, 2013  36KB  6480 
Advanced NBA Statistics for 20132014 Season
N = 342; only players with at least 40 games played are included.
These are advanced metrics which attempt to evaluate, relatively speaking, how good an NBA basketball player was during the 20132014 (in which Kevin Durant won the MVP Award).
Variables..........Position  what position did they play?..... Age  How old was the player as of February 1, 2014?..... Team  Obvious..... PER  Player Efficiency Rating; a measure of perminute production standardized such that the league average is 15.....
TS  True Shooting Percentage; a measure of shooting effeciency that takes into account 2point field goals, 3point field goals, and free throws.....
ORB  Offensive Rebound Percentage; an estimate of the percentage of available offensive rebounds a player grabbed while he was on the floor.....
DRB  Defensive Rebound Percentage; an estimate of the percentage of available defensive rebounds a player grabbed while he was on the floor.....
TRB  Total Rebound Percentage; an estimate of the percentage of available rebounds a player grabbed while he was on the floor.....
AST  Assist Percentage; an estimate of the percentage of teammate field goals a player assisted while he was on the floor.....
STL  Steal Percentage; an estimate of the percentage of opponent possessions that end with a steal by the player while he was on the floor.....
BLK  Block Percentage; an estimate of the percentage of opponent twopoint field goal attempts blocked by the player while he was on the floor.....
TOV  Turnover Percentage; an estimate of turnovers per 100 plays.....
USG  Usage Percentage; an estimate of the percentage of team plays used by a player while he was on the floor.....
ORtg  Offensive Rating: An estimate of points produced (players) or scored (teams) per 100 possessions.....
DRtg  Defensive Rating: An estimate of points allowed per 100 possessions.....
OWS  Offensive Win Shares; an estimate of the number of wins contributed by a player due to his offense.....
DWS  Defensive Win Shares; an estimate of the number of wins contributed by a player due to his defense.....
WS  Win Shares; an estimate of the number of wins contributed by a player.
 daniel.inghram  May 22, 2014  33KB  4045 
NFL Scores from 2013
Data for every NFL game of the 2013 season. Variables include the week of the game (Week), day of the week (Day), calendar date of the game (Date), winning team (Winner), losing team (Loser), whether the winning team was playing at home or away (WinnerAt), points for the winning team (PtsW), points for the losing team (PtsL), total points scored by both teams (TotalPts), yards for the winning team (YdsW), yards for the losing team (YdsL), turn overs for the winning team (TOW) and turn overs for the losing team (TOL). There are also additional columns of TieHome and TieAway indicating the two teams that played the only tie game of the 2013 season in week 12. The Week column also denotes the nature of each playoff game.  statcrunchhelp  Oct 3, 2014  26KB  1322 
Local College Data
The following data is for the year 2011 for colleges and universities in the Delaware, District of Columbia, Maryland, Pennsylvania, Virginia and West Virginia. Variables included in the data set include the college, type of college, location of college, admissions rate, SAT scores (Reading and Math, 75th percentile), tuition and fees, average amount of financial aid, enrollment (total and undergraduate), freshman retention rate, student/teacher ratio and graduation rate (5 year).  jpalmateer  Sep 23, 2013  24KB  1668 
