StatCrunch logo (home)

Data sets shared by StatCrunch members
Showing 1 to 15 of 381 data sets matching VARIABLE
Data Set/Description Owner Last edited Size Views
Criminal Recidivism in Iowa: 2010-2014
Recidivism is defined as the "tendency of a convicted criminal to reoffend". This dataset tracks former criminals from Iowa over a 3 year period after their release from prison to see whether or not they were convicted of a new crime during that time. The recidivism reporting year is the fiscal year (year ending June 30) marking the end of the three year tracking period. Included are the following variables: Fiscal Year Released (the year the individual was released from Prison), the Race, Ethnicity, Sex, and Age of individual when released. Also included are details about the original crime committed along with whether that individual committed a new crime (Recidivism - Return to Prison) within the 3 year window.
statcrunch_featuredMar 21, 20183MB3644
USA Car Accidents in 2011
This data set contains information for drivers involved in car accidents in the United States during 2011. The variables include the age in years of the person (Age), the gender of the person (Gender), the month in which the accident occurred (Month), and the day of the week of the accident (DayOfWeek).
statcrunch_featuredSep 12, 2017919KB9797
Cereal Brands
Data on several variable of different brands of cereal. Number of cases: 77 Variable Names: Name: Name of cereal mfr: Manufacturer of cereal where A = American Home Food Products; G = General Mills; K = Kelloggs; N = Nabisco; P = Post; Q = Quaker Oats; R = Ralston Purina type: cold or hot calories: calories per serving protein: grams of protein fat: grams of fat sodium: milligrams of sodium fiber: grams of dietary fiber carbo: grams of complex carbohydrates sugars: grams of sugars potass: milligrams of potassium vitamins: vitamins and minerals - 0, 25, or 100, indicating the typical percentage of FDA recommended shelf: display shelf (1, 2, or 3, counting from the floor) weight: weight in ounces of one serving cups: number of cups in one serving rating: a rating of the cereals
statcrunch_featuredApr 3, 20174KB7594
cell_phones.xls
Q1. Based on a recent study, roughly 80% of college students in the U.S. own a cell phone. Do the data provide evidence that the proportion of students who own cell phones in this university is lower than the national figure? Answer. Most likely not. Ownership of cellphones and ratios do not depend on anything. Relevant Variables - The cell is the relevant variable and it is categorical. Analyze Data - The formal analysis of Q1 will pinpoint on searching the population proportion. The correct statistical test is the one sample z-test for the proportion. Null Hypothesis - Ho: p = 8 Alternative Hypothesis - Ha: p < .8 Outcomes: Cell Success: yes Test stat z = -.71, p-value is .239 > .05, so Ho cannot be rejected. Roughly 78% of the students sampled own a cellphone. Even though 78% percent is less than 80%, there is not enough support to conclude that the exact data holds right for the whole college or that it would be different from the national proportion.
faithnwanneMay 3, 20198KB380
Major League Players Elected to Hall of Fame as Players
Includes 2019 BBWAA-elected inductees Mariano Rivera, Edgar Martinez, Roy Halladay, and Mike Mussina. 31 variables for each player. Team=primary team; BBWAA=Baseball Writers Association of America; Bat: R=right, L=left, B=both; WAR=Wins Against Replacement: number of wins the player added to the team above what an "average" replacement player would add. CS=caught stealing. OPS=On-base Plus Slugging; as a rule of thumb, a "good" OPS is a value that when divided by 3 results in a value that would be considered a "good" batting average. Other variables are hopefully self-explanatory.
treilandJan 25, 201937KB5750
Attendance Vs. Grade
Compares percent of classes attended with final grade in the class. If you use % missed as the independent variable, you end up with a regression model that allows for interpretation of the intercept and has a negative slope.
lbgreenJan 28, 2019744B1175
US Counties and Presidential Voting Dataset
Sampling Unit county 3141 observations and 19 variables, maximum # NAs:2956 Name county -- County state -- State msa -- Metropolitan Statistical Area pmsa -- Primary Metropolitan Statistical Area pop.density -- 1992 pop per 1990 miles^2 pop -- 1990 population pop.change -- Percent population change 1980-1992 age6574 -- Percent age 65-74, 1990 age75 -- Percent age >= 75, 1990 crime -- serious crimes per 100,000 1991 college -- Percent with bachelor's degree or higher of those age>=25 income -- median family income, 1989 dollars farm -- farm population, % of total, 1990 democrat -- Percent votes cast for democratic president republican -- Percent votes cast for republican president Perot -- Percent votes cast for Ross Perot white -- Percent white, 1990 black -- Percent black, 1990 turnout -- 1992 votes for president / 1990 pop x 100
craig_slinkmanApr 12, 2011755KB2244
Class Seating vs Grade
From Body Image Data Set: "A student survey was conducted at a major university. Data were collected from a random sample of 239 undergraduate students". Variables: Gender - Male or Female, GPA - Student's cumulative college GPA. GPA is then converted to Grades (where, 4.33 = A+, 4.00 = A, 3.67 = A-, 3.33 = B+, 3.00 = B, 2.67 = B-, 2.33 = C+, 2.00 = C, 1.67 = C-). Seat - Typical classroom seat location (Front & Back)
mallirhea86Oct 26, 20182KB3605
Nfl draft combine results 1999-2013
The NFL Combine occurs once per year and is used to measure the physical characteristics of potential NFL draft picks. The data covers 1999-2013. Variables include college, position, height, weight, 40 yard dash time, etc.
daniel.inghramFeb 14, 2014324KB2344
The Unofficial 2014 NFL Player Census
This data set contains a number of variables on every NFL player participating in the 2014 season. Most of the variables should be self explanatory. Salary represents the average annual salary for the player under their existing contract. Exp represents years of experience. Pro Bowler represents the number of years the player was selected for the pro bowl. Champ provides the number of championship teams on which the player has played. Heisman represents whether or not the player won the Heisman trophy in college.
websterwestMay 5, 2015321KB1982
Skyscrapers in the U.S.
Data for buildings in the United States that are 100 meters tall or higher. The variables include the rank in terms of height (Rank), the building name (Building), the height in meters (Height), the number of floors (Floors), the year of completion (Year), materials used in construction (Materials), and the use of the building (Use). The last two variables contain multiple outcomes delimited by /. When considering these columns, consider an outcomes table (Stat > Tables > Outcomes) with / as a delimiter.
websterwestJan 14, 2015148KB2556
Titanic.xlsx
Report on the Loss of the ‘Titanic’ (S.S.) (1990), British Board of Trade Inquiry Report (reprint), Gloucester, UK: Allan Sutton Publishing. Taken from the Journal on Statistical Education Archive, submitted by rdawson@husky1.stmarys.ca. Dr. Craig Slinkman has recoded the data as self-explanatory nominal variables. yes craig_slinkman Mar 23, 2010 68KB 5
craig_slinkmanMar 23, 201061KB2283
Field Goal Data for the 2013 NFL Season
This data set contains information for every field goal attempted in the 2013 NFL season. The variables include the name of the team, the name of the kicker, the number of yards for the attempt, the outcome of the kick (Made, Missed or Blocked), the quarter of the game in which the attempt was made, the minutes/seconds left in the quarter and the number of points the kicking team was ahead before the kick (if negative the team was behind).
websterwestOct 10, 201443KB1514
% voting for Obama and other state statistics
This data set has over 100 statistics (current for 2010-11) for U.S. states obtained from Measure of America. Each state's percentage voting for President Obama in 2012 has been added. Which of the original variables is most highly correlated with this voting percentage? How does this data match the ideas provided by political pundits? See the source for a complete description of all variables.
websterwestFeb 16, 201336KB6435
Advanced NBA Statistics for 2013-2014 Season
N = 342; only players with at least 40 games played are included. These are advanced metrics which attempt to evaluate, relatively speaking, how good an NBA basketball player was during the 2013-2014 (in which Kevin Durant won the MVP Award). Variables..........Position -- what position did they play?..... Age -- How old was the player as of February 1, 2014?..... Team -- Obvious..... PER -- Player Efficiency Rating; a measure of per-minute production standardized such that the league average is 15..... TS -- True Shooting Percentage; a measure of shooting effeciency that takes into account 2-point field goals, 3-point field goals, and free throws..... ORB -- Offensive Rebound Percentage; an estimate of the percentage of available offensive rebounds a player grabbed while he was on the floor..... DRB -- Defensive Rebound Percentage; an estimate of the percentage of available defensive rebounds a player grabbed while he was on the floor..... TRB -- Total Rebound Percentage; an estimate of the percentage of available rebounds a player grabbed while he was on the floor..... AST -- Assist Percentage; an estimate of the percentage of teammate field goals a player assisted while he was on the floor..... STL -- Steal Percentage; an estimate of the percentage of opponent possessions that end with a steal by the player while he was on the floor..... BLK -- Block Percentage; an estimate of the percentage of opponent two-point field goal attempts blocked by the player while he was on the floor..... TOV -- Turnover Percentage; an estimate of turnovers per 100 plays..... USG -- Usage Percentage; an estimate of the percentage of team plays used by a player while he was on the floor..... ORtg -- Offensive Rating: An estimate of points produced (players) or scored (teams) per 100 possessions..... DRtg -- Defensive Rating: An estimate of points allowed per 100 possessions..... OWS -- Offensive Win Shares; an estimate of the number of wins contributed by a player due to his offense..... DWS -- Defensive Win Shares; an estimate of the number of wins contributed by a player due to his defense..... WS -- Win Shares; an estimate of the number of wins contributed by a player.
daniel.inghramMay 22, 201433KB3953

1 2 3 4 5 6 7 8 9 10   >

Always Learning