StatCrunch logo (home)

Data sets shared by StatCrunch members
Showing 1 to 15 of 226 data sets matching name
Data Set/Description Owner Last edited Size Views
Super Heroes
This data set originally came from the following website: It contains various physical characteristics for over 700 fictional comic book super heroes.
statcrunch_featuredAug 1, 201847KB6834
Flight Delay Data For July 2014
This data set contains information on the flight delays for each airline at each U.S. airport in July of 2014. The columns include the carrier, airport city/state, airport code, airport name, total number of flights (Flights), the number of delayed flights (Delayed), the number of cancelled flights (Cancelled), the number of diverted flights (Diverted), the number of on-time flights (On-time), and the on-time percentage (On-time Percentage).
statcrunch_featuredJan 2, 201888KB6359
US Presidential Election History
This dataset tracks the US presidential election results dating back to 1824. Included is the winning candidate, winning party, popular voting totals, margin of victory, and the electoral college totals. Also included is the name and party of the runner-up along with the percentage of all eligible voters that turned out for the election (Voter Turnout Percentage).
statcrunch_featuredFeb 20, 20185KB2704
All MLB Salaries (1985-2015)
This data has all MLB player salaries between 1985-2015 including the team played for, the city, and a unique ID for each player. Total this includes 25,575 salaries for 4,963 different baseball players.
The player ID is the first 5 letters from the last name, followed by the first two letters from the first name, followed by a number in case of duplicate names. For example, bondsba01 stands for Barry Bonds with "01" because he's the first with the "bondsba" name ID.
statcrunch_featuredJun 27, 20171MB5130
Roller Coasters Data
This dataset looks at some of the roller coasters across the US and various other countries.
NameName of roller coaster
ParkAmusement park for roller coaster
CityCity for amusement park
StateState abbreviation
CountryCountry of the roller coaster. US: United States, MX: Mexico, CR: Costa Rica, GT: Guatemala, CO: Columbia, VE: Venezuela, BR: Brazil, AR: Argentina, CL: Chile, EQ: Ecuador, PE: Peru, F: France, D: Germany
TypeS: Steel, W: Wood
ConstructorType of build for the roller coaster
HeightHeight in meters
SpeedSpeed in kilometers per hour (km/h)
LengthLength in meters
InversionsYes if there are inversions, no if not
DurationDuration of ride in seconds
GForceMax g-force
OpenedYear it opened
RegionGeographic region for the roller coaster
statcrunch_featuredApr 3, 201748KB6544
Cereal Brands
Data on several variable of different brands of cereal. Number of cases: 77 Variable Names: Name: Name of cereal mfr: Manufacturer of cereal where A = American Home Food Products; G = General Mills; K = Kelloggs; N = Nabisco; P = Post; Q = Quaker Oats; R = Ralston Purina type: cold or hot calories: calories per serving protein: grams of protein fat: grams of fat sodium: milligrams of sodium fiber: grams of dietary fiber carbo: grams of complex carbohydrates sugars: grams of sugars potass: milligrams of potassium vitamins: vitamins and minerals - 0, 25, or 100, indicating the typical percentage of FDA recommended shelf: display shelf (1, 2, or 3, counting from the floor) weight: weight in ounces of one serving cups: number of cups in one serving rating: a rating of the cereals
statcrunch_featuredApr 3, 20174KB7594
Fatal Encounters Updated September 2018
This data set was downloaded from Fatal Encounters, a non-profit organization that is collecting data on Police Involved Deaths. This data set has been truncated to include the subject's name, age at time of death, subject's gender, subjects race, location of death, cause of death and year of death. This does not only include people shot by police, but there are also instances of police that died during fatal encounters. It is good to remind students using this data set that this is a volunteer agency collecting the data from people that are scouring news articles for evidence of these fatal encounters, thus it is not a complete population of fatal encounters, only a very, very large sample.
habarkerApr 8, 20193MB162
US Counties and Presidential Voting Dataset
Sampling Unit county 3141 observations and 19 variables, maximum # NAs:2956 Name county -- County state -- State msa -- Metropolitan Statistical Area pmsa -- Primary Metropolitan Statistical Area pop.density -- 1992 pop per 1990 miles^2 pop -- 1990 population pop.change -- Percent population change 1980-1992 age6574 -- Percent age 65-74, 1990 age75 -- Percent age >= 75, 1990 crime -- serious crimes per 100,000 1991 college -- Percent with bachelor's degree or higher of those age>=25 income -- median family income, 1989 dollars farm -- farm population, % of total, 1990 democrat -- Percent votes cast for democratic president republican -- Percent votes cast for republican president Perot -- Percent votes cast for Ross Perot white -- Percent white, 1990 black -- Percent black, 1990 turnout -- 1992 votes for president / 1990 pop x 100
craig_slinkmanApr 12, 2011755KB2244
Pharmaceutical Sales (2018)
Code Book: Country (where products were sold) Sales Rep (Last Name) Gross Sales (U.S. $).
ctorresrApr 14, 201916KB135
Times World University Rankings (2011-2016)
This data comes from the annual Times magazine rankings of universities across the world. The webpage for the Times 2016 rankings is listed above in the source.
The formula for the 2016 rankings is as follows:
30% for Teaching Rating
7.5% for International Outlook Rating
30% for Research Rating
30% for Citations Rating
2.5% for Industry Income Rating.
The “Total Score” from 2016 can be recreated using this formula.

World_RankUniversity rank for a given year
University_NameThe name of the university
CountryLocation of university
Teaching_Rating Rating from a 0-100 scale of the quality of teaching at the university. This rating is based on the institution’s reputation for teaching, it’s student/staff ratio, it’s PhD’s/ undergraduate degrees awarded ratio, and it’s institutional income/ academic staff ratio.
Inter_Outlook_Rating Rating from a 0-100 scale of the international makeup of a university. This rating is based the international student percentage, international staff percentage, and the percentage of research papers from the university that include at least one international author.
Research_Rating Rating from a 0-100 scale of quality of research at the university. This rating is based on the university’s reputation, it’s research income/ academic staff ratio, and it’s production of scholarly papers.
Citations_Rating Rating from a 0-100 scale of based on the normalized average of citations by other papers per paper from the university (how often the research from the university is cited by other papers).
Industry_Income_Rating Rating from a 0-100 scale grading how much companies are willing to invest in the universities research. The rating is calculated based on the research income from businesses per academic staff member.
Total_ScoreThe final score used to determine the university ranking based on Teaching_Rating, International_Outlook_Rating, Research_Rating, Citations_Rating, and Industrial_Income_Rating.
Num_StudentsTotal number of students in a given year
Student/Staff_RatioNumber of students per academic staff member
%_Inter_StudentsPercentage of student body who come from a foreign county
%_Female_Students Percentage of student body that is female.
YearAcademic year that the ranking was released. For example, 2016 denotes the 2015-2016 academic year.
statcrunchhelpApr 5, 2016254KB3979
NFL Players 2016
This file lists the 2,764 NFL players for all team rosters as of July 22, 2016. Information includes jersey number, name, position, age, height (in inches), weight (in lbs), years in the NFL, college they graduated from, NFL team, position grouping (OL, QB, tailback, TE, WR, Front 7, DB, special teams), side of the football (offensive, defense or special teams), and their experience level by years played.
je175Jul 24, 2016192KB2280
Skyscrapers in the U.S.
Data for buildings in the United States that are 100 meters tall or higher. The variables include the rank in terms of height (Rank), the building name (Building), the height in meters (Height), the number of floors (Floors), the year of completion (Year), materials used in construction (Materials), and the use of the building (Use). The last two variables contain multiple outcomes delimited by /. When considering these columns, consider an outcomes table (Stat > Tables > Outcomes) with / as a delimiter.
websterwestJan 14, 2015148KB2556
Flight Delay Data For July 2014
This data set contains information on the flight delays for each airline at each U.S. airport in July of 2014. The columns include the carrier, airport city/state, airport code, airport name, total number of flights (Flights), the number of delayed flights (Delayed), the number of cancelled flights (Cancelled), the number of diverted flights (Diverted), the number of on-time flights (On-time), and the on-time percentage (On-time Percentage).
websterwestOct 3, 201488KB1765
U.S. House Candidates Fund Raising (In-District vs Out-of-District)
This data set contains each candidate's name, party affiliation, state/district of race, total funds raised, funds raised in-district, funds raised out-of-district, funds raised with no district information, percentage of funds raised in-district and percentage of funds raised out-of-district.
websterwestDec 2, 201454KB1441
Field Goal Data for the 2013 NFL Season
This data set contains information for every field goal attempted in the 2013 NFL season. The variables include the name of the team, the name of the kicker, the number of yards for the attempt, the outcome of the kick (Made, Missed or Blocked), the quarter of the game in which the attempt was made, the minutes/seconds left in the quarter and the number of points the kicking team was ahead before the kick (if negative the team was behind).
websterwestOct 10, 201443KB1514

1 2 3 4 5 6 7 8 9 10   >

Always Learning