StatCrunch logo (home)

Data sets shared by StatCrunch members
Showing 1 to 15 of 183 data sets matching national
Data Set/Description Owner Last edited Size Views
Impaired Driving Death Rate by Age and Gender 2012 to 2014 All States
Rate of deaths by age/gender (per 100,000 population) for people killed in crashes involving a driver with BAC =>0.08%, 2012. 2012 Source: Fatality Analysis Reporting System (FARS)Note: Blank cells indicate data are suppressed. 2014 Source: Source: National Highway Traffic Administration's (NHTSA) Fatality Analysis Reporting System (FARS), 2014 Annual Report File. Fatality rates based on fewer than 20 deaths are suppressed.
lmcmath34Aug 19, 20196KB55
National Longitudinal Youth Survey: Weight Perception
The Youth survey consists of a nationally representative sample of youths who were 14 to 20 years old as of December 31, 1999.
This dataset tracks the Age, Height (in inches), Weight (in pounds), Gender, and the self reported "How would you describe your weight?" multiple choice answers for the individuals.
statcrunch_featuredNov 10, 2017330KB8395
FIFA World Cup Mens Players 2018
This data set records information for all 736 players for the 2018 FIFA World Cup. Included for each player is their national team (Team) along with their club team (Club).
statcrunch_featuredAug 1, 201863KB3904
Top 100 Retailers 2015
This dataset comes from the National Retail Federation and tracks the top retail chains in the US for 2015 based on their 2014 sales. The original data can be found at the webpage listed as the source. Note that these retailers include all sorts of avenues including internet sales.
statcrunch_featuredNov 10, 20178KB4702
All MLB Salaries (1985-2015)
This data has all MLB player salaries between 1985-2015 including the team played for, the city, and a unique ID for each player. Total this includes 25,575 salaries for 4,963 different baseball players.
The player ID is the first 5 letters from the last name, followed by the first two letters from the first name, followed by a number in case of duplicate names. For example, bondsba01 stands for Barry Bonds with "01" because he's the first with the "bondsba" name ID.
statcrunch_featuredJun 27, 20171MB5253
USDA Nutrition Data
This dataset has the nutritional values per serving size for a large variety of foods as calculated by the USDA.

US Department of Agriculture, Agricultural Research Service, Nutrient Data Laboratory. USDA National Nutrient Database for Standard Reference, Release 28. Version Current: September 2015. Internet: http://www.ars.usda.gov/nea/bhnrc/ndl
statcrunchhelpJan 13, 2016832KB1961
National Longitudinal Youth Survey
The Youth survey consists of a nationally representative sample of youths who were 14 to 20 years old as of December 31, 1999.
This dataset tracks the Age, Height (in inches), Weight (in pounds), Gender, and the self reported "How would you describe your weight?" multiple choice answers for the individuals.
statcrunchhelpMar 8, 2016330KB2034
Times World University Rankings (2011-2016)
This data comes from the annual Times magazine rankings of universities across the world. The webpage for the Times 2016 rankings is listed above in the source.
The formula for the 2016 rankings is as follows:
30% for Teaching Rating
7.5% for International Outlook Rating
30% for Research Rating
30% for Citations Rating
2.5% for Industry Income Rating.
The “Total Score” from 2016 can be recreated using this formula.

ColumnDescription
World_RankUniversity rank for a given year
University_NameThe name of the university
CountryLocation of university
Teaching_Rating Rating from a 0-100 scale of the quality of teaching at the university. This rating is based on the institution’s reputation for teaching, it’s student/staff ratio, it’s PhD’s/ undergraduate degrees awarded ratio, and it’s institutional income/ academic staff ratio.
Inter_Outlook_Rating Rating from a 0-100 scale of the international makeup of a university. This rating is based the international student percentage, international staff percentage, and the percentage of research papers from the university that include at least one international author.
Research_Rating Rating from a 0-100 scale of quality of research at the university. This rating is based on the university’s reputation, it’s research income/ academic staff ratio, and it’s production of scholarly papers.
Citations_Rating Rating from a 0-100 scale of based on the normalized average of citations by other papers per paper from the university (how often the research from the university is cited by other papers).
Industry_Income_Rating Rating from a 0-100 scale grading how much companies are willing to invest in the universities research. The rating is calculated based on the research income from businesses per academic staff member.
Total_ScoreThe final score used to determine the university ranking based on Teaching_Rating, International_Outlook_Rating, Research_Rating, Citations_Rating, and Industrial_Income_Rating.
Num_StudentsTotal number of students in a given year
Student/Staff_RatioNumber of students per academic staff member
%_Inter_StudentsPercentage of student body who come from a foreign county
%_Female_Students Percentage of student body that is female.
YearAcademic year that the ranking was released. For example, 2016 denotes the 2015-2016 academic year.
statcrunchhelpApr 5, 2016254KB4024
US News National University Rankings
Ranking of U.S. national universities in 2014. Variables include the ranking, university name, city, state, type (public or private), tuition in-state, tuition out-of-state, enrollment, acceptance rate, freshman retention rate and 6-year graduation rate.
websterwestSep 9, 201423KB2324
Top 100 Retailers 2015
This dataset comes from the National Retail Federation and tracks the top retail chains in the US for 2015 based on their 2014 sales. The original data can be found at the webpage listed as the source. Note that these retailer include all sorts of avenues including internet sales.
statcrunchhelpMar 14, 20167KB4540
Baseball2013.xlsx
Stats from the major league baseball teams for 2013. The last column I added denotes AL for American League and NL for National League. One could possibly conduct a two-sample means test, for example, to find out whether the average runs for the two leagues are equal. Or there are of course lots of regressions one could run.
eykoloNov 4, 20133KB2060
oldfaith.xls
The data in the Old Faithful file gives data about eruptions of the Old Faithful Geyser during October 1980. Variables are Duration in seconds of the current eruption, and Interval, the time to the next eruption. Old Faithful is an important tourist attraction, with up to a thousand people watching it erupt on pleasant summer days. The National Park Service uses data to obtain a prediction of the time to the next eruption.
craig_slinkmanMay 4, 20102KB2505
Acid rain data
The EPA states that any area where the pH of rain is less than 5.6 on average has an acid rain problem. The pH values collected over 90 rain falls at Shenandoah National Park are given in this data set.
websterwestJul 12, 2007564B1800
Deflategate data
After the AFC championship game on January 18th, 2015, there was a controversy about whether the New England Patriots might have deflated their footballs during the game to gain an advantage. During the game with the Indianapolis Colts, two officials, Blakeman and Prioleau, measured the PSI in 11 balls from the Patriots and 4 balls from the Colts. These measurements are included in this data set. The Patriots balls are labeled with a "P" prefix, and the Colts balls are labeled with a "C" prefix. The official rules of the National Football League require footballs to be inflated to between 12.5 and 13.5 PSI. Does this data set support the deflation claim?
websterwestMay 8, 2015375B1770
Annual Newspaper Ad Expenditures
Annual expenditures on newspaper advertisements broken down into national, retail, classified and online categories. The national, retail and classified categories are all print ads.
websterwestSep 21, 2011252B1947

1 2 3 4 5 6 7 8 9 10   >

Always Learning