StatCrunch logo (home)

Data sets shared by StatCrunch members
Showing 1 to 15 of 32 data sets matching release
Data Set/Description Owner Last edited Size Views
Movie Budgets and Box Office Earnings (Updated Spring 2018)
This data all comes from the following website the tracks the financial performance of movies:
http://www.the-numbers.com/movie/budgets/all

The “Budget”, “Domestic Gross”, and “Worldwide Gross” columns each are in millions of dollars.

statcrunch_featuredOct 4, 2018270KB9104
Criminal Recidivism in Iowa: 2010-2014
Recidivism is defined as the "tendency of a convicted criminal to reoffend". This dataset tracks former criminals from Iowa over a 3 year period after their release from prison to see whether or not they were convicted of a new crime during that time. The recidivism reporting year is the fiscal year (year ending June 30) marking the end of the three year tracking period. Included are the following variables: Fiscal Year Released (the year the individual was released from Prison), the Race, Ethnicity, Sex, and Age of individual when released. Also included are details about the original crime committed along with whether that individual committed a new crime (Recidivism - Return to Prison) within the 3 year window.
statcrunch_featuredMar 21, 20183MB2794
US Workforce Participation
This data primarily comes from two sources: Federal Reserve Bank of St. Louis and the US Bureau of Labor Statistics .
ColumnDescription
YearThe calendar year for each value
Annual Average Workforce ParticipationDefined by the Bureau of Labor Statistics as "the percentage of the population [16 years and older] that is either employed or unemployed (that is, either working or actively seeking work). Note that 2015's Annual Average is calculated using the first 11 months."
Male Workforce Participation RateAnnual workforce participation rate for males.
Female Workforce Participation RateAnnual workforce participation rate for females.
Male Inactivity Rate Aged 25-54Defined as the proportion of the male population aged 25-54 that is not in the labour force. Common reasons for leaving labour force: college, retirement, stay at home, can't find work and no longer try.
Change in Rate (Male Inactivity Rate Aged 25-54)The change in the inactivity rate calculated as the current year minus the previous year.
Female Inactivity Rate Aged 25-54Defined as the proportion of the female population aged 25-54 that is not in the labour force.
Change in Rate (Female Inactivity Rate Aged 25-54)The change in the inactivity rate calculated as the current year minus the previous year.
Presidential ControlPolitical party of president.
Senate ControlPolitical party of the Senate majority
House ControlPolitical party of the House of Representatives majority.
Legislative Branch (House and Senate)Combined control of Senate and House of Representativs
statcrunch_featuredJun 27, 201710KB2204
USDA Nutrition Data
This dataset has the nutritional values per serving size for a large variety of foods as calculated by the USDA.

US Department of Agriculture, Agricultural Research Service, Nutrient Data Laboratory. USDA National Nutrient Database for Standard Reference, Release 28. Version Current: September 2015. Internet: http://www.ars.usda.gov/nea/bhnrc/ndl
statcrunchhelpJan 13, 2016832KB1593
Times World University Rankings (2011-2016)
This data comes from the annual Times magazine rankings of universities across the world. The webpage for the Times 2016 rankings is listed above in the source.
The formula for the 2016 rankings is as follows:
30% for Teaching Rating
7.5% for International Outlook Rating
30% for Research Rating
30% for Citations Rating
2.5% for Industry Income Rating.
The “Total Score” from 2016 can be recreated using this formula.

ColumnDescription
World_RankUniversity rank for a given year
University_NameThe name of the university
CountryLocation of university
Teaching_Rating Rating from a 0-100 scale of the quality of teaching at the university. This rating is based on the institution’s reputation for teaching, it’s student/staff ratio, it’s PhD’s/ undergraduate degrees awarded ratio, and it’s institutional income/ academic staff ratio.
Inter_Outlook_Rating Rating from a 0-100 scale of the international makeup of a university. This rating is based the international student percentage, international staff percentage, and the percentage of research papers from the university that include at least one international author.
Research_Rating Rating from a 0-100 scale of quality of research at the university. This rating is based on the university’s reputation, it’s research income/ academic staff ratio, and it’s production of scholarly papers.
Citations_Rating Rating from a 0-100 scale of based on the normalized average of citations by other papers per paper from the university (how often the research from the university is cited by other papers).
Industry_Income_Rating Rating from a 0-100 scale grading how much companies are willing to invest in the universities research. The rating is calculated based on the research income from businesses per academic staff member.
Total_ScoreThe final score used to determine the university ranking based on Teaching_Rating, International_Outlook_Rating, Research_Rating, Citations_Rating, and Industrial_Income_Rating.
Num_StudentsTotal number of students in a given year
Student/Staff_RatioNumber of students per academic staff member
%_Inter_StudentsPercentage of student body who come from a foreign county
%_Female_Students Percentage of student body that is female.
YearAcademic year that the ranking was released. For example, 2016 denotes the 2015-2016 academic year.
statcrunchhelpApr 5, 2016254KB3701
Marvel vs. DC at the Box Office
This data set contains information on how the two comic book companies have faired at the box office. Each movie from both Marvel and DC is listed with name of the film and release date. The Domestic and Foreign gross of each film is provided in millions of dollars along with the total Worldwide gross. The Adjusted column modifies this total for inflation.
websterwestAug 23, 20144KB2512
Rock'n'Roll Hall of Fame
This dataset has information on a selected group of members of The Rock'n'Roll Hall of Fame, including information on the number of people in the group, if the group had a female member, if the person/group was a double inductee into the Hall, how many studio albums they had, the number of #1 hits, the number of top 40 hits, how many music videos they had (this needs work) and the year of release of their first album.
jpalmateerMay 27, 20163KB1514
US Emissions of Greenhouse Gases Based on Global Warming Potential 1990-2007 Energy Information Administration.xls
U.S. Emissions of Greenhouse Gases, Based on Global Warming Potential, 1990-2007 Units are Million Metric Tons of Carbon Dioxide Equivalent Report #: DOE/EIA-0573(2007) Released Date: December 3, 2008   Next Release Date: November 2009 P = Preliminary Note: Data in this table are revised from the data contained in the previous EIA report, Emissions of Greenhouse Gases in the United States 2006, DOE/EIA-0573(2006) (Washington, DC, November 2007). Sources: Emissions of carbon dioxide, methane and nitrous oxide EIA. Emissions of HFCs, PFCs, and SF6, U.S. Environmental Protection Agency, preliminary data. Global Warming Potentials: United Nations, Intergovernmental Panel on Climate Change, Climate Change 2007 - The Physical Science Basis (Cambridge, UK: Cambridge University Press, 2007)
deathbysteveoMay 26, 2009813B2405
AP Statistics Predictions 2013-16
GPA = Student's Weighted GPA before beginning AP Statistics PrevMath = The highest math course the student completed at our school prior to AP Stats AP.Ave = The student's average score on the AP exams taken (if available) MathGPA = Unweighted GPA of student's work in math courses MT.MC = Students number correct (out of 40) on the multiple choice section of their midterm (MT) MT.Raw = Student's raw score (out of 100) on the multiple choice and free response sections of a previously released AP exam Locus.Aug = Student's score (out of 100) on the LOCUS diagnostic test in the beginning of the school year S1P = Student's first semester grade as a percentage S1G = Student's first semester letter grade S1F = Student's (scaled) first semester final exam grade (a.k.a. midterm test grade) S2P = Student's second semester grade as a percentage S2G = Student's second semester letter grade Ch 1-4 = Student's raw test average on ch. 1-4 Ch 1-6 = Student's raw test average on ch. 1-6 Ch 1-8 = Students raw test average on ch. 1-8 MT = Student's raw test average on the midterm Ch 1-12=Student's raw test average on ch. 1-12 (entire textbook) Mock 1 = Student's raw score on first mock exam (mid-March) Mock 2 = Student's raw score on second mock exam (late April) Mock 1&2 = Student's average on two mock exams MT&Mock1&2 = Student's average on midterm and two mock exams MT.AP = Student's converted score (1-5) on midterm Mocks.AP = Student's converted score (1-5) on average of two mock exams MT&Mocks.AP = Student's converted score (1-5) on average of MT and two mock exams ACTUAL = student's actual performance on AP exam (blank means student opted out of taking exam) MT.Resid = Actual score - Midterm score Mocks.Resid = Actual score - average Mock exam score MT&Mocks.Resid = Actual score - average midterm and mock exam score
je175Jul 5, 20169KB1939
IMDB Movie Ratings
This data set contains the title, original year of production, the number of user votes, and the average user rating for a number of movies listed at imdb.com. The data set also contains the source of the content, which unless otherwise noted is a film released for theaters. The data set does contain a number of items made for television including awards shows and also movies that went straight to a video release. The data set is restricted to relatively popular movies obtaining at least 500 votes. It was created using data that was current as of May 28th, 2014 and revised to remove video games on June 2nd, 2014.
websterwestSep 29, 20141MB1174
2010 Movie Revenue
This dataset has the revenues for each movie released in 2010. Toy Story 3 tops the charts -
cecil_collegeJan 22, 201135KB5782
US Workforce Participation
This data primarily comes from two sources: Federal Reserve Bank of St. Louis and the US Bureau of Labor Statistics .
ColumnDescription
YearThe calendar year for each value
Annual Average Workforce ParticipationDefined by the Bureau of Labor Statistics as "the percentage of the population [16 years and older] that is either employed or unemployed (that is, either working or actively seeking work). Note that 2015's Annual Average is calculated using the first 11 months."
Male Workforce Participation RateAnnual workforce participation rate for males.
Female Workforce Participation RateAnnual workforce participation rate for females.
Male Inactivity Rate Aged 25-54Defined as the proportion of the male population aged 25-54 that is not in the labour force. Common reasons for leaving labour force: college, retirement, stay at home, can't find work and no longer try.
Change in Rate (Male Inactivity Rate Aged 25-54)The change in the inactivity rate calculated as the current year minus the previous year.
Female Inactivity Rate Aged 25-54Defined as the proportion of the female population aged 25-54 that is not in the labour force.
Change in Rate (Female Inactivity Rate Aged 25-54)The change in the inactivity rate calculated as the current year minus the previous year.
Presidential ControlPolitical party of president.
Senate ControlPolitical party of the Senate majority
House ControlPolitical party of the House of Representatives majority.
Legislative Branch (House and Senate)Combined control of Senate and House of Representativs
statcrunchhelpJan 7, 201610KB744
Annual Movie Data 2008 Random Sampling.txt
This data is a random sampling of movies that played in theaters in 2008. It includes movies released in previous years that earned money during 2008. For example, a movie released over Thanksgiving in 2007 will most likely earn money in 2007 and 2008. Each box office year ends on the first Sunday of the following year. The next year starts the following day (Monday). For example, the "2004 box office year" ended on Sunday, January 2, 2005. Inflation-adjusted figures are based ticket sale estimates, and may not be precise due to rounding errors.
wikipetersonOct 7, 20098KB472
greenhouse-transposed.xls
TRANSPOSED U.S. Emissions of Greenhouse Gases, Based on Global Warming Potential, 1990-2007 Units are Million Metric Tons of Carbon Dioxide Equivalent Report #: DOE/EIA-0573(2007) Released Date: December 3, 2008 Next Release Date: November 2009 P = Preliminary Note: Data in this table are revised from the data contained in the previous EIA report, Emissions of Greenhouse Gases in the United States 2006, DOE/EIA-0573(2006) (Washington, DC, November 2007). Sources: Emissions of carbon dioxide, methane and nitrous oxide EIA. Emissions of HFCs, PFCs, and SF6, U.S. Environmental Protection Agency, preliminary data. Global Warming Potentials: United Nations, Intergovernmental Panel on Climate Change, Climate Change 2007 - The Physical Science Basis (Cambridge, UK: Cambridge University Press, 2007)
deathbysteveoMay 26, 2009809B464
Annual Movie Data 2008.txt
This chart ranks movies by the amount they earned during 2008. It includes movies released in previous years that earned money during 2008. For example, a movie released over Thanksgiving in 2007 will most likely earn money in 2007 and 2008. Each box office year ends on the first Sunday of the following year. The next year starts the following day (Monday). For example, the "2004 box office year" ended on Sunday, January 2, 2005. Inflation-adjusted figures are based ticket sale estimates, and may not be precise due to rounding errors.
wikipetersonOct 7, 200970KB454

1 2 3   >

Always Learning