Data Set/Description |
Owner |
Last edited |
Size |
Views |
Movie Budgets and Box Office Earnings (Updated Spring 2018)
This data all comes from the following website the tracks the financial performance of movies:
http://www.the-numbers.com/movie/budgets/all
The “Budget”, “Domestic Gross”, and “Worldwide Gross” columns each are in millions of dollars. | statcrunch_featured | Oct 4, 2018 | 270KB | 10681 |
Shark Attacks Worldwide
This data comes from www.sharkattackfile.net. It records data on all shark attacks in recorded history including attacks before 1800. Included is all known information on the shark attack including the date, location, information on the individual who was attacked, details on the injuries sustained by the victim, and the species of the shark | statcrunch_featured | Jun 28, 2017 | 1MB | 9450 |
National Longitudinal Youth Survey: Weight Perception
The Youth survey consists of a nationally representative sample of youths who were 14 to 20 years old as of December 31, 1999.
This dataset tracks the Age, Height (in inches), Weight (in pounds), Gender, and the self reported "How would you describe your weight?" multiple choice answers for the individuals. | statcrunch_featured | Nov 10, 2017 | 330KB | 6821 |
Flight Delay Data For July 2014
This data set contains information on the flight delays for each airline at each U.S. airport in July of 2014. The columns include the carrier, airport city/state, airport code, airport name, total number of flights (Flights), the number of delayed flights (Delayed), the number of cancelled flights (Cancelled), the number of diverted flights (Diverted), the number of on-time flights (On-time), and the on-time percentage (On-time Percentage). | statcrunch_featured | Jan 2, 2018 | 88KB | 5520 |
California Home Prices, 2009
This dataset is a collection of real estate listings from San Luis Obispo county, California, and some locations around it from 2009. The prices are their list price at the creation of this dataset. For more information about this data, go to the website source listed above. | statcrunch_featured | Apr 3, 2017 | 46KB | 6818 |
US Workforce Participation
This data primarily comes from two sources: Federal Reserve Bank of St. Louis and the US Bureau of Labor Statistics .
Column | Description | Year | The calendar year for each value | Annual Average Workforce Participation | Defined by the Bureau of Labor Statistics as "the percentage of the population [16 years and older] that is either employed or unemployed (that is, either working or actively seeking work). Note that 2015's Annual Average is calculated using the first 11 months." | Male Workforce Participation Rate | Annual workforce participation rate for males. | Female Workforce Participation Rate | Annual workforce participation rate for females. | Male Inactivity Rate Aged 25-54 | Defined as the proportion of the male population aged 25-54 that is not in the labour force. Common reasons for leaving labour force: college, retirement, stay at home, can't find work and no longer try. | Change in Rate (Male Inactivity Rate Aged 25-54) | The change in the inactivity rate calculated as the current year minus the previous year. | Female Inactivity Rate Aged 25-54 | Defined as the proportion of the female population aged 25-54 that is not in the labour force. | Change in Rate (Female Inactivity Rate Aged 25-54) | The change in the inactivity rate calculated as the current year minus the previous year. | Presidential Control | Political party of president. | Senate Control | Political party of the Senate majority | House Control | Political party of the House of Representatives majority. | Legislative Branch (House and Senate) | Combined control of Senate and House of Representativs |
| statcrunch_featured | Jun 27, 2017 | 10KB | 2372 |
2014 MLB Top 100 Batters
This data came from ESPN.com and has the top 100 batters by WAR (wins above replacement).
AB: At bats
R: Runs
H: Hits
2B: Doubles
3B: Triples
RBI: Runs batted in
SB: Stolen Bases
BB: Walks
SO: Strikeouts
AVG: Batting average
OBP: On Base Percentage
SLG: Slugging Percentage
OPS: OBP + SLG
WAR: Wins Above Replacement | statcrunch_featured | Apr 3, 2017 | 9KB | 3100 |
All MLB Salaries (1985-2015)
This data has all MLB player salaries between 1985-2015 including the team played for, the city, and a unique ID for each player. Total this includes 25,575 salaries for 4,963 different baseball players.
The player ID is the first 5 letters from the last name, followed by the first two letters from the first name, followed by a number in case of duplicate names. For example, bondsba01 stands for Barry Bonds with "01" because he's the first with the "bondsba" name ID. | statcrunch_featured | Jun 27, 2017 | 1MB | 4621 |
Roller Coasters Data
This dataset looks at some of the roller coasters across the US and various other countries.
Column | Description | Name | Name of roller coaster | Park | Amusement park for roller coaster | City | City for amusement park | State | State abbreviation | Country | Country of the roller coaster. US: United States, MX: Mexico, CR: Costa Rica, GT: Guatemala, CO: Columbia, VE: Venezuela, BR: Brazil, AR: Argentina, CL: Chile, EQ: Ecuador, PE: Peru, F: France, D: Germany | Type | S: Steel, W: Wood | Constructor | Type of build for the roller coaster | Height | Height in meters | Speed | Speed in kilometers per hour (km/h) | Length | Length in meters | Inversions | Yes if there are inversions, no if not | Duration | Duration of ride in seconds | GForce | Max g-force | Opened | Year it opened | Region | Geographic region for the roller coaster |
| statcrunch_featured | Apr 3, 2017 | 48KB | 5780 |
Cereal Brands
Data on several variable of different brands of cereal. Number of cases: 77 Variable Names: Name: Name of cereal mfr: Manufacturer of cereal where A = American Home Food Products; G = General Mills; K = Kelloggs; N = Nabisco; P = Post; Q = Quaker Oats; R = Ralston Purina type: cold or hot calories: calories per serving protein: grams of protein fat: grams of fat sodium: milligrams of sodium fiber: grams of dietary fiber carbo: grams of complex carbohydrates sugars: grams of sugars potass: milligrams of potassium vitamins: vitamins and minerals - 0, 25, or 100, indicating the typical percentage of FDA recommended shelf: display shelf (1, 2, or 3, counting from the floor) weight: weight in ounces of one serving cups: number of cups in one serving rating: a rating of the cereals | statcrunch_featured | Apr 3, 2017 | 4KB | 6481 |
Body Temperature
Data taken from the Journal of Statistics Education online data archive. That archive in turn got the data from an article in the Journal of the American Medical Association. (Mackowiak, et al., "A Critical Appraisal of 98.6 Degrees F …", vol. 268, pp. 1578-80, 1992).
"Body Temp" is measured in degrees fahrenheit
"Heart rate" is the resting beats per minute | statcrunch_featured | Jun 27, 2017 | 2KB | 12253 |
Marigold Fertilizer Comparison
Data represent the height in inches after 90 days of marigolds raised from seed in 3 treatment groups. | janet.schlaak | Sep 14, 2015 | 1KB | 266 | Singers
Heights of singers in the NY Choral Society in 1979. Self-report, to the nearest inch. Voice parts in order from highest pitch to lowest pitch are Soprano, Alto, Tenor, Bass. The first two are female voices and the last two are male voices. The original dataset included two divisions for each voice part. This dataset reports only soprano 1, alto 1, tenor 1, and bass 1 from the original dataset. Reference: Chambers, Cleveland, Kleiner, and Tukey. (1983). Graphical Methods for Data Analysis[Pooled t-test , ANOVA , Boxplot]
Variable | Description |
Soprano | Heights of sopranos (in inches) |
Alto | Heights of altos (in inches) |
Tenor | Heights of tenors (in inches) |
Bass | Heights of basses (in inches) |
| ds-231%sc | Aug 11, 2008 | 485B | 536 | Cancer Survival
Patients with advanced cancers of the stomach, bronchus, colon, ovary or breast were treated with ascorbate. The purpose of the study was to determine if the survival times differ with respect to the organ
affected by the cancer.
Number of cases: 64
Reference:Cameron, E. and Pauling, L. (1978) Supplemental ascorbate in the supportive treatment of cancer: re-evaluation of prolongation of survival times in terminal human cancer. Proceedings of the National Academy of Science USA, 75, 4538Ã4542. Also found in: Manly, B.F.J. (1986) Multivariate Statistical Methods: A Primer, New York: Chapman & Hall, 11. Also found in: Hand, D.J., et al. (1994) A Handbook of Small Data Sets, London: Chapman & Hall, 255.
[ANOVA , Boxplot , Transformation]
Variable | Description |
Survival | Survival time (in days?) |
Organ | Organ affected by the cancer |
| ds-231%sc | Aug 11, 2008 | 817B | 510 | CEO Salaries
Small companies were defined as those with annual sales greater than five and less than $350 million. Companies were ranked according to 5-year average return on investment. This data covers the first 60 ranked firms.
Reference: Forbes, November 8, 1993, "America's Best Small Companies,".
[Outlier , Histogram , Mean , Median , Boxplot , Distribution]
Variable | Description |
Age: | Age of chief executive officer
| Sal: | Salary of chief executive officer (including bonuses), $thousands |
| ds-231%sc | Aug 11, 2008 | 681B | 422 |
|