StatCrunch logo (home)

Data sets shared by StatCrunch members
Showing 1 to 15 of 225 data sets matching DIE
Data Set/Description Owner Last edited Size Views
US Presidential Data
This data set gives information on the US Presidents from 1789-2019. The following data is included: College, Religion, Number of Children, Age at Inauguration, Year of Inauguration, Date at Death, Political Party, and Occupation

This data set was originally uploaded to StatCrunch via the khoffman2014 user.
statcrunch_featuredNov 13, 20193KB305
New York City Leading Causes of Death (2007-2014)
This data set breaks down the leading causes of death in New York City between 2007-2014. Included is the number of Deaths (Deaths) for each combination of Sex and Race Ethnicity. The Death Rate represents the rate within that Sex/ Race Ethnicity category. Age Adjusted Death Rate adjusts the Death Rate by the ages of those who died.
statcrunch_featuredAug 1, 201896KB5473
Cereal Brands
Data on several variable of different brands of cereal. Number of cases: 77 Variable Names: Name: Name of cereal mfr: Manufacturer of cereal where A = American Home Food Products; G = General Mills; K = Kelloggs; N = Nabisco; P = Post; Q = Quaker Oats; R = Ralston Purina type: cold or hot calories: calories per serving protein: grams of protein fat: grams of fat sodium: milligrams of sodium fiber: grams of dietary fiber carbo: grams of complex carbohydrates sugars: grams of sugars potass: milligrams of potassium vitamins: vitamins and minerals - 0, 25, or 100, indicating the typical percentage of FDA recommended shelf: display shelf (1, 2, or 3, counting from the floor) weight: weight in ounces of one serving cups: number of cups in one serving rating: a rating of the cereals
statcrunch_featuredApr 3, 20174KB9152
USDA Nutrition Data
This dataset has the nutritional values per serving size for a large variety of foods as calculated by the USDA.

US Department of Agriculture, Agricultural Research Service, Nutrient Data Laboratory. USDA National Nutrient Database for Standard Reference, Release 28. Version Current: September 2015. Internet:
statcrunchhelpJan 13, 2016832KB2159
Wimbledon Ladies Singles Champions
Names and Nationalities of the Wimbledon Ladies Singles Champions each year since the beginning of the Open Era
groomdjSep 20, 20192KB292
Responses to Soda survey
A recent survey asked respondents what percentage of their daily fluid intake is soda. It also asked if they drink diet soda, their gender, and their age.

Check out the original survey here:

Feel free to copy it for your own use

scsurveyFeb 14, 20115KB3400
Diamond Ring Prices.xls
The source of the data is a full page advertisement placed in the Straits Times newspaper issue of February 29, 1992, by a Singapore-based retailer of diamond jewelry. The advertisement contained pictures of diamond rings and listed their prices, diamond content, and gold purity. Only 20K ladies' rings, each mounted with a single diamond stone, were considered for this study. 20K rings are made with gold of 20 carat purity. (Pure gold is rated as 24K.) There were 48 such rings of varying designs. The weights of the diamond stones ranged from 0.12 to 0.35 carats (a one carat diamond stone weighs 0.2 gram) and were priced between $223 and $1086. The jewelry store adopted a fixed-price policy. How Is Jewelry Priced? In Singapore, the pricing of gold jewelry is simple. The price equals the current market value of the gold content (i.e., weight times the going rate per gram of gold) plus a craftsmanship fee. However, the pricing of other jewelry like diamond rings is more complicated because they are not as standardized as gold jewelry. The price of diamond jewelry depends on the four C's: caratage, cut, colour, and clarity of the diamond stone. A good cut gives a diamond more sparkle. Colourless diamonds are the most prized. A flawless diamond has maximum clarity because the passage of light is unimpeded through the stone. Cut, colour, and clarity are subjective factors and are very hard for the layman to gauge.
craig_slinkmanApr 22, 2010586B2906
Pie Data
Contains name of pie, pie's country of origin, type of pie (savory, sweet, etc.), description of the pie, Baking Time (in minutes), Number of Ingredients, and Calories (per serving).
kelceyMar 11, 201420KB975
HeartAttack Data from 1993
Heart Attack Patients This set of data is all of the hospital discharges in New York State with an admitting diagnosis of an Acute Myocardial Infarction (AMI), also called a heart attack, who did not have surgery, in the year 1993. There are 12,844 cases. AGE gives age in years SEX is coded M for males F for females DIAGNOSIS is in the form of an International Classification of Diseases, 9th Edition, Clinical Modification code. These tell which part of the heart was affected. DRG is the Diagnosis Related Group. It groups together patients with similar management. In this data set there are just three different drgs. 121 for AMIs with cardiovascular complications who did not die. 122 for AMIs without cardiovascular complications who did not die. 123 for AMIs where the patient died. LOS gives the hospital length of stay in days. DIED has a 1 for patients who died in hospital and a 0 otherwise. CHARGES gives the total hospital charges in dollars. The SEX1 column converts F/M to 0/1 Data provided by Health Process Management of Doylestown, PA.
cdcummings12Aug 23, 2009412KB3706
Week 6 Group Quiz: Elderly Health Care Consumption
dlozimekJul 30, 2014192KB1331
Recent Weekly Gas Prices (Stacked)
Weekly Retail Gasoline and Diesel Prices 09:14:16 GMT-0400 (Eastern Daylight Time) Source: U.S. Energy Information Administration Stacked for use with two-way ANOVA
sbroadJun 27, 20122KB1064
Cereal nutrition
Name: Name of cereal, Manu: Manufacturer of cereal, Target: Target audience for cereal (adult, child), Shelf: Display shelf at the grocery store, Calories: Calories per serving, Carbs: Grams of complex carbohydrates in one serving, Fat: Grams of fat in one serving, Fiber: Grams of dietary fiber in one serving, Potassium: Milligrams of potassium in one serving

Protein: Grams of protein in one serving, Sodium: Milligrams of sodium in one serving, Sugars: Grams of sugars in one serving, Vitamins: Vitamins and minerals - 0, 25, or 100% of daily need in one serving, CRRating: Consumer Report rating, Cups: Number of cups in one serving, Weight: Weight in ounces of one serving

cdcummings12Sep 15, 20112KB3506
This is a small data set used to illustrate the failure of an inappropriate use of an independent means test compared with a paired test on the same data. The story is that we have before and after weights for 6 customers of a weight loss clinic. Visual observation makes it clear that the clinic is effective (except in one questionable case). Students can discuss what sources there are for the variation found in the data set and relate them to the assumptions of the independent versus paired analysis models. Application of classical techniques will produce an extremely large p_value for the independent analysis and a significant p_value for the paired analysis. To illustrate the difference with simulation techniques, first do a randomization for two means between the before and after data groups. This will spectacularly fail to show a difference, when in fact there is a clear difference. Then use a bootstrap to examine the 6 differences and it is clear that a zero difference is highly unlikely.
david.zeitlerMay 19, 201187B1954
This data was a pilot study of health outcomes related to domestic violence. Women from domestic violence shelters were asked about the severity of symptoms experienced in the last year. Women who were not abused were often workers at domestic violence shelters. The severity of the emotional, sexual, and physical violence was categorized into four groups 0 - no abuse, 1=least abuse, 2=middle level, and 3 = most. Abused is a dummy for whether abuse occurred. The main outcome variables are dummy variables for whether the woman experienced the health malady often in the last year. sxllhead=severe headaches, sxlinsom=insomnia, sxlchest=chest pain, sxlpelv=pelvic pain, sxlstom=stomach pain, sxlchok=sensation of choking, sxlbrea=shortness of breath,sxlvag=vaginal infection, sxlfat = fatigue. The women were age 18 - 48. Age25-34 is a dummy variable for age 25-34 and age35p is a dummy for ages 35 - 48. The variables least, middle, and most are dummy variables for level of violence. [Note: there are no easy labels for severity of violence. The label "Least" in no way implies that such a level is not important nor does it imply that there are no serious negative consequences on the life of the women experiencing the violence.] Creating contingency tables of abuse status vs. the health maladies is excellent practice for conditional probability and calculating a prevalence ratio to compare the two probabilities.
jph422Oct 15, 20086KB801
Low Birth Weight Study
SOURCE: Hosmer and Lemeshow (2000) Applied Logistic Regression: Second Edition Data were collected at Baystate Medical Center, Springfield, Massachusetts during 1986. DESCRIPTIVE ABSTRACT: The goal of this study was to identify risk factors associated with giving birth to a low birth weight baby (weighing less than 2500 grams). Data were collected on 189 women, 59 of which had low birth weight babies and 130 of which had normal birth weight babies. Four variables which were thought to be of importance were age, weight of the subject at her last menstrual period, race, and the number of physician visits during the first trimester of pregnancy. LIST OF VARIABLES: Columns Variable Abbreviation ----------------------------------------------------------------------------- 2-4 Identification Code ID 10 Low Birth Weight (0 = Birth Weight >= 2500g, LOW 1 = Birth Weight < 2500g) 17-18 Age of the Mother in Years AGE 23-25 Weight in Pounds at the Last Menstrual Period LWT 32 Race (1 = White, 2 = Black, 3 = Other) RACE 40 Smoking Status During Pregnancy (1 = Yes, 0 = No) SMOKE 48 History of Premature Labor (0 = None 1 = One, etc.) PTL 55 History of Hypertension (1 = Yes, 0 = No) HT 61 Presence of Uterine Irritability (1 = Yes, 0 = No) UI 67 Number of Physician Visits During the First Trimester FTV (0 = None, 1 = One, 2 = Two, etc.) 73-76 Birth Weight in Grams BWT ----------------------------------------------------------------------------- PEDAGOGICAL NOTES: These data have been used as an example of fitting a multiple logistic regression model. STORY BEHIND THE DATA: Low birth weight is an outcome that has been of concern to physicians for years. This is due to the fact that infant mortality rates and birth defect rates are very high for low birth weight babies. A woman's behavior during pregnancy (including diet, smoking habits, and receiving prenatal care) can greatly alter the chances of carrying the baby to term and, consequently, of delivering a baby of normal birth weight. The variables identified in the code sheet given in the table have been shown to be associated with low birth weight in the obstetrical literature. The goal of the current study was to ascertain if these variables were important in the population being served by the medical center where the data were collected. References: 1. Hosmer and Lemeshow, Applied Logistic Regression, Wiley, (1989).
wikipetersonJul 23, 20126KB7971

1 2 3 4 5 6 7 8 9 10   >

Always Learning