StatCrunch logo (home)

Data sets shared by StatCrunch members
Showing 1 to 15 of 218 data sets matching DIE
Data Set/Description Owner Last edited Size Views
Fictional Course Midterm Scores (2017)
Code Book: Degree Master of Arts 0 Undergraduate 1 Subject Bio 0 Chem 1 Chinese 2 English 3 Math 4 Music 5 Social Studies 6 Dance: 7 Grades PreK-12 0 7th-12th 1
ctorresrApr 14, 2019477B83
Fatal Encounters Updated September 2018
This data set was downloaded from Fatal Encounters, a non-profit organization that is collecting data on Police Involved Deaths. This data set has been truncated to include the subject's name, age at time of death, subject's gender, subjects race, location of death, cause of death and year of death. This does not only include people shot by police, but there are also instances of police that died during fatal encounters. It is good to remind students using this data set that this is a volunteer agency collecting the data from people that are scouring news articles for evidence of these fatal encounters, thus it is not a complete population of fatal encounters, only a very, very large sample.
habarkerApr 8, 20193MB117
New York City Leading Causes of Death (2007-2014)
This data set breaks down the leading causes of death in New York City between 2007-2014. Included is the number of Deaths (Deaths) for each combination of Sex and Race Ethnicity. The Death Rate represents the rate within that Sex/ Race Ethnicity category. Age Adjusted Death Rate adjusts the Death Rate by the ages of those who died.
statcrunch_featuredAug 1, 201896KB3909
Cereal Brands
Data on several variable of different brands of cereal. Number of cases: 77 Variable Names: Name: Name of cereal mfr: Manufacturer of cereal where A = American Home Food Products; G = General Mills; K = Kelloggs; N = Nabisco; P = Post; Q = Quaker Oats; R = Ralston Purina type: cold or hot calories: calories per serving protein: grams of protein fat: grams of fat sodium: milligrams of sodium fiber: grams of dietary fiber carbo: grams of complex carbohydrates sugars: grams of sugars potass: milligrams of potassium vitamins: vitamins and minerals - 0, 25, or 100, indicating the typical percentage of FDA recommended shelf: display shelf (1, 2, or 3, counting from the floor) weight: weight in ounces of one serving cups: number of cups in one serving rating: a rating of the cereals
statcrunch_featuredApr 3, 20174KB7230
Wimbledon Ladies Singles Champions
Names and Nationalities of the Wimbledon Ladies Singles Champions each year since the beginning of the Open Era
groomdjFeb 22, 20192KB112
USDA Nutrition Data
This dataset has the nutritional values per serving size for a large variety of foods as calculated by the USDA.

US Department of Agriculture, Agricultural Research Service, Nutrient Data Laboratory. USDA National Nutrient Database for Standard Reference, Release 28. Version Current: September 2015. Internet:
statcrunchhelpJan 13, 2016832KB1793
Responses to Soda survey
A recent survey asked respondents what percentage of their daily fluid intake is soda. It also asked if they drink diet soda, their gender, and their age.

Check out the original survey here:

Feel free to copy it for your own use

scsurveyFeb 14, 20115KB3178
Diamond Ring Prices.xls
The source of the data is a full page advertisement placed in the Straits Times newspaper issue of February 29, 1992, by a Singapore-based retailer of diamond jewelry. The advertisement contained pictures of diamond rings and listed their prices, diamond content, and gold purity. Only 20K ladies' rings, each mounted with a single diamond stone, were considered for this study. 20K rings are made with gold of 20 carat purity. (Pure gold is rated as 24K.) There were 48 such rings of varying designs. The weights of the diamond stones ranged from 0.12 to 0.35 carats (a one carat diamond stone weighs 0.2 gram) and were priced between $223 and $1086. The jewelry store adopted a fixed-price policy. How Is Jewelry Priced? In Singapore, the pricing of gold jewelry is simple. The price equals the current market value of the gold content (i.e., weight times the going rate per gram of gold) plus a craftsmanship fee. However, the pricing of other jewelry like diamond rings is more complicated because they are not as standardized as gold jewelry. The price of diamond jewelry depends on the four C's: caratage, cut, colour, and clarity of the diamond stone. A good cut gives a diamond more sparkle. Colourless diamonds are the most prized. A flawless diamond has maximum clarity because the passage of light is unimpeded through the stone. Cut, colour, and clarity are subjective factors and are very hard for the layman to gauge.
craig_slinkmanApr 22, 2010586B2614
HeartAttack Data from 1993
Heart Attack Patients This set of data is all of the hospital discharges in New York State with an admitting diagnosis of an Acute Myocardial Infarction (AMI), also called a heart attack, who did not have surgery, in the year 1993. There are 12,844 cases. AGE gives age in years SEX is coded M for males F for females DIAGNOSIS is in the form of an International Classification of Diseases, 9th Edition, Clinical Modification code. These tell which part of the heart was affected. DRG is the Diagnosis Related Group. It groups together patients with similar management. In this data set there are just three different drgs. 121 for AMIs with cardiovascular complications who did not die. 122 for AMIs without cardiovascular complications who did not die. 123 for AMIs where the patient died. LOS gives the hospital length of stay in days. DIED has a 1 for patients who died in hospital and a 0 otherwise. CHARGES gives the total hospital charges in dollars. The SEX1 column converts F/M to 0/1 Data provided by Health Process Management of Doylestown, PA.
cdcummings12Aug 23, 2009412KB3205
Week 6 Group Quiz: Elderly Health Care Consumption
dlozimekJul 30, 2014192KB1307
Cereal nutrition
Name: Name of cereal, Manu: Manufacturer of cereal, Target: Target audience for cereal (adult, child), Shelf: Display shelf at the grocery store, Calories: Calories per serving, Carbs: Grams of complex carbohydrates in one serving, Fat: Grams of fat in one serving, Fiber: Grams of dietary fiber in one serving, Potassium: Milligrams of potassium in one serving

Protein: Grams of protein in one serving, Sodium: Milligrams of sodium in one serving, Sugars: Grams of sugars in one serving, Vitamins: Vitamins and minerals - 0, 25, or 100% of daily need in one serving, CRRating: Consumer Report rating, Cups: Number of cups in one serving, Weight: Weight in ounces of one serving

cdcummings12Sep 15, 20112KB3236
This is a small data set used to illustrate the failure of an inappropriate use of an independent means test compared with a paired test on the same data. The story is that we have before and after weights for 6 customers of a weight loss clinic. Visual observation makes it clear that the clinic is effective (except in one questionable case). Students can discuss what sources there are for the variation found in the data set and relate them to the assumptions of the independent versus paired analysis models. Application of classical techniques will produce an extremely large p_value for the independent analysis and a significant p_value for the paired analysis. To illustrate the difference with simulation techniques, first do a randomization for two means between the before and after data groups. This will spectacularly fail to show a difference, when in fact there is a clear difference. Then use a bootstrap to examine the 6 differences and it is clear that a zero difference is highly unlikely.
david.zeitlerMay 19, 201187B1822
Recent Weekly Gas Prices (Stacked)
Weekly Retail Gasoline and Diesel Prices 09:14:16 GMT-0400 (Eastern Daylight Time) Source: U.S. Energy Information Administration Stacked for use with two-way ANOVA
sbroadJun 27, 20122KB938
Pie Data
Contains name of pie, pie's country of origin, type of pie (savory, sweet, etc.), description of the pie, Baking Time (in minutes), Number of Ingredients, and Calories (per serving).
kelceyMar 11, 201420KB811
Low Birth Weight Study
SOURCE: Hosmer and Lemeshow (2000) Applied Logistic Regression: Second Edition Data were collected at Baystate Medical Center, Springfield, Massachusetts during 1986. DESCRIPTIVE ABSTRACT: The goal of this study was to identify risk factors associated with giving birth to a low birth weight baby (weighing less than 2500 grams). Data were collected on 189 women, 59 of which had low birth weight babies and 130 of which had normal birth weight babies. Four variables which were thought to be of importance were age, weight of the subject at her last menstrual period, race, and the number of physician visits during the first trimester of pregnancy. LIST OF VARIABLES: Columns Variable Abbreviation ----------------------------------------------------------------------------- 2-4 Identification Code ID 10 Low Birth Weight (0 = Birth Weight >= 2500g, LOW 1 = Birth Weight < 2500g) 17-18 Age of the Mother in Years AGE 23-25 Weight in Pounds at the Last Menstrual Period LWT 32 Race (1 = White, 2 = Black, 3 = Other) RACE 40 Smoking Status During Pregnancy (1 = Yes, 0 = No) SMOKE 48 History of Premature Labor (0 = None 1 = One, etc.) PTL 55 History of Hypertension (1 = Yes, 0 = No) HT 61 Presence of Uterine Irritability (1 = Yes, 0 = No) UI 67 Number of Physician Visits During the First Trimester FTV (0 = None, 1 = One, 2 = Two, etc.) 73-76 Birth Weight in Grams BWT ----------------------------------------------------------------------------- PEDAGOGICAL NOTES: These data have been used as an example of fitting a multiple logistic regression model. STORY BEHIND THE DATA: Low birth weight is an outcome that has been of concern to physicians for years. This is due to the fact that infant mortality rates and birth defect rates are very high for low birth weight babies. A woman's behavior during pregnancy (including diet, smoking habits, and receiving prenatal care) can greatly alter the chances of carrying the baby to term and, consequently, of delivering a baby of normal birth weight. The variables identified in the code sheet given in the table have been shown to be associated with low birth weight in the obstetrical literature. The goal of the current study was to ascertain if these variables were important in the population being served by the medical center where the data were collected. References: 1. Hosmer and Lemeshow, Applied Logistic Regression, Wiley, (1989).
wikipetersonJul 23, 20126KB7544

1 2 3 4 5 6 7 8 9 10   >

Always Learning