StatCrunch logo (home)

Data sets shared by StatCrunch members
Showing 136 to 150 of 37995 data sets
Data Set/Description Owner Last edited Size Views
North Carolina premature births
A Random Sample of 1000 births from the state of North Carolina. Plurarility refers to the number of children associated with the birth. Gender 1=Male, 2=Female. fage is age of father (years), mage is age of mother (years), visits is number of pre-natal medical visits, marital is 1=married, 2=unmarried, racemom is Race of Mother (0=Other Non-white, 1=White, 2=Black 3=American Indian, 4=Chinese, 5=Japanese, 6=Hawaiian, 7=Filipino, 8=Other Asian or Pacific Islander), hispmom is whether mother is of Hispanic origin (C=Cuban, M=Mexican, N=Non-Hispanic, O=Other and Unknown Hispanic, P=Puerto Rican, S=Central/South American, U=Not Classifiable), gained is weight gain during pregnancy (pounds), lowbw is if birth weight is 2500 grams or lower, tpounds is birthweight in pounds, smoke is 0=no, 1=yes for mother admitted to smoking, mature is 0=no, 1-yes for mother is 35 or older, premie is 0=no, 1=yes to being born 36 weeks or sooner.
statcrunchhelpApr 10, 20144KB2271
Random Sample of 100 observations from NHANES (which contains more observations). GENDER (1=Male, 2=Female), AGE (years), WEIGHTENG (inches), HEIGHTENG (inches), SIXFOOT (0=No, 1=Yes to being 72 inches or taller), LEGENG (Leg length inches), WAISTENG (Waist circumference inches), THIGHENG (Thigh circumference inches), WAIST28 (0=No, 1=Yes to having waist 28 inches or smaller), HEIGHT65 (0=No, 1=Yes to being 65 inches tall or shorter), BMI30 (0=No, 1=Yes to having Body Mass Index 30 or higher), OVER200 (0=No, 1=Yes to weighing 200 pounds or more).
jph422Sep 16, 20084KB3967
Responses to physical fitness survey
This data set represents the results of StatCrunch survey administered between September 22nd, 2009 and October 2nd, 2009. Respondents provided the number of hours per week (Hours) that they exercise each week and the number of days per week (Days) that they exercise. Respondents also stated whether or not they were gym members (Gym), the type of exercise they do (Type - Cardio, Strength, Both or Neither), their age (Age) and their gender (Gender).
statcrunchOct 2, 20094KB4270
SAT 2012
Combined SAT scores (out of 2400) for college bound high school seniors in 2012 by gender and total, binned by score.
sblackJan 13, 20134KB2759
NY Times: The Most Economically Diverse Top Colleges
To measure top colleges efforts on economic diversity, The Upshot calculated a College Access Index, based on the percentage of freshmen in recent years who came from low-income families (measured by the share receiving a Pell grant) and on the net price of attendance for low- and middle-income families. The recent Pell (2012 - 2014) number for each college is the average percentage of the freshman class that received a Pell grant in 2011-12, 2012-13 and 2013-14; not all colleges had 2013 data yet. The earlier Pell (2008) value is for the fall of 2007. Average net price is the average total cost of attendance in 2012-13, including tuition, fees, room and board, after taking into account federal, state and institutional financial aid, for students who come from households earning between $30,000 and $48,000 a year and qualifying for federal aid. Endowment per student is for the year 2011-12 and includes graduate students. The College Access Index is a combination of net price and the Pell average for 2011, 2012 and 2013, using a statistical technique known as a z-score. A college with an average score on the two measures in combination will receive a zero.
statcrunchhelpSep 10, 20144KB1374
Driver Data for Traffic Tickets
These data are from a survey of traffic violations. Female: 0 = Male, 1 = Female; Had Ticket: 0 = No, 1 = Yes; Tickets = Number of tickets received in life; Warnings = Number of warnings (i.e., pulled over but no ticket) in life.
qtpie1480Dec 2, 20104KB1627
Survey on Exercise
- Value Key 2 Yes 1 No 2 Male 1 Female 1-Aesthetics 2-Fitness 3-Fun 4-Healthy 5-Hobby 6-Medical 7-No Motivation 8-Mental Motional 9-Requirement 10-Social 11-Time 1-Cardio 2-Combination 3-None 4-Resistance 5-yoga / Pilates Source of information on exercise. 1-Doctor 2-Friend or family 3-Media 4-N/A 5-professional 6-text 7-school 8-trainer 1- 2-.5-1 years 3- <.5 years 4-1-2 years 5- 2+ years 1=0 2=1-2 3=3-4 4=5+ 1=0 2- < 30 minutes 3. 31-60 4- 61-90 5- > 90 Source of information on exercise. 1-Doctor 2-Friend or family 3-Media 4-N/A 5-professional 6-text 7-school 8-trainer
jramirez514May 18, 20133KB2474
Stats from the major league baseball teams for 2013. The last column I added denotes AL for American League and NL for National League. One could possibly conduct a two-sample means test, for example, to find out whether the average runs for the two leagues are equal. Or there are of course lots of regressions one could run.
eykoloNov 4, 20133KB2082
Responses to you and your mate survey
This data set contains responses to a survey ran on the StatCrunch site between October 2nd and October 22nd (2009). Respondents provided their height (in inches), their opinion of the ideal height of their mate (in inches) and their gender. This data set has removed ten extreme observations.
statcrunchSep 27, 20133KB1189
Hot Dogs
The data set describes some of the nutritional properties of hot dogs.
craig_slinkmanSep 13, 20113KB1197
Crime: Walmart vs. Target
This data set comes directly from the "Is Wal-Mart Safe?" report. Are you tempted to do a paired t test to compare the mean number of police incidents at Walmart to the mean number of police incidents at Target? This was actually done in the report. I think this is a bad idea because only the Walmart stores with high incident reports were considered. See the excerpt from the report in the comments below.
websterwestFeb 29, 20083KB3978
Responses to opinions on H1N1
This data set consists of responses to a StatCrunch survey that was conducted between September 8th, 2009 and September 22nd, 2009. Respondents were asked if they were concerned about H1N1, if they were taking steps to avoid it, if they know anyone who has had it and whether or not they were a student.
websterwestSep 22, 20093KB2040
Rock'n'Roll Hall of Fame
This dataset has information on a selected group of members of The Rock'n'Roll Hall of Fame, including information on the number of people in the group, if the group had a female member, if the person/group was a double inductee into the Hall, how many studio albums they had, the number of #1 hits, the number of top 40 hits, how many music videos they had (this needs work) and the year of release of their first album.
jpalmateerMay 27, 20163KB1702
The data in the Old Faithful file gives data about eruptions of the Old Faithful Geyser during October 1980. Variables are Duration in seconds of the current eruption, and Interval, the time to the next eruption. Old Faithful is an important tourist attraction, with up to a thousand people watching it erupt on pleasant summer days. The National Park Service uses data to obtain a prediction of the time to the next eruption.
craig_slinkmanMay 4, 20102KB2547
NBA 2014-2015
This data set comes from ESPN and represents the regular season standings for the 2014-2015 NBA Season. Variables are as follows:
W = # of wins
L = # of loses
Win % = winning percentage
HOME = Home record
ROAD = Road Record
DIV = Division Record
CONF = Conference record
PPG = Points scored per game
OPP PPG = Opponent's points scored per game
Conference = Conference played in
ntorno8May 30, 20152KB5101

<   1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19   >

Always Learning