StatCrunch logo (home)

Data sets shared by StatCrunch members
Showing 1 to 15 of 1629 data sets matching SET
Data Set/Description Owner Last edited Size Views
chapter 9
This data set is Galton's Mother and Daughter data set as used in Sanfford Weisberg's Applied Linear Regression, 3rd Edition.
katcroweApr 12, 2019847B53
Thanksgiving 2015 Poll Data
This data was collected using a SurveyMonkey poll conducted on November 17th, 2015. Originally there were 1,058 respondents. The following where the original questions summarized in this data set:
Do you celebrate Thanksgiving?
What is typically the main dish at your Thanksgiving dinner?
How is the main dish typically cooked?
What kind of stuffing/dressing do you typically have?
What type of cranberry sauce do you typically have?
Do you typically have gravy?
Which of these side dishes are typically served at your Thanksgiving dinner? Please select all that apply.
Corn
Green beans/green bean casserole
Mashed potatoes
Rolls/biscuits
Yams/sweet potato casserole
Which type of pie is typically served at your Thanksgiving dinner? Please select all that apply.
Apple
Pecan
Pumpkin
Which of these desserts do you typically have at Thanksgiving dinner? Please select all that apply.
Do you typically pray before or after the Thanksgiving meal?
How far will you travel for Thanksgiving?
Will you watch any of the following programs on Thanksgiving?
Macy's Parade What's the age cutoff at your "kids' table" at Thanksgiving?
Have you ever tried to meet up with hometown friends on Thanksgiving night?
Have you ever attended a "Friendsgiving?"
Will you shop any Black Friday sales on Thanksgiving Day?
Do you work in retail?
Will you employer make you work on Black Friday?
How would you describe where you live?
Age
What is your gender?
How much total combined money did all members of your HOUSEHOLD earn last year?
What US Region do you live in?
statcrunch_featuredNov 14, 2018204KB2978
Christmas tree sales: Real vs. Fake 2004-2016
This data set contains the number of real and fake Christmas trees sold in the US between 2004 and 2016.
statcrunch_featuredNov 13, 2018398B2826
Fatal Encounters Updated September 2018
This data set was downloaded from Fatal Encounters, a non-profit organization that is collecting data on Police Involved Deaths. This data set has been truncated to include the subject's name, age at time of death, subject's gender, subjects race, location of death, cause of death and year of death. This does not only include people shot by police, but there are also instances of police that died during fatal encounters. It is good to remind students using this data set that this is a volunteer agency collecting the data from people that are scouring news articles for evidence of these fatal encounters, thus it is not a complete population of fatal encounters, only a very, very large sample.
habarkerApr 8, 20193MB117
FIFA World Cup Match Results (1930-2014)
This data set records all World Cup Men's soccer matches played between 1930 and 2014. Included is the date of the match, the location, the World Cup Stage (Stage), both teams, the halftime score, the final score, and the attendance for the game.
statcrunch_featuredAug 1, 2018102KB2067
New York City Leading Causes of Death (2007-2014)
This data set breaks down the leading causes of death in New York City between 2007-2014. Included is the number of Deaths (Deaths) for each combination of Sex and Race Ethnicity. The Death Rate represents the rate within that Sex/ Race Ethnicity category. Age Adjusted Death Rate adjusts the Death Rate by the ages of those who died.
statcrunch_featuredAug 1, 201896KB3909
Criminal Recidivism in Iowa: 2010-2014
Recidivism is defined as the "tendency of a convicted criminal to reoffend". This dataset tracks former criminals from Iowa over a 3 year period after their release from prison to see whether or not they were convicted of a new crime during that time. The recidivism reporting year is the fiscal year (year ending June 30) marking the end of the three year tracking period. Included are the following variables: Fiscal Year Released (the year the individual was released from Prison), the Race, Ethnicity, Sex, and Age of individual when released. Also included are details about the original crime committed along with whether that individual committed a new crime (Recidivism - Return to Prison) within the 3 year window.
statcrunch_featuredMar 21, 20183MB3422
FIFA World Cup Mens Players 2018
This data set records information for all 736 players for the 2018 FIFA World Cup. Included for each player is their national team (Team) along with their club team (Club).
statcrunch_featuredAug 1, 201863KB3210
Super Heroes
This data set originally came from the following website: https://www.kaggle.com/claudiodavi/superhero-set. It contains various physical characteristics for over 700 fictional comic book super heroes.
statcrunch_featuredAug 1, 201847KB6305
USA Car Accidents in 2011
This data set contains information for drivers involved in car accidents in the United States during 2011. The variables include the age in years of the person (Age), the gender of the person (Gender), the month in which the accident occurred (Month), and the day of the week of the accident (DayOfWeek).
statcrunch_featuredSep 12, 2017919KB9249
National Longitudinal Youth Survey: Weight Perception
The Youth survey consists of a nationally representative sample of youths who were 14 to 20 years old as of December 31, 1999.
This dataset tracks the Age, Height (in inches), Weight (in pounds), Gender, and the self reported "How would you describe your weight?" multiple choice answers for the individuals.
statcrunch_featuredNov 10, 2017330KB7566
Flight Delay Data For July 2014
This data set contains information on the flight delays for each airline at each U.S. airport in July of 2014. The columns include the carrier, airport city/state, airport code, airport name, total number of flights (Flights), the number of delayed flights (Delayed), the number of cancelled flights (Cancelled), the number of diverted flights (Diverted), the number of on-time flights (On-time), and the on-time percentage (On-time Percentage).
statcrunch_featuredJan 2, 201888KB6004
California Home Prices, 2009
This dataset is a collection of real estate listings from San Luis Obispo county, California, and some locations around it from 2009. The prices are their list price at the creation of this dataset. For more information about this data, go to the website source listed above.
statcrunch_featuredApr 3, 201746KB8042
Top 100 Retailers 2015
This dataset comes from the National Retail Federation and tracks the top retail chains in the US for 2015 based on their 2014 sales. The original data can be found at the webpage listed as the source. Note that these retailers include all sorts of avenues including internet sales.
statcrunch_featuredNov 10, 20178KB4358
US Presidential Election History
This dataset tracks the US presidential election results dating back to 1824. Included is the winning candidate, winning party, popular voting totals, margin of victory, and the electoral college totals. Also included is the name and party of the runner-up along with the percentage of all eligible voters that turned out for the election (Voter Turnout Percentage).
statcrunch_featuredFeb 20, 20185KB2624

1 2 3 4 5 6 7 8 9 10   >

Always Learning