StatCrunch logo (home)

Data sets shared by StatCrunch members
Showing 1 to 15 of 160 data sets matching DESCRIPTION
Data Set/Description Owner Last edited Size Views
US Workforce Participation
This data primarily comes from two sources: Federal Reserve Bank of St. Louis and the US Bureau of Labor Statistics .
ColumnDescription
YearThe calendar year for each value
Annual Average Workforce ParticipationDefined by the Bureau of Labor Statistics as "the percentage of the population [16 years and older] that is either employed or unemployed (that is, either working or actively seeking work). Note that 2015's Annual Average is calculated using the first 11 months."
Male Workforce Participation RateAnnual workforce participation rate for males.
Female Workforce Participation RateAnnual workforce participation rate for females.
Male Inactivity Rate Aged 25-54Defined as the proportion of the male population aged 25-54 that is not in the labour force. Common reasons for leaving labour force: college, retirement, stay at home, can't find work and no longer try.
Change in Rate (Male Inactivity Rate Aged 25-54)The change in the inactivity rate calculated as the current year minus the previous year.
Female Inactivity Rate Aged 25-54Defined as the proportion of the female population aged 25-54 that is not in the labour force.
Change in Rate (Female Inactivity Rate Aged 25-54)The change in the inactivity rate calculated as the current year minus the previous year.
Presidential ControlPolitical party of president.
Senate ControlPolitical party of the Senate majority
House ControlPolitical party of the House of Representatives majority.
Legislative Branch (House and Senate)Combined control of Senate and House of Representativs
statcrunch_featuredJun 27, 201710KB2194
Roller Coasters Data
This dataset looks at some of the roller coasters across the US and various other countries.
ColumnDescription
NameName of roller coaster
ParkAmusement park for roller coaster
CityCity for amusement park
StateState abbreviation
CountryCountry of the roller coaster. US: United States, MX: Mexico, CR: Costa Rica, GT: Guatemala, CO: Columbia, VE: Venezuela, BR: Brazil, AR: Argentina, CL: Chile, EQ: Ecuador, PE: Peru, F: France, D: Germany
TypeS: Steel, W: Wood
ConstructorType of build for the roller coaster
HeightHeight in meters
SpeedSpeed in kilometers per hour (km/h)
LengthLength in meters
InversionsYes if there are inversions, no if not
DurationDuration of ride in seconds
GForceMax g-force
OpenedYear it opened
RegionGeographic region for the roller coaster
statcrunch_featuredApr 3, 201748KB5399
McKenna Morrissey: Depression and the Internet
This study was done to figure if spending more time on the internet causes depression. This data set includes hours spent on the internet per week, depression before, and after, gender, race, age, household income, and household size. (https://dasl.datadescription.com/datafile/depression-and-the-internet/?_sfm_cases=4+17504&sf_paged=6)
mckenrmOct 24, 20189KB841
Times World University Rankings (2011-2016)
This data comes from the annual Times magazine rankings of universities across the world. The webpage for the Times 2016 rankings is listed above in the source.
The formula for the 2016 rankings is as follows:
30% for Teaching Rating
7.5% for International Outlook Rating
30% for Research Rating
30% for Citations Rating
2.5% for Industry Income Rating.
The “Total Score” from 2016 can be recreated using this formula.

ColumnDescription
World_RankUniversity rank for a given year
University_NameThe name of the university
CountryLocation of university
Teaching_Rating Rating from a 0-100 scale of the quality of teaching at the university. This rating is based on the institution’s reputation for teaching, it’s student/staff ratio, it’s PhD’s/ undergraduate degrees awarded ratio, and it’s institutional income/ academic staff ratio.
Inter_Outlook_Rating Rating from a 0-100 scale of the international makeup of a university. This rating is based the international student percentage, international staff percentage, and the percentage of research papers from the university that include at least one international author.
Research_Rating Rating from a 0-100 scale of quality of research at the university. This rating is based on the university’s reputation, it’s research income/ academic staff ratio, and it’s production of scholarly papers.
Citations_Rating Rating from a 0-100 scale of based on the normalized average of citations by other papers per paper from the university (how often the research from the university is cited by other papers).
Industry_Income_Rating Rating from a 0-100 scale grading how much companies are willing to invest in the universities research. The rating is calculated based on the research income from businesses per academic staff member.
Total_ScoreThe final score used to determine the university ranking based on Teaching_Rating, International_Outlook_Rating, Research_Rating, Citations_Rating, and Industrial_Income_Rating.
Num_StudentsTotal number of students in a given year
Student/Staff_RatioNumber of students per academic staff member
%_Inter_StudentsPercentage of student body who come from a foreign county
%_Female_Students Percentage of student body that is female.
YearAcademic year that the ranking was released. For example, 2016 denotes the 2015-2016 academic year.
statcrunchhelpApr 5, 2016254KB3696
% voting for Obama and other state statistics
This data set has over 100 statistics (current for 2010-11) for U.S. states obtained from Measure of America. Each state's percentage voting for President Obama in 2012 has been added. Which of the original variables is most highly correlated with this voting percentage? How does this data match the ideas provided by political pundits? See the source for a complete description of all variables.
websterwestFeb 16, 201336KB6296
Federal Food Assistance Participation
This primarily comes from the following source: United States Department of Agriculture: Food and Nutrition Service . This dataset also incorporates data from another StatCrunch dataset: US Workforce Participation

ColumnDescription
YearThe year for each data value
Average Federal Food Assistance Participation in ThousandsNumber of individuals in the US who took part in SNAP (Supplemental Nutrition Assistance Program) during the given year.
% US Population on Federal Food Assitance% of US population that is currently in the SNAP program and is receiving aid with food.
Change of % (US Population on Federal Food Assistance)The change in the percentage of the US population that is receiving food assistance from SNAP.
Presidential ControlPolitical party of president.
Senate ControlPolitical party of the Senate majority
House ControlPolitical party of the House of Representatives majority.
Legislative Branch (House and Senate)Combined control of Senate and House of Representativs
Male Inactivity Rate Aged 25-54Defined as the proportion of the male population aged 25-54 that is not in the labour force. Common reasons for leaving labour force: college, retirement, stay at home, can't find work and no longer try.
Change of Rate (Male Inactivity Rate Aged 25-54)The change in the inactivity rate calculated as the current year minus the previous year.
Female Inactivity Rate Aged 25-54Defined as the proportion of the female population aged 25-54 that is not in the labour force.
Change of Rate (Female Inactivity Rate Aged 25-54)The change in the inactivity rate calculated as the current year minus the previous year.
Annual Average Workforce Participation RateDefined by the Bureau of Labor Statistics as "the percentage of the population [16 years and older] that is either employed or unemployed (that is, either working or actively seeking work). Note that 2015's Annual Average is calculated using the first 11 months."
Change of Rate (Annual Workforce Participation Rate)The change in the workforce participation rate calculated as the current year minus the previous year.
statcrunchhelpJan 8, 201610KB1673
Top Rated Jobs 2014
This data is gathered from careercast.com and is available in it's original form at the source listed above. The dataset originally was created by Keisha Brown from Georgia Perimeter College.

ColumnDescription
Ranking Ranking from 0 to 200 based on the combined “Overall Rating”
JobTitle for the job.
Median Annual IncomeBased on Bureau of Labor Statistics
Overall RatingCombined rating based on income, stress, hiring outlook, and work environment. The lower the rating the better rated the job.
Stress RatingA rating from 1 to 200 estimating the overall stress level from the job. This essentially is a ranking with 1 being the least stressful job and 200 being the most stressful job.
Hiring Outlook Rating A rating from 1 to 200 estimating the overall stress level from the job. This essentially is a ranking with 1 being the best hiring outlook and 200 being the worst hiring outlook.
Work Environment Rating A rating from 1 to 200 estimating the overall stress level from the job. This essentially is a ranking with 1 being the best work environment and 200 being the worst work environment.
statcrunchhelpMar 14, 20169KB2594
Egyptian Skulls
Description: Four measurements of male Egyptian skulls from 5 different time periods. Thirty skulls are measured from each time period. Number of cases: 150 Variable Names: MB: Maximal Breadth of Skull BH: Basibregmatic Height of Skull BL: Basialveolar Length of Skull NH: Nasal Height of Skull Year: Approximate Year of Skull Formation (negative = B.C., positive = A.D.)
hassan.dayemNov 19, 20129KB2240
titanic_full.xls
VARIABLE DESCRIPTIONS: survival Survival (0 = No; 1 = Yes), pclass Passenger Class (1 = 1st; 2 = 2nd; 3 = 3rd), name Name, sex Sex, age Age, sibsp Number of Siblings/Spouses Aboard, parch Number of Parents/Children Aboard, ticket Ticket Number, fare Passenger Fare, cabin Cabin, embarked Port of Embarkation (C = Cherbourg; Q = Queenstown; S = Southampton), boat Lifeboat, body Body Identification Number home.dest Home/Destination.
swhardyOct 25, 2015110KB1274
Roller Coasters Data
This dataset looks at some of the roller coasters across the US and various other countries.
ColumnDescription
NameName of roller coaster
ParkAmusement park for roller coaster
CityCity for amusement park
StateState abbreviation
CountryCountry of the roller coaster. US: United States, MX: Mexico, CR: Costa Rica, GT: Guatemala, CO: Columbia, VE: Venezuela, BR: Brazil, AR: Argentina, CL: Chile, EQ: Ecuador, PE: Peru, F: France, D: Germany
TypeS: Steel, W: Wood
ConstructorType of build for the roller coaster
HeightHeight in meters
SpeedSpeed in kilometers per hour (km/h)
LengthLength in meters
InversionsYes if there are inversions, no if not
DurationDuration of ride in seconds
GForceMax g-force
OpenedYear it opened
RegionGeographic region for the roller coaster
ntorno8Sep 15, 201648KB15448
Treatment Effects of a Drug on Cognitive Functioning in Children with Mental Retardation and ADHD
Research conducted by: Pearson et al. Case study prepared by: David Lane and Emily Zitek Overview This study investigated the cognitive effects of stimulant medication in children with mental retardation and Attention-Deficit/Hyperactivity Disorder. This case study shows the data for the Delay of Gratification (DOG) task. Children were given various dosages of a drug, methylphenidate (MPH) and then completed this task as part of a larger battery of tests. The order of doses was counterbalanced so that each dose appeared equally often in each position. For example, six children received the lowest dose first, six received it second, etc. The children were on each dose one week before testing. This task, adapted from the preschool delay task of the Gordon Diagnostic System (Gordon, 1983), measures the ability to suppress or delay impulsive behavioral responses. Children were told that a star would appear on the computer screen if they waited long enough to press a response key. If a child responded sooner in less than four seconds after their previous response, they did not earn a star, and the 4-second counter restarted. The DOG differentiates children with and without ADHD of normal intelligence (e.g., Mayes et al., 2001), and is sensitive to MPH treatment in these children (Hall & Kataria, 1992). Questions to Answer Does higher dosage lead to higher cognitive performance (measured by the number of correct responses to the DOG task)? Design Issues This is a repeated-measures design because each participant performed the task after each dosage. Variable Description Placebo: Number of correct responses after taking a placebo d15 Number of correct responses after taking .15 mg/kg of the drug d30 Number of correct responses after taking .30 mg/kg of the drug d60 Number of correct responses after taking .60 mg/kg of the drug
kari.taylorOct 22, 2014434B1387
1st: helium football
Datafile Name: Helium football Datafile Subjects: Sports Story Names: Helium football Reference: Lafferty, M. B. (1993), "OSU scientists get a kick out of sports controversy, "The Columbus Dispatch (November, 21, 1993), B7. Authorization: Contact authors Description: Two identical footballs, one air-filled and one helium-filled, were used outdoors on a windless day at The Ohio State University's athletic complex. Each football was kicked 39 times and the two footballs were alternated with each kick. The experimenter recorded the distance traveled by each ball. Number of cases: 39 Variable Names: Trial: Trial Number Air: distance in yards for air-filled football Helium: distance in yards for helium-filled football
phil_larsonSep 13, 2012359B980
Titanic Passenger List
VARIABLE DESCRIPTIONS: survival Survival (0 = No; 1 = Yes), pclass Passenger Class (1 = 1st; 2 = 2nd; 3 = 3rd), name Name, sex Sex, age Age, sibsp Number of Siblings/Spouses Aboard, parch Number of Parents/Children Aboard, ticket Ticket Number, fare Passenger Fare, cabin Cabin, embarked Port of Embarkation (C = Cherbourg; Q = Queenstown; S = Southampton), boat Lifeboat, body Body Identification Number home.dest Home/Destination.
lmcmath34Jun 29, 2016123KB1344
CEREAL BRANDS
Datafile Subjects: Food , Health Story Names: Healthy Breakfast Reference: Data available at many grocery stores Authorization: free use Description: Data on several variable of different brands of cereal. A value of -1 for nutrients indicates a missing observation. Number of cases: 77 Variable Names: Name: Name of cereal mfr: Manufacturer of cereal where A = American Home Food Products; G = General Mills; K = Kelloggs; N = Nabisco; P = Post; Q = Quaker Oats; R = Ralston Purina type: cold or hot calories: calories per serving protein: grams of protein fat: grams of fat sodium: milligrams of sodium fiber: grams of dietary fiber carbo: grams of complex carbohydrates sugars: grams of sugars potass: milligrams of potassium vitamins: vitamins and minerals - 0, 25, or 100, indicating the typical percentage of FDA recommended shelf: display shelf (1, 2, or 3, counting from the floor) weight: weight in ounces of one serving cups: number of cups in one serving rating: a rating of the cereals
lil_bizAug 27, 20095KB3095
Home prices in Albuquerque
The data are a random sample of 117 records of resales of homes from Feb 15 to Apr 30, 1993 from the files maintained by the Albuquerque Board of Realtors. This type of data is collected by multiple listing agencies in many cities and is used by realtors as an information base.
ColumnDescription
PRICE Selling price in hundreds of dollars
SQFT Square feet of living space
AGE Age of home in years
FEATS Number out of 11 features (dishwasher, refrigerator, microwave, disposer, washer, intercom, skylight(s), compactor, dryer, handicap fit, cable TV access)
NE Located in northeast sector of city (1) or not (0)
COR Corner location (1) or not (0)
TAX Annual taxes in dollars
statcrunchhelpSep 4, 20143KB2370

1 2 3 4 5 6 7 8 9 10   >

Always Learning