|
Public profile for statcrunchhelp
Shared data sets | Shared results | Shared reports
Showing 1 to 82 of 82 data sets
Data Set/Description |
Owner |
Last edited |
Size |
Views |
Fictional Super Hero Good/Evil
This data set originally came from the following website: https://www.kaggle.com/claudiodavi/superhero-set. It contains various physical characteristics for over 700 fictional comic book super heroes. | statcrunchhelp | Dec 10, 2018 | 5KB | 11 |
US Birthrates over Years
US Birthrates over the years. . Rates are per 1,000 population estimated as of July 1 for each year. | statcrunchhelp | Feb 28, 2017 | 1KB | 509 |
Nutritional Data for Fast Food 2017
The dataset was collected in January of 2017 by looking through online nutritional information provided by fast food restaurant chains. Nutrition data on various burgers, a breaded chicken sandwich, a grilled chicken sandwich, chicken nuggets, french fries, and a chocolate milkshake were collected for each restaurant (when applicable). For each chain the smallest hamburger, the smallest cheeseburger, and a variety of their most well known larger burgers were selected. | statcrunchhelp | Jan 6, 2017 | 11KB | 3209 |
Fat and calorie content for a sample of seven chicken sandwiches
Fat is measured in grams. | statcrunchhelp | Jun 20, 2016 | 124B | 251 | Math expressions with bad parameters
This data set is useful only for testing purposes, to confirm that StatCrunch gives valid results for out-of-range parameters or edge cases. Use Data>Compute>From Column and select the "Expression" column. In most cases, the result should be blank, but in some cases it should be zero or one. This data set is used in an autotest, so please do not modify it without updating the corresponding autotest.
The same functions with valid parameters are tested in the file dataid=1712005. | statcrunchhelp | Jun 13, 2016 | 5KB | 119 | Question Library Q1 Distribution6 | statcrunchhelp | May 3, 2016 | 1KB | 35 | Question Library Q1 Distribution4 | statcrunchhelp | May 3, 2016 | 2KB | 27 | Question Library Q1 Distribution3 | statcrunchhelp | May 3, 2016 | 2KB | 41 | Question Library Q1 Distribution2 | statcrunchhelp | May 3, 2016 | 1KB | 11 | Question Library Q1 Distribution1 | statcrunchhelp | May 3, 2016 | 1KB | 49 |
Granola comparison
Ten subjects in this fictional study were each asked to sample three kinds of granola cereal, labelled simply "A", "B", and "C", and to rate the granola's taste on a scale of 1 to 10. Each subject was given the three granola samples in random order. | statcrunchhelp | Apr 19, 2016 | 223B | 918 |
Times World University Rankings (2011-2016)
This data comes from the annual Times magazine rankings of universities across the world. The webpage for the Times 2016 rankings is listed above in the source.
The formula for the 2016 rankings is as follows: 30% for Teaching Rating 7.5% for International Outlook Rating 30% for Research Rating 30% for Citations Rating 2.5% for Industry Income Rating. The “Total Score” from 2016 can be recreated using this formula.
Column | Description | World_Rank | University rank for a given year |
University_Name | The name of the university |
Country | Location of university |
Teaching_Rating | Rating from a 0-100 scale of the quality of teaching at the university. This rating is based on the institution’s reputation for teaching, it’s student/staff ratio, it’s PhD’s/ undergraduate degrees awarded ratio, and it’s institutional income/ academic staff ratio. |
Inter_Outlook_Rating | Rating from a 0-100 scale of the international makeup of a university. This rating is based the international student percentage, international staff percentage, and the percentage of research papers from the university that include at least one international author. |
Research_Rating | Rating from a 0-100 scale of quality of research at the university. This rating is based on the university’s reputation, it’s research income/ academic staff ratio, and it’s production of scholarly papers. |
Citations_Rating | Rating from a 0-100 scale of based on the normalized average of citations by other papers per paper from the university (how often the research from the university is cited by other papers). |
Industry_Income_Rating | Rating from a 0-100 scale grading how much companies are willing to invest in the universities research. The rating is calculated based on the research income from businesses per academic staff member. |
Total_Score | The final score used to determine the university ranking based on Teaching_Rating, International_Outlook_Rating, Research_Rating, Citations_Rating, and Industrial_Income_Rating. |
Num_Students | Total number of students in a given year |
Student/Staff_Ratio | Number of students per academic staff member |
%_Inter_Students | Percentage of student body who come from a foreign county |
%_Female_Students | Percentage of student body that is female. |
Year | Academic year that the ranking was released. For example, 2016 denotes the 2015-2016 academic year. |
| statcrunchhelp | Apr 5, 2016 | 254KB | 3805 |
Popular US Baby Names (1990-2014)
The US government tracking of the frequency of a name for a newborns in each birth year. This dataset excludes any name that had less than 10 occurrences in a given year.
Year: the year of birth for the babies with that name.
Frequency: the number of newborns with the specified name, birth year, and gender. | statcrunchhelp | Apr 5, 2016 | 7MB | 428 |
All MLB Salaries (1985-2015)
This data has all MLB player salaries between 1985-2015 including the team played for, the city, and a unique ID for each player. Total this includes 25,575 salaries for 4,963 different baseball players.
The player ID is the first 5 letters from the last name, followed by the first two letters from the first name, followed by a number in case of duplicate names. For example, bondsba01 stands for Barry Bonds with "01" because he's the first with the "bondsba" name ID. | statcrunchhelp | Mar 15, 2016 | 1MB | 1519 |
Professor Median Salaries by Discipline, Level, and Institution (2012-2013)
This data comes from the source listed above. The webpage gives the following description "These are the results of the 2012-13 Faculty in Higher Education Salary Survey by Discipline, Rank and Tenure Status in Four-Year Colleges and Universities conducted by The College and University Professional Association for Human Resources (CUPA-HR). Findings reflect the salaries of 184,924 tenured/tenure-track faculty members at 794 institutions nationwide. Salaries were reported by 794 institutions, including 478 private institutions and 316 public institutions, for 31 academic disciplines"
The dataset originally was created by Keisha Brown from Georgia Perimeter College. Each salary is the median salary based on the responses to the survey. | statcrunchhelp | Mar 14, 2016 | 12KB | 878 |
Top Rated Jobs 2014
This data is gathered from careercast.com and is available in it's original form at the source listed above. The dataset originally was created by Keisha Brown from Georgia Perimeter College.
Column | Description | Ranking | Ranking from 0 to 200 based on the combined “Overall Rating” | Job | Title for the job. | Median Annual Income | Based on Bureau of Labor Statistics | Overall Rating | Combined rating based on income, stress, hiring outlook, and work environment. The lower the rating the better rated the job. | Stress Rating | A rating from 1 to 200 estimating the overall stress level from the job. This essentially is a ranking with 1 being the least stressful job and 200 being the most stressful job. | Hiring Outlook Rating | A rating from 1 to 200 estimating the overall stress level from the job. This essentially is a ranking with 1 being the best hiring outlook and 200 being the worst hiring outlook. | Work Environment Rating | A rating from 1 to 200 estimating the overall stress level from the job. This essentially is a ranking with 1 being the best work environment and 200 being the worst work environment. |
| statcrunchhelp | Mar 14, 2016 | 9KB | 2690 |
Top 100 Retailers 2015
This dataset comes from the National Retail Federation and tracks the top retail chains in the US for 2015 based on their 2014 sales. The original data can be found at the webpage listed as the source. Note that these retailer include all sorts of avenues including internet sales. | statcrunchhelp | Mar 14, 2016 | 7KB | 4335 |
Companies Cutting Most Jobs 2013
This dataset comes from a Yahoo finance article from October 9th, 2013. The dataset originally was created by Keisha Brown from Georgia Perimeter College. | statcrunchhelp | Mar 14, 2016 | 540B | 335 |
California Home Prices, 2009
This dataset is a collection of real estate listings from San Luis Obispo county, California, and some locations around it from 2009. The prices are their list price at the creation of this dataset. For more information about this data, go to the website source listed above. | statcrunchhelp | Mar 11, 2016 | 46KB | 2167 |
Maryland Number of Drug and Alcohol-Related Intoxication Deaths by County of Incident, 2007-2013
Cited from the source: "Drug and alcohol-related Intoxication death data is prepared using drug and alcohol intoxication data housed in a registry developed and maintained by the Vital Statistics Administration (VSA) of the Maryland Department of Health and Mental Hygiene (DHMH). The methodology for reporting on drug-related intoxication deaths in Maryland was developed by VSA with assistance from the DHMH Alcohol and Drug Abuse Administration, the Office of the Chief Medical Examiner (OCME) and the Maryland Poison Control Center. Assistance was also provided by authors of a 2008 Baltimore City Health Department report on intoxication deaths. Data in this table is by incident location, where the death occurred, rather than by county of residence." | statcrunchhelp | Mar 9, 2016 | 849B | 433 |
Percent of Adult Current Smokers by Sex and Race/Ethnicity, 1995-2010
The original data comes from the U.S. Department of Health and Human Services
Cited from the source: Adults are defined as 18 years of age and older. The CDC defines a "Current Smoker" as an adult who has smoked at least 100 cigarettes (5 packs) in their lifetime and currently smokes either "Every Day" or "Some Days." BRFSS data methodology changed in 2011; therefore, 2011 and after is not comparable to 2010 data and before. | statcrunchhelp | Mar 9, 2016 | 1KB | 855 |
Most Popular Baby Names NYC, 2011
This dataset tracks baby names for New York City in 2011. The "# with Name" is the total number of babies with that given name for the specified Gender and Ethnicity. The "Rank" tracks the most popular names within each combination of gender and ethnicity (1 being the most popular). | statcrunchhelp | Mar 9, 2016 | 74KB | 209 |
US Population Estimates by Gender and Age (2010-2014)
Find a description at the following webpage: Census Data Description | statcrunchhelp | Mar 9, 2016 | 17KB | 415 |
National Longitudinal Youth Survey
The Youth survey consists of a nationally representative sample of youths who were 14 to 20 years old as of December 31, 1999.
This dataset tracks the Age, Height (in inches), Weight (in pounds), Gender, and the self reported "How would you describe your weight?" multiple choice answers for the individuals. | statcrunchhelp | Mar 8, 2016 | 330KB | 1759 |
Body Temperature
Data taken from the Journal of Statistics Education online data archive. That archive in turn got the data from an article in the Journal of the American Medical Association. (Mackowiak, et al., "A Critical Appraisal of 98.6 Degrees F …", vol. 268, pp. 1578-80, 1992).
"Body Temp" is measured in degrees fahrenheit
"Heart rate" is the resting beats per minute | statcrunchhelp | Mar 8, 2016 | 2KB | 4259 | Sample expressions
This data set gives examples of basic StatCrunch expressions. Data> Compute > From Column, with the first column selected, should yield the results in ExpectedResults column.
Data > Compute Expression can be used to subtract Column 3 from ExpectedResults. This should yield blank for the the first row and zero for all the other columns. Data > Validate can be used on the column of differences to verify this.
These expressions are used by autotest, so they should not be modified unless the corresponding autotest is modified. | statcrunchhelp | Feb 4, 2016 | 2KB | 273 |
USDA Nutrition Data
This dataset has the nutritional values per serving size for a large variety of foods as calculated by the USDA.
US Department of Agriculture, Agricultural Research Service, Nutrient Data Laboratory. USDA National Nutrient Database for Standard Reference, Release 28. Version Current: September 2015. Internet: http://www.ars.usda.gov/nea/bhnrc/ndl | statcrunchhelp | Jan 13, 2016 | 832KB | 1654 |
Federal Food Assistance Participation
This primarily comes from the following source: United States Department of Agriculture: Food and Nutrition Service . This dataset also incorporates data from another StatCrunch dataset: US Workforce Participation
Column | Description | Year | The year for each data value | Average Federal Food Assistance Participation in Thousands | Number of individuals in the US who took part in SNAP (Supplemental Nutrition Assistance Program) during the given year. | % US Population on Federal Food Assitance | % of US population that is currently in the SNAP program and is receiving aid with food. | Change of % (US Population on Federal Food Assistance) | The change in the percentage of the US population that is receiving food assistance from SNAP. | Presidential Control | Political party of president. | Senate Control | Political party of the Senate majority | House Control | Political party of the House of Representatives majority. | Legislative Branch (House and Senate) | Combined control of Senate and House of Representativs | Male Inactivity Rate Aged 25-54 | Defined as the proportion of the male population aged 25-54 that is not in the labour force. Common reasons for leaving labour force: college, retirement, stay at home, can't find work and no longer try. | Change of Rate (Male Inactivity Rate Aged 25-54) | The change in the inactivity rate calculated as the current year minus the previous year. | Female Inactivity Rate Aged 25-54 | Defined as the proportion of the female population aged 25-54 that is not in the labour force. | Change of Rate (Female Inactivity Rate Aged 25-54) | The change in the inactivity rate calculated as the current year minus the previous year. | Annual Average Workforce Participation Rate | Defined by the Bureau of Labor Statistics as "the percentage of the population [16 years and older] that is either employed or unemployed (that is, either working or actively seeking work). Note that 2015's Annual Average is calculated using the first 11 months." | Change of Rate (Annual Workforce Participation Rate) | The change in the workforce participation rate calculated as the current year minus the previous year. |
| statcrunchhelp | Jan 8, 2016 | 10KB | 1726 |
All Texas Executions from 1982-2015
This data set records all executions in Texas from 1982-2015 and comes from the following website: Texas Executions. The data includes a variety of information about each execution including their last statement.
| statcrunchhelp | Jan 7, 2016 | 242KB | 2479 |
US Workforce Participation
This data primarily comes from two sources: Federal Reserve Bank of St. Louis and the US Bureau of Labor Statistics .
Column | Description | Year | The calendar year for each value | Annual Average Workforce Participation | Defined by the Bureau of Labor Statistics as "the percentage of the population [16 years and older] that is either employed or unemployed (that is, either working or actively seeking work). Note that 2015's Annual Average is calculated using the first 11 months." | Male Workforce Participation Rate | Annual workforce participation rate for males. | Female Workforce Participation Rate | Annual workforce participation rate for females. | Male Inactivity Rate Aged 25-54 | Defined as the proportion of the male population aged 25-54 that is not in the labour force. Common reasons for leaving labour force: college, retirement, stay at home, can't find work and no longer try. | Change in Rate (Male Inactivity Rate Aged 25-54) | The change in the inactivity rate calculated as the current year minus the previous year. | Female Inactivity Rate Aged 25-54 | Defined as the proportion of the female population aged 25-54 that is not in the labour force. | Change in Rate (Female Inactivity Rate Aged 25-54) | The change in the inactivity rate calculated as the current year minus the previous year. | Presidential Control | Political party of president. | Senate Control | Political party of the Senate majority | House Control | Political party of the House of Representatives majority. | Legislative Branch (House and Senate) | Combined control of Senate and House of Representativs |
| statcrunchhelp | Jan 7, 2016 | 10KB | 752 |
NBA 2014-2015
This data set comes from ESPN and represents the regular season standings for the 2014-2015 NBA Season. Variables are as follows:
W = # of wins
L = # of loses
Win % = winning percentage
HOME = Home record
ROAD = Road Record
DIV = Division Record
CONF = Conference record
PPG = Points scored per game
OPP PPG = Opponent's points scored per game
DIFF = (PPG - OPP PPG)
Conference = Conference played in | statcrunchhelp | Jan 5, 2016 | 2KB | 1414 |
2014 MLB Top 100 Batters
This data came from ESPN.com and has the top 100 batters by WAR (wins above replacement). AB: At bats R: Runs H: Hits 2B: Doubles 3B: Triples RBI: Runs batted in SB: Stolen Bases BB: Walks SO: Strikeouts AVG: Batting average OBP: On Base Percentage SLG: Slugging Percentage OPS: OBP + SLG WAR: Wins Above Replacement | statcrunchhelp | Jan 5, 2016 | 9KB | 848 | MidWestern Volleyball Heights (cm) | statcrunchhelp | Oct 19, 2015 | 144B | 1244 | Colleges' SAT Math Scores | statcrunchhelp | Oct 9, 2015 | 745B | 64 | Yellow-White exam data (split format) | statcrunchhelp | Sep 22, 2015 | 109B | 25 |
AgeDrugHDL
For testing two-way ANOVA | statcrunchhelp | Jul 28, 2015 | 496B | 345 | Bone 2-way unbalanced ANOVA with Ortho contrasts
The first 3 columns (Growth, Gender, Bone) can be used to demonstrate Unbalanced 2-way Anova. The same results can be obtained using MLR with Growth as Y var and last five columns (g1, b1, b2, g1*b1, g1*b2) as X vars. Run subsets and subtract the Model SS of each subset from Model SS with all 5 vars. This will match the Anova. | statcrunchhelp | Jun 25, 2015 | 422B | 808 |
StatCrunch Auto Testing Data set
This data set should be modified very cautiously since lots of tests depend on it. Suggestions: Save a backup before modifying. After modifying, run all auto tests to verify that they still work. This is based on Nathan's dataset 1471057. | statcrunchhelp | Jun 25, 2015 | 5KB | 12779 |
Summary of all Home Runs Hit During the 2014 Season | statcrunchhelp | Mar 18, 2015 | 411KB | 140 |
College Worth It?
This data set was collected via a StatCrunch survey. Respondents were asked if they think college is a good financial decision, if they currently attend or have attended college, their gender and their age. Check out the original survey here: http://www.statcrunch.com/5.0/survey.php?surveyid=3007&code=OYAVB&groupid=256 Feel free to copy this survey and use for your own data collection. | statcrunchhelp | Feb 4, 2015 | 18KB | 9289 |
U.S. Senate Election Results For NC Counties
This dataset contains the population size and the number of votes cast for Thom Tillis (R), Kay Hagan (D), Sean Haugh (L) and Write-In candidates. The last variable is the percent of the vote cast for Hagan within each county. | statcrunchhelp | Nov 7, 2014 | 4KB | 148 |
Heights of Females
This data set contains the heights in inches for 428 women taking an introductory statistics course at a Midwestern college. This dataset was originally created by Dr. Jim Albert. | statcrunchhelp | Oct 28, 2014 | 1KB | 2434 |
Activity: Sampling Distribution for a Mean | statcrunchhelp | Oct 28, 2014 | 69B | 126 |
North Carolina Pick 4 Results
Daily daytime/evening results for the North Carolina Pick 4 lottery from January 2012 through September 2014. Each time the game is played four numbers between 0 and 9 are selected with replacement. Each sequence of four numbers is stored in the Numbers column with a hyphen separator. Try using the Data > Arrange > Slice menu option with the Numbers column and a hyphen delimiter to break the individual numbers out into four separate columns. Then stack the four columns using the Data > Arrange > Stack menu option to get all of the results into a single column for analysis. | statcrunchhelp | Oct 22, 2014 | 84KB | 844 |
NHL game summaries for the 2013-2014 season
For each game, the variables include month (Month), day of the month (Day), year (Year), day of the week (DayOfWeek), attendance (Attendance), visiting team (Visitor), visiting team score (VisitorScr), home team (Home), home team score (HomeScr), whether the game ended in overtime or a shoot out (O/S), name of the winning goalie (WinGoalie), the name of the payer who made the winning goal (WinGoal), visiting team shots on goal (VisitorShots), visiting team power play goals (VisitorPowerPlayGoals), visiting team power play opportunities (VisitorPowerPlayOpp), visiting team penalty minutes (VisitorPenaltyMInutes), home team shots on goal (HomeShots), home team power play goals (HomePowerPlayGoals), home team power play opportunities (HomePowerPlayOpp) and home team penalty minutes (HomePenaltyMInutes). | statcrunchhelp | Oct 14, 2014 | 117KB | 493 |
Metropolitan Statistical Areas in the U.S. - Population, Location
The variables for each metros area include the population rank (PopRank), cites/states included (MSA), 2013 population estimate (Pop2013), 2010 population from census (Pop2010), percent change from 2010 (Change%), the binned percent change (ChangePercent), latitude and longitude. | statcrunchhelp | Oct 14, 2014 | 34KB | 4356 |
Most Populous U.S. Cities - Population, Area, Location
This data set contains information for all U.S. cities with a population of over 100,000 people. The variables for each city include the population rank (PopRank), city, state, 2013 population estimate (Pop2013), 2010 population from census (Pop2010), percent change from 2010 (Change%), the binned percent change (ChangePercent), land area in square miles (SqMi), 2013 population density (PopPerSqMi), latitude and longitude. | statcrunchhelp | Oct 12, 2014 | 24KB | 583 |
NFL Scores from 2013
Data for every NFL game of the 2013 season. Variables include the week of the game (Week), day of the week (Day), calendar date of the game (Date), winning team (Winner), losing team (Loser), whether the winning team was playing at home or away (WinnerAt), points for the winning team (PtsW), points for the losing team (PtsL), total points scored by both teams (TotalPts), yards for the winning team (YdsW), yards for the losing team (YdsL), turn overs for the winning team (TOW) and turn overs for the losing team (TOL). There are also additional columns of TieHome and TieAway indicating the two teams that played the only tie game of the 2013 season in week 12. The Week column also denotes the nature of each playoff game. | statcrunchhelp | Oct 3, 2014 | 26KB | 1222 |
Favorite Browser Survey (Needs Cleaning)
This data set contains some incorrect data that can easily be found by using Data > Validate. (The data is from a survey about favorite Web browsers.)
| statcrunchhelp | Sep 30, 2014 | 14KB | 151 |
NHL Attendance by Season (2001-2014)
This data set contains the attendance figures for every NHL team for the seasons ending in 2001 through 2014. The variables include the season, ranking (by home average attendance), home games, home total attendance, home average attendance, away games, away total attendance, away average attendance, total games, total attendance and total average attendance. The data set also contains the percentage capacity for home games, away games and total games. | statcrunchhelp | Sep 26, 2014 | 25KB | 514 |
Telephone Holding Times
An airline has a toll-free phone number that they use for reservations. Sometimes callers have to be placed on hold. The airline conducted a randomized experiment to determine if there was a significant difference in how long a caller would remain on hold depending on what is playing on the call. The airline randomly selected one out of every 1000 calls to be placed on hold with either a advertisement of current promotions, with muzak playing (elevator music), or with classical music playing. Total, 15 callers were sampled for this study. Each column is the number of minutes that the random caller remained on the line until they hung up for each type of recorded message. This data set comes from "Statistics: The Art and Science of Learning from Data" by Alan Agresti and Christine Franklin. | statcrunchhelp | Sep 17, 2014 | 85B | 1521 |
Distribution of US Population
This dataset is based on the distribution of the US population. 1500 households are randomly selected to determine which region they are from in the Observed Count column. The break down of region is then compared to the Expected Count column which is based off of the percentages for each region in the year 2000 (19.0% for Northeast, 22.9% for Midwest, 35.6% for South, and 22.5% for West). This data set comes from "Statistics: Informed Decisions Using Data" by Michael Sullivan. | statcrunchhelp | Sep 16, 2014 | 140B | 510 |
Weight Loss Program
This dataset is for a hypothetical weight loss program. Each row represents an individual in the program. Their weight is measured before and after the imaginary weight loss program. Try doing a paired T-test to see if this weight loss program had a significant effect. | statcrunchhelp | Sep 10, 2014 | 173B | 901 |
NY Times: The Most Economically Diverse Top Colleges
To measure top colleges efforts on economic diversity, The Upshot calculated a College Access Index, based on the percentage of freshmen in recent years who came from low-income families (measured by the share receiving a Pell grant) and on the net price of attendance for low- and middle-income families. The recent Pell (2012 - 2014) number for each college is the average percentage of the freshman class that received a Pell grant in 2011-12, 2012-13 and 2013-14; not all colleges had 2013 data yet. The earlier Pell (2008) value is for the fall of 2007. Average net price is the average total cost of attendance in 2012-13, including tuition, fees, room and board, after taking into account federal, state and institutional financial aid, for students who come from households earning between $30,000 and $48,000 a year and qualifying for federal aid.
Endowment per student is for the year 2011-12 and includes graduate students. The College Access Index is a combination of net price and the Pell average for 2011, 2012 and 2013, using a statistical technique known as a z-score. A college with an average score on the two measures in combination will receive a zero. | statcrunchhelp | Sep 10, 2014 | 4KB | 1267 |
Yellow-White exam data
An instructor printed 10 copies of an exam on yellow paper and 10 copies on white paper, and randomly gave these yellow and white exams to 20 students. | statcrunchhelp | Sep 5, 2014 | 226B | 530 |
Improving Reading Ability
Results of an experiment to test whether directed reading activities in the classroom help elementary school students improve aspects of their reading ability. A treatment class of 21 third-grade students participated in these activities for eight weeks, and a control class of 23 third-graders followed the same curriculum without the activities. After the eight-week period, students in both classes took a Degree of Reading Power (DRP) test which measures the aspects of reading ability that the treatment is designed to improve. Number of cases: 44 Reference: Moore, David S., and George P. McCabe (1989). Introduction to the Practice of Statistics [Two sample t-test , Summary statistics]
Variable | Description | Treatment | Whether student participated in activities (treated) or not (control) | Response | Score on Degree of Reading Power test |
| statcrunchhelp | Sep 4, 2014 | 527B | 1213 |
Exam Scores Transposed
These are the grades for the second exam of an introductory Statistics course. The "Exam 2 Score" row is the grade for each student. | statcrunchhelp | Sep 4, 2014 | 189B | 817 |
Home prices in Albuquerque
The data are a random sample of 117 records of resales of homes from Feb 15 to Apr 30, 1993 from the files maintained by the Albuquerque Board of Realtors. This type of data is collected by multiple listing agencies in many cities and is used by realtors as an information base.
Column | Description | PRICE | Selling price in hundreds of dollars | SQFT | Square feet of living space | AGE | Age of home in years | FEATS | Number out of 11 features (dishwasher, refrigerator, microwave, disposer, washer, intercom, skylight(s), compactor, dryer, handicap fit, cable TV access) | NE | Located in northeast sector of city (1) or not (0) | COR | Corner location (1) or not (0) | TAX | Annual taxes in dollars |
| statcrunchhelp | Sep 4, 2014 | 3KB | 2414 |
Binned Exam scores | statcrunchhelp | Aug 29, 2014 | 627B | 276 |
Top US Problems
A Gallup survey taken in July and August 2014 asked 945 Republicans and 854 Democrats to name top U.S. problems. Only the top four responses are tabulated here: Immigration, Dysfunctional Government, Economy and Unemployment. The remaining survey responses are listed as Other. | statcrunchhelp | Aug 29, 2014 | 34KB | 578 |
Apple Juice Bottles | statcrunchhelp | Aug 15, 2014 | 167B | 691 |
Exam scores | statcrunchhelp | Aug 12, 2014 | 104B | 2341 |
50 Coin Flips | statcrunchhelp | Aug 11, 2014 | 327B | 534 | Pairwise Counts For Two Categorical Variables | statcrunchhelp | Jul 31, 2014 | 58B | 271 | Categorical Variable In Summary Form | statcrunchhelp | Jul 29, 2014 | 55B | 391 | Two Categorical Variables | statcrunchhelp | Jul 22, 2014 | 72B | 1935 |
Survey: Your height and ideal height of mate
This data set contains responses to a StatCrunch survey. Respondents provided their height (in inches), their opinion of the ideal height of their mate (in inches) and their gender. This data set has removed ten extreme observations.
| statcrunchhelp | Apr 15, 2014 | 2KB | 1956 |
Sound type, volume level, and task time
Sound type (Ad or Music), Volume level (Low or High), and Time to complete a task (in minutes). | statcrunchhelp | Apr 10, 2014 | 144B | 736 |
Minutes before hanging up when on hold
Time, in minutes, that a caller stays on hold before hanging up under three different treatments. | statcrunchhelp | Apr 10, 2014 | 100B | 359 |
Full-time and part-time graduation rates
Graduation rate of full-time and part-time students in a random sample of four colleges | statcrunchhelp | Apr 10, 2014 | 211B | 626 |
North Carolina premature births
A Random Sample of 1000 births from the state of North Carolina. Plurarility refers to the number of children associated with the birth. Gender 1=Male, 2=Female. fage is age of father (years), mage is age of mother (years), visits is number of pre-natal medical visits, marital is 1=married, 2=unmarried, racemom is Race of Mother (0=Other Non-white, 1=White, 2=Black 3=American Indian, 4=Chinese, 5=Japanese, 6=Hawaiian, 7=Filipino, 8=Other Asian or Pacific Islander), hispmom is whether mother is of Hispanic origin (C=Cuban, M=Mexican, N=Non-Hispanic, O=Other and Unknown Hispanic, P=Puerto Rican, S=Central/South American, U=Not Classifiable), gained is weight gain during pregnancy (pounds), lowbw is if birth weight is 2500 grams or lower, tpounds is birthweight in pounds, smoke is 0=no, 1=yes for mother admitted to smoking, mature is 0=no, 1-yes for mother is 35 or older, premie is 0=no, 1=yes to being born 36 weeks or sooner.
| statcrunchhelp | Apr 10, 2014 | 4KB | 2025 |
Back problems and gender | statcrunchhelp | Apr 10, 2014 | 60B | 947 |
Survey: Is college worth it?
This data set was collected via a StatCrunch survey. Respondents were asked if they think college is a good financial decision, if they currently attend or have attended college, their gender and their age.
Check out the original survey here: http://www.statcrunch.com/5.0/survey.php?surveyid=3007&code=OYAVB&groupid=256
Feel free to copy this survey and use for your own data collection. | statcrunchhelp | Apr 10, 2014 | 22KB | 1975 |
Height
The height in inches of 50 students is given. One can create a histogram and see the bimodal nature of the data. There is an interesting article that argues against human height being bimodal: http://faculty.washington.edu/tamre/IsHumanHeightBimodal.pdf | statcrunchhelp | Apr 9, 2014 | 156B | 1010 |
Asking prices for 4-bedroom homes in Bryan-College Station TX
Random sample of 30 four-bedroom homes listed for sale in the Bryan-College Station, Texas, area. For each home, the data set contains the list price in thousands of dollars (Price), square footage (Sqft), number of bathrooms (Baths) and location (Bryan, TX or College Station, TX). | statcrunchhelp | Apr 4, 2014 | 951B | 3935 |
Effect of Smoke on infants
Data was collected by a random survey of mothers in KY through a dance studio during November 2010 by SABRINA LAFFERTY & KAREN HOLLAND (ST 291 Fall 2010 candidates at HCTC) as a requirement for semester project. They asked 57 mothers about the gestation period for their pregnancies, the birth weight, the length of their newborns and whether they smoked while they were pregnant. | statcrunchhelp | Mar 6, 2014 | 1KB | 6984 |
Old Faithful
Old Faithful Geyser, Yellowstone National Park. Duration (in seconds). Interval (in minutes). | statcrunchhelp | Feb 18, 2014 | 2KB | 2175 |
hotdog | statcrunchhelp | Feb 17, 2014 | 823B | 1866 |
Favorite Web Browser | statcrunchhelp | Feb 17, 2014 | 15KB | 451 |
Responses to Intro Statistics Class Survey Fall 2012 | statcrunchhelp | Feb 14, 2014 | 3KB | 175 | Responses to Sullivan Statistics Survey | statcrunchhelp | Feb 13, 2014 | 31KB | 160 |
Facebook Friend Data
This is the data that came from the StatCrunch Friend Data application for one 27 year old who graduated from Texas A&M University. All information that could prevent anonymity was excluded. | statcrunchhelp | Feb 13, 2014 | 575KB | 623 |
|
|