StatCrunch logo (home)

Data sets shared by StatCrunch members
Showing 1 to 15 of 1874 data sets matching TXT
Data Set/Description Owner Last edited Size Views
Measurements of weight and tar, nicotine, and carbon monoxide content for 25 brands of domestic cigarettes
** This is not my data set ** Please see the following link for source information and an overview of the context of the data.: http://www.amstat.org/publications/jse/datasets/cigarettes.txt These data were obtained from the following link: http://www.amstat.org/publications/jse/v2n1/datasets.mcintyre.html
dlozimekJan 13, 20181KB1404
nlbatting2009.txt
This dataset contains batting statistics for all National League teams in the 2009 baseball season. The goal of batting is to score runs and the dataset contains the number of runs scored per game. An interesting activity is find which offensive measures (batting average, OBP, SLG, OPS) are most helpful in predicting runs scored.
bayesballJun 8, 2010958B986
olympics.run.txt
This dataset contains the winning time in seconds of each of the men sprint running events in the Olympics from 1972 to 2008.
bayesballMay 14, 2010376B1204
anscombe.txt
Anscombe's 4 data sets for regression. They are very different, yet have the same correlation and regression coefficients.
butlerMay 31, 2011360B744
HusbandsAndWives.txt
Marsh, C. (1988) Exploring Data. Cambridge, UK: Polity Press, 315. These data are taken from the OPCS Study of the heights and weights of the adult population of Great Britain in 1980. They represent a random sample of two hundred married men and their wives. The five variables are husband's age (years), husband's height (mm), wife's age (years), wife's height (mm) and husband's age at the time of the marriage.
lakestatsOct 5, 20076KB800
BodyMeasurementHeights.txt
The heights of over 500 males and females
smcdanie%scJul 2, 200810KB593
quarterbacks2009.txt
This dataset contains statistics for all NFL quarterbacks in the 2009 season.
bayesballMay 14, 20103KB723
Annual Movie Data 2008 Random Sampling.txt
This data is a random sampling of movies that played in theaters in 2008. It includes movies released in previous years that earned money during 2008. For example, a movie released over Thanksgiving in 2007 will most likely earn money in 2007 and 2008. Each box office year ends on the first Sunday of the following year. The next year starts the following day (Monday). For example, the "2004 box office year" ended on Sunday, January 2, 2005. Inflation-adjusted figures are based ticket sale estimates, and may not be precise due to rounding errors.
wikipetersonOct 7, 20098KB472
Annual Movie Data 2008.txt
This chart ranks movies by the amount they earned during 2008. It includes movies released in previous years that earned money during 2008. For example, a movie released over Thanksgiving in 2007 will most likely earn money in 2007 and 2008. Each box office year ends on the first Sunday of the following year. The next year starts the following day (Monday). For example, the "2004 box office year" ended on Sunday, January 2, 2005. Inflation-adjusted figures are based ticket sale estimates, and may not be precise due to rounding errors.
wikipetersonOct 7, 200970KB454
Annual Movie Data 2008 Random Sampling.txt
This chart ranks movies by the amount they earned during 2008.
wikipetersonOct 14, 20096KB396
BodyMeasurements.txt
Contains body girth measurements and skeletal diameter measurements on 23 variables for a group of 507 Physically active individuals (most in their 20s and early 30s within normal weight range.
smcdanie%scJul 2, 200873KB412
titanicwithtext.dat
Is the same data as posted earlier (titanic.dat - info at http://www.amstat.org/publications/jse/datasets/titanic.txt), but with text categories instead of numbers in the later columns. The original variables are retained. The names of the new variables (obviously needing amendment) illustrate how the textual categories can be created using the "ifelse" function to do the recoding.
martineconFeb 14, 200866KB299
YMS 2.27.TXT
lenghts, in feet, of 44 great white sharks.
lakestatsSep 28, 2006224B237
Sullivan_SIDUD4_03_04_22.txt Hemoglobin in cats
hemoglobin (in g/dL) for 20 randomly selected cats
phil_larsonJan 24, 201395B228
steven.9.18.10.txt
Statistics taken for all points during a college tennis match in the Fall of 2010. Variables are the Game: the game number, Server: the server (Wittenberg or Otterbein), Length: the number of shots in the rally, and Winner: the winner of the point.
albertcb1Sep 3, 20131KB165

1 2 3 4 5 6 7 8 9 10   >

Always Learning