StatCrunch logo (home)

Data sets shared by StatCrunch members
Showing 1 to 15 of 219 data sets matching COLUMN
Data Set/Description Owner Last edited Size Views
Movie Budgets and Box Office Earnings (Updated Spring 2018)
This data all comes from the following website the tracks the financial performance of movies:
http://www.the-numbers.com/movie/budgets/all

The “Budget”, “Domestic Gross”, and “Worldwide Gross” columns each are in millions of dollars.

statcrunch_featuredOct 4, 2018270KB15100
Flight Delay Data For July 2014
This data set contains information on the flight delays for each airline at each U.S. airport in July of 2014. The columns include the carrier, airport city/state, airport code, airport name, total number of flights (Flights), the number of delayed flights (Delayed), the number of cancelled flights (Cancelled), the number of diverted flights (Diverted), the number of on-time flights (On-time), and the on-time percentage (On-time Percentage).
statcrunch_featuredJan 2, 201888KB6803
US Workforce Participation
This data primarily comes from two sources: Federal Reserve Bank of St. Louis and the US Bureau of Labor Statistics .
ColumnDescription
YearThe calendar year for each value
Annual Average Workforce ParticipationDefined by the Bureau of Labor Statistics as "the percentage of the population [16 years and older] that is either employed or unemployed (that is, either working or actively seeking work). Note that 2015's Annual Average is calculated using the first 11 months."
Male Workforce Participation RateAnnual workforce participation rate for males.
Female Workforce Participation RateAnnual workforce participation rate for females.
Male Inactivity Rate Aged 25-54Defined as the proportion of the male population aged 25-54 that is not in the labour force. Common reasons for leaving labour force: college, retirement, stay at home, can't find work and no longer try.
Change in Rate (Male Inactivity Rate Aged 25-54)The change in the inactivity rate calculated as the current year minus the previous year.
Female Inactivity Rate Aged 25-54Defined as the proportion of the female population aged 25-54 that is not in the labour force.
Change in Rate (Female Inactivity Rate Aged 25-54)The change in the inactivity rate calculated as the current year minus the previous year.
Presidential ControlPolitical party of president.
Senate ControlPolitical party of the Senate majority
House ControlPolitical party of the House of Representatives majority.
Legislative Branch (House and Senate)Combined control of Senate and House of Representativs
statcrunch_featuredJun 27, 201710KB2963
Seattle Monthly Rain Gauge Accumulations (2003-2017)
Monthly accumulations for Seattle Pacific University's rain gauges located throughout Seattle city limits. Each column represents the amount of water accumulated (in inches) over the past month at a different rain gauge.
statcrunch_featuredAug 1, 201816KB956
Roller Coasters Data
This dataset looks at some of the roller coasters across the US and various other countries.
ColumnDescription
NameName of roller coaster
ParkAmusement park for roller coaster
CityCity for amusement park
StateState abbreviation
CountryCountry of the roller coaster. US: United States, MX: Mexico, CR: Costa Rica, GT: Guatemala, CO: Columbia, VE: Venezuela, BR: Brazil, AR: Argentina, CL: Chile, EQ: Ecuador, PE: Peru, F: France, D: Germany
TypeS: Steel, W: Wood
ConstructorType of build for the roller coaster
HeightHeight in meters
SpeedSpeed in kilometers per hour (km/h)
LengthLength in meters
InversionsYes if there are inversions, no if not
DurationDuration of ride in seconds
GForceMax g-force
OpenedYear it opened
RegionGeographic region for the roller coaster
statcrunch_featuredApr 3, 201748KB6876
Bill's Island
The X column represents the number of years since 1900. The Y column is the sea level change in centimeters.
burgin.billJul 29, 201983B63
D1.6
Dataset: airline_costs.dat Source: J.W. Proctor and J.S. Duncan (1954). "A Regression Analysis of Airline Costs," Journal of Air Law and Commerce, Vol.21, #3, pp.282-292. Description: Regression relating Operating Costs per revenue ton-mile to 7 factors: length of flight, speed of plane, daily flight time per aircraft, population served, ton-mile load factor, available tons per aircraft mile, and firms net assets. Regression based on natural logarithms of all factors, except load factor. Load factor and available tons (capacity) for Northeast Airlines was imputed from summary calculations. Variables/columns Airline 1-20 Length of flight (miles) 22-28 L_Group (inserted) Long (>175), Med (>60), Short (<69) Speed of Plane (miles per hour) 30-36 Daily Flight Time per plane (hours) 38-44 Population served (1000s) 46-52 Total Operating Cost (cents per revenue ton-mile) 54-60 Revenue Tons per Aircraft mile 62-68 Ton-Mile load factor (proportion) 70-76 Available Capacity (Tons per mile) 78-84 Total Assets ($100,000s) 86-92 Investments and Special Funds ($100,000s) 94-100 Adjusted Assets ($100,000s) 102-108
housew1Jul 3, 20192KB67
D2.4
Data set with a mean =40 and SD = 10. Explore how z-scores were computed. You will see [8] above the StatCrunch button. Explore the tables visuals, and calculated columns (Z-score and DATA2).
housew1Jun 16, 201953KB53
Movie Budgets and Box Office Earnings (Updated Fall 2016)
This data all comes from the following website the tracks the financial performance of movies:
http://www.the-numbers.com/movie/budgets/all

The “Budget”, “Domestic Gross”, and “Worldwide Gross” columns each are in millions of dollars.

ntorno8Jun 30, 2017266KB6121
Times World University Rankings (2011-2016)
This data comes from the annual Times magazine rankings of universities across the world. The webpage for the Times 2016 rankings is listed above in the source.
The formula for the 2016 rankings is as follows:
30% for Teaching Rating
7.5% for International Outlook Rating
30% for Research Rating
30% for Citations Rating
2.5% for Industry Income Rating.
The “Total Score” from 2016 can be recreated using this formula.

ColumnDescription
World_RankUniversity rank for a given year
University_NameThe name of the university
CountryLocation of university
Teaching_Rating Rating from a 0-100 scale of the quality of teaching at the university. This rating is based on the institution’s reputation for teaching, it’s student/staff ratio, it’s PhD’s/ undergraduate degrees awarded ratio, and it’s institutional income/ academic staff ratio.
Inter_Outlook_Rating Rating from a 0-100 scale of the international makeup of a university. This rating is based the international student percentage, international staff percentage, and the percentage of research papers from the university that include at least one international author.
Research_Rating Rating from a 0-100 scale of quality of research at the university. This rating is based on the university’s reputation, it’s research income/ academic staff ratio, and it’s production of scholarly papers.
Citations_Rating Rating from a 0-100 scale of based on the normalized average of citations by other papers per paper from the university (how often the research from the university is cited by other papers).
Industry_Income_Rating Rating from a 0-100 scale grading how much companies are willing to invest in the universities research. The rating is calculated based on the research income from businesses per academic staff member.
Total_ScoreThe final score used to determine the university ranking based on Teaching_Rating, International_Outlook_Rating, Research_Rating, Citations_Rating, and Industrial_Income_Rating.
Num_StudentsTotal number of students in a given year
Student/Staff_RatioNumber of students per academic staff member
%_Inter_StudentsPercentage of student body who come from a foreign county
%_Female_Students Percentage of student body that is female.
YearAcademic year that the ranking was released. For example, 2016 denotes the 2015-2016 academic year.
statcrunchhelpApr 5, 2016254KB4021
Skyscrapers in the U.S.
Data for buildings in the United States that are 100 meters tall or higher. The variables include the rank in terms of height (Rank), the building name (Building), the height in meters (Height), the number of floors (Floors), the year of completion (Year), materials used in construction (Materials), and the use of the building (Use). The last two variables contain multiple outcomes delimited by /. When considering these columns, consider an outcomes table (Stat > Tables > Outcomes) with / as a delimiter.
websterwestJan 14, 2015148KB2638
Flight Delay Data For July 2014
This data set contains information on the flight delays for each airline at each U.S. airport in July of 2014. The columns include the carrier, airport city/state, airport code, airport name, total number of flights (Flights), the number of delayed flights (Delayed), the number of cancelled flights (Cancelled), the number of diverted flights (Diverted), the number of on-time flights (On-time), and the on-time percentage (On-time Percentage).
websterwestOct 3, 201488KB1803
NFL Scores from 2013
Data for every NFL game of the 2013 season. Variables include the week of the game (Week), day of the week (Day), calendar date of the game (Date), winning team (Winner), losing team (Loser), whether the winning team was playing at home or away (WinnerAt), points for the winning team (PtsW), points for the losing team (PtsL), total points scored by both teams (TotalPts), yards for the winning team (YdsW), yards for the losing team (YdsL), turn overs for the winning team (TOW) and turn overs for the losing team (TOL). There are also additional columns of TieHome and TieAway indicating the two teams that played the only tie game of the 2013 season in week 12. The Week column also denotes the nature of each playoff game.
statcrunchhelpOct 3, 201426KB1322
Federal Food Assistance Participation
This primarily comes from the following source: United States Department of Agriculture: Food and Nutrition Service . This dataset also incorporates data from another StatCrunch dataset: US Workforce Participation

ColumnDescription
YearThe year for each data value
Average Federal Food Assistance Participation in ThousandsNumber of individuals in the US who took part in SNAP (Supplemental Nutrition Assistance Program) during the given year.
% US Population on Federal Food Assitance% of US population that is currently in the SNAP program and is receiving aid with food.
Change of % (US Population on Federal Food Assistance)The change in the percentage of the US population that is receiving food assistance from SNAP.
Presidential ControlPolitical party of president.
Senate ControlPolitical party of the Senate majority
House ControlPolitical party of the House of Representatives majority.
Legislative Branch (House and Senate)Combined control of Senate and House of Representativs
Male Inactivity Rate Aged 25-54Defined as the proportion of the male population aged 25-54 that is not in the labour force. Common reasons for leaving labour force: college, retirement, stay at home, can't find work and no longer try.
Change of Rate (Male Inactivity Rate Aged 25-54)The change in the inactivity rate calculated as the current year minus the previous year.
Female Inactivity Rate Aged 25-54Defined as the proportion of the female population aged 25-54 that is not in the labour force.
Change of Rate (Female Inactivity Rate Aged 25-54)The change in the inactivity rate calculated as the current year minus the previous year.
Annual Average Workforce Participation RateDefined by the Bureau of Labor Statistics as "the percentage of the population [16 years and older] that is either employed or unemployed (that is, either working or actively seeking work). Note that 2015's Annual Average is calculated using the first 11 months."
Change of Rate (Annual Workforce Participation Rate)The change in the workforce participation rate calculated as the current year minus the previous year.
statcrunchhelpJan 8, 201610KB1887
Top Rated Jobs 2014
This data is gathered from careercast.com and is available in it's original form at the source listed above. The dataset originally was created by Keisha Brown from Georgia Perimeter College.

ColumnDescription
Ranking Ranking from 0 to 200 based on the combined “Overall Rating”
JobTitle for the job.
Median Annual IncomeBased on Bureau of Labor Statistics
Overall RatingCombined rating based on income, stress, hiring outlook, and work environment. The lower the rating the better rated the job.
Stress RatingA rating from 1 to 200 estimating the overall stress level from the job. This essentially is a ranking with 1 being the least stressful job and 200 being the most stressful job.
Hiring Outlook Rating A rating from 1 to 200 estimating the overall stress level from the job. This essentially is a ranking with 1 being the best hiring outlook and 200 being the worst hiring outlook.
Work Environment Rating A rating from 1 to 200 estimating the overall stress level from the job. This essentially is a ranking with 1 being the best work environment and 200 being the worst work environment.
statcrunchhelpMar 14, 20169KB2884

1 2 3 4 5 6 7 8 9 10   >

Always Learning