Title: | Datasets to Accompany Wolfe and Schneider - Intuitive Introductory Statistics |
---|---|
Description: | These datasets and functions accompany Wolfe and Schneider (2017) - Intuitive Introductory Statistics (ISBN: 978-3-319-56070-0) <doi:10.1007/978-3-319-56072-4>. They are used in the examples throughout the text and in the end-of-chapter exercises. The datasets are meant to cover a broad range of topics in order to appeal to the diverse set of interests and backgrounds typically present in an introductory Statistics class. |
Authors: | Grant Schneider [aut, cre] |
Maintainer: | Grant Schneider <[email protected]> |
License: | GPL-2 |
Version: | 1.1 |
Built: | 2025-03-14 05:46:16 UTC |
Source: | https://github.com/cran/IIS |
This package contains the R datasets and functions referenced in the text and exercises of Wolfe and Schneider - Intuitive Introductory Statistics (2017)
Grant Schneider
Maintainer: Grant Schneider <[email protected]>
Wolfe and Schneider - Intuitive Introductory Statistics (2017)
This data set contains twenty-five years (1991 through 2015) of quarterly charge-off rates for agricultural loans as reported by the Federal Reserve. This is the full dataset corresponding to the selected rows displayed in Table 2.5 of Wolfe and Schneider - Intuitive Introductory Statistics.
agricultural_chargeoff_rates_by_quarter
agricultural_chargeoff_rates_by_quarter
A data.frame
with 100 rows and 3 columns indicating the chargeoff rates for agricultural loans for each quarter.
https://www.federalreserve.gov/releases/chargeoff/chgallnsa.htm
data(agricultural_chargeoff_rates_by_quarter) summary(agricultural_chargeoff_rates_by_quarter)
data(agricultural_chargeoff_rates_by_quarter) summary(agricultural_chargeoff_rates_by_quarter)
This data set contains on-time arrival records for U.S. flight carriers in for all four quarters of the year 2015, the month of December 2015, and the entire year 2015. This data is displayed in Table 2.9 of Wolfe and Schneider - Intuitive Introductory Statistics.
airline_arrivals
airline_arrivals
A data.frame
with 13 rows and 13 columns indicating the percentages and ranks for each airline.
U.S. Department of Transportation (2016)
data(airline_arrivals) summary(airline_arrivals)
data(airline_arrivals) summary(airline_arrivals)
This data set contains the 2014 salaries (as of March 26, 2014) for all baseball players in the American League.
american_league_salary_2014
american_league_salary_2014
A data.frame
with 447 rows and 3 columns corresponding to the name, salary, and team for each player.
Petchesky (2014)
data(american_league_salary_2014) summary(american_league_salary_2014)
data(american_league_salary_2014) summary(american_league_salary_2014)
This data set contains the Acceptability Indices (AI) for Arion Subfuscus from woodland and waste sites with the toxic woodland plant Allium Ursinum (wild garlic) as test gel as reported by Whelan (1982). This data is displayed in Table 9.3 of Wolfe and Schneider - Intuitive Introductory Statistics.
arion_subfuscus
arion_subfuscus
A data.frame
with 10 rows and 2 columns indicating the AI for the woodland and waste sites.
Whelan (1982)
data(arion_subfuscus) summary(arion_subfuscus)
data(arion_subfuscus) summary(arion_subfuscus)
This data set contains the averages of two HDL measurements taken 24 hours apart for each of twelve women who participated in Kerr (1983). This data is displayed in Table 7.1 of Wolfe and Schneider - Intuitive Introductory Statistics.
average_HDL_levels
average_HDL_levels
A vector
with 12 observations corresponding to the average HDL measurements for each woman.
Kerr (1983)
data(average_HDL_levels) summary(average_HDL_levels)
data(average_HDL_levels) summary(average_HDL_levels)
This data set contains the height of the initial head formation upon pouring, the percentage adhesion of the head, and the percentage collapse of the head 4 min after pouring for 20 bottles selected from two different production lots of beer as reported by Ault et al. (1967).
beer_head
beer_head
A list
containing 2 data.frame
s (one for each of the two production lots). Each data.frame
has 20 rows and 3 columns corresponding to the attributes for each bottle.
Ault et al. (1967)
data(beer_head) summary(beer_head)
data(beer_head) summary(beer_head)
This data set contains body temperature values artificially generated by Shoemaker (1996), to closely recreate the original data obtained by Mackowiak et al. (1992) for 65 male and 65 female subjects.
body_temperature_and_heart_rate
body_temperature_and_heart_rate
A data.frame
with 130 rows and 3 columns corresponding to the body temperature, gender (1 for male, 2 for female), and heart rate for each subject.
Shoemaker (1996)
data(body_temperature_and_heart_rate) summary(body_temperature_and_heart_rate)
data(body_temperature_and_heart_rate) summary(body_temperature_and_heart_rate)
This data set contains twenty-five years (1991 through 2015) of quarterly charge-off rates for eight different types of loans as reported by the Federal Reserve. This data is displayed in Table 2.4 of Wolfe and Schneider - Intuitive Introductory Statistics.
chargeoff_rates
chargeoff_rates
A data.frame
with 100 rows and 9 columns indicating the chargeoff rates for each of the eight loan types for each quarter.
https://www.federalreserve.gov/releases/chargeoff/chgallnsa.htm
data(chargeoff_rates) summary(chargeoff_rates)
data(chargeoff_rates) summary(chargeoff_rates)
This data set contains a subset of the College Scorecard Data reported by the U.S. Department of Education.
college_rankings_2012
college_rankings_2012
A data.frame
with 7793 rows and 9 columns indicating various scorecard metrics for each school.
United States Department of Education (2016)
data(college_rankings_2012) summary(college_rankings_2012)
data(college_rankings_2012) summary(college_rankings_2012)
This data set contains twenty-five years (1991 through 2015) of quarterly delinquency rates for eight different types of loans as reported by the Federal Reserve.
delinquency_rates
delinquency_rates
A data.frame
with 100 rows and 9 columns indicating the delinquency rates for each of the eight loan types for each quarter.
https://www.federalreserve.gov/releases/chargeoff/delallnsa.htm
data(delinquency_rates) summary(delinquency_rates)
data(delinquency_rates) summary(delinquency_rates)
This data set contains the body weight of 12 subjects with anorexia nervosa before and after treatment with the medication desimipramine as reported by Brambilla et al. (1985). This data is displayed in Table 8.3 of Wolfe and Schneider - Intuitive Introductory Statistics.
desimipramine
desimipramine
A data.frame
with 12 rows and 2 columns indicating the pre-treatment and post-treatment weights (in kg) of each subject.
Brambilla et al. (1985)
data(desimipramine) summary(desimipramine)
data(desimipramine) summary(desimipramine)
This data set contains weight in carats, color purity, grade of clarity, certification body, and value in Singapore dollars for 308 round diamond stones from an advertisement in the February 18, 2000 edition of Singapore's Business Times as discussed in Chu (2001).
diamonds_carats_color_cost
diamonds_carats_color_cost
A data.frame
with 308 rows and 5 columns indicating the attributes of each diamond.
Chu (2001)
data(diamonds_carats_color_cost) summary(diamonds_carats_color_cost)
data(diamonds_carats_color_cost) summary(diamonds_carats_color_cost)
This data set contains annual emissions data (in million metric tons of carbon dioxide equivalents) for four common air pollutants from 1990 to 2014 in the United States. This data is displayed in Table 2.3 of Wolfe and Schneider - Intuitive Introductory Statistics.
emissions
emissions
A data.frame
with 25 rows and 5 columns indicating the amount of the four pollutants for each year.
U.S. Environmental Protection Agency (2016)
data(emissions) summary(emissions)
data(emissions) summary(emissions)
This data set contains the total engineering drawing hours contributing to the cost of pieces of machinery/mechanical devices for a large Ohio-based company that is displayed in Table 1.3 of Wolfe and Schneider - Intuitive Introductory Statistics.
engineering_drawing_hours
engineering_drawing_hours
A vector
with 96 observations corresponding to the hours for each piece of machinery/mechanical device.
Ohio-based company
data(engineering_drawing_hours) summary(engineering_drawing_hours)
data(engineering_drawing_hours) summary(engineering_drawing_hours)
This data set contains the stature of 20 female Amerindians from the Turner site in Cincinnati, Ohio, representing the Ohio Hopewell period (200-400 AD) studied by Sciulli and Carlisle (1975). This data is displayed in Table 7.8 of Wolfe and Schneider - Intuitive Introductory Statistics.
female_amerindians
female_amerindians
A vector
with 20 observations corresponding to the stature (in centimeters) for each female Amerindian.
Sciulli and Carlisle (1975)
data(female_amerindians) summary(female_amerindians)
data(female_amerindians) summary(female_amerindians)
FindTriples
returns the number of left triples, right triples, and triples that are neither left nor right for the given vector.
FindTriples(z)
FindTriples(z)
z |
Numeric vector |
Triples are formed by taking any three data values from a vector and order this triple of data values from smallest to largest. If the middle ordered item is closer to the smallest than to the largest, the triple is said to be a right triple. If the middle ordered item is closer to the largest, the triple is said to be a left triple. If the middle ordered value is exactly halfway between the other two, the triple is neither right nor left. This function will do the necessary computations and return the number of each type of triple.
A list with three components:
right
- Number of right triples (where middle value is closer to smallest)
left
- Number of left triples (where middle value is closer to largest)
neither
- Number of triples that are neither left nor right (where middle value is exactly halfway between the other two)
FindTriples(1:10) FindTriples(rnorm(20))
FindTriples(1:10) FindTriples(rnorm(20))
This data set contains data on the fasting metabolic rate (FMR) of white-tailed deer for different two-month periods of the year. This data is displayed in Table 12.9 of Wolfe and Schneider - Intuitive Introductory Statistics.
fmr_white_tailed_deer
fmr_white_tailed_deer
A list
with 3 vector
s, each containing FMR (kcal/kg/day) for the deer observed in the corresponding time period.
Silver et al. (1969)
data(fmr_white_tailed_deer) summary(fmr_white_tailed_deer)
data(fmr_white_tailed_deer) summary(fmr_white_tailed_deer)
This data set contains counts of gender 7685 individuals for which gender could be identified into the seven role categories. This data was used by Vigorito and Curry (1998) to investigate whether there is any relationship between gender and the roles that individuals play in magazine ad illustrations. This data is displayed in Table 10.12 of Wolfe and Schneider - Intuitive Introductory Statistics.
gender_roles
gender_roles
A data.frame
with 7 rows and 2 columns indicating the counts for each gender and role category.
Vigorito and Curry (1998)
data(gender_roles) summary(gender_roles)
data(gender_roles) summary(gender_roles)
This data set contains the number of times that 36 turtle hatchlings split into two groups 'circled', in an attempt to determine whether light had an effect on the orientation and sea-finding ability of such hatchlings as reported by Mrosovsky and Shettleworth (1974). This data is displayed in Table 9.6 of Wolfe and Schneider - Intuitive Introductory Statistics.
goggled_green_turtles
goggled_green_turtles
A data.frame
with 18 rows and 2 columns indicating the number of circles in a two-minute period for the nasal field goggles and for the harlequin goggles.
Mrosovsky and Shettleworth (1974)
data(goggled_green_turtles) summary(goggled_green_turtles)
data(goggled_green_turtles) summary(goggled_green_turtles)
This data set contains counts of respondents' preferences between two statements about healthcare grouped by political party from the report "Generation to Generation: American Values about Taking Care of Each Other" (1998). This data is displayed in Table 10.20 of Wolfe and Schneider - Intuitive Introductory Statistics.
health_care_by_affiliation
health_care_by_affiliation
A data.frame
with 2 rows and 2 columns indicating the counts for each statement and each party.
Princeton Survey Research Associates of Princeton, New Jersey (1998)
data(health_care_by_affiliation) summary(health_care_by_affiliation)
data(health_care_by_affiliation) summary(health_care_by_affiliation)
This data set contains the information about house prices for a random sample of 100 properties in Wake County, North Carolina, as collected by Woodard and Leone (2008).
homes_prices
homes_prices
A data.frame
with 100 rows and 6 columns indicating the attributes of each house.
Woodard and Leone (2008)
data(homes_prices) summary(homes_prices)
data(homes_prices) summary(homes_prices)
This data set contains the information about house and lot sizes for a random sample of 100 properties in Wake County, North Carolina, as collected by Woodard and Leone (2008).
house_lot_sizes
house_lot_sizes
A data.frame
with 100 rows and 8 columns indicating the attributes of each house.
Woodard and Leone (2008)
data(house_lot_sizes) summary(house_lot_sizes)
data(house_lot_sizes) summary(house_lot_sizes)
This data set contains the ages at which 12 infants were reported to have started walking by their mothers. The infants were randomly split into "exercise" and "no-exercise" groups" as part of the study conducted by Zelazo et al. (1972). This data is displayed in Table 9.8 of Wolfe and Schneider - Intuitive Introductory Statistics.
infant_walking
infant_walking
A data.frame
with 6 rows and 2 columns indicating the age (in months) at which infants first walked for the exercise and no-exercise groups.
Zelazo et al. (1972)
data(infant_walking) summary(infant_walking)
data(infant_walking) summary(infant_walking)
This data set contains the averages (mm) of interstitial lengths (distances between midpoints) of ten pairs of secondary branches for each of twelve blue spruce and twelve white pine trees. This data is displayed in Table 1.24 of Wolfe and Schneider - Intuitive Introductory Statistics.
interstitial_lengths
interstitial_lengths
A data.frame
with 12 rows and 2 columns corresponding to 12 observations each of blue spruce and white pine trees.
Kayle (1984)
data(interstitial_lengths) summary(interstitial_lengths)
data(interstitial_lengths) summary(interstitial_lengths)
This data set contains race statistics for the Kentucky Derby for each year from 1990 to 2012. This data is displayed in Table 2.7 of Wolfe and Schneider - Intuitive Introductory Statistics.
kentucky_derby_2012
kentucky_derby_2012
A data.frame
with 23 rows and 6 columns indicating various statistics for each year.
data(kentucky_derby_2012) summary(kentucky_derby_2012)
data(kentucky_derby_2012) summary(kentucky_derby_2012)
This data set contains the load at failure for 18 cadaveric menisci repaired by one of three techniques: the FasT-Fix Meniscal Repair Suture System (FasT-Fix), the use of biodegradable Meniscus Arrows (MA), and the Vertical Mattress Sutures (VMS) approach. This data is displayed in Table 12.1 of Wolfe and Schneider - Intuitive Introductory Statistics.
meniscal_repairs_load_at_failure
meniscal_repairs_load_at_failure
A list
with 3 vector
s, each containing the load (in Newtons) at failure of the 6 menisci assigned to the corresponding group.
Borden et al. (2003)
data(meniscal_repairs_load_at_failure) summary(meniscal_repairs_load_at_failure)
data(meniscal_repairs_load_at_failure) summary(meniscal_repairs_load_at_failure)
This data set contains the percentages of mothers in the United States who smoked during pregnancies from 2010 to 2014 for various age categories. This data is displayed in Table 2.14 of Wolfe and Schneider - Intuitive Introductory Statistics.
mother_smoking_age
mother_smoking_age
A data.frame
with 45 rows and 4 columns indicating the age group, number of births, and percentage of smoking mothers for each year.
Centers for Disease Control and Prevention (CDC) "WONDER Online Database" (2016)
data(mother_smoking_age) summary(mother_smoking_age)
data(mother_smoking_age) summary(mother_smoking_age)
This data set contains the percentages of mothers in the United States from 2010 to 2014 who smoked during pregnancy for various categories of educational level. A subset of this data is displayed in Table 2.13 of Wolfe and Schneider - Intuitive Introductory Statistics.
mother_smoking_education
mother_smoking_education
A data.frame
with 40 rows and 4 columns indicating the education, births, and percentage of smoking mothers for each year.
Centers for Disease Control and Prevention (CDC) "WONDER Online Database" (2016)
data(mother_smoking_education) summary(mother_smoking_education)
data(mother_smoking_education) summary(mother_smoking_education)
This data set contains the percentages of mothers in the United States from 1989 to 1993 who smoked during pregnancy for various categories of educational level. This data is displayed in Table 2.15 of Wolfe and Schneider - Intuitive Introductory Statistics.
mother_smoking_education_1989_1993
mother_smoking_education_1989_1993
A data.frame
with 25 rows and 3 columns indicating the education and percentage of smoking mothers for each year.
Centers for Disease Control and Prevention, National Center for Health Statistics (1995)
data(mother_smoking_education_1989_1993) summary(mother_smoking_education_1989_1993)
data(mother_smoking_education_1989_1993) summary(mother_smoking_education_1989_1993)
This data set contains the percentages of mothers in the United States in 2010 who smoked during pregnancy for various categories of educational level. This is a subset of the mother_smoking_education dataset.
mother_smoking_education_2010
mother_smoking_education_2010
A data.frame
with 8 rows and 2 columns indicating the percentage of smoking mothers for each educational level in the year 2010.
Centers for Disease Control and Prevention (CDC) "WONDER Online Database" (2016)
data(mother_smoking_education_2010) summary(mother_smoking_education_2010)
data(mother_smoking_education_2010) summary(mother_smoking_education_2010)
This data set contains the total motor-vehicle fatalities per 100 million vehicle miles traveled in the 2012 calendar year that is displayed in Table 1.16 of Wolfe and Schneider - Intuitive Introductory Statistics.
motor_vehicle_death_rate_2012
motor_vehicle_death_rate_2012
A data.frame
with 50 rows and 3 columns corresponding to the name, fatality rate, and region for each state.
National Highway Traffic Safety Administration (2013)
data(motor_vehicle_death_rate_2012) summary(motor_vehicle_death_rate_2012)
data(motor_vehicle_death_rate_2012) summary(motor_vehicle_death_rate_2012)
This data set contains a random sample of 100 movies from the 1996 edition of The Movie and Video Guide prepared annually by Leonard Maltin as selected by Moore (2006).
movie_facts
movie_facts
A data.frame
with 100 rows and 6 columns corresponding to the title, release year, length (minutes), number of cast members, rating, and number of lines of description for each movie.
Moore (2006)
data(movie_facts) summary(movie_facts)
data(movie_facts) summary(movie_facts)
This data set contains the 2014 salaries (as of March 26, 2014) for all baseball players in the National League.
national_league_salary_2014
national_league_salary_2014
A data.frame
with 437 rows and 3 columns corresponding to the name, salary, and team for each player.
Petchesky (2014)
data(national_league_salary_2014) summary(national_league_salary_2014)
data(national_league_salary_2014) summary(national_league_salary_2014)
This data set contains NBA teams' performance in the 2015-2016 season as provided by the NBA's official website.
nba_2015_2016
nba_2015_2016
A data.frame
with 30 rows and 19 columns indicating various performance statistics for each team.
https://www.nba.com/stats/teams/traditional?Season=2015-16
data(nba_2015_2016) summary(nba_2015_2016)
data(nba_2015_2016) summary(nba_2015_2016)
This data set contains 2015 salaries of faculty and staff in the Mathematics Department at The Ohio State University. This data is displayed in Table 3.1 of Wolfe and Schneider - Intuitive Introductory Statistics.
osu_math_salaries_2015
osu_math_salaries_2015
A vector
with 226 observations corresponding to each reported salary.
Cleveland.com (2016)
data(osu_math_salaries_2015) summary(osu_math_salaries_2015)
data(osu_math_salaries_2015) summary(osu_math_salaries_2015)
This data set contains the ages (based on minting dates) of 374 United States pennies collected by an introductory class of 25 students.
pennies_age
pennies_age
A vector
with 374 observations corresponding to the ages of the pennies.
Classroom Experiment
data(pennies_age) summary(pennies_age)
data(pennies_age) summary(pennies_age)
This data set contains the percentages of eggs that eventually hatched for dump (communal) nests and normal nests each year from 1966 through 1974 as reported by Clawson et al. (1979). This data is displayed in Table 8.1 of Wolfe and Schneider - Intuitive Introductory Statistics.
percentage_hatched_eggs
percentage_hatched_eggs
A data.frame
with 9 rows and 2 columns indicating the percentage of hatched eggs for the two nest types for each year.
Clawson et al. (1979)
data(percentage_hatched_eggs) summary(percentage_hatched_eggs)
data(percentage_hatched_eggs) summary(percentage_hatched_eggs)
This data set contains the percentages of people who answered "yes" to various questions about scientific topics in a survey conducted by the Pew Research Center grouped by age group. This data is displayed in Table 2.18 of Wolfe and Schneider - Intuitive Introductory Statistics.
pew_science_survey_data_by_age_group
pew_science_survey_data_by_age_group
A data.frame
with 5 rows and 4 columns indicating the proportion of respondents answering "yes" to each of the four questions indicated in Exercise 2.3.4. of Wolfe and Schneider - Intuitive Introductory Statistics by age group.
Pew Internet & American Life Project, Science Issues (2014)
data(pew_science_survey_data_by_age_group) summary(pew_science_survey_data_by_age_group)
data(pew_science_survey_data_by_age_group) summary(pew_science_survey_data_by_age_group)
This data set contains the percentages of people who answered "yes" to various questions about scientific topics in a survey conducted by the Pew Research Center grouped by political affiliation. This data is displayed in Table 2.17 of Wolfe and Schneider - Intuitive Introductory Statistics.
pew_science_survey_data_by_party
pew_science_survey_data_by_party
A data.frame
with 3 rows and 4 columns indicating the proportion of respondents answering "yes" to each of the four questions indicated in Exercise 2.3.4. of Wolfe and Schneider - Intuitive Introductory Statistics by party.
Pew Internet & American Life Project, Science Issues (2014)
data(pew_science_survey_data_by_party) summary(pew_science_survey_data_by_party)
data(pew_science_survey_data_by_party) summary(pew_science_survey_data_by_party)
This data set contains a subset of the data collected by biology students at Kenyon College to determine which factors cause pines in the Kenyon Center for Environmental Study to vary in growth rates. A description of the dataset is displayed in Table 2.2 and Example 3.4 of Wolfe and Schneider - Intuitive Introductory Statistics.
pines_1997
pines_1997
A data.frame
with 1000 rows and 5 columns corresponding to variables described in Table 2.2 of Intuitive Introductory Statistics for each tree.
Kenyon Center for Environmental Study (1997)
data(pines_1997) summary(pines_1997)
data(pines_1997) summary(pines_1997)
This data set contains the polymorphonuclear leukocytes (PMN) migration in the sera of the 8 patients with acute pancreatitis measured by Perez et al. (1983). This data is displayed in Table 7.4 of Wolfe and Schneider - Intuitive Introductory Statistics.
pmn_migration
pmn_migration
A vector
with 8 observations corresponding to the PMN migration for each patient.
Perez et al. (1983)
data(pmn_migration) summary(pmn_migration)
data(pmn_migration) summary(pmn_migration)
This data set contains population estimates, birth rates (per 1000 population), and net migration (per 1000 population) for each year 2011 through 2015 at the state level as reported by the U.S. Census Bureau.
population_estimates_2015
population_estimates_2015
A data.frame
with 50 rows and 17 columns indicating each metric over the four years for each state.
United States Census Bureau (2016)
data(population_estimates_2015) summary(population_estimates_2015)
data(population_estimates_2015) summary(population_estimates_2015)
This data set contains the results of the final pre-election Gallup Voter Opinion Poll and the eventual election results for every presidential election from 1936 through 2012.
presidential_election_polls
presidential_election_polls
A data.frame
with 46 rows and 5 columns corresponding to the year, name, final polling estimate, actual election results, and deviation between polling and actual results for each candidate.
Gallup (2015)
data(presidential_election_polls) summary(presidential_election_polls)
data(presidential_election_polls) summary(presidential_election_polls)
This data set contains data on the proportion of a state's hospitals that are for-profit for a subset of states from each of four regions of the country. This data is displayed in Table 12.5 of Wolfe and Schneider - Intuitive Introductory Statistics.
proportion_for_profit_hospitals
proportion_for_profit_hospitals
A data.frame
with 20 rows and 3 columns indicating the region and proportion of for-profit hospitals for each state.
American Hospital Association (2012)
data(proportion_for_profit_hospitals) summary(proportion_for_profit_hospitals)
data(proportion_for_profit_hospitals) summary(proportion_for_profit_hospitals)
This data set contains twenty-five years (1991 through 2015) of Q2 and Q4 charge-off rates for agricultural loans as reported by the Federal Reserve. This is a subset of the agricultural_chargeoff_rates_by_quarter dataset.
q2_q4_agricultural_chargeoff_rates
q2_q4_agricultural_chargeoff_rates
A data.frame
with 50 rows and 3 columns indicating the chargeoff rates for agricultural loans for each quarter.
https://www.federalreserve.gov/releases/chargeoff/chgallnsa.htm
data(q2_q4_agricultural_chargeoff_rates) summary(q2_q4_agricultural_chargeoff_rates)
data(q2_q4_agricultural_chargeoff_rates) summary(q2_q4_agricultural_chargeoff_rates)
This data set contains data from the 2011 Pew Research Center report titled "The Rise of E-reading" intended to investigate how reading and e-reading habits vary by demographic and socio-economic categories. A subset of this data is displayed in Table 3.4 of Wolfe and Schneider - Intuitive Introductory Statistics.
reading_habits_2011
reading_habits_2011
A data.frame
with 2986 rows and 7 columns indicating various pieces of demographic information and reading habits for each study participant.
Pew Internet & American Life Project: The rise of e-reading (2016)
data(reading_habits_2011) summary(reading_habits_2011)
data(reading_habits_2011) summary(reading_habits_2011)
This data set contains 2014 school performance data, or "School Quality Snapshots", for high schools in New York City. A subset of this data is displayed in Table 3.2 of Wolfe and Schneider - Intuitive Introductory Statistics.
school_report_cards_2014
school_report_cards_2014
A data.frame
with 484 rows and 8 columns indicating various performance statistics for each high school.
New York City Department of Education (2016)
data(school_report_cards_2014) summary(school_report_cards_2014)
data(school_report_cards_2014) summary(school_report_cards_2014)
This data set contains the mother's mating weight and her lamb offspring's weight at age 7 months for a subset of twenty ewe-lamb pairs from the Research Farm at Ataturk University, Erzurum, Turkey. This data is displayed in Table 11.6 of Wolfe and Schneider - Intuitive Introductory Statistics.
sheep_weight
sheep_weight
A data.frame
with 20 rows and 2 columns indicating the weight (in kg) for the mother and the lamb of each pair.
Ozturk et al. (2005)
data(sheep_weight) summary(sheep_weight)
data(sheep_weight) summary(sheep_weight)
This data set contains a sample of "chronic disease indicators", as reported by the CDC's Division of Population Health, for the four states California, Michigan, Ohio, and West Virginia. This data is displayed in Table 2.16 of Wolfe and Schneider - Intuitive Introductory Statistics.
state_cdi
state_cdi
A data.frame
with 16 rows and 3 columns indicating the indicator and the percentage of the popoulation having such an indicator for each state.
Centers for Disease Control and Prevention (2016)
data(state_cdi) summary(state_cdi)
data(state_cdi) summary(state_cdi)
This data set contains the number of individuals in each state whose annual income was below the established poverty level in 2013 at a state-by-state level. This data is displayed in Table 1.4 of Wolfe and Schneider - Intuitive Introductory Statistics.
state_poverty_levels_2013
state_poverty_levels_2013
A data.frame
with 50 rows and 3 columns corresponding to the name, total population, and number of people at or below the poverty line in each state.
United States Census Bureau: Population Division (2014) & American Community Surveys (2014)
data(state_poverty_levels_2013) summary(state_poverty_levels_2013)
data(state_poverty_levels_2013) summary(state_poverty_levels_2013)
This data set contains the historical unit values from January 2, 2015 to March 15, 2016 for seven TIAA and CREF variable annuities. This is the full dataset corresponding to the selected rows displayed in Table 2.6 of Wolfe and Schneider - Intuitive Introductory Statistics.
tiaa_cref
tiaa_cref
A data.frame
with 302 rows and 8 columns indicating the unit values for each of the seven annuities for each day.
data(tiaa_cref) summary(tiaa_cref)
data(tiaa_cref) summary(tiaa_cref)
This data set contains traffic accident data from 1949 and 1985 that is displayed in Table 1.2 of Wolfe and Schneider - Intuitive Introductory Statistics.
traffic_accidents
traffic_accidents
A data.frame
with 9 rows and 2 columns. The rows correspond to various types of accidents and the two columns correspond to the years 1945 and 1985, respectively.
National Safety Council (1996)
data(traffic_accidents) summary(traffic_accidents)
data(traffic_accidents) summary(traffic_accidents)
This data set contains the median weekly earnings of male and female employees in service occupations, sales and office occupations, and construction and extraction occupations for each quarter from 2005 to 2015. This data is displayed in Table 2.1 of Wolfe and Schneider - Intuitive Introductory Statistics.
weekly_salaries
weekly_salaries
A data.frame
with 44 rows and 6 columns corresponding to male and female earnings for each of the three occupation types for each quarter.
U.S. Department of Labor, Bureau of Labor Statistics (2016)
data(weekly_salaries) summary(weekly_salaries)
data(weekly_salaries) summary(weekly_salaries)
This data set contains the weight of 2000 coins from 8 separate packages of 250 Euros each as weighed and reported by Shkedy et al. (2006).
weight_of_Euros
weight_of_Euros
A data.frame
with 2000 rows and 3 columns corresponding to the ID, weight, and batch/package number for each coin.
Shkedy et al. (2006)
data(weight_of_Euros) summary(weight_of_Euros)
data(weight_of_Euros) summary(weight_of_Euros)