R Dataset / Package Zelig / homerun
On this R-data statistics page, you will find information about the homerun data set which pertains to Sample Data on Home Runs Hit By Mark McGwire and Sammy Sosa in 1998.. The homerun data set is found in the Zelig R package. You can load the homerun data set in R by issuing the following command at the console data("homerun"). This will load the data into a variable called homerun. If R says the homerun data set is not found, you can try installing the package by issuing this command install.packages("Zelig") and then attempt to reload the data with the library() command. If you need to download R, you can go to the R project website. You can download a CSV (comma separated values) version of the homerun R data set. The size of this file is about 7,582 bytes.
Sample Data on Home Runs Hit By Mark McGwire and Sammy Sosa in 1998.
Description
Game-by-game information for the 1998 season for Mark McGwire and Sammy Sosa. Data are a subset of the dataset provided in Simonoff (1998).
Usage
data(homerun)
Format
A data frame containing 5 variables ("gameno", "month", "homeruns", "playerstatus", "player") and 326 observations.
gameno
-
an integer variable denoting the game number
month
-
a factor variable taking with levels "March" through "September" denoting the month of the game
homeruns
-
an integer vector denoting the number of homeruns hit in that game for that player
playerstatus
-
an integer vector equal to "0" if the player played in the game, and "1" if they did not.
player
-
an integer vector equal to "0" (McGwire) or "1" (Sosa)
Source
https://ww2.amstat.org/publications/jse/v6n3/datasets.simonoff.html
References
Simonoff, Jeffrey S. 1998. “Move Over, Roger Maris: Breaking Baseball's Most Famous Record.” Journal of Statistics Education 6(3). Data used are a subset of the data in the article.
Dataset imported from https://www.r-project.org.