R Dataset / Package ISLR / Hitters
On this R-data statistics page, you will find information about the Hitters data set which pertains to Baseball Data. The Hitters data set is found in the ISLR R package. You can load the Hitters data set in R by issuing the following command at the console data("Hitters"). This will load the data into a variable called Hitters. If R says the Hitters data set is not found, you can try installing the package by issuing this command install.packages("ISLR") and then attempt to reload the data with the library() command. If you need to download R, you can go to the R project website. You can download a CSV (comma separated values) version of the Hitters R data set. The size of this file is about 27,040 bytes.
Baseball Data
Description
Major League Baseball Data from the 1986 and 1987 seasons.
Usage
Hitters
Format
A data frame with 322 observations of major league players on the following 20 variables.
AtBat
-
Number of times at bat in 1986
Hits
-
Number of hits in 1986
HmRun
-
Number of home runs in 1986
Runs
-
Number of runs in 1986
RBI
-
Number of runs batted in in 1986
Walks
-
Number of walks in 1986
Years
-
Number of years in the major leagues
CAtBat
-
Number of times at bat during his career
CHits
-
Number of hits during his career
CHmRun
-
Number of home runs during his career
CRuns
-
Number of runs during his career
CRBI
-
Number of runs batted in during his career
CWalks
-
Number of walks during his career
League
-
A factor with levels
A
andN
indicating player's league at the end of 1986 Division
-
A factor with levels
E
andW
indicating player's division at the end of 1986 PutOuts
-
Number of put outs in 1986
Assists
-
Number of assists in 1986
Errors
-
Number of errors in 1986
Salary
-
1987 annual salary on opening day in thousands of dollars
NewLeague
-
A factor with levels
A
andN
indicating player's league at the beginning of 1987
Source
This dataset was taken from the StatLib library which is maintained at Carnegie Mellon University. This is part of the data that was used in the 1988 ASA Graphics Section Poster Session. The salary data were originally from Sports Illustrated, April 20, 1987. The 1986 and career statistics were obtained from The 1987 Baseball Encyclopedia Update published by Collier Books, Macmillan Publishing Company, New York.
References
Games, G., Witten, D., Hastie, T., and Tibshirani, R. (2013) An Introduction to Statistical Learning with applications in R, www.StatLearning.com, Springer-Verlag, New York
Examples
summary(Hitters) lm(Salary~AtBat+Hits,data=Hitters)
Dataset imported from https://www.r-project.org.