R Dataset / Package psych / income
On this R-data statistics page, you will find information about the income data set which pertains to US family income from US census 2008. The income data set is found in the psych R package. You can load the income data set in R by issuing the following command at the console data("income"). This will load the data into a variable called income. If R says the income data set is not found, you can try installing the package by issuing this command install.packages("psych") and then attempt to reload the data with the library() command. If you need to download R, you can go to the R project website. You can download a CSV (comma separated values) version of the income R data set. The size of this file is about 1,610 bytes.
US family income from US census 2008
Description
US census data on family income from 2008
Usage
data(income)
Format
A data frame with 44 observations on the following 4 variables.
value
-
lower boundary of the income group
count
-
Number of families within that income group
mean
-
Mean of the category
prop
-
proportion of families
Details
The distribution of income is a nice example of a log normal distribution. It is also an interesting example of the power of graphics. It is quite clear when graphing the data that income statistics are bunched to the nearest 5K. That is, there is a clear sawtooth pattern in the data.
The all.income set is interpolates intervening values for 100-150K, 150-200K and 200-250K
Source
US Census: Table HINC-06. Income Distribution to $250,000 or More for Households: 2008
http://www.census.gov/hhes/www/cpstables/032009/hhinc/new06_000.htm
Examples
data(income) with(income[1:40,], plot(mean,prop, main="US family income for 2008",xlab="income", ylab="Proportion of families",xlim=c(0,100000))) with (income[1:40,], points(lowess(mean,prop,f=.3),typ="l")) describe(income) with(all.income, plot(mean,prop, main="US family income for 2008",xlab="income", ylab="Proportion of families",xlim=c(0,250000))) with (all.income[1:50,], points(lowess(mean,prop,f=.25),typ="l")) #curve(100000* dlnorm(x, 10.8, .8), x = c(0,250000),ylab="Proportion")
Dataset imported from https://www.r-project.org.