R Dataset / Package rpart / stagec
On this R-data statistics page, you will find information about the stagec data set which pertains to Stage C Prostate Cancer. The stagec data set is found in the rpart R package. You can load the stagec data set in R by issuing the following command at the console data("stagec"). This will load the data into a variable called stagec. If R says the stagec data set is not found, you can try installing the package by issuing this command install.packages("rpart") and then attempt to reload the data with the library() command. If you need to download R, you can go to the R project website. You can download a CSV (comma separated values) version of the stagec R data set. The size of this file is about 4,748 bytes.
Stage C Prostate Cancer
Description
A set of 146 patients with stage C prostate cancer, from a study exploring the prognostic value of flow cytometry.
Usage
data(stagec)
Format
A data frame with 146 observations on the following 8 variables.
pgtime
-
Time to progression or last follow-up (years)
pgstat
-
1 = progression observed, 0 = censored
age
-
age in years
eet
-
early endocrine therapy, 1 = no, 2 = yes
g2
-
percent of cells in G2 phase, as found by flow cytometry
grade
-
grade of the tumor, Farrow system
gleason
-
grade of the tumor, Gleason system
ploidy
-
the ploidy status of the tumor, from flow cytometry. Values are diploid, tetraploid, and aneuploid
Details
A tumor is called diploid (normal complement of dividing cells) if the fraction of cells in G2 phase was determined to be 13% or less. Aneuploid cells have a measurable fraction with a chromosome count that is neither 24 nor 48, for these the G2 percent is difficult or impossible to measure.
Examples
require(survival) rpart(Surv(pgtime, pgstat) ~ ., stagec)
Dataset imported from https://www.r-project.org.