Background
The NCCN Clinical Practice Guidelines in Oncology (NCCN Guidelines) for Prostate Cancer (PCa) support active surveillance (AS) in patients at very low risk, low risk, and even favorable intermediate risk for PCa.1 These definitions originate from studies that relied almost exclusively on White patients. For example, Epstein et al2 and Bastian et al3 validated the low-risk criteria in 157 and 237 patients, respectively, which included Black patients but did not stratify by race. Consequently, large-scale, population-based validation studies of these criteria are needed in racial groups other than White patients.
Our study addressed this void and tested for AS misclassification rates in Black patients who fulfilled at least 1 of the AS criteria but underwent radical prostatectomy (RP) with available pathologic staging. Rates were then compared with those for White patients. We hypothesized that equally low misclassification applies to both races.
Methods
Study Population
Data were obtained from the SEER database for the period 2010 through 2015.4 We focused on patients with histologically confirmed nonmetastatic adenocarcinoma of the prostate (ICD-O C61.9) who had available prostate-specific antigen (PSA) values and were treated using RP. Patients were categorized according to the AS criteria into 1 of 3 groups: very low, low, or favorable intermediate risk.1 The very-low-risk criteria required PSA level <10 ng/mL, Gleason grade group (GGG) 1, cT stage 1, and <3 positive prostate biopsy cores with <50% cancer per core. Low-risk PCa criteria required PSA level <10 ng/mL, GGG 1, and cT1–2a. Favorable intermediate-risk PCa criteria required GGG ≤2, <50% positive biopsy cores, and 1 intermediate risk factor, including cT2b/c or GGG 3 or PSA level of 10–20 ng/mL.1 Patients with an unknown GGG (n=2,893), an unknown cT stage (n=11,529), or an unknown number of positive and negative biopsy cores (n=49,969) were excluded. These selection criteria yielded 28,923 patients.
Variable Definition
Race was defined as Black or White. Covariables consisted of age at diagnosis, year of diagnosis, preoperative PSA level, total number of biopsy cores, number of positive biopsy cores, cT stage (cT1c, cT2a, cT2b, cT2c, cT2), and percentage of positive biopsy cores (core ratio), which was calculated by dividing the number of positive cores by the total number of cores (first quartile, <0.17; second quartile, 0.17 to <0.33; third quartile, 0.33 to <0.50; fourth quartile, ≥0.50).
Statistical Analyses
Descriptive statistics focused on frequencies and proportions for categorical variables. Means, medians, and interquartile ranges were reported for continuously coded variables. Chi-square testing was used for the statistical significance in proportion differences. The t test and Kruskal-Wallis test examined the statistical significance of differences in means and medians. The rates of upgrading, defined as an increase of at least 1 GGG from the prostate biopsy GGG to the GGG at final pathology at RP; the rates of upstaging, defined as a non–organ-confined tumor (stage pT3+/pN1) at RP; and the rates of upgrading and/or upstaging, defined as a combination of both, were tabulated for Black and White patients. Specifically, 3 particular endpoints were examined: rates of upgrading, rates of upstaging, and rates of combined upgrading and/or upstaging. The 3 endpoints were stratified according to Black versus White race. Additional stratifications of each endpoint were performed; in particular, rates of combined upgrading and/or upstaging were specified for upgrading to GGG 3, 4, or 5 and/or upstaging to pT3+,pN1. Subsequently, multivariate logistic regression (MLR) models tested the effect of Black race on upgrading, upstaging, and upgrading and/or upstaging (see supplemental eTables 1–3, available with this article at JNCCN.org). Finally, temporal trends for upgrading but not upstaging were examined due to very low observation counts. Statistical analyses were performed using R version 3.4.1 (R Foundation for Statistical Computing). All tests were 2-sided with a level of significance set at P<.05.
Results
Descriptive Characteristics
Of all 210,100 patients treated with RP, 4,704 (2.2%), 17,785 (8.4%), and 11,040 (5.2%) fulfilled the very-low-risk, low-risk, and favorable intermediate-risk AS criteria, respectively (Table 1). Of those, 14.9% (n=701; P<.001), 15.1% (n=2,696; P<.001), and 15.3% (n=1,693; P<.001), respectively, were Black. Overall, Black patients were younger than White patients (P<.001). No clinically meaningful or statistically significant differences in tumor characteristics were recorded in Black patients compared with White patients (Table 1).
Patient Characteristics
In patients with very low risk (Figure 1), Black patients had higher rates of upgrading (40.1% vs 36.3%; absolute Δ = +3.8%; P<.001), upstaging (10.1% vs 6.9%; absolute Δ = +3.2%; P=.01), and upgrading and/or upstaging (42.1% vs 37.7%; absolute Δ = +4.4%; P<.001). Similarly, as shown in Table 2, the rates of upgrading to GGG 3, 4, or 5 and/or upstaging were higher in Black versus White patients (8.9% vs 7.3%; P<.01). Upgrading rates into each GGG were as follows: GGG 2, 33.1% versus 30.3%; GGG 3, 5.4% versus 4.6%; GGG 4, 1.3% versus 0.9%; and GGG 5, 0.3% versus 0.6% in Black versus White patients, respectively (all P>.05). In the subgroup upgraded to GGG 2, Black patients exhibited a higher proportion of upgrading with concomitant upstaging than White patients (18.2% vs 13.4%; P<.01). Similarly, a higher proportion of upgrading with upstaging was also noted in those Black patients upgraded to GGG 3, 4, or 5 (30.6% vs 25.9%; P<.01). Finally, in upstaged patients, no statistically significant differences were recorded between Black and White patients (7.9% vs 4.6%; P=.2). In MLR models, Black race independently predicted higher rates of upgrading (odds ratio [OR], 1.23; 95% CI, 1.03–1.45; P=.01), upstaging (OR, 1.60; 95% CI, 1.19–2.13; P=.001), and upgrading and/or upstaging (OR, 1.26; 95% CI, 1.06–1.49; P=.007) (supplemental eTable 1). Upgrading rates increased significantly over time in Black patients (39.5% in 2010 vs 47.2% in 2015; estimated annual percentage change [EAPC], 4.95%; 95% CI, –0.3% to 10.6%; P=.01) and in White patients (31.4% in 2010 vs 42.5% in 2015; EAPC, 5.04%; 95% CI, 3.1%–7.0%; P<.006) (Figure 2).
Rates of Upgrading, Upstaging, and Upgrading and/or Upstaging
In patients with low risk (Figure 1), Black patients had higher rates of upgrading (46.0% vs 43.5%; absolute Δ = +2.5%; P<.001) and upgrading and/or upstaging (48.7% vs 46.0%; absolute Δ = +2.7%; P<.001), but not rates of upstaging alone (8.8% vs 9.7%; absolute Δ = –0.9%; P<.001). Moreover, as shown in Table 2, no statistically significant differences in upgrading to GGG 3, 4, or 5 and/or upstaging rates were recorded between Black and White patients (9.2% vs 8.2%; P<.01). However, upgrading rates into each GGG were as follows: GGG 2, 39.5% versus 37.8%; GGG 3, 4.9% versus 4.4%; GGG 4, 1.3% versus 0.8%; and GGG 5, 0.3% versus 0.5% in Black versus White patients, respectively (all P>.05). In the subgroup upgraded to GGG 2 and the subgroups upgraded to GGG 3, 4, or 5, no clinically meaningful or statistically significant differences (P=.3) were recorded between Black and White patients regarding the proportions of upgrading with concomitant upstaging. Finally, in upstaged patients, no statistically significant differences were recorded between Black and White patients (6.0% vs 7.2%; P=.4). In MLR models, Black race independently predicted higher rates of upgrading (OR, 1.12; 95% CI, 1.00–1.25; P=.03) and upgrading and/or upstaging (OR, 1.15; 95% CI, 1.03–1.28; P<.01) but not of upstaging (OR, 0.85; P=.09) (supplemental eTable 2). Upgrading rates increased significantly over time in Black patients (44.3% in 2010 vs 53.0% in 2015; EAPC, 5.2%; 95% CI, 2.1%–8.3%; P=.02) and in White patients (41.0% in 2010 vs 48.4% in 2015; EAPC, 3.8%; 95% CI, 3.1%–4.5%; P<.001) (Figure 2).
In patients with favorable intermediate risk, Black patients did not exhibit statistically significantly higher rates than White patients of either upgrading (24.0% vs 24.7%; absolute Δ = –0.7%; P=.2), upstaging (15.8% vs 17.4%; absolute Δ = –1.6%; P=.06), or upgrading and/or upstaging (33.8% vs 35.3%; absolute Δ = –1.5%; P=.5) (Figure 1). No statistically significant differences in upgrading to GGG 3, 4, or 5 and/or upstaging rates were recorded between Black and White patients (23.1% vs 25.2%; P=.2). Rates of upgrading into each GGG were not statistically significantly higher in Black than in White patients (P=.05); subgroup analyses according to upgrading into GGG 2 or GGG 3, 4, or 5 were not performed in patients with favorable intermediate risk, because biopsy at GGG 2 was allowed. Finally, in upstaged patients, no clinically meaningful or statistically significant differences (P=.2) were recorded between Black and White patients (Table 2). In MLR models, Black race failed to independently predict higher rates of upgrading, upstaging, and upgrading and/or upstaging (all P=.05) (supplemental eTable 3). Upgrading rates did not change over time in either Black (28.7% in 2010 vs 25.3% in 2015; EAPC, –4.8%; P=.2) or White patients (23.2% in 2010 vs 25.9% in 2015; EAPC, 0.7%; P=.7) (Figure 2).
Discussion
Large-scale population-based studies that have relied on modern definitions of AS and have focused on rates of upgrading and/or upstaging in Black patients are not available. This study addressed this void. Specifically, the objective of this study was to examine upgrading, upstaging, and combined upgrading and/or upstaging rates in Black and White patients who fulfilled very-low-risk, low-risk, or favorable intermediate-risk criteria as defined in the NCCN Guidelines,1 and in whom RP pathology–derived GGG and pT and pN stages were also available. We hypothesized that upgrading, upstaging, and upgrading and/or upstaging rates do not differ between Black and White patients. Our analyses yielded several noteworthy findings.
First, we found marginal or at best very small differences in the examined endpoints between Black and White patients. The largest-magnitude differences were identified in patients with very low risk; Black patients were invariably affected by higher rates of upgrading, upstaging, or upgrading and/or upstaging, and the differences only ranged from 3.2% to 4.4%. In MLR, these unadjusted rates were translated into ORs of 1.2, 1.6, and 1.3 for upgrading, upstaging, or upgrading and/or upstaging, respectively (all P<.05). However, considering the unadjusted observed rates, it is questionable whether such differences are clinically meaningful, other than to the design and planning of large-scale prospective trials that rely on the correct grade and stage assignment for randomization or stratification purposes. In patients with low risk, the differences between Black and White patients regarding upgrading, upstaging, or upgrading and/or upstaging rates were of even lesser magnitude (–0.9% to 2.7%). In those patients, Black race reached independent predictor status only for higher upgrading rates (OR, 1.12; P=.03) and higher upgrading and/or upstaging rates (OR, 1.15; P=.01) but not for higher upstaging rates (P>.05). Finally, in patients with favorable intermediate risk, marginally higher rates of upgrading, upstaging, or upgrading and/or upstaging were recorded in White and not in Black patients (–0.7% to –1.6%), and Black race failed to reach independent predictor status in each endpoint.
Taken together, these observations indicate that Black race does not invariably predispose to higher upgrading or upstaging, except for a very modestly increased risk in patients with very low risk but not in those with low or favorable intermediate risk. Consequently, Black patients may be given the same consideration as White patients when AS is an option based on the NCCN Guidelines1 very-low, low-, or favorable intermediate-risk criteria. The earlier observations validate the original hypothesis about the absence of meaningful differences between Black and White patients with respect to upgrading and/or upstaging. Moreover, these observations agree with previous historical studies in which Black race was not associated with higher misclassification rates.5,6 Finally, a recent population-based study showed that Black patients had similar 10-year PCa-specific mortality (PCSM) rates after RP relative to White patients.7
Second, temporal trend analyses that focused on upgrading rates over time (2010–2015) in the 3 AS categories showed no meaningful differences in either Black or White patients. In patients with very low risk, upgrading rates increased in Black patients from 39.5% to 47.2% (P=.01) and in White patients from 31.4% to 42.5% (P<.006). Similarly, in patients with low risk, an increase over time was recorded for Black patients (44.3%–53.0%; P=.02) and in White patients (41.0%–48.4%; P<.001). Conversely, in patients with favorable intermediate risk, no differences over time were recorded between Black and White patients (P>.05 for both). Taken together, the magnitude of increase in upgrading was very similar between Black (8%–9%) and White patients (7%–11%). However, the absolute starting point for the upgrading rates was invariably higher in Black compared with White patients. Consequently, the increase in upgrading rates also invariably culminated in higher absolute rates in Black than in White patients. Nonetheless, these differences may be not clinically meaningful in individual patients. However, they may be of importance in the context of clinical trial planning and design.
Third, unexpectedly elevated rates of upgrading were seen in both Black and White patients within all 3 AS categories. Specifically, in patients with very low risk and low risk, upgrading rates ranged from 36.4% to 46.0% compared with 24.0% to 24.7% in patients with favorable intermediate risk. Consequently, in patients with very low or low risk, upgrading should be expected in more than 1 in 3 individuals. Conversely, in patients with favorable intermediate risk, upgrading should be expected in as many as 1 in 4 individuals. Such misclassification rates in GGG seem elevated, especially in patients with very low and low risk. Although most upgrading in these individuals occurred from GGG 1 to GGG 2 (30.3%–39.5%), between 5.7% and 7.0% of these patients were upgraded to GGG 3, 4, or 5. The natural history of GGG 3, 4, or 5 is clearly different from that of GGG 1. Indeed, Pompe et al8 reported significantly worse 8-year PCSM-free survival rates in patients with RP GGG 3, 4, and 5 (85.8%–97.4%) than in those with RP GGG 1 or 2 (99.1%–99.5%; P<.001). However, the ProtecT trial focused on 10-year PCSM-free survival and did not detect important differences between the AS and RP arms (98.8% vs 99.0%; P=.5).9 Moreover, a recently published study of a prospective AS cohort (n=1,818) also reported a 15-year PCSM-free survival of 99.9%.10 Therefore, most upstaged and/or upgraded patients will respond to treatment without excess mortality. Nonetheless, upgrading and/or upstaging due to misclassification errors may result in more treatment-intense follow-up and in higher total health expenditures.
Fourth, unlike for upgrading rates that ranged from 24.0% to 46.0%, upstaging rates in patients with very low, low, and favorable intermediate risk were lower and ranged from 6.9% to 17.4%. However, upstaged patients exhibited pT3 or higher stages and/or pN1 stage. Therefore, the consequences of upstaging are worse than those of upgrading, even when upgrading results in GGG 3, 4, or 5. It is of interest that most upstaged patients will also be upgraded (37.0%–80.2%). Therefore, it can be argued that the seemingly low absolute rates of upstaging have dire consequences on the treated natural history of patients with initially very low, low, or favorable intermediate risk for PCa. Indeed, Gandaglia et al11 reported a 5-year biochemical recurrence–free survival of 79.1% in patients upgraded to GGG 3, 4, or 5 or those upstaged to pT3+/pN1 compared with 97.0% in patients upgraded to GGG 2 (P<.001).
Our work distinguishes itself from previous reports because it relies on more contemporary and detailed AS definitions. Specifically, Katz et al5 relied on the SEER database from 2004 to 2013 and reported no differences in upgrading rates (7.0% vs 7.0%) between Black (n=1,009) and White patients (n=6,346). However, in their analyses, PSA and cT stage were not considered. Consequently, they included patients with a PSA level and cT stage outside of the allowed NCCN Guidelines AS scope. Moreover, they focused on patients upgrading to GGG 3, 4, or 5 but not those upgrading to GGG 2. Based on these limitations, it is difficult if not impossible to interpret their results in the context of the NCCN AS recommendations.1 Moreover, it is impossible to compare the Katz et al5 findings with institutional databases that strictly adhere to the NCCN AS recommendations. Indeed, important differences distinguish the Katz et al5 findings from those of other institutional reports that focused on upgrading and/or upstaging. For example, Faisal et al12 reported on their institutional experience (1992–2013) with patients with very low risk and low risk, including 1,634 Black and 15,993 White patients. In their report, upgrading rates for Black and White patients with very low risk were 29.3% versus 15.4%, respectively, and 30.8% versus 24.9%, respectively, for those with low risk.12
Taken together, we recorded unexpectedly high rates of misclassification between biopsy and RP tumor grade and stage in AS candidates within this large contemporary cohort of Black and White patients. Consequently, AS definitions may require a recalibration to reduce misclassification rates. Moreover, Black race does not predispose to higher upgrading or upstaging rates, except in a very modest fashion in patients with PCa with very low risk. Finally, our data seem to indicate that Black patients may be given the same consideration as White patients when AS is an option. However, further validations should ideally follow.
Our work has limitations and should be interpreted in the context of its retrospective and population-based design. First, it does not allow adjustment or specific analyses that focus on earlier cancer control endpoints, such as biochemical recurrence rates. Second, no information regarding PSA density and the percentage of cancer per core was available in the SEER database. Moreover, we identified 2.1% (n=101) of cT1a and 1.8% (n=86) of cT1b tumors within patients with very low risk. We analyzed the data after deletion of these cases and arrived at exactly the same results. Therefore, cT1 stage can be considered as T1c for practical purposes. Third, we could not account for selection biases related to primary treatment assignment to RP. This surgical cohort may be not reflective of all patients with newly diagnosed PCa according to age and comorbidity profile. Fourth, we could not account for the use of additional staging tools, such as MRI. It is possible that the rates of prostate MRI have increased over time and approximate MRI use in prospective trials. However, it is unlikely that the follow-up of included patients in the SEER database approximates any formal and structured AS protocol.1 Similar considerations apply to the use of follow-up biopsies that are not recorded in the SEER database. However, these limitations as well as those related to the retrospective nature of the SEER database apply to all other population-based analyses that have been derived from the SEER or other similar large-scale data repositories.4
Conclusions
Rates of misclassification were particularly elevated in patients with very low risk and low risk regardless of Black or white race and ranged from 33.8% to 48.6%. Recalibration of very-low-risk, low-risk, and, to a lesser extent, favorable intermediate-risk AS criteria may be required. Finally, our data seem to indicate that Black patients may be given the same consideration as White patients when AS is an option.
References
- 2.↑
Epstein JI, Walsh PC, Carmichael M, et al.. Pathologic and clinical findings to predict tumor extent of nonpalpable (stage T1c) prostate cancer. JAMA 1994;271:368–374.
- 3.↑
Bastian PJ, Mangold LA, Epstein JI, et al.. Characteristics of insignificant clinical T1c prostate tumors. Cancer 2004;101:2001–2005.
- 4.↑
National Cancer Institute. Overview of the SEER program. Accessed April 30, 2020. Available at: https://seer.cancer.gov/about/overview.html
- 5.↑
Katz JE, Chinea FM, Patel VN, et al.. Disparities in Hispanic/Latino and non-Hispanic black men with low-risk prostate cancer and eligible for active surveillance: a population-based study. Prostate Cancer Prostatic Dis 2018;21:533–538.
- 6.↑
Leapman MS, Freedland SJ, Aronson WJ, et al.. Pathological and biochemical outcomes among African-American and Caucasian men with low risk prostate cancer in the SEARCH Database: implications for active surveillance candidacy. J Urol 2016;196:1408–1414.
- 7.↑
Dess RT, Hartman HE, Mahal BA, et al.. Association of black race with prostate cancer–specific and other-cause mortality. JAMA Oncol 2019;5:975–983.
- 8.↑
Pompe RS, Davis‐Bondarenko H, Zaffuto E, et al.. Population-based validation of the 2014 ISUP Gleason grade groups in patients treated with radical prostatectomy, brachytherapy, external beam radiation, or no local treatment. Prostate 2017;77:686–693.
- 9.↑
Hamdy FC, Donovan JL, Lane JA, et al.. 10-year outcomes after monitoring, surgery, or radiotherapy for localized prostate cancer. N Engl J Med 2016;375:1415–1424.
- 10.↑
Tosoian JJ, Mamawala M, Epstein JI, et al.. Intermediate and longer-term outcomes from a prospective active-surveillance program for favorable-risk prostate cancer. J Clin Oncol 2015;33:3379–3385.
- 11.↑
Gandaglia G, Ploussard G, Isbarn H, et al.. What is the optimal definition of misclassification in patients with very low-risk prostate cancer eligible for active surveillance? Results from a multi-institutional series. Urol Oncol 2015;33:164.e1–9.
- 12.↑
Faisal FA, Sundi D, Cooper JL, et al.. Racial disparities in oncologic outcomes after radical prostatectomy: long-term follow-up. Urology 2014;84:1434–1441.