|
|
||||||||
Full Scientific Reports |
Correspondence: 1Corresponding Author: Franco Guscetti, Institut für Veterinärpathologie, Vetsuisse-Fakultät, Universität Zürich, Winterthurerstrasse 268, CH-8057 Zürich, Switzerland, e-mail: gufo{at}vetpath.uzh.ch
| Abstract |
|---|
|
|
|---|
0.6 = 0.79;
1.2 = 0.91). The present study provides a basis for the use of TMAs in future high-throughput immunohistochemical investigations of selected markers in canine lymphomas. The statistical model presented can be used to determine an optimal TMA design depending on a desired accuracy.
Key Words: Dogs immunohistochemistry lymphoma tissue microarray (TMA) validation
| Introduction |
|---|
|
|
|---|
Although TMAs have been widely used for a number of years in human medicine, a lack of consensus still exists on how many core biopsies should be collected to determine a reliable value of percentage positivity.5 Suggested numbers of core biopsies range from 21,7,13 to 33,4,8,9 to 4.5,6 Positivity of a given marker can be assessed using a binary (e.g., positive or negative) or ordinary scoring system (division into categories, e.g., 0–25%, 25–50%, 50–75%, 50–100%) or can be expressed on a continuous scale (percentage values between 0% and 100%). Most studies rely on binary1,4,7–9,13 or ordinary6,17 scoring systems that presumably require a smaller number of core biopsies but are also less accurate than if a continuous scale3–5,7,16 is applied. Furthermore, little attention has been paid to the optimal size of core biopsies. Most TMA studies use a standard diameter of 0.6 mm, and few data are available on whether the use of larger cores would result in better agreement between whole sections and TMAs. A detailed quantitative assessment on the accuracy of TMAs depending on different core sizes, as well as core numbers, has to the authors' knowledge not been done in the veterinary medicine field, where the use of TMAs is still limited.
The first objective of the present study was to determine the degree of accuracy by which various combinations of core sizes and numbers of TMAs represent whole sections in immunohistochemical analyses of canine lymphoma. Furthermore, the suitability of TMAs to accurately determine the immunophenotype was assessed for 2 specific combinations.
| Material and Methods |
|---|
|
|
|---|
Artificial Tma Core Biopsies
First, whole sections of archival biopsies from 40 canine lymphomas were immunolabeled for Ki-67 and active caspase-3 as described below. Tumor regions were manually delineated on each whole section, and 12 artificial 0.6-mm and 2 artificial 1.2-mm core biopsies were randomly collected. This was achieved by generating digital imagesa of the maximum number of adjacent, nonoverlapping fields of vision (FOVs) fitting in a 0.6-mm or 1.2-mm core biopsy (1 and 4 images, respectively), as depicted in Figure 1. The images were acquired at 20x magnification, and each FOV measured 83,200 µm2. The area evaluated using artificial core biopsies was smaller than that of a corresponding physical core biopsy; however, the ratio of the analyzed area between artificial 0.6 mm and 1.2 mm core biopsies was identical to the ratio of the area of physical core biopsies (i.e., 1:4).
|
Immunohistochemistry
Prior to cutting, the TMA blocks were immersed in water and kept at –20°C for 15 min while the original tissue blocks were cooled without the use of water. Sections of 2.5-µm (whole sections) or 1.5-µm (TMAs) thickness were cut, transferred to positively charged glass slides,d dried overnight at 37°C, dewaxed, and rehydrated by routine methods. The following primary antibodies and dilutions were used: polyclonal anti-human pan T-cell (CD3),e 1:250; monoclonal anti-human pan B-cell (CD79
),e 1:300; monoclonal anti-human Ki-67,e 1:100; polyclonal anti-human/mouse active caspase-3,f 1:150. The last antibody detects the cleaved form of caspase-3. Whole sections were labeled with all 4 antibodies, TMA sections with CD3, and CD79
antibodies. Antigen retrieval was carried out by boiling the slides in Tris/EDTA buffere (CD3, CD79a) or citrate buffere (Ki-67, active caspase-3) in a conventional microwave oven at 750W for 20 min. The sections were allowed to cool for 10 min and were then incubated with the primary antibody for 1 hr at room temperature. Endogenous peroxidase was inactivated after antigen retrieval by immersing the slides in peroxidase blocking solutione for 10 min at room temperature. For detection of the primary antibody, the Detection Kite was applied according to the manufacturer's instructions. Finally, the reaction was visualized by an AEC chromogen.e The slides were counterstained with hemalum. A canine lymph node showing reactive hyperplasia was used as positive controls for CD3, CD79
, and Ki-67. Normal canine-haired skin was used as a positive and negative control for Ki-67. Formalin-fixed and paraffin wax–embedded ultraviolet-irradiated and nonirradiated canine keratinocytes were used as positive and negative controls for active caspase-3 as previously described.10 Additional negative controls involved omission of the primary antibody.
Cell Counting
The total number of cells for each FOV was assessed for Ki-67 and active caspase-3 immunostains. This was done counting positive and negative cells separately. The results were expressed as the percentage of positive cells within each FOV. Semiautomated cell counting was performed using morphometric software.g Each image was modified to enhance nuclear (Ki-67) or cellular (caspase-3) outlines, to allow automatic counting. Color thresholds for hue, saturation, and intensity were set to select positive cells, and the original color image was converted into a black and white image (binarized). Then edges were enhanced twice and particles were separated using the corresponding functions of the software. The same routine was performed to count negative cells after adjusting color thresholds. For detection of active caspase-3–positive cells, adjacent apoptotic bodies likely derived from a single cell were additionally merged by dilation (8 times) followed by a fusion of particles. The detection threshold was set at 100 pixels to avoid counting of too small particles. Each resulting image was compared visually to the original image and checked for plausibility before commencing automatic counting of particles.
Immunophenotypic Classification
In whole-tissue sections, tumors were classified as T- or B-cell lymphoma if at least 50% of the neoplastic cells showed cytoplasmic labeling for either CD3 or CD79
, respectively. Tumors were classified as double negative (DN) if neoplastic cells reacted with neither of the antibodies. In TMAs, a cutoff of 10% was chosen to avoid misclassification of tumors due to the presence of non-neoplastic T or B cells, which in some cases are difficult to differentiate from neoplastic cells in core biopsies. This cutoff was determined empirically in preliminary trials on tissues included in the present study. Tumors were classified as T- or B-cell lymphoma if more than 10% of neoplastic cells were labeled for either CD3 or CD79
in at least half of the core biopsies of one case (i.e., 1 and 2 core biopsies for 1.2-mm and 0.6-mm core biopsies, respectively). Tumors were classified as DN if less than 10% of the neoplastic cells were labeled. Tumors were defined as nonclassifiable if the minimum number of core biopsies could not be evaluated because cores were missing or originated from non-neoplastic or necrotic tissue or if labeled cells approximated 10% with both antibodies. Immunostains of the whole sections and both TMAs were evaluated blindly.
Statistical Analysis
Accuracy of Various Combinations of Core Sizes and Numbers
Each of the 40 tumors was represented by 2 artificial 1.2-mm core biopsies and 12 artificial 0.6-mm core biopsies. In total, 20 FOVs were analyzed per tumor. For both types of core biopsies, the number of cells positive for Ki-67 and active caspase-3 and the total number of cells counted was recorded. The data from the different core biopsy sizes were analyzed separately, and it was assumed that the number of positive cells posij in tumor i and core biopsy j follows the binomial distribution: posij
bin(nij,
i). Here, nij is the total number of investigated cells of tumor i and core biopsy j and
i is the probability that a cell in tumor i is positive. A generalized linear model14 with the binomial distribution and a logit-link
was fitted to the data of each core biopsy size. The linear predictor
ij is given by
ij = ti + bij, where ti is the fixed effect of tumor i and bij is a random error for each core biopsy j in tumor i, which allows for overdispersion. The random effects bij were assumed to be independent and normally distributed with mean 0 and variance
. The main interest of the study lies in the variance estimate of core biopsies within a tumor,
, which is a key to the accuracy of the estimate for positivity. Because the variance of the random effect bij was estimated on a linear scale, data were simulated according to the linear predictor and each single observation was transformed into a percentage scale. The variance per tumor was determined on this scale. The data were simulated as follows: 100 virtual tumors were generated with different levels of positivity ti and a random effect for each core biopsy that was normally distributed with 0 mean and variance
. Each tumor was represented by 106 virtual samples. Based on the variance estimates on the percentage scale, mean percentage positivity of core biopsies within a tumor was calculated for different numbers of core biopsies in 1 tumor. This procedure was done separately for the 0.6-mm and 1.2-mm cores for Ki-67 and active caspase-3. A reading example provided in the footnote of Table 1 and comments on Figure 1 in the results section illustrate how to interpret the results.
|
).2 The data analysis was generated using SAS/STAT software, Version 9 of the SAS System.h | Results |
|---|
|
|
|---|
|
|
|
= 0.91, 95% CI: 0.79–1.00) than with the 0.6-mm core biopsies (
= 0.79, 95% CI: 0.62–0.96). All T-cell lymphomas and DN lymphomas were classified correctly with both core sizes. A number of classification errors, however, affected B-cell lymphomas. Three B-cell tumors were incorrectly classified as T-cell lymphomas, 1 with both core sizes, 1 with 0.6 mm, and 1 with 1.2 mm. In all cases, these misclassifications were due to a high content of non-neoplastic T lymphocytes. Other 3 B-cell lymphomas were erroneously classified as DN with 0.6-mm cores. In the whole sections of these cases, the overall staining intensity of neoplastic cells was weak, while in the small core biopsies, no staining was observed. An additional B-cell lymphoma could not be classified with the 0.6-mm core biopsies because the cores contained a similar number of T and B cells (
10%). The same tumor was incorrectly classified as a T-cell lymphoma with the 1.2-mm cores because, in this sample, T cells predominated over B cells. | Discussion |
|---|
|
|
|---|
The model indicates that a given number of 1.2-mm cores provide a clearly higher degree of accuracy (as indicated by a smaller 95% CI) than an equal number of 0.6-mm cores. This accuracy is a result of the larger total area available from the larger cores, which increases the number of FOVs for analysis. In contrast, when an equal number of FOVs is analyzed, 0.6-mm cores are superior to 1.2-mm cores. This is because the FOVs of the 0.6-mm cores are spread over a larger area of the specimen than those of 1.2-mm cores. As a general rule based on the results of this study, the number of 1.2-mm cores needed to attain a given degree of accuracy will always be lower than that of 0.6-mm cores. However, the total number of FOVs analyzed using 1.2-mm cores will be higher. Therefore, selection of 1.2-mm cores requires more effort for evaluation (i.e., counting a larger number of cells), whereas the use of 0.6-mm cores necessitates more work to assemble the array. However, cell counting is likely to be the more labor-intensive step, especially if multiple markers are evaluated. Based on these considerations, it can be concluded that in most situations in which scoring is performed on a continuous scale, it is more convenient to use 0.6-mm than 1.2-mm core biopsies.
As expected, the model predicts that an increase in core number is associated with an increase in accuracy, which is expressed as a decrease in the 95% CI width of the mean percentage positivity of core biopsies within a tumor. It is important to stress, however, that the amplitude of this decrease also progressively declines, as illustrated by an increased proximity of the corresponding curves in Figure 3. A similar observation has been reported in a previous TMA study that focused on human colorectal cancers.5 At some point, the increased accuracy obtained by evaluating an additional core does not justify the extra effort, and other factors (e.g., interobserver variations, precision of cell counting) are likely to be a greater source of imprecision. Additional factors that influence the degree of accuracy include whether positive cells are scored in a binary or ordinary fashion or counted on a continuous scale and whether a marker is distributed homogeneously within a tumor.
In cancer research, the prognostic or predictive significance of a particular protein, or its potential role in pathogenesis, is often determined by immunohistochemical staining of tumors. Often it is not until immunohistochemical results are correlated to other parameters (e.g., patient data, tumor grade) that the degree of required accuracy can be defined. If the data are too imprecise, correlations might be overlooked. In this study, an unexpected and striking similarity was seen between the predictions computed for the 2 antigens examined, particularly considering their unequal range and distribution of positive percentage values. This similarity might simply be a coincidence, but it suggests that the degree of accuracy is only partially dependent on the marker analyzed.
Based on the model, it appears to the authors that collecting at least 4 0.6-mm or 2 1.2-mm core biopsies would adequately represent the whole sections without extending time and effort beyond acceptable limits. Compared with previous studies, this number of suggested cores is rather high. However, most studies scored positivity in a binary1,4,7–9,13 or ordinary6,17 way, which is less precise than using a continuous scale and is likely to require fewer cores. Furthermore, most validation studies only focus on a predetermined number of cores,3,4,8,9,12,13,16,17 and comparison between studies is difficult due to variations in several parameters, such as antigen examined, scoring method or statistical analysis used, interinstitutional variability among observers, and immunohistochemical staining methods.17
Two combinations of core sizes and numbers were used to assemble physical arrays from the case material, which were tested in a practical application (i.e., to immunophenotype the tumors). In general, comparison of core biopsies with the whole sections revealed a high degree of agreement, but this practical approach also unveiled some limitations of the TMA technique. When evaluating whole sections, the tissue architecture can be taken into account to select representative areas, whereas this advantage is reduced or lacking when core biopsies are analyzed. This observation is compatible with the fact that 1.2-mm cores showed a better agreement with the whole sections than 0.6-mm cores in the present study.
Tumors incorrectly classified with the TMAs either displayed a weak overall staining intensity or presented with core biopsies containing a high number of non-neoplastic lymphocytes. Additional possible sources of discrepancies include loss of individual core biopsies during preparation, heterogeneous staining of specimens and variations of staining intensity in different runs, and the antibody applied. For instance, all T-cell tumors identified on the TMAs using an additional monoclonal CD3 antibody were correctly classified, but this antibody yielded a higher number of DN or nonclassifiable tumors in the core biopsies than the polyclonal antibody (data not shown). This was attributed to the lower reactivity of the monoclonal versus polyclonal antibody. To avoid misclassification of nondistinctive cases, the authors suggest that labeling of the whole section be performed if tumors are classified as DN or are not classifiable based on core biopsies. Alternatively, if the immunophenotype must be determined with the highest possible accuracy, the whole sections of all tumors should be labeled. At this point, TMA data quality could further be improved by additional immunolabeling of the TMAs. This would facilitate recognition and exclusion of nonrepresentative cores (i.e., cores with divergent immunolabeling), which might otherwise be challenging or even impossible in some subtypes of lymphoma.
In conclusion, the model presented in this study provides a general insight into the degree of accuracy by which tissue arrays may represent whole sections. The statistical approach developed might be applied to other antigens and tumor types. In a practical application, TMAs appeared to be suited for immunophenotyping of canine lymphomas, despite the occurrence of known technical limitations. The study provides a basis for the use of TMAs in future high-throughput immunohistochemical investigations of selected markers in canine lymphomas and should encourage the use of this technique in veterinary medicine.
| Acknowledgments |
|---|
| Sources and manufacturers |
|---|
|
|
|---|
a. JVC KY-F70, Yokohama, Japan. ![]()
b. Beecher instruments, Sun Prairie, WI. ![]()
c. Polysciences Inc., Warrington, PA. ![]()
d. Erie Scientific Company, Portsmouth, NH. ![]()
e. Dako Cytomation, Zug, Switzerland. ![]()
f. R&D Systems Europe Ltd., Abingdon, United Kingdom. ![]()
g. Soft Imaging System, Muenster, Germany. ![]()
| References |
|---|
|
|
|---|
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |