| |
Section I:
Expanded Methods
RNA extraction, labeling, hybridization, and data
analysis
Mononuclear cell suspensions from diagnostic BM aspirates or peripheral
blood (PB) samples were prepared from each patient and an aliquot cryopreserved.
RNA was extracted using the Trizol reagent (Gibco BRL Life Technologies,
Gaithersburg, MD) following the manufacture´s recommended protocol.
RNA integrity was assessed by electrophoresis on the Agilent 2100 Bioanalyzer
(Agilent, Palo Alto, CA).
First and second strand cDNA were synthesized from 5-15 µg of total
RNA using the SuperScript Double-Stranded cDNA Synthesis Kit (Gibco Life
Technologies) and the oligo-dT24-T7 (5´-GGC CAG TGA ATT
GTA ATA CGA CTC ACT ATA GGG AGG CGG-3´) primer according to the
manufacturer´s instructions. cRNA was synthesized and labeled with
biotinylated UTP and CTP by in vitro transcription using the T7 promoter
coupled double stranded cDNA as template and the T7 RNA Transcript Labeling
Kit (ENZO Diagnostics Inc., Farmingdale NY). Briefly, double stranded
cDNA synthesized from the previous steps was washed twice with 70% ethanol
and resuspended in 22 µl RNase-free H2O. The cDNA was
incubated with 4 µl of 10X each reaction buffer, 1µl of biotin
labeled ribonucleotides, 2 µl of DTT, 1µl of RNase inhibitor
mix and 2 µl 20X T7 RNA Polymerase for 5 hr at 37°C. The labeled
cRNA was separated from unincorporated ribonucleotides by passing through
a CHROMA SPIN-100 column (Clontech, Palo Alto, CA) and precipitated at
-20°C for 1 hr to overnight.
The cRNA pellet was resuspended in 10 ml Rnase-free H2O. and
10.0 mg was fragmented by heat and ion-mediated hydrolysis at 95°C
for 35 minutes in 200 mM Tris-acetate, pH 8.1, 500 mM KOAc, 150 mM MgOAc.
The fragmented cRNA was hybridized for 16 hr at 45°C to HG_U95Av2
oligonucleotide arrays (Affymetrix, Santa Clara, CA) containing 12,600
probe sets from full length annotated genes together with additional
probe sets designed to represent EST sequences. Arrays were washed at
25°C with 6 X SSPE (0.9M NaCl, 60 mM NaH2PO4,
6 mM EDTA + 0.01% Tween 20) followed by a stringent wash at 50°C
with 100 mM MES, 0.1M NaCl2, 0.01% Tween 20. The arrays were
then stained with phycoerythrin conjugated streptavidin (Molecular Probes,
Eugene, OR). Arrays were scanned using a laser confoCal scanner (Agilent,
Palo Alto, CA) and the expression value for each gene was calculated
using Affymetrix Microarray software (MAS 4.0). The signal intensity
for each gene was calculated as the average intensity difference (AID),
represented by [Σ(PM - MM)/(number of probe pairs)], where PM and
MM denote perfect-match and mismatch probes, respectively. Expression
values were normalized across the sample set by scaling the average of
the fluorescent intensities of all genes on an array to a constant target
intensity of 2500, then any AID over 45,000 was capped to a value of
45,000. All AID´s less than 100, including negative values and absent
calls were converted to a value of 1. In addition, a variation filter
was used that eliminated any probe set in which less than 1% of the samples
had a present call, or if the Max AID - Min AID across the sample set
was less than 100. The average intensity differences for each of the
remaining genes were analyzed. For some metrics the data was log transformed
prior to analysis. The minimum quality control values required for inclusion
of a sample´s hybridization data in the study were 10% or greater
present calls, a GAPDH/Actin 3´ /5´ ´ ratio <5, and
use of a scaling factor that was within 3 standard deviations from the
mean of the scaling values of all chips analyzed. The average percent
present calls for our overall dataset was 29.7%, and for each of the
genetic subgroups was BCR-ABL (31.1%), E2A-PBX1 (28.9%), Hyper >50
(31%), MLL (29.8%), T-ALL (29.1%), TEL-AML1 (28.5%), Novel (30.2%), others
(31.1%). In addition, we required each sample to have >75% blasts.
The average percentage blasts for the overall dataset used to define
the genetic subtypes was 93%, and for each genetic subtype was BCR-ABL
(92%), E2A-PBX1 (96%), Hyper >50 (93%), MLL (93%), T-ALL (91%), TEL-AML1
(92%), Novel (95%), and others (94%).
The data was analyzed using the MAS 4.0 software. Recently, a new version
of the Affymetrix software became available (Affymetrix MAS 5.0). Affymetrix
reports an overall 94% correlation between data analyzed using these
two software programs. We have performed a limited analysis to directly
compare the results of our data analyzed with these two different software
programs. We took 74 randomly selected cases and reanalyzed the primary
data using the MAS 5.0 software. Using the default parameters set by
Affymetrix, the average percent present call was lower for the MAS 5.0
analyzed data as compare to the values obtained using MAS 4.0; however,
with exceedingly rare exception, all of the 271 Chi-squared genes selected
as discriminators of the seven genetic subtypes (see below) remained
within the new dataset. Moreover, when we used these 271 in a 2-D hierarchical
clustering analysis (as described below), the data looked identical to
the results obtained using the MAS 4.0 dataset (data not shown).
Reproducibility of microarray data
The reproducibility of the Affymetrix microarray system was assessed
by comparing the gene expression profiles of the following samples:
(i) RNA extracted from duplicate cryopreserved diagnostic leukemic
samples from 23 patients; (ii) Single RNA samples from 13 patients
analyzed on two separate arrays. Results from these analyses are presented
in Table 2. As shown, the mean number of probe sets that displayed
a >2-fold difference in expression between separately extracted
but paired RNA samples was 144, and for single RNA samples analyzed
on two separate occasions was 133. Moreover, very few probe sets were
found to have a >3-fold difference in expression levels
between replicate samples. The observed number of probe sets showing
a difference in expression values represents <2% of the total number
of probe sets on the microarray, and thus these data suggest that the
Affymetrix microarray system has a very high degree of reproducibility.
Table 2. Comparison data from replicate bone marrow
sample
|
|
#>2
fold
|
2.1-3.0
|
3.1-4.0
|
>4
|
Chip |
Extraction |
|
|
|
42
|
38
|
4
|
0
|
T-ALL-#3
|
Different
|
|
45
|
43
|
2
|
0
|
BCR-ABL-#7
|
Different
|
|
59
|
53
|
1
|
5
|
E2A-PBX-#7
|
Different
|
|
61
|
55
|
2
|
4
|
T-ALL C25
|
Different
|
|
68
|
61
|
6
|
1
|
Hyperdip47-50-C17
|
Different
|
|
69
|
65
|
3
|
1
|
MLL-C6
|
Different
|
|
84
|
81
|
1
|
2
|
Hyperdip47-50-C18
|
Different
|
|
103
|
99
|
4
|
0
|
Pseudodip-C10
|
Different
|
|
107
|
93
|
11
|
3
|
Pseudodip-C12
|
Different
|
|
109
|
106
|
3
|
0
|
Normal-R1
|
Different
|
|
116
|
106
|
7
|
3
|
TEL-AML1-C33
|
Different
|
|
117
|
109
|
4
|
4
|
TEL-AML1-2M#2
|
Different
|
|
126
|
117
|
8
|
1
|
T-ALL-C3 replicate #1
|
Different
|
|
133
|
122
|
7
|
4
|
Normal-#3
|
Different
|
|
153
|
146
|
5
|
2
|
Hypodip-#4 replicate #1
|
Different
|
|
159
|
117
|
26
|
16
|
Hyperdip47-50-C13
|
Different
|
|
163
|
121
|
15
|
27
|
TEL-AML1-#10
|
Different
|
|
166
|
154
|
12
|
0
|
T-ALL-C3 replicate #2
|
Different
|
|
183
|
168
|
12
|
3
|
Pseudodip-#6
|
Different
|
|
187
|
155
|
23
|
9
|
T-ALL-RR1
|
Different
|
|
239
|
200
|
19
|
20
|
Hypodip-#4 replicate #2
|
Different
|
|
256
|
204
|
29
|
23
|
Hypodip-#4 replicate #3
|
Different
|
|
568
|
295
|
137
|
136
|
Hyperdip>50-R2M1
|
Different
|
|
21
|
19
|
2
|
0
|
T-ALL-#7
|
Same
|
|
24
|
22
|
2
|
0
|
T-ALL-R2
|
Same
|
|
34
|
33
|
1
|
0
|
T-ALL-C3 replicate #3
|
Same
|
|
52
|
52
|
0
|
0
|
TEL-AML1-C29
|
Same
|
|
64
|
58
|
2
|
4
|
Hyperdip>50-C34
|
Same
|
|
64
|
56
|
5
|
3
|
Hyperdip>50-R2M2
|
Same
|
|
76
|
74
|
1
|
1
|
T-ALL-R2M1
|
Same
|
|
132
|
129
|
1
|
2
|
Pseudodip-#7
|
Same
|
|
158
|
147
|
9
|
2
|
Hyperdip>50-C1
|
Same
|
|
161
|
139
|
19
|
3
|
TEL-AML1-C20
|
Same
|
|
197
|
189
|
6
|
2
|
T-ALL-#8
|
Same
|
|
210
|
169
|
25
|
16
|
Normal-C30-N
|
Same
|
|
539
|
139
|
123
|
277
|
Pseudodip-C4
|
Same
|
| |
|
|
Mean
|
Median
|
Stdev
|
|
| Different Extraction |
|
144.0
|
117.0
|
109.0
|
|
| Same Extraction |
|
133.2
|
76.0
|
138.4
|
|
Comparison of microarray data between
PB and BM leukemia samples
Matched BM and PB samples that contained >80% leukemic blasts
were obtained from 10 patients and the RNA extracted and assessed by
microarray analysis. As shown in Table 3, a very high level of correlation
was observed for the expression profiles of BM and PB, with only 189
probe sets having a >2-fold difference in expression. More importantly,
no genes were found to be consistently over- or under-expressed in one
sample type. These data suggest that there are minimal differences in
the gene expression profiles of leukemic blasts obtained from BM or PB,
and raise the possibility that diagnostic gene expression profiling might
be possible on samples obtained from the PB.
Table 3. Comparison of peripheral blood
and bone marrow samples
|
|
#>2
fold
|
2.1-3.0
|
3.1-4.0
|
>4
|
Peripheral Blood
|
Bone Marrow
|
|
|
|
98
|
90
|
6
|
2
|
E2A-PBX1-C7 PB
|
E2A-PBX1-C7
|
|
|
108
|
93
|
9
|
6
|
Pseudodip-NHR1 PB
|
|
|
|
113
|
105
|
4
|
4
|
BCR-ABL-#1 PB
|
BCR-ABL-#1
|
|
|
139
|
117
|
18
|
4
|
Hyperdip>50-C8 PB
|
Hyperdip>50-C8
|
|
|
141
|
135
|
6
|
0
|
E2A-PBX1-C3 PB
|
E2A-PBX1-C3
|
|
|
159
|
139
|
16
|
4
|
T-ALL-C9 PB
|
T-ALL-C9
|
|
|
244
|
223
|
14
|
7
|
Hyperdip>50-C6 PB
|
Hyperdip>50-C6
|
|
|
245
|
194
|
33
|
18
|
MLL-C3 PB
|
MLL-C3
|
|
|
272
|
246
|
20
|
6
|
MLL-RR3 PB
|
MLL-RR3
|
|
|
377
|
300
|
50
|
27
|
E2A-PBX1-C2 PB
|
E2A-PBX1-C2
|
|
| |
|
|
Mean
|
Median
|
Stdev
|
|
| Differences between PB
vs BM |
189.6
|
150.0
|
91.1
|
|
| |
|
133.2
|
76.0
|
138.4
|
|
Real-time RT-PCR results
Real-time RT-PCR assays (Taqman; Perkin-Elmer/Applied Biosystems, Foster
City, CA) were developed to independently determine the level of mRNA
for five genes that were found by microarray analysis to be predictive
of either T-lineage ALL (CD3δ, CD3D antigen delta polypeptide
TiT3 complex; MAL, mal T-Cell differentiation protein; and PRKCQ,
protein kinase C theta) or E2A-PBX1 expressing ALL (MERTK, c-MER proto-oncogene
tyrosine kinase and KIAA802). RNA samples analyzed included four samples
each of E2A-PBX1 and T-ALL, and two samples each from the remaining
subtypes (BCR-ABL, MLL, TEL-AML1, Hyperdiploid >50,
Hyperdiploid 47-50, Hypodiploid, Pseudodiploid, and Normal). Whenever
possible, the forward and reverse primers were designed in different
exons so that DNA contamination would not be a concern. In the case
of MAL where this was not clear, the RNA was treated for 15
minutes at room temperature with 1.0 unit of DNase I (Invitrogen, Carlsbad,
California) using the Invitrogen protocol to remove any contaminating
DNA.
Thirty-three ng of RNA from each sample was reverse transcribed using
random hexamers and Multiscribe Reverse Transcriptase (Perkin-Elmer/Applied
Biosystems) in a total volume of 10 µl. Real Time PCR was performed
on a PE Applied Biosystems 7700 prism using oligonucleotide primers and
probes sequences designed using Primer Express. The following primers
and probes were used: (1) MERTK, 5´ -GGC GTG CTA ACT GTT
CCA GG-3´ (forward primer), 5´ -CCT TTG TCA TTG TGG GCC TC-3´(reverse
primer), and 5´-CAA CTG AAG ACC GCC ATC TCC GTC AG-3´(probe);
(2) KIAA802, 5´- CCA AGC TGA AGG AGT CGG AC-3´(forward), 5´-AGA
TTT CCT TTG TGA TTT TCT TCT TCC-3´(reverse), and 5´-TGC TCG
GCC AGT GAG AAT CTC TAC CTG-3´(probe); (3) PRKCQ, 5´-AAG
CTG CCA CAA GTT CGA CC-3´(forward), 5´-ATG AAG GAA CTG CAG
ACC AAG AA-3´ (reverse), and 5´-CCA GAG CGA CGT TTT ATG CTG
CTG AAA TC-3´(probe); (4) MAL, 5´-CCA GTG GCT TCT CGG
TCT TC-3´ (forward), 5´-CAG GAG CCA CTC ACA AAC TCA A-3´(reverse),
and 5´-CCA CCT TGC CCG ACT TGC TCT TCA-3´(probe); and (5) CD3δ ,
5´-TCA ACA GAG CTT GTG TGT CGG-3´(forward), 5´-CAT TGC
CAC TCT GCT CCT TG-3(reverse), and 5´-TGT CCA GCA AAG CAG AAG ACT
CCC AAA-3´(probe). All probes were labeled at the 5´ end with
FAM (6-carboxy-fluroescein) and at the 3´ end with TAMRA (6-carboxy-tetramethyl-rhodamine).
The PCR reactions were performed in a total volume of 50 µl containing
10 µl of the RT product, 300 nm each of the forward and reverse
primers, 100 nM of probe, 1X master mix (PE Applied Biosystems) and 1 µl
of Taq Gold. Following a 10 minute incubation at 95°C to activate
the Taq Gold, samples were denatured at 95°C for 15 seconds, then
annealed and extended at 60°C for 1 minute, for a total of 40 cycles.
The RNA from each sample was also amplified using primers and probes
to RNase P (PE Applied Biosystems) for use in normalization. Appropriate
negative controls were included in each run. Standard curves were generated
for T-cell markers and RNase P using MOLT4 RNA, a T-cell leukemia cell
line, and for the E2A-PBX1 markers and RNase P using a leukemia
cell line, 697, that contains an E2A-PBX1 fusion. The expression level
of the predictive genes and RNase P were determined in each of the 24
ALL samples (Figures 7-11). A ratio was then calculated by taking the
expression value for the specific gene and dividing it by the expression
level of RNase P in the sample. These ratios were then compared to the
values obtained from the Affymetrix data from the same RNA sample. The
raw Affymetrix data were scaled as described and then normalized using
the 3´GAPDH value for each sample, yielding a normalized ratio.
The Taqman and Affymetrix ratios were then log transformed and compared.
Since the markers selected for Taqman analysis were predictors for either E2A-PBX1 or
T-ALLs, each gene was expected to have four RNA samples with high and
20 samples with low expression. For each gene evaluated, an average expression
value for both the Taqman and Affymetrix data was calculated for all
samples in the up-regulated group, and similarly, for the samples in
the down-regulated group. These values were then compared against each
other as shown in Figure 12.
Analysis of CD3δ
 |
| Figure 7. The standard
curves of critical threshold values (Ct) versus log
of the concentration of RNA in picograms (pg). |
Table 4. Results
of Taqman assay for CD3δ and RNase P on twenty-four
selected ALL diagnostic bone marrow samples representing
the various leukemia subgroups.
|
| |
Ct CD3δ @
|
Ct
RNP +
|
Input
Log CD3δ ++
|
Input
CD3δ%
|
Input
Log RNP*
|
Input
RNP**
|
Input CD3δ /RNP &
|
|
|
|
E2A-PBX1-#1
|
28.26
|
17.05
|
2.394
|
247.825
|
4.454
|
28452.773
|
0.0087
|
|
|
E2A-PBX1-#3
|
30
|
16.69
|
1.868
|
73.772
|
4.585
|
38490.487
|
0.0019
|
|
|
E2A-PBX1-#7
|
28.94
|
17.36
|
2.188
|
154.342
|
4.341
|
21934.21
|
0.0070
|
|
|
E2A-PBX1-C10
|
27.53
|
16.27
|
2.615
|
412.030
|
4.738
|
54758.77
|
0.0075
|
|
|
T-ALL-R3
|
23.47
|
19.43
|
3.843
|
6963.863
|
3.587
|
3859.725
|
1.8042
|
|
|
T-ALL-C20
|
21.76
|
17.14
|
4.360
|
22910.208
|
4.421
|
26382.588
|
0.8684
|
|
|
T-ALL-R1
|
24.79
|
20.06
|
3.444
|
2777.311
|
3.357
|
2274.605
|
1.2210
|
|
|
T-ALL-#3
|
22.39
|
18.24
|
4.170
|
14773.707
|
4.020
|
10479.498
|
1.4098
|
|
|
BCR-ABL-R3
|
29.68
|
20.38
|
1.965
|
92.188
|
3.240
|
1738.834
|
0.0530
|
|
|
BCR-ABL-#7
|
25.8
|
17.51
|
3.138
|
1374.537
|
4.286
|
19339.415
|
0.0711
|
|
|
Hyperdip>50-C36
|
27.65
|
17.44
|
2.579
|
378.997
|
4.312
|
20509.735
|
0.0185
|
|
|
Hyperdip>50-#5
|
30.04
|
17.36
|
1.856
|
71.7455
|
4.341
|
21934.21
|
0.0033
|
|
|
Hyperdip47-50-C17
|
30.18
|
20.08
|
1.813
|
65.081
|
3.350
|
2236.74
|
0.0291
|
|
|
Hyperdip47-50-C8
|
30.49
|
17.95
|
1.7197
|
52.444
|
4.126
|
13367.573
|
0.0039
|
|
|
Hypodip-#1
|
28.75
|
18.09
|
2.246
|
176.176
|
4.075
|
11885.546
|
0.0148
|
|
|
Hypodip-C4
|
29.91
|
18.28
|
1.895
|
78.544
|
4.006
|
10133.5
|
0.0078
|
|
|
MLL-C2
|
28.53
|
17.26
|
2.312
|
205.345
|
4.378
|
23854.727
|
0.0086
|
|
|
MLL-2M#2
|
29.88
|
17.11
|
1.904
|
80.202
|
4.432
|
27055.348
|
0.0030
|
|
|
Normal-C5
|
27.34
|
17.58
|
2.672
|
470.319
|
4.261
|
18235.876
|
0.0258
|
|
|
Normal-C6
|
31.51
|
18.79
|
1.411
|
25.77523
|
3.820
|
6604.684
|
0.0040
|
|
|
Pseudodip-#4
|
28.49
|
18.73
|
2.325
|
211.146
|
3.842
|
6945.82
|
0.0304
|
|
|
Pseudodip-C16-N
|
28.21
|
18.18
|
2.409
|
256.606
|
4.042
|
11020.77
|
0.0233
|
|
|
TEL-AML1-#7
|
29.05
|
18.08
|
2.155
|
142.960
|
4.079
|
11985.727
|
0.0119
|
|
|
TEL-AML1-C39
|
30.89
|
19.6
|
1.599
|
39.693
|
3.525
|
3346.472
|
0.0119
|
|
@Critical
threshold value for CD3δ
+ Critical threshold value for RNase P
++ Log of the pg CD3δ in patient
sample calculated from standard curve
% Exponentiated value of log CD3δ
* Log of the pg RNase P in patient sample calculated
from standard curve
** Exponentiated value of log RNase P
& Ratio of CD3δ to RNase
P
Analysis of MAL
 |
| Figure 8. The standard
curves of critical threshold values (Ct) versus log
of the concentration of RNA in picograms (pg). |
Table 5. Results of Taqman assay for MAL and RNase P on twenty-four
selected ALL diagnostic bone marrow samples representing the various
leukemia subgroups.
|
|
| |
Ct MAL@
|
Ct
RNP+
|
Input
Log MAL++
|
Input
MAL%
|
Input
Log RNP*
|
Inp
RNP**
|
Input MAL/RNP&
|
|
|
|
|
E2A-PBX1-#1
|
30.11
|
16.69
|
3.392
|
2466.058
|
4.551
|
35588.466
|
0.0693
|
|
|
E2A-PBX1-#3
|
32.63
|
16.28
|
2.653
|
449.533
|
4.696
|
49703.922
|
0.0090
|
|
|
E2A-PBX1-#7
|
32.01
|
16.46
|
2.835
|
683.343
|
4.633
|
42923.625
|
0.0159
|
|
|
E2A-PBX1-C10
|
32
|
18.69
|
2.838
|
| |