APPLICATIONS / DATABASESSUPPLEMENTAL DATAFACULTYMIRROR SITESCOLLABORATIVE PROJECTSDEPARTMENTS

 
 


Section I: Expanded Methods


RNA extraction, labeling, hybridization, and data analysis
Mononuclear cell suspensions from diagnostic BM aspirates or peripheral blood (PB) samples were prepared from each patient and an aliquot cryopreserved. RNA was extracted using the Trizol reagent (Gibco BRL Life Technologies, Gaithersburg, MD) following the manufacture´s recommended protocol. RNA integrity was assessed by electrophoresis on the Agilent 2100 Bioanalyzer (Agilent, Palo Alto, CA).

First and second strand cDNA were synthesized from 5-15 µg of total RNA using the SuperScript Double-Stranded cDNA Synthesis Kit (Gibco Life Technologies) and the oligo-dT24-T7 (5´-GGC CAG TGA ATT GTA ATA CGA CTC ACT ATA GGG AGG CGG-3´) primer according to the manufacturer´s instructions. cRNA was synthesized and labeled with biotinylated UTP and CTP by in vitro transcription using the T7 promoter coupled double stranded cDNA as template and the T7 RNA Transcript Labeling Kit (ENZO Diagnostics Inc., Farmingdale NY). Briefly, double stranded cDNA synthesized from the previous steps was washed twice with 70% ethanol and resuspended in 22 µl RNase-free H2O. The cDNA was incubated with 4 µl of 10X each reaction buffer, 1µl of biotin labeled ribonucleotides, 2 µl of DTT, 1µl of RNase inhibitor mix and 2 µl 20X T7 RNA Polymerase for 5 hr at 37°C. The labeled cRNA was separated from unincorporated ribonucleotides by passing through a CHROMA SPIN-100 column (Clontech, Palo Alto, CA) and precipitated at -20°C for 1 hr to overnight.

The cRNA pellet was resuspended in 10 ml Rnase-free H2O. and 10.0 mg was fragmented by heat and ion-mediated hydrolysis at 95°C for 35 minutes in 200 mM Tris-acetate, pH 8.1, 500 mM KOAc, 150 mM MgOAc. The fragmented cRNA was hybridized for 16 hr at 45°C to HG_U95Av2 oligonucleotide arrays (Affymetrix, Santa Clara, CA) containing 12,600 probe sets from full length annotated genes together with additional probe sets designed to represent EST sequences. Arrays were washed at 25°C with 6 X SSPE (0.9M NaCl, 60 mM NaH2PO4, 6 mM EDTA + 0.01% Tween 20) followed by a stringent wash at 50°C with 100 mM MES, 0.1M NaCl2, 0.01% Tween 20. The arrays were then stained with phycoerythrin conjugated streptavidin (Molecular Probes, Eugene, OR). Arrays were scanned using a laser confoCal scanner (Agilent, Palo Alto, CA) and the expression value for each gene was calculated using Affymetrix Microarray software (MAS 4.0). The signal intensity for each gene was calculated as the average intensity difference (AID), represented by [Σ(PM - MM)/(number of probe pairs)], where PM and MM denote perfect-match and mismatch probes, respectively. Expression values were normalized across the sample set by scaling the average of the fluorescent intensities of all genes on an array to a constant target intensity of 2500, then any AID over 45,000 was capped to a value of 45,000. All AID´s less than 100, including negative values and absent calls were converted to a value of 1. In addition, a variation filter was used that eliminated any probe set in which less than 1% of the samples had a present call, or if the Max AID - Min AID across the sample set was less than 100. The average intensity differences for each of the remaining genes were analyzed. For some metrics the data was log transformed prior to analysis. The minimum quality control values required for inclusion of a sample´s hybridization data in the study were 10% or greater present calls, a GAPDH/Actin 3´ /5´ ´ ratio <5, and use of a scaling factor that was within 3 standard deviations from the mean of the scaling values of all chips analyzed. The average percent present calls for our overall dataset was 29.7%, and for each of the genetic subgroups was BCR-ABL (31.1%), E2A-PBX1 (28.9%), Hyper >50 (31%), MLL (29.8%), T-ALL (29.1%), TEL-AML1 (28.5%), Novel (30.2%), others (31.1%). In addition, we required each sample to have >75% blasts. The average percentage blasts for the overall dataset used to define the genetic subtypes was 93%, and for each genetic subtype was BCR-ABL (92%), E2A-PBX1 (96%), Hyper >50 (93%), MLL (93%), T-ALL (91%), TEL-AML1 (92%), Novel (95%), and others (94%).

The data was analyzed using the MAS 4.0 software. Recently, a new version of the Affymetrix software became available (Affymetrix MAS 5.0). Affymetrix reports an overall 94% correlation between data analyzed using these two software programs. We have performed a limited analysis to directly compare the results of our data analyzed with these two different software programs. We took 74 randomly selected cases and reanalyzed the primary data using the MAS 5.0 software. Using the default parameters set by Affymetrix, the average percent present call was lower for the MAS 5.0 analyzed data as compare to the values obtained using MAS 4.0; however, with exceedingly rare exception, all of the 271 Chi-squared genes selected as discriminators of the seven genetic subtypes (see below) remained within the new dataset. Moreover, when we used these 271 in a 2-D hierarchical clustering analysis (as described below), the data looked identical to the results obtained using the MAS 4.0 dataset (data not shown).

Reproducibility of microarray data
The reproducibility of the Affymetrix microarray system was assessed by comparing the gene expression profiles of the following samples: (i) RNA extracted from duplicate cryopreserved diagnostic leukemic samples from 23 patients; (ii) Single RNA samples from 13 patients analyzed on two separate arrays. Results from these analyses are presented in Table 2. As shown, the mean number of probe sets that displayed a >2-fold difference in expression between separately extracted but paired RNA samples was 144, and for single RNA samples analyzed on two separate occasions was 133. Moreover, very few probe sets were found to have a >3-fold difference in expression levels between replicate samples. The observed number of probe sets showing a difference in expression values represents <2% of the total number of probe sets on the microarray, and thus these data suggest that the Affymetrix microarray system has a very high degree of reproducibility.



Table 2. Comparison data from replicate bone marrow sample


#>2 fold

2.1-3.0

3.1-4.0

>4
Chip Extraction  

42

38

4

0

T-ALL-#3

Different

45

43

2

0

BCR-ABL-#7

Different

59

53

1

5

E2A-PBX-#7

Different

61

55

2

4

T-ALL C25

Different

68

61

6

1

Hyperdip47-50-C17

Different

69

65

3

1

MLL-C6

Different

84

81

1

2

Hyperdip47-50-C18

Different

103

99

4

0

Pseudodip-C10

Different

107

93

11

3

Pseudodip-C12

Different

109

106

3

0

Normal-R1

Different

116

106

7

3

TEL-AML1-C33

Different

117

109

4

4

TEL-AML1-2M#2

Different

126

117

8

1

T-ALL-C3  replicate #1

Different

133

122

7

4

Normal-#3

Different

153

146

5

2

Hypodip-#4 replicate #1

Different

159

117

26

16

Hyperdip47-50-C13

Different

163

121

15

27

TEL-AML1-#10

Different

166

154

12

0

T-ALL-C3  replicate #2

Different

183

168

12

3

Pseudodip-#6

Different

187

155

23

9

T-ALL-RR1

Different

239

200

19

20

Hypodip-#4 replicate #2

Different

256

204

29

23

Hypodip-#4 replicate #3

Different

568

295

137

136

Hyperdip>50-R2M1

Different

21

19

2

0

T-ALL-#7

Same

24

22

2

0

T-ALL-R2

Same

34

33

1

0

T-ALL-C3 replicate #3

Same

52

52

0

0

TEL-AML1-C29

Same

64

58

2

4

Hyperdip>50-C34

Same

64

56

5

3

Hyperdip>50-R2M2

Same

76

74

1

1

T-ALL-R2M1

Same

132

129

1

2

Pseudodip-#7

Same

158

147

9

2

Hyperdip>50-C1

Same

161

139

19

3

TEL-AML1-C20

Same

197

189

6

2

T-ALL-#8

Same

210

169

25

16

Normal-C30-N

Same

539

139

123

277

Pseudodip-C4

Same


     
Mean
Median
Stdev
 
Different Extraction  
144.0
117.0
109.0
 
Same Extraction  
133.2
76.0
138.4
 


Comparison of microarray data between PB and BM leukemia samples
Matched BM and PB samples that contained >80% leukemic blasts were obtained from 10 patients and the RNA extracted and assessed by microarray analysis. As shown in Table 3, a very high level of correlation was observed for the expression profiles of BM and PB, with only 189 probe sets having a >2-fold difference in expression. More importantly, no genes were found to be consistently over- or under-expressed in one sample type. These data suggest that there are minimal differences in the gene expression profiles of leukemic blasts obtained from BM or PB, and raise the possibility that diagnostic gene expression profiling might be possible on samples obtained from the PB.

Table 3. Comparison of peripheral blood and bone marrow samples


#>2 fold

2.1-3.0

3.1-4.0

>4

Peripheral Blood

Bone Marrow

 

98

90

6

2

E2A-PBX1-C7 PB

E2A-PBX1-C7

 

108

93

9

6

Pseudodip-NHR1 PB

Pseudodip-NHR1

 

113

105

4

4

BCR-ABL-#1 PB

BCR-ABL-#1

 

139

117

18

4

Hyperdip>50-C8 PB

Hyperdip>50-C8

 

141

135

6

0

E2A-PBX1-C3 PB

E2A-PBX1-C3

 

159

139

16

4

T-ALL-C9 PB

T-ALL-C9

 

244

223

14

7

Hyperdip>50-C6 PB

Hyperdip>50-C6

 

245

194

33

18

MLL-C3 PB

MLL-C3

 

272

246

20

6

MLL-RR3 PB

MLL-RR3

 

377

300

50

27

E2A-PBX1-C2 PB

E2A-PBX1-C2

 

     
Mean
Median
Stdev
 
Differences between PB vs BM
189.6
150.0
91.1
 
   
133.2
76.0
138.4
 

Real-time RT-PCR results
Real-time RT-PCR assays (Taqman; Perkin-Elmer/Applied Biosystems, Foster City, CA) were developed to independently determine the level of mRNA for five genes that were found by microarray analysis to be predictive of either T-lineage ALL (CD3δ, CD3D antigen delta polypeptide TiT3 complex; MAL, mal T-Cell differentiation protein; and PRKCQ, protein kinase C theta) or E2A-PBX1 expressing ALL (MERTK, c-MER proto-oncogene tyrosine kinase and KIAA802). RNA samples analyzed included four samples each of E2A-PBX1 and T-ALL, and two samples each from the remaining subtypes (BCR-ABL, MLL, TEL-AML1, Hyperdiploid >50, Hyperdiploid 47-50, Hypodiploid, Pseudodiploid, and Normal). Whenever possible, the forward and reverse primers were designed in different exons so that DNA contamination would not be a concern. In the case of MAL where this was not clear, the RNA was treated for 15 minutes at room temperature with 1.0 unit of DNase I (Invitrogen, Carlsbad, California) using the Invitrogen protocol to remove any contaminating DNA.

Thirty-three ng of RNA from each sample was reverse transcribed using random hexamers and Multiscribe Reverse Transcriptase (Perkin-Elmer/Applied Biosystems) in a total volume of 10 µl. Real Time PCR was performed on a PE Applied Biosystems 7700 prism using oligonucleotide primers and probes sequences designed using Primer Express. The following primers and probes were used: (1) MERTK, 5´ -GGC GTG CTA ACT GTT CCA GG-3´ (forward primer), 5´ -CCT TTG TCA TTG TGG GCC TC-3´(reverse primer), and 5´-CAA CTG AAG ACC GCC ATC TCC GTC AG-3´(probe); (2) KIAA802, 5´- CCA AGC TGA AGG AGT CGG AC-3´(forward), 5´-AGA TTT CCT TTG TGA TTT TCT TCT TCC-3´(reverse), and 5´-TGC TCG GCC AGT GAG AAT CTC TAC CTG-3´(probe); (3) PRKCQ, 5´-AAG CTG CCA CAA GTT CGA CC-3´(forward), 5´-ATG AAG GAA CTG CAG ACC AAG AA-3´ (reverse), and 5´-CCA GAG CGA CGT TTT ATG CTG CTG AAA TC-3´(probe); (4) MAL, 5´-CCA GTG GCT TCT CGG TCT TC-3´ (forward), 5´-CAG GAG CCA CTC ACA AAC TCA A-3´(reverse), and 5´-CCA CCT TGC CCG ACT TGC TCT TCA-3´(probe); and (5) CD3δ , 5´-TCA ACA GAG CTT GTG TGT CGG-3´(forward), 5´-CAT TGC CAC TCT GCT CCT TG-3(reverse), and 5´-TGT CCA GCA AAG CAG AAG ACT CCC AAA-3´(probe). All probes were labeled at the 5´ end with FAM (6-carboxy-fluroescein) and at the 3´ end with TAMRA (6-carboxy-tetramethyl-rhodamine).

The PCR reactions were performed in a total volume of 50 µl containing 10 µl of the RT product, 300 nm each of the forward and reverse primers, 100 nM of probe, 1X master mix (PE Applied Biosystems) and 1 µl of Taq Gold. Following a 10 minute incubation at 95°C to activate the Taq Gold, samples were denatured at 95°C for 15 seconds, then annealed and extended at 60°C for 1 minute, for a total of 40 cycles. The RNA from each sample was also amplified using primers and probes to RNase P (PE Applied Biosystems) for use in normalization. Appropriate negative controls were included in each run. Standard curves were generated for T-cell markers and RNase P using MOLT4 RNA, a T-cell leukemia cell line, and for the E2A-PBX1 markers and RNase P using a leukemia cell line, 697, that contains an E2A-PBX1 fusion. The expression level of the predictive genes and RNase P were determined in each of the 24 ALL samples (Figures 7-11). A ratio was then calculated by taking the expression value for the specific gene and dividing it by the expression level of RNase P in the sample. These ratios were then compared to the values obtained from the Affymetrix data from the same RNA sample. The raw Affymetrix data were scaled as described and then normalized using the 3´GAPDH value for each sample, yielding a normalized ratio. The Taqman and Affymetrix ratios were then log transformed and compared. Since the markers selected for Taqman analysis were predictors for either E2A-PBX1 or T-ALLs, each gene was expected to have four RNA samples with high and 20 samples with low expression. For each gene evaluated, an average expression value for both the Taqman and Affymetrix data was calculated for all samples in the up-regulated group, and similarly, for the samples in the down-regulated group. These values were then compared against each other as shown in Figure 12.


Analysis of CD3δ

Figure 7. The standard curves of critical threshold values (Ct) versus log of the concentration of RNA in picograms (pg).

Table 4. Results of Taqman assay for CD3δ and RNase P on twenty-four selected ALL diagnostic bone marrow samples representing the various leukemia subgroups.


 

Ct CD3δ @

Ct RNP +

Input Log CD3δ ++

Input
CD3δ%

Input Log RNP*

Input
RNP**

Input CD3δ /RNP &

 


E2A-PBX1-#1  

28.26

17.05

2.394

247.825

4.454

28452.773

0.0087

 

E2A-PBX1-#3

30

16.69

1.868

73.772

4.585

38490.487

0.0019

 

E2A-PBX1-#7

28.94

17.36

2.188

154.342

4.341

21934.21

0.0070

 

E2A-PBX1-C10

27.53

16.27

2.615

412.030

4.738

54758.77

0.0075

 

T-ALL-R3

23.47

19.43

3.843

6963.863

3.587

3859.725

1.8042

 

T-ALL-C20

21.76

17.14

4.360

22910.208

4.421

26382.588

0.8684

 

T-ALL-R1

24.79

20.06

3.444

2777.311

3.357

2274.605

1.2210

 

T-ALL-#3

22.39

18.24

4.170

14773.707

4.020

10479.498

1.4098

 

BCR-ABL-R3

29.68

20.38

1.965

92.188

3.240

1738.834

0.0530

 

BCR-ABL-#7

25.8

17.51

3.138

1374.537

4.286

19339.415

0.0711

 

Hyperdip>50-C36

27.65

17.44

2.579

378.997

4.312

20509.735

0.0185

 

Hyperdip>50-#5

30.04

17.36

1.856

71.7455

4.341

21934.21

0.0033

 

Hyperdip47-50-C17

30.18

20.08

1.813

65.081

3.350

2236.74

0.0291

 

Hyperdip47-50-C8

30.49

17.95

1.7197

52.444

4.126

13367.573

0.0039

 

Hypodip-#1

28.75

18.09

2.246

176.176

4.075

11885.546

0.0148

 

Hypodip-C4

29.91

18.28

1.895

78.544

4.006

10133.5

0.0078

 

MLL-C2

28.53

17.26

2.312

205.345

4.378

23854.727

0.0086

 

MLL-2M#2

29.88

17.11

1.904

80.202

4.432

27055.348

0.0030

 

Normal-C5

27.34

17.58

2.672

470.319

4.261

18235.876

0.0258

 

Normal-C6

31.51

18.79

1.411

25.77523

3.820

6604.684

0.0040

 

Pseudodip-#4

28.49

18.73

2.325

211.146

3.842

6945.82

0.0304

 

Pseudodip-C16-N

28.21

18.18

2.409

256.606

4.042

11020.77

0.0233

 

TEL-AML1-#7

29.05

18.08

2.155

142.960

4.079

11985.727

0.0119

 

TEL-AML1-C39

30.89

19.6

1.599

39.693

3.525

3346.472

0.0119

 

@Critical threshold value for CD3δ                         
+ Critical threshold value for RNase P                  
++ Log of the pg CD3δ   in patient sample calculated from standard curve
% Exponentiated value of log CD3δ
* Log of the pg RNase P in patient sample calculated from standard curve
** Exponentiated value of log RNase P
& Ratio of CD3δ  to RNase P


Analysis of MAL

Figure 8. The standard curves of critical threshold values (Ct) versus log of the concentration of RNA in picograms (pg).


Table 5.
Results of Taqman assay for MAL and RNase P on twenty-four selected ALL diagnostic bone marrow samples representing the various leukemia subgroups.


 
 

Ct MAL@

Ct RNP+

Input Log MAL++

Input
MAL%

Input Log RNP*

Inp
RNP**

Input MAL/RNP&

 

 


E2A-PBX1-#1  

30.11

16.69

3.392

2466.058

4.551

35588.466

0.0693

 

E2A-PBX1-#3

32.63

16.28

2.653

449.533

4.696

49703.922

0.0090

 

E2A-PBX1-#7

32.01

16.46

2.835

683.343

4.633

42923.625

0.0159

 

E2A-PBX1-C10

32

18.69

2.838