Brief Introduction
EMS scans conserved cis-regulatory
elements in human/mouse based on cross-species comparison. Sequences
and alignments of all human and mouse orthlogous gene pairs are stored
for searching. The list of human/mouse orthlogous genes was generated
based on NCBI Homologene (version:
May 2004).
From the UCSC goldenpath database,
(version: human, July, 2003 (hg16), 21231 Refseq genes; mouse, Feb.
2003 (mm4), 17724 Refseq genes), only full-length genes, which
transcriptional start sites are ahead of the transnational start sites
are kept. Finally, 11836 pairs of ortholog
genes appear in both datasets, Then we extracted their 10k (up or down-stream) RepeatMasked upstream
regions and aligned them using the standalone AVID tool. (See
the Flowchart) When searching user-defined matrices (or in other
formats like Fasta), which may generate from a
cluster of genes with a same expression patter in Microarray analyses,
EMS first scans them on all 11836 up-stream regions of human genes
for significant candidates, then matches the mouse corresponding regions,
and finally reports conserved hits (See the paper).
EMS website at St. Jude uses the Hartwell Center's 280-CPU Linux cluster for performing motif scans in parallel. With this computational power, a full search takes only around 2 minutes. And a human chromosome viewer is also available to show the genome location of hits.
|
|
Contact Us
Tao.Xie@StJude.org
(Dr. Tao Xie)
Reference
EMS has not been published yet, but has been presented at a conference:
Xie T, Lin B, Hood L, and Naeve C, EMS: Evolution-based Motif Search, Poster,
Molecular Medicine Tri-Conference, San Francisco, California USA, Mar. 23-26, 2004.
Thanks
TFBS for
drawing sequence logos!
Additional Links
ConSite Footprinter rVISTA
|