Big Data Analysis Capabilities
IEG has 20 computer servers (3 Windows, 4 Linux, 13 Unix) that provide 546 CPU cores, 6.26 Tb memory, and 344 Tb hard disk space. The lab has constructed databases to store experimental data and developed software tools for data preparation and analysis pipelines for sequencing data preprocessing, high-throughput microarray (GeoChip) data analysis, network analyses, and database management. A server cluster has been established at the OU Supercomputing Center for Education & Research (OSCER) by IEG for big data analysis and intensive computation.
Servers
Supercomputer Servers
A PowerEdge R930 (64-core CPU, 3Tb memory), 10 PowerEdge R430 (24-core, 64-128Gb memory), and 2 PowerEdge T630 (24-core CPU, 128Gb memory, 128Tb hard disk) Servers are available for metagenome and metatranscriptome sequence assembly (about 100-500 Gb data per sample) and other computational needs. These IEG dedicated servers are hosted at OSCER.
PowerEdge R910 Windows Server
This is one of the latest Windows servers in the market, equipped with Four Intel®Xeon®E7-4870 processor, 2.40 GHz, 1.0 TB memory, and 16.0 TB hard disk with RAID Controller. It is used to design oligonucleotide probes for large data sets (e.g., ~2000 genomes and metagenomes), intensive and parallel computation, data analysis, and data storage, and may replace the oldest Windows server to host the web site and integration of all computational components.
PowerEdge R900 Windows Server
This server is equipped with a Four Quad Core (E7420 Xeon) Processor, 2.13 Ghz, 64 GB memory, and 1.0 TB hard disk with RAID Controller. It is mainly used for online data analyses, including analysis of pyrosequencing data of 16S rRNA genes and other functional genes.
Dell PowerEdge 2850 Windows Server
This is one of oldest servers in the lab, equiped with a Dual-Core Intel® Xeon™ Processor, 2.8 Ghz and 4 GB memory and hosts our web site, the GeoChip data analysis pipeline, and a variety of bioinformatics tools and statistical software. It is connected to a storage system (RAID) architected with 12 hard drives (1.0 TB each).
PowerEdge R910 Linux Server
This is a recent purchase and is primarily used for metagenome sequence analysis. It is one of the newest Linux servers on the market, equipped with Four Intel®Xeon®E7-4870 processor, 2.40 GHz, and 1.0 TB memory, and 16.0 TB hard disk with RAID Controller. It is used for metagenome sequencing data analyses, intensive and parallel computation, and data storage.
Two PowerEdge R910 Linux Servers
These were purchased in 2011, primarily for metagenome sequence analysis, equipped with Four Intel®Xeon®E7-4870 processor, 2.40 GHz, and 500GB memory, and 16.0 TB hard disk with RAID Controller. It is used for metagenome sequencing data analyses, intensive and parallel computation, and data storage.
Dell PowerEdge 6850 Linux Server
The Linux server is equipped with Four Quad Core (E7420 Xeon) Processor, 2.13 Ghz, 64 GB memory, and 1.0 TB hard disk with RAID Controller. It is mainly used for analysis of 16S rRNA gene pyrosequencing and other functional genes. It connects to a storage system (RAID) architected with 12 hard drives (3.0 TB).
Other Computations Resources
OU's supercomputer facility (OSCER) has over 10,000 CPU cores, 23Tb of RAM and 450Tb of usable hard disk space available for use.