RPA: fully scalable preprocessing method for short oligonucleotide microarray atlases

How to preprocess 20,000 CEL files (or more) on an ordinary desktop computer in a few hours? Our new Online-RPA algorithm – developed in collaboration with the EBI functional genomics group and recently published in Nucleic Acids Research (2013) –  enables full utilization of the most comprehensive microarray data collections available to date. We hope this will be widely adopted by the microarray community, and welcome feedback on the implementation.

Transcriptome-wide profiling data sets are now available on standardized microarray platforms (such as the Affymetrix HG-U133Plus2 array) for tens of thousands of samples, covering thousands of body sites and disease conditions through ArrayExpress and other genomic data repositories. The lack of scalable probe-level preprocessing techniques for very large gene expression atlas collections has formed a bottleneck for full utilization of these data resources.

The new online-version of RPA (Robust Probabilistic Averaging) now allows fully scalable analysis of contemporary (Affymetrix and other) short oligonucleotide microarray atlases of any size, up to arbitrarily large collections involving hundreds of thousands of samples. The scalability is achieved by sequential hyperparameter updates, circumventing the extensive memory requirements of standard approaches. Unlike fRMA, our method is readily applicable to all short oligonucleotide platforms. It also outperforms the standard RMA (a special case of the general RPA model) already in moderately sized standard data sets and can be used as the default preprocessing method for short oligo microarrays.

Software

Online-RPA is freely available as a R/Bioconductor package. The wiki site provides installation instructions and usage examples. For feedback, issues, bug tracking, and pull requests, see the Github development version.

References

Advertisements

About antagomir

http://www.iki.fi/Leo.Lahti
This entry was posted in Uncategorized. Bookmark the permalink.

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s