{"product_id":"parallel-r-data-analysis-in-the-distributed-world-9781449309923","title":"Parallel R: Data Analysis in the Distributed World","description":"\u003cp\u003eIt's tough to argue with R as a high-quality, cross-platform, open source statistical software product--unless you're in the business of crunching Big Data. This concise book introduces you to several strategies for using R to analyze large datasets, including three chapters on using R and Hadoop together. You'll learn the basics of Snow, Multicore, Parallel, Segue, RHIPE, and Hadoop Streaming, including how to find them, how to use them, when they work well, and when they don't. \u003c\/p\u003e\u003cp\u003eWith these packages, you can overcome R's single-threaded nature by spreading work across multiple CPUs, or offloading work to multiple machines to address R's memory barrier. \u003c\/p\u003e\u003cul\u003e \u003cli\u003e\n\u003cb\u003eSnow: \u003c\/b\u003e works well in a traditional cluster environment \u003c\/li\u003e\n\u003cli\u003e\n\u003cb\u003eMulticore: \u003c\/b\u003e popular for multiprocessor and multicore computers \u003c\/li\u003e\n\u003cli\u003e\n\u003cb\u003eParallel: \u003c\/b\u003e part of the upcoming R 2.14.0 release \u003c\/li\u003e\n\u003cli\u003e\n\u003cb\u003eR+Hadoop: \u003c\/b\u003e provides low-level access to a popular form of cluster computing \u003c\/li\u003e\n\u003cli\u003e\n\u003cb\u003eRHIPE: \u003c\/b\u003e uses Hadoop's power with R's language and interactive shell \u003c\/li\u003e\n\u003cli\u003e\n\u003cb\u003eSegue: \u003c\/b\u003e lets you use Elastic MapReduce as a backend for lapply-style operations \u003c\/li\u003e\n\u003c\/ul\u003e\u003cbr\u003e\u003cbr\u003e\u003cb\u003eAuthor:\u003c\/b\u003e Q. McCallum,Stephen Weston\u003cbr\u003e\u003cb\u003ePublisher:\u003c\/b\u003e O'Reilly Media\u003cbr\u003e\u003cb\u003ePublished:\u003c\/b\u003e 11\/29\/2011\u003cbr\u003e\u003cb\u003ePages:\u003c\/b\u003e 120\u003cbr\u003e\u003cb\u003eBinding Type:\u003c\/b\u003e Paperback\u003cbr\u003e\u003cb\u003eWeight:\u003c\/b\u003e 0.47lbs\u003cbr\u003e\u003cb\u003eSize:\u003c\/b\u003e 9.19h x 7.00w x 0.27d\u003cbr\u003e\u003cb\u003eISBN:\u003c\/b\u003e 9781449309923\u003cbr\u003e\u003cp\u003e\u003cb\u003eAbout the Author\u003c\/b\u003e\u003cbr\u003e\u003c\/p\u003e\u003cp\u003eQ Ethan McCallum is a consultant, writer, and technology enthusiast, though perhaps not in that order. His work has appeared online on The O'Reilly Network and Java.net, and also in print publications such as C\/C++ Users Journal, Doctor Dobb's Journal, and Linux Magazine. In his professional roles, he helps companies to make smart decisions about data and technology.\u003c\/p\u003e\u003cp\u003eStephen Weston has been working in high performance and parallelcomputing for over 25 years. He was employed at Scientific Computing Associates in the 90's, working on the Linda programming system, invented by David Gelernter. He was also a founder of Revolution Computing, leading the development of parallel computing packages for R, including nws, foreach, doSNOW, and doMC. He works at Yale University as an HPC Specialist.\u003c\/p\u003e\u003cbr\u003e","brand":"O'Reilly Media","offers":[{"title":"Paperback","offer_id":44907014291571,"sku":"9781449309923","price":35.26,"currency_code":"USD","in_stock":true}],"thumbnail_url":"\/\/cdn.shopify.com\/s\/files\/1\/0555\/9255\/0515\/files\/img_9d882b2a-8c28-4199-931b-9ba05f418981.jpg?v=1777482060","url":"https:\/\/bookstorenmore.com\/products\/parallel-r-data-analysis-in-the-distributed-world-9781449309923","provider":"Bookstore N More","version":"1.0","type":"link"}