{"product_id":"bad-data-handbook-9781449321888","title":"Bad Data Handbook","description":"\u003cp\u003eWhat is bad data? Some people consider it a technical phenomenon, like missing values or malformed records, but bad data includes a lot more. In this handbook, data expert Q. Ethan McCallum has gathered 19 colleagues from every corner of the data arena to reveal how they've recovered from nasty data problems. \u003c\/p\u003e\u003cp\u003eFrom cranky storage to poor representation to misguided policy, there are many paths to bad data. Bottom line? Bad data is \u003ci\u003edata that gets in the way\u003c\/i\u003e. This book explains effective ways to get around it. \u003c\/p\u003e\u003cp\u003eAmong the many topics covered, you'll discover how to: \u003c\/p\u003e\u003cul\u003e \u003cli\u003eTest drive your data to see if it's ready for analysis \u003c\/li\u003e\n\u003cli\u003eWork spreadsheet data into a usable form \u003c\/li\u003e\n\u003cli\u003eHandle encoding problems that lurk in text data \u003c\/li\u003e\n\u003cli\u003eDevelop a successful web-scraping effort \u003c\/li\u003e\n\u003cli\u003eUse NLP tools to reveal the real sentiment of online reviews \u003c\/li\u003e\n\u003cli\u003eAddress cloud computing issues that can impact your analysis effort \u003c\/li\u003e\n\u003cli\u003eAvoid policies that create data analysis roadblocks \u003c\/li\u003e\n\u003cli\u003eTake a systematic approach to data quality analysis \u003c\/li\u003e\n\u003c\/ul\u003e\u003cbr\u003e\u003cbr\u003e\u003cb\u003eAuthor:\u003c\/b\u003e Q. McCallum\u003cbr\u003e\u003cb\u003ePublisher:\u003c\/b\u003e O'Reilly Media\u003cbr\u003e\u003cb\u003ePublished:\u003c\/b\u003e 12\/18\/2012\u003cbr\u003e\u003cb\u003ePages:\u003c\/b\u003e 262\u003cbr\u003e\u003cb\u003eBinding Type:\u003c\/b\u003e Paperback\u003cbr\u003e\u003cb\u003eWeight:\u003c\/b\u003e 0.94lbs\u003cbr\u003e\u003cb\u003eSize:\u003c\/b\u003e 9.05h x 7.00w x 0.57d\u003cbr\u003e\u003cb\u003eISBN:\u003c\/b\u003e 9781449321888\u003cbr\u003e\u003cp\u003e\u003cb\u003eAbout the Author\u003c\/b\u003e\u003cbr\u003e\u003c\/p\u003e\u003cp\u003eQ Ethan McCallum is a consultant, writer, and technology enthusiast, though perhaps not in that order. His work has appeared online on The O Reilly Network and Java.net, and also in print publications such as C\/C++ Users Journal, Doctor Dobb s Journal, and Linux Magazine. In his professional roles, he helps companies to make smart decisions about data and technology.\u003c\/p\u003e\"\u003cbr\u003e","brand":"O'Reilly Media","offers":[{"title":"Paperback","offer_id":44907017568371,"sku":"9781449321888","price":64.13,"currency_code":"USD","in_stock":true}],"thumbnail_url":"\/\/cdn.shopify.com\/s\/files\/1\/0555\/9255\/0515\/files\/img_c93bfab9-6c2c-4210-9be3-f159e7f2a2b8.jpg?v=1777482155","url":"https:\/\/bookstorenmore.com\/products\/bad-data-handbook-9781449321888","provider":"Bookstore N More","version":"1.0","type":"link"}