Bad Data Handbook

Cleaning Up The Data So You Can Get Back To Work

Nonfiction, Computers, Database Management
Cover of the book Bad Data Handbook by Q. Ethan McCallum, O'Reilly Media
View on Amazon View on AbeBooks View on Kobo View on B.Depository View on eBay View on Walmart
Author: Q. Ethan McCallum ISBN: 9781449324971
Publisher: O'Reilly Media Publication: November 7, 2012
Imprint: O'Reilly Media Language: English
Author: Q. Ethan McCallum
ISBN: 9781449324971
Publisher: O'Reilly Media
Publication: November 7, 2012
Imprint: O'Reilly Media
Language: English

What is bad data? Some people consider it a technical phenomenon, like missing values or malformed records, but bad data includes a lot more. In this handbook, data expert Q. Ethan McCallum has gathered 19 colleagues from every corner of the data arena to reveal how they’ve recovered from nasty data problems.

From cranky storage to poor representation to misguided policy, there are many paths to bad data. Bottom line? Bad data is data that gets in the way. This book explains effective ways to get around it.

Among the many topics covered, you’ll discover how to:

  • Test drive your data to see if it’s ready for analysis
  • Work spreadsheet data into a usable form
  • Handle encoding problems that lurk in text data
  • Develop a successful web-scraping effort
  • Use NLP tools to reveal the real sentiment of online reviews
  • Address cloud computing issues that can impact your analysis effort
  • Avoid policies that create data analysis roadblocks
  • Take a systematic approach to data quality analysis
View on Amazon View on AbeBooks View on Kobo View on B.Depository View on eBay View on Walmart

What is bad data? Some people consider it a technical phenomenon, like missing values or malformed records, but bad data includes a lot more. In this handbook, data expert Q. Ethan McCallum has gathered 19 colleagues from every corner of the data arena to reveal how they’ve recovered from nasty data problems.

From cranky storage to poor representation to misguided policy, there are many paths to bad data. Bottom line? Bad data is data that gets in the way. This book explains effective ways to get around it.

Among the many topics covered, you’ll discover how to:

More books from O'Reilly Media

Cover of the book Search Engine Optimization by Q. Ethan McCallum
Cover of the book Head First Android Development by Q. Ethan McCallum
Cover of the book APIs: A Strategy Guide by Q. Ethan McCallum
Cover of the book Subject To Change: Creating Great Products & Services for an Uncertain World by Q. Ethan McCallum
Cover of the book Modern Java Recipes by Q. Ethan McCallum
Cover of the book Programming Amazon Web Services by Q. Ethan McCallum
Cover of the book Typo3 Kochbuch by Q. Ethan McCallum
Cover of the book Vagrant: Up and Running by Q. Ethan McCallum
Cover of the book Building Web Apps for Google TV by Q. Ethan McCallum
Cover of the book Think Data Structures by Q. Ethan McCallum
Cover of the book Mapping Hacks by Q. Ethan McCallum
Cover of the book Learning Visual Basic .NET by Q. Ethan McCallum
Cover of the book Sendmail by Q. Ethan McCallum
Cover of the book WordPress: The Missing Manual by Q. Ethan McCallum
Cover of the book PC Hardware in a Nutshell by Q. Ethan McCallum
We use our own "cookies" and third party cookies to improve services and to see statistical information. By using this website, you agree to our Privacy Policy