Skip to main content
For credit card payments please select in store pick up/pay at store and you will be contacted for payment/shipping info!
Close this alert
Data Analysis with Open Source Tools: A Hands-On Guide for Programmers and Data Scientists

Data Analysis with Open Source Tools: A Hands-On Guide for Programmers and Data Scientists

Current price: $39.99
Publication Date: December 28th, 2010
Publisher:
O'Reilly Media
ISBN:
9780596802356
Pages:
530
Usually Ships in 1 to 5 Days

Description

Collecting data is relatively easy, but turning raw information into something useful requires that you know how to extract precisely what you need. With this insightful book, intermediate to experienced programmers interested in data analysis will learn techniques for working with data in a business environment. You'll learn how to look at data to discover what it contains, how to capture those ideas in conceptual models, and then feed your understanding back into the organization through business plans, metrics dashboards, and other applications.

Along the way, you'll experiment with concepts through hands-on workshops at the end of each chapter. Above all, you'll learn how to think about the results you want to achieve -- rather than rely on tools to think for you.

  • Use graphics to describe data with one, two, or dozens of variables
  • Develop conceptual models using back-of-the-envelope calculations, as well as scaling and probability arguments
  • Mine data with computationally intensive methods such as simulation and clustering
  • Make your conclusions understandable through reports, dashboards, and other metrics programs
  • Understand financial calculations, including the time-value of money
  • Use dimensionality reduction techniques or predictive analytics to conquer challenging data analysis situations
  • Become familiar with different open source programming environments for data analysis

"Finally, a concise reference for understanding how to conquer piles of data." --Austin King, Senior Web Developer, Mozilla

"An indispensable text for aspiring data scientists." --Michael E. Driscoll, CEO/Founder, Dataspora.

About the Author

After previous careers in physics and softwaredevelopment, Philipp K. Janert currentlyprovides consulting services for data analysis, algorithm development, and mathematical modeling.He has worked for small start-ups and in largecorporate environments, both in the U.S. andoverseas. He prefers simple solutions that workto complicated ones that don't, and thinks thatpurpose is more important than process. Philippis the author of Gnuplot in Action - UnderstandingData with Graphs (Manning Publications), and haswritten for the O'Reilly Network, IBM developerWorks, and IEEE Software. He is named inventor on a handfulof patents, and is an occasional contributor to CPAN.He holds a Ph.D. in theoretical physics from theUniversity of Washington. Visit his company websiteat www.principal-value.com.