Data Science at the Command Line

Data Science at the Command Line
Author :
Publisher : "O'Reilly Media, Inc."
Total Pages : 251
Release :
ISBN-10 : 9781491947807
ISBN-13 : 1491947802
Rating : 4/5 (802 Downloads)

Book Synopsis Data Science at the Command Line by : Jeroen Janssens

Download or read book Data Science at the Command Line written by Jeroen Janssens and published by "O'Reilly Media, Inc.". This book was released on 2014-09-25 with total page 251 pages. Available in PDF, EPUB and Kindle. Book excerpt: This hands-on guide demonstrates how the flexibility of the command line can help you become a more efficient and productive data scientist. You’ll learn how to combine small, yet powerful, command-line tools to quickly obtain, scrub, explore, and model your data. To get you started—whether you’re on Windows, OS X, or Linux—author Jeroen Janssens introduces the Data Science Toolbox, an easy-to-install virtual environment packed with over 80 command-line tools. Discover why the command line is an agile, scalable, and extensible technology. Even if you’re already comfortable processing data with, say, Python or R, you’ll greatly improve your data science workflow by also leveraging the power of the command line. Obtain data from websites, APIs, databases, and spreadsheets Perform scrub operations on plain text, CSV, HTML/XML, and JSON Explore data, compute descriptive statistics, and create visualizations Manage your data science workflow using Drake Create reusable tools from one-liners and existing Python or R code Parallelize and distribute data-intensive pipelines using GNU Parallel Model data with dimensionality reduction, clustering, regression, and classification algorithms


Data Science at the Command Line Related Books

Data Science at the Command Line
Language: en
Pages: 251
Authors: Jeroen Janssens
Categories: Computers
Type: BOOK - Published: 2014-09-25 - Publisher: "O'Reilly Media, Inc."

DOWNLOAD EBOOK

This hands-on guide demonstrates how the flexibility of the command line can help you become a more efficient and productive data scientist. You’ll learn how
Data Science at the Command Line
Language: en
Pages: 283
Authors: Jeroen Janssens
Categories: Computers
Type: BOOK - Published: 2021-08-17 - Publisher: "O'Reilly Media, Inc."

DOWNLOAD EBOOK

This thoroughly revised guide demonstrates how the flexibility of the command line can help you become a more efficient and productive data scientist. You'll le
Python Data Science Handbook
Language: en
Pages: 743
Authors: Jake VanderPlas
Categories: Computers
Type: BOOK - Published: 2016-11-21 - Publisher: "O'Reilly Media, Inc."

DOWNLOAD EBOOK

For many researchers, Python is a first-class tool mainly because of its libraries for storing, manipulating, and gaining insight from data. Several resources e
Doing Data Science
Language: en
Pages: 408
Authors: Cathy O'Neil
Categories: Computers
Type: BOOK - Published: 2013-10-09 - Publisher: "O'Reilly Media, Inc."

DOWNLOAD EBOOK

Now that people are aware that data can make the difference in an election or a business model, data science as an occupation is gaining ground. But how can you
Cleaning Data for Effective Data Science
Language: en
Pages: 499
Authors: David Mertz
Categories: Mathematics
Type: BOOK - Published: 2021-03-31 - Publisher: Packt Publishing Ltd

DOWNLOAD EBOOK

Think about your data intelligently and ask the right questions Key FeaturesMaster data cleaning techniques necessary to perform real-world data science and mac