
Python Data Science Handbook - GitHub Pages
This website contains the full text of the Python Data Science Handbook by Jake VanderPlas; the content is available on GitHub in the form of Jupyter notebooks.
Preface | Python Data Science Handbook - GitHub Pages
Mar 26, 2013 · This is a book about doing data science with Python, which immediately begs the question: what is data science? It's a surprisingly hard definition to nail down, especially given how …
Machine Learning | Python Data Science Handbook - GitHub Pages
This chapter will dive into practical aspects of machine learning, primarily using Python's Scikit-Learn package. This is not meant to be a comprehensive introduction to the field of machine learning; that …
Introduction to NumPy | Python Data Science Handbook
NumPy arrays form the core of nearly the entire ecosystem of data science tools in Python, so time spent learning to use NumPy effectively will be valuable no matter what aspect of data science …
Introducing Scikit-Learn | Python Data Science Handbook
We will start by covering data representation in Scikit-Learn, followed by covering the Estimator API, and finally go through a more interesting example of using these tools for exploring a set of images of …
Feature Engineering | Python Data Science Handbook
This is an excerpt from the Python Data Science Handbook by Jake VanderPlas; Jupyter notebooks are available on GitHub. The text is released under the CC-BY-NC-ND license, and code is released …
Data Manipulation with Pandas | Python Data Science Handbook
Here we'll build on this knowledge by looking in detail at the data structures provided by the Pandas library. Pandas is a newer package built on top of NumPy, and provides an efficient implementation …
Help and Documentation in IPython | Python Data Science Handbook
This is an excerpt from the Python Data Science Handbook by Jake VanderPlas; Jupyter notebooks are available on GitHub. The text is released under the CC-BY-NC-ND license, and code is released …
Introducing Pandas Objects | Python Data Science Handbook
As we will see during the course of this chapter, Pandas provides a host of useful tools, methods, and functionality on top of the basic data structures, but nearly everything that follows will require an …
Computation on Arrays: Broadcasting | Python Data Science Handbook
Broadcasting extends this ability. One commonly seen example is when centering an array of data. Imagine you have an array of 10 observations, each of which consists of 3 values. Using the …