Probably the best-curated list of data science books in Python

Probably the best curated list of data science books in Python.

- Statistics
- Data Analysis
- Data Intuition
- Feature Engineering
- Machine Learning
- Time Series
- Natural Language Processing
- Deep Learning
- Code Optimization
- Scraping
- Career in Data Science

Learn how to apply various statistical methods to data science and how to avoid their misuse. Understand what statistical concept is important and what is not.

Learn approximate inference algorithms that permit fast approximate answers in situations where exact answers are not feasible. Familiarity with multivariate calculus and basic linear algebra is required

Learn how to solve statistical problems with Python code instead of mathematical notations. Learn how to work with problems involving estimation, prediction, decision analysis, evidence, and hypothesis testing.

Learn Bayesian inference from a computational/understanding-first, and mathematics-second, point of view.

Learn key topics in statistical learning. This book is perfect for those who want a gentle introduction all popular machine learning algorithms.

Learn how to determine the appropriate type of graph for your situation, eliminate irrelevant information, and direct your audience's attention to the most important parts of your data.

Learn data science libraries, frameworks, modules, tools and algorithms by implementing them from scratch.

Learn how to determine which data sources to use for collecting information, distinguish signal from noise, cope with ambiguous information, design experiments to test hypothesis, organize your data using segmentation, and communicate the results of your analysis.

Learn how to harness the newest data mining methods and techniques to prepare data for analysis and create the necessary infrastructure for data mining at your company. Learn core data mining techniques, including decision trees, neural networks, collaborative filtering, association rules, link analysis, survival analysis.

Learn how to clarify the business question, lay out a hypothesis-driven plan, convert relevant data to insights, and make decisions that make an impact.

Learn how to explore the world that is and the worlds that could have been by understanding causality. Learn to answer hard questions, like whether a drug cured an illness.

Learn how the models being used today reinforce discrimination, prop up the lucky and punish the downtrodden. The book empower us to ask tough questions, uncover the truth, and demand change.

Learn the foundations of data science and components of analytics such as descriptive, predictive and prescriptive analytics topics using examples from several industries, as well as nine analytics case studies. The book gives equal importance to theory and practice with examples across industries and the case studies provide a deeper understanding of analytics techniques and deployment of analytics-driven solutions.

Learn techniques for extracting and transforming features into formats for machine-learning models through practical application with exercises using tools such as numpy, Pandas, Scikit-learn, and Matplotlib.

Learn how to manipulate, transform, and clean data; visualize different types of data; and use data to build statistical or machine learning models using IPython, NumPy, Pandas, Matplotlib, Scikit-Learn, and other related tools.

Learn how to manipulate, process, clean, and crunch datasets in Python and how to work with time series data through real-world problems using Jupyter Notebook, Numpy, pandas, matplotlib.

Learn everything you really need to know in Machine Learning in a hundred page.

Learn all the essential machine learning techniques in depth. Learn how to use scikit-learn for machine learning and TensorFlow for deep learning.

Learn end-to-end machine learning for the trading workflow, from the idea and feature engineering to model optimization, strategy design, and backtesting.

Learn a range of techniques, starting with simple linear regression and progressing to deep neural networks using concrete examples, minimal theory, and two production-ready Python frameworks—Scikit-Learn and TensorFlow.

Learn the skills necessary to design, build, and deploy applications powered by machine learning. Learn the tools, best practices, and challenges involved in building a real-world ML application.

Learn how to align on ML strategies in a team setting, as well as how to set up development (dev) sets and test sets.

Learn how and what you should use to solve machine learning and deep learning problems. Appropriate for those who have some theoretical knowledge of machine learning and deep learning.

Learn best practices and design patterns of building reliable machine learning solutions tha scale.

Learn the concepts of interpretability, interpretable models, and general methods for interpreting black box models. Learn in depth the strengths and weaknesses of each method and how their outputs can be interpreted.

Learn the steps of automating a machine learning pipeline using the TensorFlow ecosystem.

Learn to create a successful machine-learning application with Python and the scikit-learn library.

Learn how to load and prepare data, evaluate model skill, and implement forecasting models for time series data.This book cuts through the math and specialized methods for time series forecasting.

Learn to solve the most common data engineering and analysis challenges in time series, using both traditional statistical and modern machine learning techniques.

Learn how to predict text, filter email to automatic summarization and translation, and learn how to write Python programs that work with large collections of unstructured text.

Learn how to adapt your solutions for different industry verticals such as healthcare, social media, and retail. Understand tasks and solution approaches within NLP and best practices around deployment for NLP systems.

Learn the basics of the PyTorch, traditional NLP concepts and methods, neural networks, embeddings, sequence prediction, and design patterns for building production NLP systems.

Learn in detail the deep learning for machine translations, speech-to-text, text-to-speech, language modeling, question answering, and many more NLP domains with transformers.

Learn how to train a model on a wide range of tasks in deep learning with little math background and minimal code using fastai and Pytorch. Written by the creators of fastai.

Learn mathematical and conceptual background, deep learning techniques used by practitioners in industry, including deep feedforward networks, regularization, optimization algorithms, convolutional networks, sequence modeling, and practical methodology, and other theoretical topics.

Learn how to create deep learning and neural network systems with PyTorch and learn best practices for the entire deep learning pipeline for advanced projects.

Learn what LSTMs are, and how to develop a suite of LSTM models using Keras and TensorFlow 2. This book cuts through the math, research papers and patchwork descriptions about LSTMs.

Learn how to build practical computer vision based deep learning applications that can be deployed on the cloud, mobile, browsers, or edge devices using a hands-on approach.

Learn essential concepts in deep learning through visualization with little math.

Learn how to choose the most efficient and effective way to accomplish key tasks when multiple options exist, and how to write Python code that's easier to understand, maintain, and improve.

Learn best practices and little-known tricks to round out your Python knowledge.

Learn how to identify and sove the bottlenecks in your applications, write efficient numerical code in NumPy and Cython, and adapt your programs to run on multiple processors with parallel programming.

Learn the core Python language as well as tasks common to a wide variety of application domains such as data structures and algorithms, classes and objects, metaprogramming, modules and packages, testing, debugging, and exceptions.

Learn how to query web servers, request data, and parse it to extract the information you need using tools such as requests, BeautifulSoup, Scrapy, APIs and how to store, read, and clean the data you scrape.

Learn how to how to land your first job to the lifecycle of a data science project, and how to become a manager.

Contributions are always welcome! If you know some interesting books or other categories that should be here but are not, feel free to contribute! To contribute, follow four steps below:

- Fork the repo
- Add new resources using the same markdown format.
- Start the book summary with “Learn…”
- Submit the pull request

That’s it. As soon as I review your pull request, your resources will be added to this page.

Alternatively, you can create an issue with book recommendation.