Data Science: A First Introduction with Python focuses on using the Python programming language in Jupyter notebooks to perform data manipulation and cleaning, create effective visualizations, and extract insights from data using classification, regression, clustering, and inference. It emphasizes workflows that are clear, reproducible, and shareable, and includes coverage of the basics of version control. Based on educational research and active learning principles, the book uses a modern approach to Python and includes accompanying autograded Jupyter worksheets for interactive, self-directed learning. The text will leave readers well-prepared for data science projects. It is designed for learners from all disciplines with minimal prior knowledge of mathematics and programming. The authors have honed the material through years of experience teaching thousands of undergraduates at the University of British Columbia. Key Features: Includes autograded worksheets for interactive, self-directed learning. - Introduces readers to modern data analysis and workflow tools such as Jupyter notebooks and GitHub, and covers cutting-edge data analysis and manipulation Python libraries such as pandas, scikit-learn, and altair. - Is designed for a broad audience of learners from all backgrounds and disciplines. "This book offers a clear, thoughtful, and systematic treatment of the fundamentals of data science, with accompanying Python code. As its name implies, it is truly an introduction, and is suitable for those who wish to self-teach Python and data science, as well as to college instructors teaching a first course in data science. With a diverse set of topics that includes (among others) getting data from the web, visualization, cross-validation, clustering, and version control, this book is a one-stop shop that will be a valuable resource for years to come." - Daniela Witten , University of Washington. "The authors of this new textbook are expert teachers as well as data scientists, and that expertise is reflected in each chapter and every exercise. Topics are introduced in a digestible order, examples are approachable and well-motivated, and all the code is presented in digestible, carefully-explained pieces. If you are using Python to introduce students to reproducible quantitative analysis, this "First Introduction" should be your first choice." - Greg Wilson , Third-Bit Inc. "This book provides a sophisticated first introduction to the field of data science and provides a balanced mix of practical skills along with generalizable principles. As we continue to introduce students to data science and train them to confront an expanding array of data science problems, they will be well-served by the ideas presented here." - Roger Peng , John Hopkins Unviersity - From the Foreword "… The authors provide a friendly, effective on-ramp to programmatic data analysis with Python and key packages for data analysis (e.g., pandas, altair, and scikit-learn). I appreciate the coverage of critical practical matters, which are often neglected or written off as “out of scope”, such as navigating the file system, developing a sustainable workflow, and using version control..." - Jenny Bryan , Posit "This book is a comprehensive introduction to data science … In addition to data wrangling and visualization with pandas and altair, the book also provides a deep dive into statistical modeling and inference with the scikit-learn framework, which makes this book an incredibly valuable addition to the landscape of introductory data science books. " - Mine Çetinkaya-Rundel, Professor of the Practice at Duke University and Educator at Posit " … This book starts off by working with data and visualizing it, then levels up quickly to high impact topics like predictive modeling, inference, and collaboration with version control … These are topics that are tricky to squeeze into an intro text. But this book is not intimidating … This book has also been field-tested by a highly respected data science education program at the University of British Columbia… making it an ideal resource that educators can trust and rely on to freshen up their own materials and workflows." - Alison Hill, IBM "I made it 10 per cent of the way through Timbers et al before I learnt something new. Frankly I was surprised I made it so far. Data science pedagogy has been so disjoint and so many of us are self-taught that it is refreshing to have a class-room-tested textbook that is focused on workflows and reproducibility. The approaches are rigorous and opinionated, and the text is filled with kindness and warmth. It is the book that I wish I had when I first came to learn this material. The book is unashamedly focused on the newest innovations, and the use of Python makes it especially widely applicable. Going through the book I found myself learning things, on average, at roughly one-thing-per-page, which was an exciting experience for someone who spends his d