Zum Hauptinhalt springen
Dekorationsartikel gehören nicht zum Leistungsumfang.
Thinking in Pandas
How to Use the Python Data Analysis Library the Right Way
Taschenbuch von Hannah Stepanek
Sprache: Englisch

48,14 €*

inkl. MwSt.

Versandkostenfrei per Post / DHL

Aktuell nicht verfügbar

Kategorien:
Beschreibung
Understand and implement big data analysis solutions in pandas with an emphasis on performance. This book strengthens your intuition for working with pandas, the Python data analysis library, by exploring its underlying implementation and data structures.
Thinking in Pandas introduces the topic of big data and demonstrates concepts by looking at exciting and impactful projects that pandas helped to solve. From there, you will learn to assess your own projects by size and type to see if pandas is the appropriate library for your needs. Author Hannah Stepanek explains how to load and normalize data in pandas efficiently, and reviews some of the most commonly used loaders and several of their most powerful options. You will then learn how to access and transform data efficiently, what methods to avoid, and when to employ more advanced performance techniques. You will also go over basic data access and munging in pandas and the intuitive dictionary syntax. Choosing the right DataFrame format, working with multi-level DataFrames, and how pandas might be improved upon in the future are also covered.
By the end of the book, you will have a solid understanding of how the pandas library works under the hood. Get ready to make confident decisions in your own projects by utilizing pandas¿the right way.

What You Will Learn
Understand the underlying data structure of pandas and why it performs the way it does under certain circumstances
Discover how to use pandas to extract, transform, and load data correctly with an emphasis on performance
Choose the right DataFrame so that the data analysis is simple and efficient.
Improve performance of pandas operations with other Python libraries

Who This Book Is For
Software engineers with basic programming skills in Python keen on using pandas for a big data analysis project. Python software developers interested in big data.
Understand and implement big data analysis solutions in pandas with an emphasis on performance. This book strengthens your intuition for working with pandas, the Python data analysis library, by exploring its underlying implementation and data structures.
Thinking in Pandas introduces the topic of big data and demonstrates concepts by looking at exciting and impactful projects that pandas helped to solve. From there, you will learn to assess your own projects by size and type to see if pandas is the appropriate library for your needs. Author Hannah Stepanek explains how to load and normalize data in pandas efficiently, and reviews some of the most commonly used loaders and several of their most powerful options. You will then learn how to access and transform data efficiently, what methods to avoid, and when to employ more advanced performance techniques. You will also go over basic data access and munging in pandas and the intuitive dictionary syntax. Choosing the right DataFrame format, working with multi-level DataFrames, and how pandas might be improved upon in the future are also covered.
By the end of the book, you will have a solid understanding of how the pandas library works under the hood. Get ready to make confident decisions in your own projects by utilizing pandas¿the right way.

What You Will Learn
Understand the underlying data structure of pandas and why it performs the way it does under certain circumstances
Discover how to use pandas to extract, transform, and load data correctly with an emphasis on performance
Choose the right DataFrame so that the data analysis is simple and efficient.
Improve performance of pandas operations with other Python libraries

Who This Book Is For
Software engineers with basic programming skills in Python keen on using pandas for a big data analysis project. Python software developers interested in big data.
Über den Autor

Hannah Stepanek is a software developer with a passion for performance and is an open source advocate. She has over seven years of industry experience programming in Python and spent about two of those years implementing a data analysis project using pandas.

Hannah was born and raised in Corvallis, OR, and graduated from Oregon State University with a major in Electrical Computer Engineering. She enjoys engaging with the software community, often giving talks at local meetups as well as larger conferences. In early 2019, she spoke at PyCon US about the pandas library and at OpenCon Cascadia about the benefits of open source software. In her spare time she enjoys riding her horse Sophie and playing board games.

Zusammenfassung

Establishes a foundation of understanding by exploring the underlying data structures that pandas is built on

Guides the reader through architecting a pandas based solution by emphasizing performance

Uses simple, practical, and exploratory examples to empower the reader to recognize when to use a given pandas feature

Inhaltsverzeichnis

Chapter 1: Introduction.- Chapter 2: Basic Data Access and Merging.- Chapter 3: How Pandas Works Under the Hood.- Chapter 4: Loading and Normalizing Data in pandas.- Chapter 5: Basic Data Transformation in pandas.- Chapter 6: The Apply Method.- Chapter 7: Groupby.- Chapter 8: Performance Improvements Beyond pandas.- Chapter 9: The Future of Pandas.- Appendix.-

Details
Erscheinungsjahr: 2020
Fachbereich: Programmiersprachen
Genre: Informatik
Rubrik: Naturwissenschaften & Technik
Medium: Taschenbuch
Inhalt: xi
186 S.
27 s/w Illustr.
186 p. 27 illus.
ISBN-13: 9781484258385
ISBN-10: 148425838X
Sprache: Englisch
Ausstattung / Beilage: Paperback
Einband: Kartoniert / Broschiert
Autor: Stepanek, Hannah
Auflage: 1st ed.
Hersteller: Apress
Apress L.P.
Maße: 235 x 155 x 12 mm
Von/Mit: Hannah Stepanek
Erscheinungsdatum: 06.06.2020
Gewicht: 0,312 kg
Artikel-ID: 118008730
Über den Autor

Hannah Stepanek is a software developer with a passion for performance and is an open source advocate. She has over seven years of industry experience programming in Python and spent about two of those years implementing a data analysis project using pandas.

Hannah was born and raised in Corvallis, OR, and graduated from Oregon State University with a major in Electrical Computer Engineering. She enjoys engaging with the software community, often giving talks at local meetups as well as larger conferences. In early 2019, she spoke at PyCon US about the pandas library and at OpenCon Cascadia about the benefits of open source software. In her spare time she enjoys riding her horse Sophie and playing board games.

Zusammenfassung

Establishes a foundation of understanding by exploring the underlying data structures that pandas is built on

Guides the reader through architecting a pandas based solution by emphasizing performance

Uses simple, practical, and exploratory examples to empower the reader to recognize when to use a given pandas feature

Inhaltsverzeichnis

Chapter 1: Introduction.- Chapter 2: Basic Data Access and Merging.- Chapter 3: How Pandas Works Under the Hood.- Chapter 4: Loading and Normalizing Data in pandas.- Chapter 5: Basic Data Transformation in pandas.- Chapter 6: The Apply Method.- Chapter 7: Groupby.- Chapter 8: Performance Improvements Beyond pandas.- Chapter 9: The Future of Pandas.- Appendix.-

Details
Erscheinungsjahr: 2020
Fachbereich: Programmiersprachen
Genre: Informatik
Rubrik: Naturwissenschaften & Technik
Medium: Taschenbuch
Inhalt: xi
186 S.
27 s/w Illustr.
186 p. 27 illus.
ISBN-13: 9781484258385
ISBN-10: 148425838X
Sprache: Englisch
Ausstattung / Beilage: Paperback
Einband: Kartoniert / Broschiert
Autor: Stepanek, Hannah
Auflage: 1st ed.
Hersteller: Apress
Apress L.P.
Maße: 235 x 155 x 12 mm
Von/Mit: Hannah Stepanek
Erscheinungsdatum: 06.06.2020
Gewicht: 0,312 kg
Artikel-ID: 118008730
Warnhinweis

Ähnliche Produkte

Ähnliche Produkte