November 3, 2020

123 words 1 min read

chiphuyen/just-pandas-things

chiphuyen/just-pandas-things

An ongoing list of pandas quirks

repo name chiphuyen/just-pandas-things
repo link https://github.com/chiphuyen/just-pandas-things
homepage
language Jupyter Notebook
size (curr.) 3842 kB
stars (curr.) 465
created 2020-06-29
license

just-pandas-things

This repo contains a few peculiar things I’ve learned about pandas that have made my life easier and my code faster. This post isn’t a friendly tutorial for beginners, but a friendly introduction to pandas weirdness.

What’s in this repo?

  1. pandas is column-major, which is why row-based operations are slow
  2. SettingWithCopyWarning, or why we can’t have nice things
  3. Indexing and slicing
  4. Accessors
  5. Data exploration
  6. Common pitfalls

I’ll continue updating this repo as I have more time. As I’m still learning pandas quirks, feedback is much appreciated.

Thanks Luke Metz, Vikram Tiwari, and Karson Elmgren for reviewing!

comments powered by Disqus