The Real Python Podcast

Speeding Up Your DataFrames With Polars

19 snips
Jan 13, 2023
Ask episode
AI Snips
Chapters
Transcript
Episode notes
ADVICE

Use Fetch & Collect Strategically

  • Use fetch in lazy mode to limit output rows during development.
  • Use collect to execute full optimized queries in Polars.
INSIGHT

Apache Arrow: Memory Format Standard

  • Apache Arrow is a memory format specification and libraries optimized for tabular data.
  • It ensures fast memory access, uniform missing data handling, and zero-copy between processes.
INSIGHT

Arrow Enables Efficient Data Sharing

  • Arrow's immutability and zero-copy sharing enable efficient multi-threading and process sharing.
  • Polars and other tools can share Arrow data without costly copying.
Get the Snipd Podcast app to discover more snips from this episode
Get the app