๐ Today’s Picks
Faster Table Joins with Polars Multi-Threading
Problem
pandas processes joins on a single CPU core, leaving other cores idle during large table operations.
Solution
Polars distributes join operations across all available CPU cores, achieving significantly faster joins than pandas on large datasets.
What makes Polars fast:
- Processes rows in parallel batches
- Uses all available CPU cores
- Zero configuration required
๐ Worth Revisiting
Faster Polars Queries with Programmatic Expressions
Problem
When you want to use for loops to apply similar transformations, each Polars with_columns() call processes sequentially.
This prevents the optimizer from seeing the full computation plan.
Solution
Instead, generate all Polars expressions programmatically before applying them together.
This enables Polars to:
- See the complete computation plan upfront
- Optimize across all expressions simultaneously
- Parallelize operations across CPU cores
โ๏ธ Weekly Finds
Mole [Python Utils] – Deep clean and optimize your Mac with a simple command-line tool.
marker [LLM] – Convert PDF, DOCX, PPTX, and other documents to markdown with high speed and accuracy.
pathway [Data Engineer] – Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
Looking for a specific tool? Explore 70+ Python tools โ
Stay Current with CodeCut
Actionable Python tips, curated for busy data pros. Skim in under 2 minutes, three times a week.


