Generic selectors
Exact matches only
Search in title
Search in content
Post Type Selectors
Filter by Categories
About Article
Analyze Data
Archive
Best Practices
Better Outputs
Blog
Code Optimization
Code Quality
Command Line
Course
Daily tips
Dashboard
Data Analysis & Manipulation
Data Engineer
Data Visualization
DataFrame
Delta Lake
DevOps
DuckDB
Environment Management
Feature Engineer
Git
Jupyter Notebook
LLM
LLM Tools
Machine Learning
Machine Learning & AI
Machine Learning Tools
Manage Data
MLOps
Natural Language Processing
Newsletter Archive
NumPy
Pandas
Polars
PySpark
Python Helpers
Python Tips
Python Utilities
Scrape Data
SQL
Testing
Time Series
Tools
Visualization
Visualization & Reporting
Workflow & Automation
Workflow Automation

Feature Engineer

add_datepart: Add Relevant DateTime Features in One Line of Code

When working with time series, other features such as year, month, week, day of the week, day of the year, whether it is the end of the year or not, can be really helpful to predict future events. Is there a way that you can get all of those features in one line of code?
Fastai’s add_datepart method allows you to do exactly that. Find an example of how to use this method above.
Link to Fastai.

add_datepart: Add Relevant DateTime Features in One Line of Code Read More »

Geopy: Extract Location Based on Python String

If you work with location data, you might want to visualize them on the map. Geopy makes it easy to locate the coordinates of addresses across the globe based on a Python string.
After defining the app name and insert location, all you need to exact information about the location is to use location.address.
To extract the latitude and longitude or the use location.latitide, location.longitude.
Above is an example. Try it with any address that you can think of. Geopy will figure it out.

Geopy: Extract Location Based on Python String Read More »

Decompose high dimensional data into two or three dimensions

If you want to decompose high dimensional data into two or three dimensions to visualize it, what should you do? A common technique is PCA. Even though PCA is useful, I always find it complicated to create a PCA plot until I found this doc in Yellowbrick.
I really recommend using this tool if you want to visualize PCA in a few lines of code

Decompose high dimensional data into two or three dimensions Read More »

Scroll to Top

Work with Khuyen Tran

Work with Khuyen Tran