About Article Archives

5 Steps to Transform Messy Functions into Production-Ready Code

Leave a Comment / About Article / Khuyen Tran

In a data science project, writing poorly designed functions can introduce maintenance hurdles and diminish the code’s readability.

In this article, you will learn how to create a function that:

Perform a single, well-defined task

Can be extended without modifying the original code

Are capable of handling inputs with unexpected variations

By following these principles, you’ll be able to create functions that are not only effective but also easy to maintain and understand.

Link to the article.

5 Steps to Transform Messy Functions into Production-Ready Code Read More »

How to Structure an ML Project for Reproducibility and Maintainability

Leave a Comment / About Article / Khuyen Tran

Getting started is often the most challenging part when building ML projects.

This article will show you how to use a template to create a maintainable and reproducible project, consolidating the best practices I’ve learned over the years.

Article.

Template.

How to Structure an ML Project for Reproducibility and Maintainability Read More »

SHAP: Explain Any Machine Learning Model in Python

Leave a Comment / About Article / Khuyen Tran

If you want to explain the output of your machine learning model, use SHAP.

In the code above, I use SHAP’s summary plot to visualize the overall impact of features in a DataFrame.

My full article about SHAP.

Link to SHAP.

SHAP: Explain Any Machine Learning Model in Python Read More »

Create Observable and Reproducible Notebooks with Hex

Leave a Comment / About Article / Khuyen Tran

Jupyter Notebook is not ideal for interpretability, reproducibility, and versioning for numerous reasons. Hex notebooks solve these issues with a graph-based execution model.

Hex links cells through their dependencies and executes only the cells whose dependencies change. The GIF above demonstrates this.

In my latest article, you will learn some useful features of Hex and how to integrate Hex into your data pipeline with Prefect.

Link to the article.

Create Observable and Reproducible Notebooks with Hex Read More »

Caching Your Python Functions with Prefect

Leave a Comment / About Article / Khuyen Tran

Caching allows you to efficiently reuse the results of tasks that may be expensive to run without actually running the code that defines the task.

In this short video, you will learn how to cache your Python functions with Prefect.

View the video on YouTube.

Caching Your Python Functions with Prefect Read More »

Check Inputs and Outputs of Your Python Function in a Data Science Project

Leave a Comment / About Article / Khuyen Tran

Pandera is a Python library to validate your pandas DataFrame. In this video, you will learn how to use Pandera to test the inputs and outputs of your functions.

View the video on YouTube.

Pandera basics.

Check Inputs and Outputs of Your Python Function in a Data Science Project Read More »

How to Validate Your pandas DataFrame with Pandera

Leave a Comment / About Article / Khuyen Tran

In a data science project, it is important to test your data to make sure they work as you expected.

In this video, you will learn how to test your pandas DataFrame with Pandera.

View the video on YouTube.

Source code.

How to Validate Your pandas DataFrame with Pandera Read More »

collections.Counter: Count The Occurrences of Items in a List

Leave a Comment / About Article / Khuyen Tran

collections is a built-in Python library to deal with Python dictionaries efficiently. This video will show you how to efficiently count the occurrences of each item in a list using collections Counter.

Watch the video on YouTube.

collections.Counter: Count The Occurrences of Items in a List Read More »

How to Create Mathematical Animations like 3Blue1Brown Using Python

Leave a Comment / About Article / Khuyen Tran

3Blue1Brown is a famous math YouTube channel created by Grant Sanderson. Wouldn’t it be cool if you can create animations like 3Blue1Brown in Python?

If so, try manim. In my article, you will learn how to create mathematical animations like below using manim.

Link to the article.

Link to the source code.

How to Create Mathematical Animations like 3Blue1Brown Using Python Read More »

atoti — Build a BI Platform in Python

Leave a Comment / About Article, Dashboard / Khuyen Tran

Have you ever taken 15 minutes or so just to manipulate the data and create a plot in Python? Wouldn’t it be nice if you can quickly extract insights from data by simply clicking and dragging like below?

That is when atoti comes in handy. With atoti, you can quickly:

Create different scenarios and compare them side by side

Create and gain insights from a multi-dimensional dataset

Create interactive visualization on Jupyter lab without coding

In my latest article, you will learn how to quickly create a dashboard in Python and share it with others using atoti.

Link to the article.

atoti — Build a BI Platform in Python Read More »

About Article

5 Steps to Transform Messy Functions into Production-Ready Code

How to Structure an ML Project for Reproducibility and Maintainability

SHAP: Explain Any Machine Learning Model in Python

Create Observable and Reproducible Notebooks with Hex

Caching Your Python Functions with Prefect

Check Inputs and Outputs of Your Python Function in a Data Science Project

How to Validate Your pandas DataFrame with Pandera

collections.Counter: Count The Occurrences of Items in a List

How to Create Mathematical Animations like 3Blue1Brown Using Python

atoti — Build a BI Platform in Python

Get in touch

Join the Newsletter

Follow Us on Social Media

About Article

Work with Khuyen Tran

Work with Khuyen Tran