Machine Learning Archives

Covalent: Pythonic Tool to Iterate Quickly on Large ML Models

Leave a Comment / Machine Learning Tools / Khuyen Tran

It is challenging to iterate quickly on large ML models in a local environment.

Advanced computing hardware can help, but it may be expensive if only needed for a subset of the code.

With Covalent, you can:

Assign resource-intensive functions to advanced hardware.

Test these functions on local servers before deploying them to expensive hardware.

Covalent: Pythonic Tool to Iterate Quickly on Large ML Models Read More »

MLEM: Capture Your Machine Learning Model’s Metadata

Leave a Comment / Machine Learning Tools / Khuyen Tran

The metadata of a machine learning model provides important information about the model such as:

Hash value

Model methods

Input data schema

Python requirements used to train the model.

This information enables others to reproduce the model and its results.

With MLEM, you can save both the model and its metadata in a single line of code.

Link to MLEM.

Deploy your model with MLEM.

MLEM: Capture Your Machine Learning Model’s Metadata Read More »

Evaluate Your ML Model Performance with Simple Model Comparison

Leave a Comment / Machine Learning Tools / Khuyen Tran

How do you check if your ML model is trained properly? One approach is to use a simple model for comparison.

A simple model establishes a minimum performance benchmark for the given task. A model achieving less or a similar score to the simple model indicates a possible problem with the model.

The code above shows how to evaluate a model’s performance using Deepchecks’ simple model comparison.

Link to Deepchecks.

My previous tips on testing.

Evaluate Your ML Model Performance with Simple Model Comparison Read More »

Remove Unwanted Objects From Your Pictures with AI

Leave a Comment / Machine Learning Tools / Khuyen Tran

If you want to remove unwanted objects or people from your pictures, use Lama Cleaner.

Lama Cleaner is a free and open-source tool powered by the SOTA AI model.

Link to Lama Cleaner.

Remove Unwanted Objects From Your Pictures with AI Read More »

dtreeviz: Visualize and Interpret a Decision Tree Model

Leave a Comment / Machine Learning Tools, Visualization / Khuyen Tran

If you want to find an easy way to visualize and interpret a decision tree model, use dtreeviz.

The image above shows the output of dtreeviz when applying it to DecisionTreeClassifier.

Code.

dtreeviz: Visualize and Interpret a Decision Tree Model Read More »

Validation Curve: Determine if an Estimator Is Underfitting Over Overfitting

Leave a Comment / Machine Learning Tools, Visualization / Khuyen Tran

To find the hyperparameter where the estimator is neither underfitting nor overfitting, use Yellowbrick’s validation curve.

As we can see from the plot above, although max_depth > 2 has a higher training score but a lower cross-validation score. This indicates that the model is overfitting.

Thus, the sweet spot will be where the cross-validation score neither increases nor decreases, which is 2.

Link to Yellowbrick.

My full article about Yellowbrick.

Validation Curve: Determine if an Estimator Is Underfitting Over Overfitting Read More »

River: Online Machine Learning in Python

1 Comment / Machine Learning Tools / Khuyen Tran

Batch learning is the training of ML models in batch. As the data grows, training the model takes more time and resources.

In online learning, the model learns incrementally on a small group of observations instead of an entire dataset.

Thus, each learning step is fast and cheap, which makes it ideal:

For applications that change rapidly

For companies with limited computing resources.

In my latest article, you will learn how to use River to do machine learning on streaming data.

Article.

Code.

River: Online Machine Learning in Python Read More »

Check Conflicting Labels with Deepchecks

Leave a Comment / Machine Learning Tools, Testing / Khuyen Tran

Sometimes, your data might have identical samples with different labels. This might be because the data was mislabeled.

It is good to identify these conflicting labels in your data before using the data to train your ML model. To check conflicting labels in your data, use deepchecks.

In the example above, deepchecks identified that samples 0 and 1 have the same features but different labels.

Link to deepchecks.

My previous tips on testing.

Check Conflicting Labels with Deepchecks Read More »

SHAP: Explain Any Machine Learning Model in Python

Leave a Comment / Machine Learning Tools, Visualization / Khuyen Tran

If you want to explain the output of your machine learning model, use SHAP. In the code above, I use SHAP’s summary plot to visualize the overall impact of features in a DataFrame.

My full article about SHAP.

Link to SHAP.

SHAP: Explain Any Machine Learning Model in Python Read More »

Deepchecks + Weights & Biases: Test and Track Your ML Model and Data

Leave a Comment / Machine Learning Tools, Testing / Khuyen Tran

Weight and Biases is a tool to track and monitor your ML experiments. deepchecks is a tool that allows you to create test suites for your ML models & data with ease.

The checks in a suite includes:

🔎 model performance

🔎 data integrity

🔎 distribution mismatches

and more.

Now you can track deepchecks suite’s results with Weights & Biases as shown above.

Here is how to create and track a test suite.

Deepchecks + Weights & Biases: Test and Track Your ML Model and Data Read More »

Machine Learning

Covalent: Pythonic Tool to Iterate Quickly on Large ML Models

MLEM: Capture Your Machine Learning Model’s Metadata

Evaluate Your ML Model Performance with Simple Model Comparison

Remove Unwanted Objects From Your Pictures with AI

dtreeviz: Visualize and Interpret a Decision Tree Model

Validation Curve: Determine if an Estimator Is Underfitting Over Overfitting

River: Online Machine Learning in Python

Check Conflicting Labels with Deepchecks

SHAP: Explain Any Machine Learning Model in Python

Deepchecks + Weights & Biases: Test and Track Your ML Model and Data

Drop a line

Get in touch

Follow Us on Social Media

Machine Learning

Work with Khuyen Tran

Work with Khuyen Tran