Visualization Archives

Drag-and-Drop Visualizations with PyGWalker

Leave a Comment / Visualization / Khuyen Tran

EDA is a crucial step in any Data Science project. For large datasets, EDA can be time-consuming.

PyGWalker simplifies the process of creating visualizations by allowing users to drag and drop variables to create charts without writing much code.

You can use PyGWalker without changing your existing workflow. For example, you can call up PyGWalker with the Dataframe loaded in this way:

import pygwalker as pyg
import pandas as pd

df = pd.read_csv("https://kanaries-app.s3.ap-northeast-1.amazonaws.com/public-datasets/bike_sharing_dc.csv", parse_dates=['date'])
df.head(10)

Output:

date month season hour year holiday temperature feeling_temp \
0 2011-01-01 1 winter 0 2011 no 3.28 3.0014
1 2011-01-01 1 winter 1 2011 no 2.34 1.9982
2 2011-01-01 1 winter 2 2011 no 2.34 1.9982
3 2011-01-01 1 winter 3 2011 no 3.28 3.0014
4 2011-01-01 1 winter 4 2011 no 3.28 3.0014

humidity winspeed casual registered count work yes or not am or pm \
0 81.0 0.0 3 13 16 0 am
1 80.0 0.0 8 32 40 0 am
2 80.0 0.0 5 27 32 0 am
3 75.0 0.0 3 10 13 0 am
4 75.0 0.0 0 1 1 0 am

Day of the week
0 6
1 6
2 6
3 6
4 6

And then just walk around!

Link to PygWalker.

Run in Google Colab.
Favorite

Drag-and-Drop Visualizations with PyGWalker Read More »

Phoenix: Visualize High-Dimensional Data to Identify Performance Issues

During system performance degradation, pinpointing underlying causes can be challenging, especially with datasets containing numerous features.

Phoenix leverages UMAP to visualize high-dimensional data during periods of performance degradation, thereby enabling the identification of clusters of problematic data.

Link to Phoenix.
Favorite

Phoenix: Visualize High-Dimensional Data to Identify Performance Issues Read More »

From Text to Graphs: Enhancing Understanding with InstaGraph

Leave a Comment / Visualization / Khuyen Tran

Knowledge graphs visually represent the relationships between entities, allowing users to understand complex connections more easily. To effortlessly convert text or URLs into insightful knowledge graphs, use InstaGraph.

Link to Instagraph.
Favorite

From Text to Graphs: Enhancing Understanding with InstaGraph Read More »

Uniplot: Terminal-Based Plotting for Enhanced Data Science Pipelines

Leave a Comment / Command Line, Visualization / Khuyen Tran

Uniplot, a lightweight library, generates plots directly in the terminal. Its independence from the Jupyter Notebook allows for versatile use, such as seamlessly integrating plotting capabilities into your data science CI/CD pipeline.

As a result, when a problem occurs, you not only obtain the backtrace but also visual plots that aid in identifying the problem.

Link to uniplot.
Favorite

Uniplot: Terminal-Based Plotting for Enhanced Data Science Pipelines Read More »

LovelyPlots: Create Nice Matplotlib Figures for Presentations

Leave a Comment / Visualization / Khuyen Tran

If you want to transform your matplotlib plots into nice figures for scientific papers or presentations, try LovelyPlots.

To use LovelyPlots, simply add plt.style.use(ipynb) to your code.

Link to LovelyPlots.

Code to create the figures
Favorite

LovelyPlots: Create Nice Matplotlib Figures for Presentations Read More »

Dtale: Quickly Gain Insights from Your Data

Leave a Comment / Analyze Data, Dashboard, Visualization / Khuyen Tran

Have you ever wanted to quickly visualize, analyze, or manipulate your data by only clicking and selecting? Dtale allows you to do all of the above seamlessly within your Jupyter Notebook kernel.

Link to Dtale.
Favorite

Dtale: Quickly Gain Insights from Your Data Read More »

Simplify Code Profiling with Heatmap Visualization

Leave a Comment / Code Optimization, Visualization / Khuyen Tran

Profiling your code is helpful, but looking at data in a table can be a real headache.

Wouldn’t it be nice if you could see your code’s time distribution as a heatmap? That is when pyheat comes in handy.

Link to pyheat.

My previous tips on logging and debugging.
Favorite

Simplify Code Profiling with Heatmap Visualization Read More »

gif: The Easiest Way to Animate Your matplotlib Plot

Leave a Comment / Visualization / Khuyen Tran

If you want to effortlessly animate your matplotlib plot in Python, use gif.

The animation above is created using gif.

gif: The Easiest Way to Animate Your matplotlib Plot Read More »

Sweetviz: Compare the Similar Features Between 2 Different Datasets

Leave a Comment / Analyze Data, Visualization / Khuyen Tran

For a quick comparison of stats (missing, distinct, mean) between train and test sets in Python, use sweetviz.

Sweetviz: Compare the Similar Features Between 2 Different Datasets Read More »

Mlxtend: Plot Decision Regions of Your ML Classifiers

Leave a Comment / Machine Learning Tools, Visualization / Khuyen Tran

A decision region plot can help understand how a machine learning classifier assigns a class to a sample.

You can use the plot_decision_regions function from the mlxtend library to create such a plot for classifiers like Logistic Regression, Random Forest, RBF kernel SVM, and Ensemble classifier.

Mlxtend: Plot Decision Regions of Your ML Classifiers Read More »

Visualization

Drag-and-Drop Visualizations with PyGWalker

Phoenix: Visualize High-Dimensional Data to Identify Performance Issues

From Text to Graphs: Enhancing Understanding with InstaGraph

Uniplot: Terminal-Based Plotting for Enhanced Data Science Pipelines

LovelyPlots: Create Nice Matplotlib Figures for Presentations

Dtale: Quickly Gain Insights from Your Data

Simplify Code Profiling with Heatmap Visualization

gif: The Easiest Way to Animate Your matplotlib Plot

Sweetviz: Compare the Similar Features Between 2 Different Datasets

Mlxtend: Plot Decision Regions of Your ML Classifiers

Drop a line

Get in touch

Follow Us on Social Media

Visualization

Work with Khuyen Tran

Work with Khuyen Tran