Generic selectors
Exact matches only
Search in title
Search in content
Post Type Selectors
Filter by Categories
About Article
Analyze Data
Archive
Best Practices
Better Outputs
Blog
Code Optimization
Code Quality
Command Line
Daily tips
Dashboard
Data Analysis & Manipulation
Data Engineer
Data Visualization
DataFrame
Delta Lake
DevOps
DuckDB
Environment Management
Feature Engineer
Git
Jupyter Notebook
LLM
LLM
Machine Learning
Machine Learning
Machine Learning & AI
Manage Data
MLOps
Natural Language Processing
NumPy
Pandas
Polars
PySpark
Python Tips
Python Utilities
Python Utilities
Scrape Data
SQL
Testing
Time Series
Tools
Visualization
Visualization & Reporting
Workflow & Automation
Workflow Automation

About Article

Detect Change Points with Bayesian Inference and PyMC3

After looking at the views of your website, you suspect there might be a sudden change in the number of views after a certain date.

If there is indeed a date when the number of views changes suddenly, how do you find that date?

Wouldn’t it be nice if you can leverage observed data and some beliefs to guess the change point with high certainty? That is when Bayesian inference comes in handy.

In my latest article, we will learn what Bayesian inference is and how to use PyMC3 to perform Bayesian analysis.

Link to the article.

Link to the source code.
Favorite

Detect Change Points with Bayesian Inference and PyMC3 Read More »

Kedro — A Python Framework for Reproducible Data Science Project

Have you ever passed your data to a list of functions and classes without knowing for sure how the output is like?
Another common issue is that it’s hard to understand the relationships between functions when looking at a Python script that contains both the code to create and execute functions.
Wouldn’t it be nicer if you can visualize how the inputs and outputs of different functions are connected as shown in the image above?
That is when Kedro comes in handy. In my latest article, you will learn how to create reproducible, maintainable, and modular data science code with Kedro.
Link to the article.
Link to the source code. Favorite

Kedro — A Python Framework for Reproducible Data Science Project Read More »

Simulate Real-life Events in Python Using SimPy

As a restaurant manager, you want to approximate how much food your restaurant needs to prepare for tomorrow. You know how many customers come to your restaurant per day and the average time it takes to serve one customer.
However, it is challenging to put all of these variables into one calculation. Wouldn’t it be great if you can simulate this event using Python?
That is when SimPy comes in handy. In my latest article, you will learn how to use SimPy to simulate a restaurant.
Link to the article.
Link to the source code.Favorite

Simulate Real-life Events in Python Using SimPy Read More »

How to Create Bindings and Conditions Between Multiple Plots Using Altair

Have you ever wanted to see one plot change when you interact with another plot like above? That is when Altair comes in handy.
Altair is a Python library that allows you to create concise visualization grammar and quickly build statistical graphics.
In my latest article, I will show how you can create bindings and conditions between multiple plots using Altair.
Link to the article about Altair.
Link to the source code.Favorite

How to Create Bindings and Conditions Between Multiple Plots Using Altair Read More »

Pytest for Data Scientists

For data scientists, to make your code reproducible, you need to put them into functions and classes, but the code may break because of some functions. Even if your code doesn’t break, how do you know if your function will work as you expected?
In general, you should use testing for your data science projects because it allows you to:

Make sure the code works as expected
Detect edge cases
Feel confident to swap your existing code with improved code without being afraid of breaking the entire pipeline

There are many Python tools available for testing, but the easiest tool is pytest. I like pytest because it helps me to write tests with minimal code. If you were not familiar with testing, pytest is a great tool to get started.
In this article, I provided some simple examples and short explanations to get you started.Favorite

Pytest for Data Scientists Read More »

0
    0
    Your Cart
    Your cart is empty
    Scroll to Top

    Work with Khuyen Tran

    Work with Khuyen Tran