Generic selectors
Exact matches only
Search in title
Search in content
Post Type Selectors
Filter by Categories
About Article
Analyze Data
Archive
Best Practices
Better Outputs
Blog
Code Optimization
Code Quality
Command Line
Daily tips
Dashboard
Data Analysis & Manipulation
Data Engineer
Data Visualization
DataFrame
Delta Lake
DevOps
DuckDB
Environment Management
Feature Engineer
Git
Jupyter Notebook
LLM
LLM
Machine Learning
Machine Learning
Machine Learning & AI
Manage Data
MLOps
Natural Language Processing
NumPy
Pandas
Polars
PySpark
Python Tips
Python Utilities
Python Utilities
Scrape Data
SQL
Testing
Time Series
Tools
Visualization
Visualization & Reporting
Workflow & Automation
Workflow Automation

Workflow Automation

Knock Knock: Get Alerts When Your Models Finish Training

Training a model can be a time-consuming process, often taking hours or days, and you may not always be at your computer when it completes. Wouldn’t it be nice to get an email notification once your code has finished executing?

Knock Knock is a lightweight library that sends notifications to your email, Slack, Microsoft Teams, text messages, Discord, and others by simply adding a single decorator to your main function call.

Link to Knock Knock.
Favorite

Knock Knock: Get Alerts When Your Models Finish Training Read More »

Taipy: Scenario Management in Data Pipeline Development

As data users, you may want to explore various scenarios by providing different inputs and observing how the metrics evolve.

Taipy enables you to prototype the pipeline locally on your machine, and then seamlessly transform it into an intuitive user interface, ready for use and share with your colleagues.

Link to Taipy.
Favorite

Taipy: Scenario Management in Data Pipeline Development Read More »

Simplify Your Workflows with Kestra’s User-Friendly Interface

Kestra is an ideal choice for an open-source orchestrator if you’re looking for the following features:

Language-agnostic workflows.

Seamless integration with your existing tool stack.

An embedded code editor that permits writing code directly from the UI.

Comprehensive dashboard for monitoring workflows.

Link to Kestra.
Favorite

Simplify Your Workflows with Kestra’s User-Friendly Interface Read More »

Streamline dbt Testing with DataPilot’s Power User VSCode Extension

Ensuring data quality and reliability is crucial in dbt projects.

DataPilot’s Power User VSCode extension provide a seamless testing experience in dbt, enabling you to catch data issues early in the development process. With the extension, you can:

Validate your dbt code against best practices

Troubleshoot using real-time column lineage

Generate dbt tests with ease

Link to DataPilot VSCode extension.
Favorite

Streamline dbt Testing with DataPilot’s Power User VSCode Extension Read More »

Streamlining Code Review with Sourcery

Manually reviewing code changes in pull requests (PRs) can be time-consuming and error-prone, especially in large projects or teams. Sourcery can streamline this process by automatically handling the review process.

After submitting a PR, Sourcery quickly reviews the code, checking for bugs and code quality, allowing developers to focus on more complex tasks.

Link to Sourcery.
Favorite

Streamlining Code Review with Sourcery Read More »

Automate Weekly Data Monitoring and Sharing with Kestra

Consider the scenario where you need to query a CSV file and subsequently share the results in a Slack channel every Monday to monitor the data and enhance team communication.

Completing this task manually each week can be inefficient and repetitive.

However, with Kestra, you can streamline this process by automating it with just a few lines of YAML code.

Link to Kestra.
Favorite

Automate Weekly Data Monitoring and Sharing with Kestra Read More »

Streamline Anomaly Detection and Notification with Kestra

Timely detection and notification of data anomalies are crucial for stakeholders to address potential issues promptly. 

Kestra, an open-source orchestrator, simplifies this process by enabling you to create a workflow using a YAML file.

In the given example, a DuckDB query is used to identify anomalies, and if any are detected, an email with the anomalous rows in a CSV file is sent to relevant parties.

Link to Kestra.
Favorite

Streamline Anomaly Detection and Notification with Kestra Read More »

Transform PDFs to Markdown with Marker

Markdown files are more lightweight than PDFs, integrate seamlessly with version control systems like Git, and are easier to edit.

To convert PDFs to markdowns, use Marker, which provides the following features:

Support for a range of PDF documents (optimized for books and scientific papers)

Removal of headers/footers/other artifacts

Conversion of most equations into LaTeX

Formatting of code blocks and tables

Link to Marker.
Favorite

Transform PDFs to Markdown with Marker Read More »

0
    0
    Your Cart
    Your cart is empty
    Scroll to Top

    Work with Khuyen Tran

    Work with Khuyen Tran