Testing Archives

Check Conflicting Labels with Deepchecks

Leave a Comment / Machine Learning, Testing / Khuyen Tran

Sometimes, your data might have identical samples with different labels. This might be because the data was mislabeled.

It is good to identify these conflicting labels in your data before using the data to train your ML model. To check conflicting labels in your data, use deepchecks.

In the example above, deepchecks identified that samples 0 and 1 have the same features but different labels.

Link to deepchecks.

My previous tips on testing.
Favorite

Check Conflicting Labels with Deepchecks Read More »

Pytest skipif: Skip a Test When a Condition is Not Met

Leave a Comment / Testing / Khuyen Tran

If you want to skip a test when a condition is not met, use pytest skipif. For example, in the code above, I use skipif to skip a test if the python version is less than 3.9.

My previous tips on testing in Python.
Favorite

Pytest skipif: Skip a Test When a Condition is Not Met Read More »

pytest-steps: Share Data Between Tests

Leave a Comment / Testing / Khuyen Tran

Have you ever wanted to use the result of one test for another test? That is when pytest_steps comes in handy.

In the code above, I use the result of sum_test as the input of average_2_nums. The argument steps_data allows me to share the data between 2 tests.

Link to pytest_steps.

My previous tips on testing.
Favorite

pytest-steps: Share Data Between Tests Read More »

Assign IDs to Pytest Parametrize

Leave a Comment / Testing / Khuyen Tran

When using pytest parametrize, it can be difficult to understand the role of each test case. You can add the ids parameter to pytest parametrize to assign names to test cases.

In the code above, the first test case is shown as neg-neg instead of [-1–2]. This makes it easier for others to understand the roles of your test cases.

My previous tips on testing in Python.
Favorite

Assign IDs to Pytest Parametrize Read More »

Deepchecks + Weights & Biases: Test and Track Your ML Model and Data

Leave a Comment / Machine Learning, Testing / Khuyen Tran

Weight and Biases is a tool to track and monitor your ML experiments. deepchecks is a tool that allows you to create test suites for your ML models & data with ease.

The checks in a suite includes:

🔎 model performance

🔎 data integrity

🔎 distribution mismatches

and more.

Now you can track deepchecks suite’s results with Weights & Biases as shown above.

Here is how to create and track a test suite.
Favorite

Deepchecks + Weights & Biases: Test and Track Your ML Model and Data Read More »

pytest parametrize twice: Test All Possible Combinations of Two Sets of Parameters

Leave a Comment / Testing / Khuyen Tran

If you want to test the combinations of two sets of parameters, writing all possible combinations can be time-consuming and is difficult to read.

You can save your time by using pytest.mark.parametrize twice instead. From the output of pytest, we can see that all possible combinations of the given functions and inputs are tested.

My previous tips on testing.
Favorite

pytest parametrize twice: Test All Possible Combinations of Two Sets of Parameters Read More »

Checklist: Create Data to Test Your NLP Model

Leave a Comment / Natural Language Processing, Testing / Khuyen Tran

It can be time-consuming to create data to test edge cases of your NLP model. If you want to quickly create data to test your NLP models, use Checklist.

In the code above, I use Checklist’s Editor to create multiple examples of negation in one line of code.

My full article on Checklist.

Link to Checklist.
Favorite

Checklist: Create Data to Test Your NLP Model Read More »

ipytest: Unit Tests in IPython Notebooks

Leave a Comment / Jupyter Notebook, Testing / Khuyen Tran

It is important to create unit tests for your functions to make sure they work as you expected, even the experimental code in your Jupyter Notebook. However, it can be difficult to create unit tests in a notebook.

Luckily, ipytest allows you to run pytest inside the notebook environment. To use ipytest, simply add %%ipytest -qq inside the cell you want to run pytest.

Link to ipytest.

My previous tips on Jupyter Notebook.
Favorite

ipytest: Unit Tests in IPython Notebooks Read More »

Deepchecks: Check Category Mismatch Between Train and Test Set

Leave a Comment / Testing / Khuyen Tran

Sometimes, it is important to know if your test set contains the same categories in the train set. If you want to check the category mismatch between the train and test set, use Deepchecks’s CategoryMismatchTrainTest.

In the example above, the result shows that there are 2 new categories in the test set. They are ‘d’ and ‘e’.

Link to Deepchecks.

My previous tips on testing.
Favorite

Deepchecks: Check Category Mismatch Between Train and Test Set Read More »

hypothesis: Property-based Testing in Python

Leave a Comment / Testing / Khuyen Tran

If you want to test some properties or assumptions, it can be cumbersome to write a wide range of scenarios.

To automatically run your tests against a wide range of scenarios and find edge cases in your code that you would otherwise have missed, use hypothesis.

In the code above, I test if the addition of two floats is commutative. The test fails when either x or y is NaN. Now I can rewrite my code to make it more robust against these edge cases.

Learn more about hypothesis here.

Link to my previous tips about testing.
Favorite

hypothesis: Property-based Testing in Python Read More »

Testing

Check Conflicting Labels with Deepchecks

Pytest skipif: Skip a Test When a Condition is Not Met

pytest-steps: Share Data Between Tests

Assign IDs to Pytest Parametrize

Deepchecks + Weights & Biases: Test and Track Your ML Model and Data

pytest parametrize twice: Test All Possible Combinations of Two Sets of Parameters

Checklist: Create Data to Test Your NLP Model

ipytest: Unit Tests in IPython Notebooks

Deepchecks: Check Category Mismatch Between Train and Test Set

hypothesis: Property-based Testing in Python

Stay up-to-date with
data skills using
CodeCut

Drop a line

Get in touch

Follow Us on Social Media

Testing

Stay up-to-date with data skills using CodeCut

Follow Us on Social Media

Work with Khuyen Tran

Work with Khuyen Tran

Stay up-to-date with
data skills using
CodeCut