By default, scikit-learn transformers return a NumPy array. This can pose a challenge if a pandas DataFrame is required for subsequent data processing steps.
Luckily, as of scikit-learn version 1.3.2, you can use the set_output
method to obtain the results as a pandas DataFrame.
This method is not limited to individual transformers but can also be applied within a scikit-learn pipeline.