❓FAQs

Frequently Asked Questions!

Why PyCaret?

The short answer is it's an open-source, low-code machine learning library built on top of your favorite libraries and frameworks like scikit-learn, xgboost, lightgbm, etc. Machine Learning experments take a lot of iterations and the primary goal of PyCaret is to give you the ability to iterate with lightning speed. In comparison with the other awesome open-source machine learning libraries, PyCaret is an alternate low-code library that can be used to replace hundreds of lines of code with a few lines only. Give it a try!

Does PyCaret work with all OS and Python versions?

PyCaret is tested and supported on 64-bit systems:

Python 3.7, 3.8, 3.9, and 3.10
Ubuntu 16.04 or later
Windows 7 or later

PyCaret also works on Mac OS but we do not guarantee the performance as the releases are not tested for Mac. To learn more about our testing workflows, click here.

Can I use PyCaret on Google Colab or Kaggle Notebooks?

Absolutely. Just do pip install pycaret

Does PyCaret support model training on GPU?

Yes. We have integrated PyCaret with the amazing Rapids.AI project. To use GPU instead of CPU, just pass use_gpu=True in the setup function.

This will use CPU for model training:

from pycaret.classification import *
s = setup(data, target = 'target_name')

This will use GPU for model training:

from pycaret.classification import *
s = setup(data, target = 'target_name', use_gpu = True)

There is no change in the use of the API, however, in some cases, additional libraries have to be installed as they are not installed with the default version or the full version of PyCaret. You can learn more about this here.

Can I use PyCaret with parallel processing frameworks like Spark?

Absolutely Yes. PyCaret 3.0 has integration with Fugue. You can now distribute compare_models function using the parallel parameter on your choice of framework. The current supported frameworks are Ray, Dask, and Spark. To learn more about this, check out this link.

How can I contribute to PyCaret?

Thank you for choosing to contribute to PyCaret. There are a ton of great open-source projects out there, so we appreciate your interest in contributing to PyCaret. Please check out our Contribution Guidelines.

Does PyCaret support Deep Learning or Reinforcement Learning?

Not yet. In the future, maybe.

Can I integrate PyCaret with BI tools like Power BI, Tableau, Qlik, etc.?

Yes, any tool that supports the Python environment. You can use PyCaret within Power BI, Tableau, SQL, Alteryx, KNIME.

How can I change verbosity in PyCaret?

Most functions in PyCaret has verbose parameter. Simply set verbose=False in the function.

Example:

lr = create_model('lr', verbose = False)

How can can I silent the logger?

We have noticed in some situations that the logger of PyCaret can conflict with other libraries in the environment causing an abnormal behavior resulting in logs being printed on the screen (Notebook or CLI) as the code is running. While in the next major release (3.0), we are planning to make the logger more configurable, allowing you to totally disable it if you want. In the meantime, there is a way around using environment variables. Run the following code on the top of your Notebook:

import os
os.environ["PYCARET_CUSTOM_LOGGING_LEVEL"] = "CRITICAL"

NOTE: This command will set an environment variable that is used by PyCaret's logger. Setting it to CRITICAL means that only critical messages will be logged and there aren't many critical messages in PyCaret.

I am having issues in installing PyCaret, what can I do?

Search on our GitHub if others may have faced the same issue. If you are still stuck feel free to open a new issue.

Can I add my own custom models in PyCaret?

Absolutely. PyCaret's vision is to give you full control of your ML pipeline. To add custom models, there is only one rule. They must be compatible with standard sklearn API. To learn how to do it, you can read the following tutorials by Fahad Akbar:

Can I add custom metrics for cross-validation in PyCaret?

Absolutely. PyCaret aim's to balance the abstraction with flexibility and so far we are doing a pretty good job. You can use PyCaret's add_metric and remove_metric functions to add or remove metrics used for cross-validation. Learn More.

Can I just use PyCaret for data preprocessing?

Yes if you would like. You can run the setup function which handles all the data preprocessing and after that you can access the transformed train set and test set using the get_config function.

Example:

from pycaret.classification import *
s = setup(data, target = 'target_name')

X_train, y_train = get_config('X_train_transformed'), get_config('y_train_transformed')
X_test, y_test = get_config('X_test_transformed'), get_config('y_test_transformed')

Can I export models from PyCaret and work on them outside of PyCaret?

Absolutely. You can use the save_model function of PyCaret to export the entire Pipeline as a pkl file. Learn more about this function.

Can I deploy ML pipelines on cloud using PyCaret?

Absolutely. PyCaret is an end-to-end library with a lot of deployment functionalities. There are many official tutorials on deployment on different cloud platforms such as Azure, AWS, and GCP. You can check out these tutorials here.

Can I install and run PyCaret on an Apple M1 MacBook?

It's not straightforward due to some issues in the underlying dependencies of PyCaret. However, if you have tried everything and still can't find a solution, this article by Pareekshith Katti may help you.

Do I need a powerful computer to use PyCaret?

No, as long as your data can fit in the memory, you can use PyCaret. No super computer is needed.

Why is my pull request not getting any attention?

The review process may take some time. You should not be discouraged by delays in review on your pull request. We have many features that are requested by the community and only limited time from our maintainers to review and approve these pull requests. Since every feature comes at a cost of lifetime maintenance, we care a lot about getting things right the first time.

Is PyCaret comparable to scikit-learn and ML libraries and framework?

Well, PyCaret is built on top of common ML libraries and frameworks such as scikit-learn, LightGBM, XGBoost, etc. The benefit of using PyCaret is that you don't have to write a lot of code. The underlying models and evaluation framework are the same as what you are used to.

PreviousCheat sheet

Last updated 2 years ago

Was this helpful?