Evaluate multiple models

LazyPredict

A Python library that enables you to train, test, and evaluate multiple ML models at once using just a few lines of code. Supports both regression & classification.

https://pypi.org/project/lazypredict/

CometML Opik

Opik is an open-source, end-to-end LLM evaluation platform that lets developers seamlessly test and evaluate their LLM applications during development and monitor them in production.