Create evaluation jobs to test model performance against datasets
Choose an evaluator from your account to run. The evaluator will be used to assess the performance of your model against a dataset.
Evaluators help you understand how well your model performs on specific tasks or criteria. Make sure to select an evaluator that matches your assessment needs.
An evaluator is a Python function that scores your model's outputs against specific criteria. It helps you measure performance, quality, and correctness of your model's responses.