The overall process of building an evaluation pipeline looks like this:
You now have a pipeline. Preview mode allows you to iterate with live feedback, allowing for adjustments in real-time.
Click ‘Add Step’ to start building your pipeline, with each column representing a step in the evaluation process.
Steps execute in order left to right. That means that if a column depends on a previous column, make sure it appears to the right of the dependency.
If the last step of your evaluation pipeline contains all booleans or numeric values, that will be consider the score for the row. Your full evaluation report will have a scorecard of the average of this last step.
NOTE: All cells in the last column must be boolean or all must be numeric. If any cell deviates, the score will not be calculated
Transition from pipeline to full batch run to apply your pipeline across the entire dataset for comprehensive evaluation.
The overall process of building an evaluation pipeline looks like this:
You now have a pipeline. Preview mode allows you to iterate with live feedback, allowing for adjustments in real-time.
Click ‘Add Step’ to start building your pipeline, with each column representing a step in the evaluation process.
Steps execute in order left to right. That means that if a column depends on a previous column, make sure it appears to the right of the dependency.
If the last step of your evaluation pipeline contains all booleans or numeric values, that will be consider the score for the row. Your full evaluation report will have a scorecard of the average of this last step.
NOTE: All cells in the last column must be boolean or all must be numeric. If any cell deviates, the score will not be calculated
Transition from pipeline to full batch run to apply your pipeline across the entire dataset for comprehensive evaluation.