MetricsRunResult

MetricsRunResult(
    output_csv,
    report_file,
    columns_to_compare,
    metrics_by_column,
    reports,
    total_inference_time,
    avg_inference_time,
    input_chars,
    output_chars,
    energy_consumed,
    emissions,
    cpu_model,
    gpu_model,
    summary_text,
    run_id=None,
    runs_csv=None,
    n_queries=None,
)

Result returned by run_metrics.

Attributes

Name Type Description
output_csv Path Filesystem path to the aggregate metrics CSV that was updated.
report_file Path Filesystem path to the per-column classification report text file.
columns_to_compare list[str] Annotation columns included in the evaluation.
metrics_by_column dict[str, dict[str, Any]] Nested dictionary of computed metrics keyed by column name.
reports dict[str, str] Human-readable report text keyed by annotation column name.
total_inference_time float | None Total model inference time in seconds, if available.
avg_inference_time float | None Mean inference time per annotation request in seconds, if available.
input_chars int | None Total prompt characters sent to the model, if available.
output_chars int | None Total response characters returned by the model, if available.
energy_consumed float | None Energy consumption in kilowatt-hours, if available.
emissions float | None Emissions estimate in kilograms of CO2 equivalent, if available.
cpu_model str | None CPU metadata recorded by CodeCarbon, if available.
gpu_model str | None GPU metadata recorded by CodeCarbon, if available.
summary_text str Plain-text summary of the main evaluation metrics.
run_id str | None Identifier linking this run’s rows across the runs and metrics tables.
runs_csv Path | None Filesystem path to the per-run efficiency/config table, if written.
n_queries int | None Number of model inference calls in the annotation run, if available.