Spaces:

SEA-AI
/

det-metrics

Running

App Files Files Community

franzi2505 commited on 3 days ago

Commit

9cf5bb9

1 Parent(s): 802ca0e

update det-metrics for class-specific computation

Browse files

Files changed (2) hide show

README.md +7 -2
det-metrics.py +60 -4

README.md CHANGED Viewed

@@ -61,10 +61,11 @@ results = module.compute()
 print(results)
 ```
-This will output the following dictionary containing metrics for the detection model. The key of the dictionary will be the model name or "custom" if no model names are available like in this case.
 ```json
 {
     "custom": {
         "metrics": ...,
         "eval": ...,
@@ -137,6 +138,7 @@ Customize your evaluation by specifying various parameters when loading SEA-AI/d
 - **bbox_format**: Set the bounding box format (e.g., `"xywh"`).
 - **iou_threshold**: Choose the IOU threshold for determining correct detections.
 - **class_agnostic**: Specify whether to calculate metrics disregarding class labels.
 ```python
 area_ranges_tuples = [
@@ -191,6 +193,8 @@ SEA-AI/det-metrics metrics dictionary provides a detailed breakdown of performan
 - **fpi**: Number of images with predictions but no ground truths.
 - **nImgs**: Total number of images evaluated.
 ### Eval
 The SEA-AI/det-metrics evaluation dictionary provides details about evaluation metrics and results. Below is a description of each field:
@@ -244,7 +248,8 @@ The params return value of the COCO evaluation parameters in PyCOCO represents a
 - **areaRng**: Object area ranges for evaluation. This parameter defines the sizes of objects to evaluate. It is specified as a list of tuples, where each tuple represents a range of area in square pixels.
 - **maxDets**: List of thresholds on maximum detections per image for evaluation. By default, it evaluates with thresholds of 1, 10, and 100 detections per image.
 - **iouType**: Type of IoU calculation used for evaluation. It can be ‘segm’ (segmentation), ‘bbox’ (bounding box), or ‘keypoints’.
-- **useCats**: Boolean flag indicating whether to use category labels for evaluation (default is 1, meaning true).
 > Note:
 > If useCats=0 category labels are ignored as in proposal scoring.

 print(results)
 ```
+This will output the following dictionary containing metrics for the detection model. The key of the dictionary will be the model name or "custom" if no model names are available like in this case. Additionally, there is a single key "classes" which maps the labels to the respective indices of the results. If the results are class agnostic, the value of "classes" is None.
 ```json
 {
+    "classes": ...
     "custom": {
         "metrics": ...,
         "eval": ...,
 - **bbox_format**: Set the bounding box format (e.g., `"xywh"`).
 - **iou_threshold**: Choose the IOU threshold for determining correct detections.
 - **class_agnostic**: Specify whether to calculate metrics disregarding class labels.
+- **label_mapping**: Provide an optional mapping of string labels to numeric labels in the form of a dictionary (e.g., `{"SHIP": 0, "BOAT": 1}`). Defaults to a label mapping defined by the SEA.AI label merging map.
 ```python
 area_ranges_tuples = [
 - **fpi**: Number of images with predictions but no ground truths.
 - **nImgs**: Total number of images evaluated.
+If the det-metrics is computed with `class_agnostic=False`, all counts (`tp/fp/fn/duplicates/support/fpi`) and scores (`precision/recall/f1`) are arrays instead of single numbers. For a label mapping of `{"SHIP": 0, "BOAT": 1}`, a exemplary array could be `tp=np.array([10, 4])`, which means there are 10 true positive ships and 4 true positive boats.
 ### Eval
 The SEA-AI/det-metrics evaluation dictionary provides details about evaluation metrics and results. Below is a description of each field:
 - **areaRng**: Object area ranges for evaluation. This parameter defines the sizes of objects to evaluate. It is specified as a list of tuples, where each tuple represents a range of area in square pixels.
 - **maxDets**: List of thresholds on maximum detections per image for evaluation. By default, it evaluates with thresholds of 1, 10, and 100 detections per image.
 - **iouType**: Type of IoU calculation used for evaluation. It can be ‘segm’ (segmentation), ‘bbox’ (bounding box), or ‘keypoints’.
+- **class_agnostic**: Boolean flag indicating whether to use category labels for evaluation (default is 1, meaning true).
+- **label_mapping**: Dict of str: int pairs, mapping string labels to numeric labels, so that the payload labels can be mapped to numeric labels (default is a label mapping defined by the class merging structure). Should be provided only if `class_agnostic=False`.
 > Note:
 > If useCats=0 category labels are ignored as in proposal scoring.

det-metrics.py CHANGED Viewed

@@ -13,7 +13,7 @@
 # limitations under the License.
 """TODO: Add a description here."""
-from typing import List, Literal, Tuple
 import datasets
 import evaluate
@@ -23,6 +23,43 @@ from seametrics.detection import PrecisionRecallF1Support
 from seametrics.detection.utils import payload_to_det_metric
 from seametrics.payload import Payload
 _CITATION = """\
 @InProceedings{coco:2020,
 title = {Microsoft {COCO:} Common Objects in Context},
@@ -124,6 +161,7 @@ class DetectionMetric(evaluate.Metric):
         bbox_format: str = "xywh",
         iou_type: Literal["bbox", "segm"] = "bbox",
         payload: Payload = None,
         **kwargs,
     ):
         super().__init__(**kwargs)
@@ -136,6 +174,12 @@ class DetectionMetric(evaluate.Metric):
         self.class_agnostic = class_agnostic
         self.iou_type = iou_type
         self.bbox_format = bbox_format
         # postprocess parameters
         self.iou_thresholds = (
@@ -143,7 +187,7 @@ class DetectionMetric(evaluate.Metric):
         )
         self.area_ranges = [v for _, v in area_ranges_tuples]
         self.area_ranges_labels = [k for k, _ in area_ranges_tuples]
         # initialize coco_metrics
         self.coco_metric = PrecisionRecallF1Support(
             iou_thresholds=self.iou_thresholds,
@@ -152,6 +196,7 @@ class DetectionMetric(evaluate.Metric):
             class_agnostic=self.class_agnostic,
             iou_type=self.iou_type,
             box_format=self.bbox_format,
         )
         # initialize evaluation metric
@@ -237,6 +282,7 @@ class DetectionMetric(evaluate.Metric):
         """Called within the evaluate.Metric.compute() method"""
         results = {}
         for model_name in self.model_names:
             print(f"\n##### {model_name} #####")
             # add payload if available (otherwise predictions and references must be added with add function)
@@ -260,7 +306,7 @@ class DetectionMetric(evaluate.Metric):
         """Converts the payload to the format expected by the metric"""
         # import only if needed since fiftyone is not a direct dependency
-        predictions, references = payload_to_det_metric(payload, model_name)
         self.add(prediction=predictions, reference=references)
         return self
@@ -312,6 +358,11 @@ class DetectionMetric(evaluate.Metric):
         import plotly.graph_objects as go
         from seametrics.detection.utils import get_confidence_metric_vals
         # Create traces
         fig = go.Figure()
         metrics = ["precision", "recall", "f1"]
@@ -377,6 +428,11 @@ class DetectionMetric(evaluate.Metric):
             wandb: To interact with the Weights and Biases platform.
             datetime: To generate a timestamp for run names.
         """
         import os
         import wandb
         import datetime
@@ -448,7 +504,7 @@ class DetectionMetric(evaluate.Metric):
         Note:
         - If the metric does not support area ranges, the metric should store the results under the `all` key.
         - If a range area is provided it will be displayed in the output. if area_ranges_tuples is None, then all the area ranges will be displayed
-        """
         results = {}
         for model_name in payload.models:

 # limitations under the License.
 """TODO: Add a description here."""
+from typing import List, Literal, Tuple, Dict
 import datasets
 import evaluate
 from seametrics.detection.utils import payload_to_det_metric
 from seametrics.payload import Payload
+LABEL_MAPPING = {
+    'SHIP': 0,
+    'BATTLE_SHIP': 0,
+    'FISHING_SHIP': 0,
+    'CONTAINER_SHIP': 0,
+    'CRUISE_SHIP': 0,
+    'BOAT_WITHOUT_SAILS': 1,
+    'MOTORBOAT': 1,
+    'MARITIME_VEHICLE': 1,
+    'BOAT': 1,
+    'SAILING_BOAT': 2,
+    'SAILING_BOAT_WITH_CLOSED_SAILS': 2,
+    'SAILING_BOAT_WITH_OPEN_SAILS': 2,
+    'LEISURE_VEHICLE': 3,
+    'WATER_SKI': 3,
+    'BUOY': 4,
+    'CONSTRUCTION': 4,
+    'FISHING_BUOY': 4,
+    'HARBOUR_BUOY': 4,
+    'FLOTSAM': 5,
+    'CONTAINER': 5,
+    'SEA_MINE': 5,
+    'WOODEN_LOG': 5,
+    'UNKNOWN': 5,
+    'HUMAN_IN_WATER': 5,
+    'FAR_AWAY_OBJECT': 6,
+    'MARITIME_ANIMAL': 7,
+    'ANIMAL': 7,
+    'FISH': 7,
+    'DOLPHIN': 7,
+    'MAMMAL': 7,
+    'WHALE': 7,
+    'AERIAL_ANIMAL': 8,
+    'SEAGULL': 8,
+    'BIRD': 8,
+}
 _CITATION = """\
 @InProceedings{coco:2020,
 title = {Microsoft {COCO:} Common Objects in Context},
         bbox_format: str = "xywh",
         iou_type: Literal["bbox", "segm"] = "bbox",
         payload: Payload = None,
+        label_mapping: Dict[str, int] = None,
         **kwargs,
     ):
         super().__init__(**kwargs)
         self.class_agnostic = class_agnostic
         self.iou_type = iou_type
         self.bbox_format = bbox_format
+        self.label_mapping = LABEL_MAPPING if not self.class_agnostic else None
+        if not class_agnostic:
+            if label_mapping:
+                print("WARNING: overwritting the default label mapping with the \
+                      custom label mapping provided via `label_mapping`.")
+                self.label_mapping = label_mapping
         # postprocess parameters
         self.iou_thresholds = (
         )
         self.area_ranges = [v for _, v in area_ranges_tuples]
         self.area_ranges_labels = [k for k, _ in area_ranges_tuples]
         # initialize coco_metrics
         self.coco_metric = PrecisionRecallF1Support(
             iou_thresholds=self.iou_thresholds,
             class_agnostic=self.class_agnostic,
             iou_type=self.iou_type,
             box_format=self.bbox_format,
+            labels=sorted(list(set(list(self.label_mapping.values())))) if self.label_mapping else None,
         )
         # initialize evaluation metric
         """Called within the evaluate.Metric.compute() method"""
         results = {}
+        results["classes"] = self.label_mapping
         for model_name in self.model_names:
             print(f"\n##### {model_name} #####")
             # add payload if available (otherwise predictions and references must be added with add function)
         """Converts the payload to the format expected by the metric"""
         # import only if needed since fiftyone is not a direct dependency
+        predictions, references = payload_to_det_metric(payload, model_name, class_agnostic=self.class_agnostic, label_mapping=self.label_mapping)
         self.add(prediction=predictions, reference=references)
         return self
         import plotly.graph_objects as go
         from seametrics.detection.utils import get_confidence_metric_vals
+        if not self.class_agnostic:
+            raise ValueError(
+                "This method is not yet implemented for `self.class_agnostic=False`."
+            )
         # Create traces
         fig = go.Figure()
         metrics = ["precision", "recall", "f1"]
             wandb: To interact with the Weights and Biases platform.
             datetime: To generate a timestamp for run names.
         """
+        if not self.class_agnostic:
+            raise ValueError(
+                "This method is not yet implemented for `self.class_agnostic=False`."
+            )
         import os
         import wandb
         import datetime
         Note:
         - If the metric does not support area ranges, the metric should store the results under the `all` key.
         - If a range area is provided it will be displayed in the output. if area_ranges_tuples is None, then all the area ranges will be displayed
+        """
         results = {}
         for model_name in payload.models: