Spaces:

NeerajCodz
/

ObjectDetection

Running

App Files Files Community

NeerajCodz commited on 4 days ago

Commit

d3a3e0d

1 Parent(s): e581bf6

Sync from GitHub

Browse files

Files changed (16) hide show

README.md +12 -8
app.py +235 -435
hf_space/app.py +626 -268
hf_space/hf_space/README.md +148 -61
hf_space/hf_space/hf_space/README.md +185 -12
hf_space/hf_space/hf_space/hf_space/hf_space/hf_space/.huggingface.yaml +7 -0
hf_space/hf_space/hf_space/hf_space/hf_space/hf_space/hf_space/.github/workflows/docker-build-push.yml +26 -0
hf_space/hf_space/hf_space/hf_space/hf_space/hf_space/hf_space/.github/workflows/hf-space-sync.yml +36 -0
hf_space/hf_space/hf_space/hf_space/hf_space/hf_space/hf_space/.gitignore +5 -0
hf_space/hf_space/hf_space/hf_space/hf_space/hf_space/hf_space/Dockerfile +13 -0
hf_space/hf_space/hf_space/hf_space/hf_space/hf_space/hf_space/LICENSE +21 -0
hf_space/hf_space/hf_space/hf_space/hf_space/hf_space/hf_space/app.py +384 -0
hf_space/hf_space/hf_space/hf_space/hf_space/hf_space/hf_space/hf_space/.gitattributes +35 -0
hf_space/hf_space/hf_space/hf_space/hf_space/hf_space/hf_space/hf_space/README.md +12 -0
hf_space/hf_space/hf_space/hf_space/hf_space/hf_space/hf_space/requirements.txt +8 -0
requirements.txt +4 -1

README.md CHANGED Viewed

@@ -44,7 +44,7 @@ Follow these steps to set up the application locally:
 #### Clone the Repository
 ```bash
-git clone https://github.com/NeerajCodz/ObjectDetection
 cd ObjectDetection
 ```
@@ -136,11 +136,13 @@ The `app.py` script supports the following command-line arguments:
   - Example: `python app.py --enable-fastapi`
 - `--fastapi-port <port>`: Specify the port for the FastAPI server (default: 8000).
   - Example: `python app.py --enable-fastapi --fastapi-port 8001`
 You can combine arguments:
 ```bash
-python app.py --gradio-port 7870 --enable-fastapi --fastapi-port 8001
 ```
 Alternatively, set the `GRADIO_SERVER_PORT` environment variable:
@@ -186,16 +188,16 @@ Access the Swagger UI at `http://localhost:8000/docs` for interactive testing.
 #### Using `curl` with an Image URL
 ```bash
-curl -X POST "http://localhost:8000/detect" \
-  -H "Content-Type: application/json" \
   -d '{"image_url": "https://example.com/image.jpg", "model_name": "facebook/detr-resnet-50"}'
 ```
 #### Using `curl` with an Image File
 ```bash
-curl -X POST "http://localhost:8000/detect" \
-  -F "file=@/path/to/image.jpg" \
   -F "model_name=facebook/detr-resnet-50"
 ```
@@ -226,7 +228,7 @@ To contribute or modify the application:
 1. Clone the repository:
 ```bash
-git clone https://github.com/NeerajCodz/ObjectDetection
 cd ObjectDetection
 ```
@@ -265,8 +267,10 @@ Please include tests and documentation for new features. Report issues via GitHu
 ## Troubleshooting
 - **Port Conflicts**: If port 7860 is in use, specify a different port with `--gradio-port` or set `GRADIO_SERVER_PORT`.
-- **Colab Issues**: Use the `--gradio-port` argument or environment variable to avoid port conflicts in Google Colab.
 - **Panoptic Model Bugs**: Avoid `detr-resnet-*-panoptic` models until stability issues are resolved.
 - **API Instability**: Test with smaller images and object detection models first.
 For further assistance, open an issue on the [GitHub repository](https://github.com/NeerajCodz/ObjectDetection).

 #### Clone the Repository
 ```bash
+git clone https://github.com/NeerajCodz/ObjectDetection.git
 cd ObjectDetection
 ```
   - Example: `python app.py --enable-fastapi`
 - `--fastapi-port <port>`: Specify the port for the FastAPI server (default: 8000).
   - Example: `python app.py --enable-fastapi --fastapi-port 8001`
+- `--confidence-threshold <float-value)`: Confidence threshold for detection (Range: 0 - 1) (default: 8000).
+  - Example: `python app.py --confidence-threshold 0.75`
 You can combine arguments:
 ```bash
+python app.py --gradio-port 7870 --enable-fastapi --fastapi-port 8001 --confidence-threshold 0.75
 ```
 Alternatively, set the `GRADIO_SERVER_PORT` environment variable:
 #### Using `curl` with an Image URL
 ```bash
+curl -X POST "http://localhost:8000/detect" \\
+  -H "Content-Type: application/json" \\
   -d '{"image_url": "https://example.com/image.jpg", "model_name": "facebook/detr-resnet-50"}'
 ```
 #### Using `curl` with an Image File
 ```bash
+curl -X POST "http://localhost:8000/detect" \\
+  -F "file=@/path/to/image.jpg" \\
   -F "model_name=facebook/detr-resnet-50"
 ```
 1. Clone the repository:
 ```bash
+git clone https://github.com/NeerajCodz/ObjectDetection.git
 cd ObjectDetection
 ```
 ## Troubleshooting
 - **Port Conflicts**: If port 7860 is in use, specify a different port with `--gradio-port` or set `GRADIO_SERVER_PORT`.
+  - Example: `python app.py --gradio-port 7870`
+- **Colab Asyncio Error**: If you encounter `RuntimeError: asyncio.run() cannot be called from a running event loop` in Colab, the application now uses `nest_asyncio` to handle this. Ensure `nest_asyncio` is installed (`pip install nest_asyncio`).
 - **Panoptic Model Bugs**: Avoid `detr-resnet-*-panoptic` models until stability issues are resolved.
 - **API Instability**: Test with smaller images and object detection models first.
+- **FastAPI Not Starting**: Ensure `--enable-fastapi` is used, and check that the specified `--fastapi-port` (default: 8000) is available.
 For further assistance, open an issue on the [GitHub repository](https://github.com/NeerajCodz/ObjectDetection).

app.py CHANGED Viewed

@@ -3,11 +3,10 @@ import base64
 import logging
 import os
 import sys
-import traceback
 import threading
 from collections import Counter
 from io import BytesIO
-from typing import Dict, List, Optional, Tuple
 import gradio as gr
 import pandas as pd
@@ -30,15 +29,12 @@ import nest_asyncio
 # Configuration
 # ------------------------------
-# Logging configuration
-logging.basicConfig(
-    level=logging.INFO,
-    format="%(asctime)s - %(levelname)s - %(message)s",
-)
 logger = logging.getLogger(__name__)
-# Model and processing constants
-CONFIDENCE_THRESHOLD: float = 0.5
 VALID_MODELS: List[str] = [
     "facebook/detr-resnet-50",
     "facebook/detr-resnet-101",
@@ -48,128 +44,109 @@ VALID_MODELS: List[str] = [
     "hustvl/yolos-base",
 ]
 MODEL_DESCRIPTIONS: Dict[str, str] = {
-    "facebook/detr-resnet-50": (
-        "DETR with ResNet-50 backbone for object detection. Fast and accurate for general use."
-    ),
-    "facebook/detr-resnet-101": (
-        "DETR with ResNet-101 backbone for object detection. More accurate but slower than ResNet-50."
-    ),
-    "facebook/detr-resnet-50-panoptic": (
-        "DETR with ResNet-50 for panoptic segmentation. Detects objects and segments scenes."
-    ),
-    "facebook/detr-resnet-101-panoptic": (
-        "DETR with ResNet-101 for panoptic segmentation. High accuracy for complex scenes."
-    ),
-    "hustvl/yolos-tiny": (
-        "YOLOS Tiny model. Lightweight and fast, ideal for resource-constrained environments."
-    ),
-    "hustvl/yolos-base": (
-        "YOLOS Base model. Balances speed and accuracy for object detection."
-    ),
 }
-# Port configuration
-DEFAULT_GRADIO_PORT: int = 7860
-DEFAULT_FASTAPI_PORT: int = 8000
-PORT_RANGE: range = range(7860, 7870)  # Try ports 7860-7869
-MAX_PORT_ATTEMPTS: int = 10
 # Thread-safe storage for lazy-loaded models and processors
 models: Dict[str, any] = {}
 processors: Dict[str, any] = {}
 model_lock = threading.Lock()
-# ------------------------------
-# Model Loading
-# ------------------------------
-def load_model_and_processor(model_name: str) -> Tuple[any, any]:
-    """
-    Load and cache the specified model and processor thread-safely.
-    Args:
-        model_name: Name of the model to load (must be in VALID_MODELS).
-    Returns:
-        Tuple containing the loaded model and processor.
-    Raises:
-        ValueError: If the model_name is invalid or loading fails.
-    """
-    with model_lock:
-        if model_name not in models:
-            logger.info(f"Loading model: {model_name}")
-            try:
-                if "yolos" in model_name:
-                    models[model_name] = YolosForObjectDetection.from_pretrained(model_name)
-                    processors[model_name] = YolosImageProcessor.from_pretrained(model_name)
-                elif "panoptic" in model_name:
-                    models[model_name] = DetrForSegmentation.from_pretrained(model_name)
-                    processors[model_name] = DetrImageProcessor.from_pretrained(model_name)
-                else:
-                    models[model_name] = DetrForObjectDetection.from_pretrained(model_name)
-                    processors[model_name] = DetrImageProcessor.from_pretrained(model_name)
-                logger.debug(f"Model {model_name} loaded successfully")
-            except Exception as e:
-                logger.error(f"Failed to load model {model_name}: {str(e)}")
-                raise ValueError(f"Failed to load model: {str(e)}")
-        return models[model_name], processors[model_name]
 # ------------------------------
 # Image Processing
 # ------------------------------
-def process(image: Image.Image, model_name: str) -> Tuple[Image.Image, List[str], List[float], List[str], List[float], Dict[str, str]]:
     """
-    Process an image for object detection or panoptic segmentation.
     Args:
-        image: PIL Image to process.
         model_name: Name of the model to use (must be in VALID_MODELS).
     Returns:
-        Tuple containing:
-        - Annotated image (PIL Image).
-        - List of detected object names.
-        - List of confidence scores for detected objects.
-        - List of unique object names.
-        - List of confidence scores for unique objects.
-        - Dictionary of image properties (format, size, etc.).
-    Raises:
-        ValueError: If the model_name is invalid.
-        RuntimeError: If processing fails due to model or image issues.
     """
-    if model_name not in VALID_MODELS:
-        raise ValueError(f"Invalid model: {model_name}. Choose from: {VALID_MODELS}")
     try:
-        # Load model and processor
-        model, processor = load_model_and_processor(model_name)
-        logger.debug(f"Processing image with model: {model_name}")
-        # Prepare image for processing
         inputs = processor(images=image, return_tensors="pt")
         with torch.no_grad():
             outputs = model(**inputs)
-        # Initialize drawing context
         draw = ImageDraw.Draw(image)
         object_names: List[str] = []
         confidence_scores: List[float] = []
         object_counter = Counter()
         target_sizes = torch.tensor([image.size[::-1]])
-        # Process panoptic segmentation or object detection
         if "panoptic" in model_name:
             processed_sizes = torch.tensor([[inputs["pixel_values"].shape[2], inputs["pixel_values"].shape[3]]])
             results = processor.post_process_panoptic(outputs, target_sizes=target_sizes, processed_sizes=processed_sizes)[0]
             for segment in results["segments_info"]:
                 label = segment["label_id"]
                 label_name = model.config.id2label.get(label, "Unknown")
                 score = segment.get("score", 1.0)
                 # Apply segmentation mask if available
                 if "masks" in results and segment["id"] < len(results["masks"]):
                     mask = results["masks"][segment["id"]].cpu().numpy()
@@ -181,67 +158,92 @@ def process(image: Image.Image, model_name: str) -> Tuple[Image.Image, List[str]
                         mask_draw.bitmap((0, 0), mask_image, fill=(r, g, b, 128))
                         image = Image.alpha_composite(image.convert("RGBA"), colored_mask).convert("RGB")
                         draw = ImageDraw.Draw(image)
-                if score > CONFIDENCE_THRESHOLD:
                     object_names.append(label_name)
                     confidence_scores.append(float(score))
                     object_counter[label_name] = float(score)
         else:
             results = processor.post_process_object_detection(outputs, target_sizes=target_sizes)[0]
             for score, label, box in zip(results["scores"], results["labels"], results["boxes"]):
-                if score > CONFIDENCE_THRESHOLD:
                     x, y, x2, y2 = box.tolist()
-                    draw.rectangle([x, y, x2, y2], outline="#32CD32", width=2)
                     label_name = model.config.id2label.get(label.item(), "Unknown")
                     text = f"{label_name}: {score:.2f}"
                     text_bbox = draw.textbbox((0, 0), text)
                     text_width, text_height = text_bbox[2] - text_bbox[0], text_bbox[3] - text_bbox[1]
-                    draw.text((x2 - text_width - 2, y - text_height - 2), text, fill="#32CD32")
                     object_names.append(label_name)
                     confidence_scores.append(float(score))
                     object_counter[label_name] = float(score)
-        # Compile unique objects and confidences
         unique_objects = list(object_counter.keys())
         unique_confidences = [object_counter[obj] for obj in unique_objects]
-        # Calculate image properties
         properties: Dict[str, str] = {
             "Format": image.format if hasattr(image, "format") and image.format else "Unknown",
             "Size": f"{image.width}x{image.height}",
             "Width": f"{image.width} px",
             "Height": f"{image.height} px",
             "Mode": image.mode,
-            "Aspect Ratio": (
-                f"{round(image.width / image.height, 2)}" if image.height != 0 else "Undefined"
-            ),
             "File Size": "Unknown",
             "Mean (R,G,B)": "Unknown",
             "StdDev (R,G,B)": "Unknown",
         }
-        # Compute file size
         try:
             buffered = BytesIO()
             image.save(buffered, format="PNG")
             properties["File Size"] = f"{len(buffered.getvalue()) / 1024:.2f} KB"
-        except Exception as e:
-            logger.error(f"Error calculating file size: {str(e)}")
-        # Compute color statistics
-        try:
             stat = ImageStat.Stat(image)
             properties["Mean (R,G,B)"] = ", ".join(f"{m:.2f}" for m in stat.mean)
             properties["StdDev (R,G,B)"] = ", ".join(f"{s:.2f}" for s in stat.stddev)
         except Exception as e:
-            logger.error(f"Error calculating color statistics: {str(e)}")
-        return image, object_names, confidence_scores, unique_objects, unique_confidences, properties
     except Exception as e:
-        logger.error(f"Error in process: {str(e)}\n{traceback.format_exc()}")
-        raise RuntimeError(f"Failed to process image: {str(e)}")
 # ------------------------------
 # FastAPI Setup
@@ -254,6 +256,7 @@ async def detect_objects_endpoint(
     file: Optional[UploadFile] = File(None),
     image_url: Optional[str] = Form(None),
     model_name: str = Form(VALID_MODELS[0]),
 ) -> JSONResponse:
     """
     FastAPI endpoint to detect objects in an image from file upload or URL.
@@ -262,62 +265,35 @@ async def detect_objects_endpoint(
         file: Uploaded image file (optional).
         image_url: URL of the image (optional).
         model_name: Model to use for detection (default: first VALID_MODELS entry).
     Returns:
-        JSONResponse containing the processed image (base64), detected objects, and confidences.
     Raises:
-        HTTPException: If input validation fails or processing errors occur.
     """
     try:
-        # Validate input
         if (file is None and not image_url) or (file is not None and image_url):
-            raise HTTPException(
-                status_code=400,
-                detail="Provide either an image file or an image URL, not both.",
-            )
-        # Load image
         if file:
             if not file.content_type.startswith("image/"):
                 raise HTTPException(status_code=400, detail="File must be an image")
             contents = await file.read()
             image = Image.open(BytesIO(contents)).convert("RGB")
-        else:
-            response = requests.get(image_url, timeout=10)
-            response.raise_for_status()
-            image = Image.open(BytesIO(response.content)).convert("RGB")
-        if model_name not in VALID_MODELS:
-            raise HTTPException(
-                status_code=400,
-                detail=f"Invalid model. Choose from: {VALID_MODELS}",
-            )
-        # Process image
-        detected_image, detected_objects, detected_confidences, unique_objects, unique_confidences, _ = process(
-            image, model_name
-        )
-        # Encode image as base64
-        buffered = BytesIO()
-        detected_image.save(buffered, format="PNG")
-        img_base64 = base64.b64encode(buffered.getvalue()).decode("utf-8")
-        img_url = f"data:image/png;base64,{img_base64}"
-        return JSONResponse(
-            content={
-                "image_url": img_url,
-                "detected_objects": detected_objects,
-                "confidence_scores": detected_confidences,
-                "unique_objects": unique_objects,
-                "unique_confidence_scores": unique_confidences,
-            }
-        )
-    except requests.RequestException as e:
-        logger.error(f"Error fetching image from URL: {str(e)}")
-        raise HTTPException(status_code=400, detail=f"Failed to fetch image: {str(e)}")
     except Exception as e:
         logger.error(f"Error in FastAPI endpoint: {str(e)}\n{traceback.format_exc()}")
         raise HTTPException(status_code=500, detail=f"Error processing image: {str(e)}")
@@ -328,7 +304,7 @@ async def detect_objects_endpoint(
 def create_gradio_ui() -> gr.Blocks:
     """
-    Create and configure the Gradio UI for object detection.
     Returns:
         Gradio Blocks object representing the UI.
@@ -337,257 +313,126 @@ def create_gradio_ui() -> gr.Blocks:
         RuntimeError: If UI creation fails.
     """
     try:
-        with gr.Blocks(theme=gr.themes.Default(primary_hue="blue", secondary_hue="gray")) as app:
             gr.Markdown(
                 f"""
                 # 🚀 Object Detection App
-                Upload an image or provide a URL to detect objects using state-of-the-art transformer models (DETR, YOLOS).
                 Running on port: {os.getenv('GRADIO_SERVER_PORT', 'auto-selected')}
                 """
             )
             with gr.Tabs():
-                with gr.Tab("📷 Image Upload"):
                     with gr.Row():
                         with gr.Column(scale=1):
                             gr.Markdown("### Input")
-                            model_choice = gr.Dropdown(
-                                choices=VALID_MODELS,
-                                value=VALID_MODELS[0],
-                                label="🔎 Select Model",
-                                info="Choose a model for object detection or panoptic segmentation.",
-                            )
-                            model_info = gr.Markdown(
-                                f"**Model Info**: {MODEL_DESCRIPTIONS[VALID_MODELS[0]]}",
-                                visible=True,
-                            )
                             image_input = gr.Image(type="pil", label="📷 Upload Image")
-                            image_url_input = gr.Textbox(
-                                label="🔗 Image URL",
-                                placeholder="https://example.com/image.jpg",
-                            )
                             with gr.Row():
                                 submit_btn = gr.Button("✨ Detect", variant="primary")
                                 clear_btn = gr.Button("🗑️ Clear", variant="secondary")
                             model_choice.change(
-                                fn=lambda model_name: (
-                                    f"**Model Info**: {MODEL_DESCRIPTIONS.get(model_name, 'No description available.')}"
-                                ),
                                 inputs=model_choice,
                                 outputs=model_info,
                             )
                         with gr.Column(scale=2):
                             gr.Markdown("### Results")
-                            error_output = gr.Textbox(
-                                label="⚠️ Errors",
-                                visible=False,
-                                lines=3,
-                                max_lines=5,
-                            )
-                            output_image = gr.Image(
-                                type="pil",
-                                label="🎯 Detected Image",
-                                interactive=False,
-                            )
                             with gr.Row():
-                                objects_output = gr.DataFrame(
-                                    label="📋 Detected Objects",
-                                    interactive=False,
-                                    value=None,
-                                )
-                                unique_objects_output = gr.DataFrame(
-                                    label="🔍 Unique Objects",
-                                    interactive=False,
-                                    value=None,
-                                )
-                            properties_output = gr.DataFrame(
-                                label="📄 Image Properties",
-                                interactive=False,
-                                value=None,
-                            )
-                    def process_for_gradio(image: Optional[Image.Image], url: Optional[str], model_name: str) -> Tuple[
-                        Optional[Image.Image], Optional[pd.DataFrame], Optional[pd.DataFrame], Optional[pd.DataFrame], str
-                    ]:
-                        """
-                        Process image for Gradio UI and return results.
-                        Args:
-                            image: Uploaded PIL Image (optional).
-                            url: Image URL (optional).
-                            model_name: Model to use for detection.
-                        Returns:
-                            Tuple of detected image, objects DataFrame, unique objects DataFrame, properties DataFrame, and error message.
-                        """
-                        try:
-                            if image is None and not url:
-                                return None, None, None, None, "Please provide an image or URL"
-                            if image and url:
-                                return None, None, None, None, "Please provide either an image or URL, not both"
-                            if url:
-                                response = requests.get(url, timeout=10)
-                                response.raise_for_status()
-                                image = Image.open(BytesIO(response.content)).convert("RGB")
-                            detected_image, objects, scores, unique_objects, unique_scores, properties = process(
-                                image, model_name
-                            )
-                            objects_df = (
-                                pd.DataFrame(
-                                    {
-                                        "Object": objects,
-                                        "Confidence Score": [f"{score:.2f}" for score in scores],
-                                    }
-                                )
-                                if objects
-                                else pd.DataFrame(columns=["Object", "Confidence Score"])
-                            )
-                            unique_objects_df = (
-                                pd.DataFrame(
-                                    {
-                                        "Unique Object": unique_objects,
-                                        "Confidence Score": [f"{score:.2f}" for score in unique_scores],
-                                    }
-                                )
-                                if unique_objects
-                                else pd.DataFrame(columns=["Unique Object", "Confidence Score"])
-                            )
-                            properties_df = (
-                                pd.DataFrame([properties])
-                                if properties
-                                else pd.DataFrame(columns=properties.keys())
-                            )
-                            return detected_image, objects_df, unique_objects_df, properties_df, ""
-                        except requests.RequestException as e:
-                            error_msg = f"Error fetching image from URL: {str(e)}"
-                            logger.error(f"{error_msg}\n{traceback.format_exc()}")
-                            return None, None, None, None, error_msg
-                        except Exception as e:
-                            error_msg = f"Error processing image: {str(e)}"
-                            logger.error(f"{error_msg}\n{traceback.format_exc()}")
-                            return None, None, None, None, error_msg
                     submit_btn.click(
-                        fn=process_for_gradio,
                         inputs=[image_input, image_url_input, model_choice],
                         outputs=[output_image, objects_output, unique_objects_output, properties_output, error_output],
                     )
                     clear_btn.click(
                         fn=lambda: [None, "", None, None, None, None],
                         inputs=None,
-                        outputs=[
-                            image_input,
-                            image_url_input,
-                            output_image,
-                            objects_output,
-                            unique_objects_output,
-                            properties_output,
-                            error_output,
-                        ],
                     )
-                with gr.Tab("🔗 JSON Output"):
-                    gr.Markdown("### Process Image for JSON Output")
-                    image_input_json = gr.Image(type="pil", label="📷 Upload Image")
-                    image_url_input_json = gr.Textbox(
-                        label="🔗 Image URL",
-                        placeholder="https://example.com/image.jpg",
-                    )
-                    url_model_choice = gr.Dropdown(
-                        choices=VALID_MODELS,
-                        value=VALID_MODELS[0],
-                        label="🔎 Select Model",
-                    )
-                    url_model_info = gr.Markdown(
-                        f"**Model Info**: {MODEL_DESCRIPTIONS[VALID_MODELS[0]]}",
-                        visible=True,
-                    )
-                    url_submit_btn = gr.Button("🔄 Process", variant="primary")
-                    url_output = gr.JSON(label="API Response")
-                    url_model_choice.change(
-                        fn=lambda model_name: (
-                            f"**Model Info**: {MODEL_DESCRIPTIONS.get(model_name, 'No description available.')}"
-                        ),
-                        inputs=url_model_choice,
-                        outputs=url_model_info,
-                    )
-                    def process_url_for_gradio(image: Optional[Image.Image], url: Optional[str], model_name: str) -> Dict:
-                        """
-                        Process image from file or URL for Gradio UI and return JSON response.
-                        Args:
-                            image: Uploaded PIL Image (optional).
-                            url: Image URL (optional).
-                            model_name: Model to use for detection.
-                        Returns:
-                            Dictionary with processed image (base64), detected objects, and confidences.
-                        """
-                        try:
-                            if image is None and not url:
-                                return {"error": "Please provide an image or URL"}
-                            if image and url:
-                                return {"error": "Please provide either an image or URL, not both"}
-                            if url:
-                                response = requests.get(url, timeout=10)
-                                response.raise_for_status()
-                                image = Image.open(BytesIO(response.content)).convert("RGB")
-                            detected_image, objects, scores, unique_objects, unique_scores, _ = process(
-                                image, model_name
                             )
-                            buffered = BytesIO()
-                            detected_image.save(buffered, format="PNG")
-                            img_base64 = base64.b64encode(buffered.getvalue()).decode("utf-8")
-                            return {
-                                "image_url": f"data:image/png;base64,{img_base64}",
-                                "detected_objects": objects,
-                                "confidence_scores": scores,
-                                "unique_objects": unique_objects,
-                                "unique_confidence_scores": unique_scores,
-                            }
-                        except requests.RequestException as e:
-                            error_msg = f"Error fetching image from URL: {str(e)}"
-                            logger.error(f"{error_msg}\n{traceback.format_exc()}")
-                            return {"error": error_msg}
-                        except Exception as e:
-                            error_msg = f"Error processing image: {str(e)}"
-                            logger.error(f"{error_msg}\n{traceback.format_exc()}")
-                            return {"error": error_msg}
                     url_submit_btn.click(
-                        fn=process_url_for_gradio,
                         inputs=[image_input_json, image_url_input_json, url_model_choice],
                         outputs=[url_output],
                     )
                 with gr.Tab("ℹ️ Help"):
                     gr.Markdown(
                         """
                         ## How to Use
-                        - **Image Upload**: Select a model, upload an image or provide a URL, and click "Detect" to see detected objects and image properties.
-                        - **JSON Output**: Upload an image or enter a URL, select a model, and click "Process" to get results in JSON format.
-                        - **Models**: Choose from DETR (object detection or panoptic segmentation) or YOLOS (lightweight detection).
-                        - **Clear**: Reset all inputs and outputs using the "Clear" button in the Image Upload tab.
-                        - **Errors**: Check the error box (Image Upload) or JSON response (JSON Output) for issues.
                         ## Tips
-                        - Use high-quality images for better detection results.
-                        - Panoptic models (e.g., DETR-ResNet-50-panoptic) provide segmentation masks for complex scenes.
-                        - For faster processing, try YOLOS-Tiny on resource-constrained devices.
                         """
                     )
-        return app
     except Exception as e:
         logger.error(f"Error creating Gradio UI: {str(e)}\n{traceback.format_exc()}")
@@ -599,38 +444,25 @@ def create_gradio_ui() -> gr.Blocks:
 def parse_args() -> argparse.Namespace:
     """
-    Parse command-line arguments with defaults and ignore unrecognized arguments.
     Returns:
         Parsed arguments as a Namespace object.
-    Raises:
-        SystemExit: If argument parsing fails (handled by argparse).
     """
-    parser = argparse.ArgumentParser(
-        description="Launcher for Object Detection App with Gradio UI and optional FastAPI server."
-    )
-    parser.add_argument(
-        "--gradio-port",
-        type=int,
-        default=DEFAULT_GRADIO_PORT,
-        help=f"Port for the Gradio UI (default: {DEFAULT_GRADIO_PORT}).",
-    )
-    parser.add_argument(
-        "--enable-fastapi",
-        action="store_true",
-        default=False,
-        help="Enable the FastAPI server (disabled by default).",
-    )
-    parser.add_argument(
-        "--fastapi-port",
-        type=int,
-        default=DEFAULT_FASTAPI_PORT,
-        help=f"Port for the FastAPI server if enabled (default: {DEFAULT_FASTAPI_PORT}).",
-    )
-    # Parse known arguments and ignore unrecognized ones (e.g., Jupyter kernel args)
     args, _ = parser.parse_known_args()
     return args
 def find_available_port(start_port: int, port_range: range, max_attempts: int) -> Optional[int]:
@@ -638,30 +470,21 @@ def find_available_port(start_port: int, port_range: range, max_attempts: int) -
     Find an available port within the specified range.
     Args:
-        start_port: Initial port to try (e.g., from args or environment).
         port_range: Range of ports to attempt.
         max_attempts: Maximum number of ports to try.
     Returns:
         Available port number, or None if no port is found.
-    Raises:
-        OSError: If port binding fails for reasons other than port in use.
     """
     import socket
-    port = start_port
     attempts = 0
-    # Check environment variable GRADIO_SERVER_PORT
-    env_port = os.getenv("GRADIO_SERVER_PORT")
-    if env_port and env_port.isdigit():
-        port = int(env_port)
-        logger.info(f"Using GRADIO_SERVER_PORT from environment: {port}")
     while attempts < max_attempts:
         with socket.socket(socket.AF_INET, socket.SOCK_STREAM) as s:
             try:
                 s.bind(("0.0.0.0", port))
                 logger.debug(f"Port {port} is available")
                 return port
@@ -672,70 +495,47 @@ def find_available_port(start_port: int, port_range: range, max_attempts: int) -
                     attempts += 1
                 else:
                     raise
-            except Exception as e:
-                logger.error(f"Error checking port {port}: {str(e)}")
-                raise
-    logger.error(f"No available port found in range {min(port_range)}-{max(port_range)} after {max_attempts} attempts")
     return None
-def run_fastapi_server(host: str, port: int) -> None:
-    """
-    Run the FastAPI server using Uvicorn.
-    Args:
-        host: Host address for the FastAPI server.
-        port: Port for the FastAPI server.
-    """
-    try:
-        uvicorn.run(app, host=host, port=port)
-    except Exception as e:
-        logger.error(f"Error running FastAPI server: {str(e)}\n{traceback.format_exc()}")
-        sys.exit(1)
 def main() -> None:
     """
-    Main function to launch Gradio UI and optional FastAPI server.
     Raises:
-        SystemExit: If the application is interrupted or encounters an error.
     """
     try:
-        # Apply nest_asyncio to allow nested event loops in Jupyter/Colab
         nest_asyncio.apply()
         # Parse command-line arguments
         args = parse_args()
         logger.info(f"Parsed arguments: {args}")
         # Find available port for Gradio
         gradio_port = find_available_port(args.gradio_port, PORT_RANGE, MAX_PORT_ATTEMPTS)
         if gradio_port is None:
             logger.error("Failed to find an available port for Gradio UI")
             sys.exit(1)
-        # Launch FastAPI server in a separate thread if enabled
         if args.enable_fastapi:
-            logger.info(f"Starting FastAPI server on port {args.fastapi_port}")
             fastapi_thread = threading.Thread(
-                target=run_fastapi_server,
-                args=("0.0.0.0", args.fastapi_port),
                 daemon=True
             )
             fastapi_thread.start()
         # Launch Gradio UI
         logger.info(f"Starting Gradio UI on port {gradio_port}")
-        app = create_gradio_ui()
-        app.launch(server_port=gradio_port, server_name="0.0.0.0")
     except KeyboardInterrupt:
         logger.info("Application terminated by user.")
         sys.exit(0)
-    except OSError as e:
-        logger.error(f"Port binding error: {str(e)}")
-        sys.exit(1)
     except Exception as e:
-        logger.error(f"Error running application: {str(e)}\n{traceback.format_exc()}")
         sys.exit(1)
 if __name__ == "__main__":

 import logging
 import os
 import sys
 import threading
 from collections import Counter
 from io import BytesIO
+from typing import Dict, List, Optional, Tuple, Union
 import gradio as gr
 import pandas as pd
 # Configuration
 # ------------------------------
+# Configure logging for debugging and monitoring
+logging.basicConfig(level=logging.INFO, format="%(asctime)s - %(levelname)s - %(message)s")
 logger = logging.getLogger(__name__)
+# Define constants for model and server configuration
+CONFIDENCE_THRESHOLD: float = 0.5  # Default threshold for object detection confidence
 VALID_MODELS: List[str] = [
     "facebook/detr-resnet-50",
     "facebook/detr-resnet-101",
     "hustvl/yolos-base",
 ]
 MODEL_DESCRIPTIONS: Dict[str, str] = {
+    "facebook/detr-resnet-50": "DETR with ResNet-50 for object detection. Fast and accurate.",
+    "facebook/detr-resnet-101": "DETR with ResNet-101 for object detection. More accurate, slower.",
+    "facebook/detr-resnet-50-panoptic": "DETR with ResNet-50 for panoptic segmentation.",
+    "facebook/detr-resnet-101-panoptic": "DETR with ResNet-101 for panoptic segmentation.",
+    "hustvl/yolos-tiny": "YOLOS Tiny. Lightweight and fast.",
+    "hustvl/yolos-base": "YOLOS Base. Balances speed and accuracy."
 }
+DEFAULT_GRADIO_PORT: int = 7860  # Default port for Gradio UI
+DEFAULT_FASTAPI_PORT: int = 8000  # Default port for FastAPI server
+PORT_RANGE: range = range(7860, 7870)  # Range of ports to try for Gradio
+MAX_PORT_ATTEMPTS: int = 10  # Maximum attempts to find an available port
 # Thread-safe storage for lazy-loaded models and processors
 models: Dict[str, any] = {}
 processors: Dict[str, any] = {}
 model_lock = threading.Lock()
 # ------------------------------
 # Image Processing
 # ------------------------------
+def process_image(
+    image: Optional[Image.Image],
+    url: Optional[str],
+    model_name: str,
+    for_json: bool = False,
+    confidence_threshold: float = CONFIDENCE_THRESHOLD
+) -> Union[Dict, Tuple[Optional[Image.Image], Optional[pd.DataFrame], Optional[pd.DataFrame], Optional[pd.DataFrame], str]]:
     """
+    Process an image for object detection or panoptic segmentation, handling Gradio and FastAPI inputs.
     Args:
+        image: PIL Image object from file upload (optional).
+        url: URL of the image to process (optional).
         model_name: Name of the model to use (must be in VALID_MODELS).
+        for_json: If True, return JSON dict for API/JSON tab; else, return tuple for Gradio Home tab.
+        confidence_threshold: Minimum confidence score for detection (default: 0.5).
     Returns:
+        For JSON: Dict with base64-encoded image, detected objects, and confidence scores.
+        For Gradio: Tuple of (annotated image, objects DataFrame, unique objects DataFrame, properties DataFrame, error message).
     """
     try:
+        # Validate input: ensure exactly one of image or URL is provided
+        if image is None and not url:
+            return {"error": "Please provide an image or URL"} if for_json else (None, None, None, None, "Please provide an image or URL")
+        if image and url:
+            return {"error": "Provide either an image or URL, not both"} if for_json else (None, None, None, None, "Provide either an image or URL, not both")
+        if model_name not in VALID_MODELS:
+            error_msg = f"Invalid model: {model_name}. Choose from: {VALID_MODELS}"
+            return {"error": error_msg} if for_json else (None, None, None, None, error_msg)
+        # Calculate margin threshold: (1 - confidence_threshold) / 2 + confidence_threshold
+        margin_threshold = (1 - confidence_threshold) / 2 + confidence_threshold
+        # Load image from URL if provided
+        if url:
+            response = requests.get(url, timeout=10)
+            response.raise_for_status()
+            image = Image.open(BytesIO(response.content)).convert("RGB")
+        # Load model and processor thread-safely
+        with model_lock:
+            if model_name not in models:
+                logger.info(f"Loading model: {model_name}")
+                try:
+                    # Select appropriate model and processor based on model name
+                    if "yolos" in model_name:
+                        models[model_name] = YolosForObjectDetection.from_pretrained(model_name)
+                        processors[model_name] = YolosImageProcessor.from_pretrained(model_name)
+                    elif "panoptic" in model_name:
+                        models[model_name] = DetrForSegmentation.from_pretrained(model_name)
+                        processors[model_name] = DetrImageProcessor.from_pretrained(model_name)
+                    else:
+                        models[model_name] = DetrForObjectDetection.from_pretrained(model_name)
+                        processors[model_name] = DetrImageProcessor.from_pretrained(model_name)
+                except Exception as e:
+                    error_msg = f"Failed to load model: {str(e)}"
+                    logger.error(error_msg)
+                    return {"error": error_msg} if for_json else (None, None, None, None, error_msg)
+            model, processor = models[model_name], processors[model_name]
+        # Prepare image for model processing
         inputs = processor(images=image, return_tensors="pt")
         with torch.no_grad():
             outputs = model(**inputs)
+        # Initialize drawing context for annotations
         draw = ImageDraw.Draw(image)
         object_names: List[str] = []
         confidence_scores: List[float] = []
         object_counter = Counter()
         target_sizes = torch.tensor([image.size[::-1]])
+        # Process results based on model type (panoptic or object detection)
         if "panoptic" in model_name:
+            # Handle panoptic segmentation
             processed_sizes = torch.tensor([[inputs["pixel_values"].shape[2], inputs["pixel_values"].shape[3]]])
             results = processor.post_process_panoptic(outputs, target_sizes=target_sizes, processed_sizes=processed_sizes)[0]
             for segment in results["segments_info"]:
                 label = segment["label_id"]
                 label_name = model.config.id2label.get(label, "Unknown")
                 score = segment.get("score", 1.0)
                 # Apply segmentation mask if available
                 if "masks" in results and segment["id"] < len(results["masks"]):
                     mask = results["masks"][segment["id"]].cpu().numpy()
                         mask_draw.bitmap((0, 0), mask_image, fill=(r, g, b, 128))
                         image = Image.alpha_composite(image.convert("RGBA"), colored_mask).convert("RGB")
                         draw = ImageDraw.Draw(image)
+                if score > confidence_threshold:
                     object_names.append(label_name)
                     confidence_scores.append(float(score))
                     object_counter[label_name] = float(score)
         else:
+            # Handle object detection
             results = processor.post_process_object_detection(outputs, target_sizes=target_sizes)[0]
             for score, label, box in zip(results["scores"], results["labels"], results["boxes"]):
+                if score > confidence_threshold:
                     x, y, x2, y2 = box.tolist()
                     label_name = model.config.id2label.get(label.item(), "Unknown")
                     text = f"{label_name}: {score:.2f}"
                     text_bbox = draw.textbbox((0, 0), text)
                     text_width, text_height = text_bbox[2] - text_bbox[0], text_bbox[3] - text_bbox[1]
+                    # Use yellow for confidence_threshold <= score < margin_threshold, green for >= margin_threshold
+                    color = "#FFFF00" if score < margin_threshold else "#32CD32"
+                    draw.rectangle([x, y, x2, y2], outline=color, width=2)
+                    draw.text((x2 - text_width - 2, y - text_height - 2), text, fill=color)
                     object_names.append(label_name)
                     confidence_scores.append(float(score))
                     object_counter[label_name] = float(score)
+        # Compile unique objects and their highest confidence scores
         unique_objects = list(object_counter.keys())
         unique_confidences = [object_counter[obj] for obj in unique_objects]
+        # Calculate image properties (metadata)
         properties: Dict[str, str] = {
             "Format": image.format if hasattr(image, "format") and image.format else "Unknown",
             "Size": f"{image.width}x{image.height}",
             "Width": f"{image.width} px",
             "Height": f"{image.height} px",
             "Mode": image.mode,
+            "Aspect Ratio": f"{round(image.width / image.height, 2)}" if image.height != 0 else "Undefined",
             "File Size": "Unknown",
             "Mean (R,G,B)": "Unknown",
             "StdDev (R,G,B)": "Unknown",
         }
         try:
+            # Compute file size
             buffered = BytesIO()
             image.save(buffered, format="PNG")
             properties["File Size"] = f"{len(buffered.getvalue()) / 1024:.2f} KB"
+            # Compute color statistics
             stat = ImageStat.Stat(image)
             properties["Mean (R,G,B)"] = ", ".join(f"{m:.2f}" for m in stat.mean)
             properties["StdDev (R,G,B)"] = ", ".join(f"{s:.2f}" for s in stat.stddev)
         except Exception as e:
+            logger.error(f"Error calculating image stats: {str(e)}")
+        # Prepare output based on request type
+        if for_json:
+            # Return JSON with base64-encoded image
+            buffered = BytesIO()
+            image.save(buffered, format="PNG")
+            img_base64 = base64.b64encode(buffered.getvalue()).decode("utf-8")
+            return {
+                "image_url": f"data:image/png;base64,{img_base64}",
+                "detected_objects": object_names,
+                "confidence_scores": confidence_scores,
+                "unique_objects": unique_objects,
+                "unique_confidence_scores": unique_confidences,
+            }
+        else:
+            # Return tuple for Gradio Home tab with DataFrames
+            objects_df = (
+                pd.DataFrame({"Object": object_names, "Confidence Score": [f"{score:.2f}" for score in confidence_scores]})
+                if object_names else pd.DataFrame(columns=["Object", "Confidence Score"])
+            )
+            unique_objects_df = (
+                pd.DataFrame({"Unique Object": unique_objects, "Confidence Score": [f"{score:.2f}" for score in unique_confidences]})
+                if unique_objects else pd.DataFrame(columns=["Unique Object", "Confidence Score"])
+            )
+            properties_df = pd.DataFrame([properties]) if properties else pd.DataFrame(columns=properties.keys())
+            return image, objects_df, unique_objects_df, properties_df, ""
+    except requests.RequestException as e:
+        # Handle URL fetch errors
+        error_msg = f"Error fetching image from URL: {str(e)}"
+        logger.error(f"{error_msg}\n{traceback.format_exc()}")
+        return {"error": error_msg} if for_json else (None, None, None, None, error_msg)
     except Exception as e:
+        # Handle general processing errors
+        error_msg = f"Error processing image: {str(e)}"
+        logger.error(f"{error_msg}\n{traceback.format_exc()}")
+        return {"error": error_msg} if for_json else (None, None, None, None, error_msg)
 # ------------------------------
 # FastAPI Setup
     file: Optional[UploadFile] = File(None),
     image_url: Optional[str] = Form(None),
     model_name: str = Form(VALID_MODELS[0]),
+    confidence_threshold: float = Form(CONFIDENCE_THRESHOLD),
 ) -> JSONResponse:
     """
     FastAPI endpoint to detect objects in an image from file upload or URL.
         file: Uploaded image file (optional).
         image_url: URL of the image (optional).
         model_name: Model to use for detection (default: first VALID_MODELS entry).
+        confidence_threshold: Confidence threshold for detection (default: 0.5).
     Returns:
+        JSONResponse with base64-encoded image, detected objects, and confidence scores.
     Raises:
+        HTTPException: For invalid inputs or processing errors.
     """
     try:
+        # Validate input: ensure exactly one of file or URL
         if (file is None and not image_url) or (file is not None and image_url):
+            raise HTTPException(status_code=400, detail="Provide either an image file or an image URL, not both.")
+        # Validate confidence threshold
+        if not 0 <= confidence_threshold <= 1:
+            raise HTTPException(status_code=400, detail="Confidence threshold must be between 0 and 1.")
+        # Load image from file if provided
+        image = None
         if file:
             if not file.content_type.startswith("image/"):
                 raise HTTPException(status_code=400, detail="File must be an image")
             contents = await file.read()
             image = Image.open(BytesIO(contents)).convert("RGB")
+        # Process image with specified parameters
+        result = process_image(image, image_url, model_name, for_json=True, confidence_threshold=confidence_threshold)
+        if "error" in result:
+            raise HTTPException(status_code=400, detail=result["error"])
+        return JSONResponse(content=result)
+    except HTTPException:
+        raise
     except Exception as e:
         logger.error(f"Error in FastAPI endpoint: {str(e)}\n{traceback.format_exc()}")
         raise HTTPException(status_code=500, detail=f"Error processing image: {str(e)}")
 def create_gradio_ui() -> gr.Blocks:
     """
+    Create and configure the Gradio UI for object detection with Home, JSON, and Help tabs.
     Returns:
         Gradio Blocks object representing the UI.
         RuntimeError: If UI creation fails.
     """
     try:
+        # Initialize Gradio Blocks with a custom theme
+        with gr.Blocks(theme=gr.themes.Default(primary_hue="blue", secondary_hue="gray")) as demo:
+            # Display app header
             gr.Markdown(
                 f"""
                 # 🚀 Object Detection App
+                Upload an image or provide a URL to detect objects using transformer models (DETR, YOLOS).
                 Running on port: {os.getenv('GRADIO_SERVER_PORT', 'auto-selected')}
                 """
             )
+            # Create tabbed interface
             with gr.Tabs():
+                # Home tab (formerly Image Upload)
+                with gr.Tab("🏠 Home"):
                     with gr.Row():
+                        # Left column for inputs
                         with gr.Column(scale=1):
                             gr.Markdown("### Input")
+                            # Model selection dropdown
+                            model_choice = gr.Dropdown(choices=VALID_MODELS, value=VALID_MODELS[0], label="🔎 Select Model")
+                            model_info = gr.Markdown(f"**Model Info**: {MODEL_DESCRIPTIONS[VALID_MODELS[0]]}")
+                            # Image upload input
                             image_input = gr.Image(type="pil", label="📷 Upload Image")
+                            # Image URL input
+                            image_url_input = gr.Textbox(label="🔗 Image URL", placeholder="https://example.com/image.jpg")
+                            # Buttons for submission and clearing
                             with gr.Row():
                                 submit_btn = gr.Button("✨ Detect", variant="primary")
                                 clear_btn = gr.Button("🗑️ Clear", variant="secondary")
+                            # Update model info when model changes
                             model_choice.change(
+                                fn=lambda model_name: f"**Model Info**: {MODEL_DESCRIPTIONS.get(model_name, 'No description available.')}",
                                 inputs=model_choice,
                                 outputs=model_info,
                             )
+                        # Right column for results
                         with gr.Column(scale=2):
                             gr.Markdown("### Results")
+                            # Error display (hidden by default)
+                            error_output = gr.Textbox(label="⚠️ Errors", visible=False, lines=3, max_lines=5)
+                            # Annotated image output
+                            output_image = gr.Image(type="pil", label="🎯 Detected Image", interactive=False)
+                            # Detected and unique objects tables
                             with gr.Row():
+                                objects_output = gr.DataFrame(label="📋 Detected Objects", interactive=False)
+                                unique_objects_output = gr.DataFrame(label="🔍 Unique Objects", interactive=False)
+                            # Image properties table
+                            properties_output = gr.DataFrame(label="📄 Image Properties", interactive=False)
+                    # Process image when Detect button is clicked
                     submit_btn.click(
+                        fn=process_image,
                         inputs=[image_input, image_url_input, model_choice],
                         outputs=[output_image, objects_output, unique_objects_output, properties_output, error_output],
                     )
+                    # Clear all inputs and outputs
                     clear_btn.click(
                         fn=lambda: [None, "", None, None, None, None],
                         inputs=None,
+                        outputs=[image_input, image_url_input, output_image, objects_output, unique_objects_output, properties_output, error_output],
                     )
+                # JSON tab for API-like output
+                with gr.Tab("🔗 JSON"):
+                    with gr.Row():
+                        # Left column for inputs
+                        with gr.Column(scale=1):
+                            gr.Markdown("### Process Image for JSON")
+                            # Model selection dropdown
+                            url_model_choice = gr.Dropdown(choices=VALID_MODELS, value=VALID_MODELS[0], label="🔎 Select Model")
+                            url_model_info = gr.Markdown(f"**Model Info**: {MODEL_DESCRIPTIONS[VALID_MODELS[0]]}")
+                            # Image upload input
+                            image_input_json = gr.Image(type="pil", label="📷 Upload Image")
+                            # Image URL input
+                            image_url_input_json = gr.Textbox(label="🔗 Image URL", placeholder="https://example.com/image.jpg")
+                            # Process button
+                            url_submit_btn = gr.Button("🔄 Process", variant="primary")
+                            # Update model info when model changes
+                            url_model_choice.change(
+                                fn=lambda model_name: f"**Model Info**: {MODEL_DESCRIPTIONS.get(model_name, 'No description available.')}",
+                                inputs=url_model_choice,
+                                outputs=url_model_info,
                             )
+                        # Right column for JSON output
+                        with gr.Column(scale=1):
+                            # JSON output display
+                            url_output = gr.JSON(label="API Response")
+                    # Process image and return JSON when Process button is clicked
                     url_submit_btn.click(
+                        fn=lambda img, url, model: process_image(img, url, model, for_json=True),
                         inputs=[image_input_json, image_url_input_json, url_model_choice],
                         outputs=[url_output],
                     )
+                # Help tab with usage instructions
                 with gr.Tab("ℹ️ Help"):
                     gr.Markdown(
                         """
                         ## How to Use
+                        - **Home**: Select a model, upload an image or provide a URL, click "Detect" to see results.
+                        - **JSON**: Select a model, upload an image or enter a URL, click "Process" for JSON output.
+                        - **Models**: Choose DETR (detection or panoptic) or YOLOS (lightweight detection).
+                        - **Clear**: Reset inputs/outputs in Home tab.
+                        - **Errors**: Check error box (Home) or JSON response (JSON) for issues.
                         ## Tips
+                        - Use high-quality images for better results.
+                        - Panoptic models provide segmentation masks for complex scenes.
+                        - YOLOS-Tiny is faster for resource-constrained devices.
                         """
                     )
+        return demo
     except Exception as e:
         logger.error(f"Error creating Gradio UI: {str(e)}\n{traceback.format_exc()}")
 def parse_args() -> argparse.Namespace:
     """
+    Parse command-line arguments for configuring the application.
     Returns:
         Parsed arguments as a Namespace object.
     """
+    parser = argparse.ArgumentParser(description="Object Detection App with Gradio and FastAPI.")
+    # Gradio port argument
+    parser.add_argument("--gradio-port", type=int, default=DEFAULT_GRADIO_PORT, help=f"Gradio port (default: {DEFAULT_GRADIO_PORT}).")
+    # FastAPI enable flag
+    parser.add_argument("--enable-fastapi", action="store_true", help="Enable FastAPI server.")
+    # FastAPI port argument
+    parser.add_argument("--fastapi-port", type=int, default=DEFAULT_FASTAPI_PORT, help=f"FastAPI port (default: {DEFAULT_FASTAPI_PORT}).")
+    # Confidence threshold argument
+    parser.add_argument("--confidence-threshold", type=float, default=CONFIDENCE_THRESHOLD, help="Confidence threshold for detection (default: 0.5).")
+    # Parse known arguments, ignoring unrecognized ones
     args, _ = parser.parse_known_args()
+    # Validate confidence threshold
+    if not 0 <= args.confidence_threshold <= 1:
+        parser.error("Confidence threshold must be between 0 and 1.")
     return args
 def find_available_port(start_port: int, port_range: range, max_attempts: int) -> Optional[int]:
     Find an available port within the specified range.
     Args:
+        start_port: Initial port to try.
         port_range: Range of ports to attempt.
         max_attempts: Maximum number of ports to try.
     Returns:
         Available port number, or None if no port is found.
     """
     import socket
+    # Check environment variable for port override
+    port = int(os.getenv("GRADIO_SERVER_PORT", start_port))
     attempts = 0
     while attempts < max_attempts:
         with socket.socket(socket.AF_INET, socket.SOCK_STREAM) as s:
             try:
+                # Attempt to bind to the port
                 s.bind(("0.0.0.0", port))
                 logger.debug(f"Port {port} is available")
                 return port
                     attempts += 1
                 else:
                     raise
+    logger.error(f"No available port in range {min(port_range)}-{max(port_range)}")
     return None
 def main() -> None:
     """
+    Launch the Gradio UI and optional FastAPI server.
     Raises:
+        SystemExit: On interruption or critical errors.
     """
     try:
+        # Apply nest_asyncio for compatibility with Jupyter/Colab
         nest_asyncio.apply()
         # Parse command-line arguments
         args = parse_args()
         logger.info(f"Parsed arguments: {args}")
         # Find available port for Gradio
         gradio_port = find_available_port(args.gradio_port, PORT_RANGE, MAX_PORT_ATTEMPTS)
         if gradio_port is None:
             logger.error("Failed to find an available port for Gradio UI")
             sys.exit(1)
+        # Start FastAPI server in a thread if enabled
         if args.enable_fastapi:
+            logger.info(f"Starting FastAPI on port {args.fastapi_port}")
             fastapi_thread = threading.Thread(
+                target=lambda: uvicorn.run(app, host="0.0.0.0", port=args.fastapi_port),
                 daemon=True
             )
             fastapi_thread.start()
         # Launch Gradio UI
         logger.info(f"Starting Gradio UI on port {gradio_port}")
+        demo = create_gradio_ui()
+        demo.launch(server_port=gradio_port, server_name="0.0.0.0")
     except KeyboardInterrupt:
         logger.info("Application terminated by user.")
         sys.exit(0)
     except Exception as e:
+        logger.error(f"Error: {str(e)}\n{traceback.format_exc()}")
         sys.exit(1)
 if __name__ == "__main__":

hf_space/app.py CHANGED Viewed

@@ -1,79 +1,166 @@
-import gradio as gr
-import torch
-from transformers import DetrImageProcessor, DetrForObjectDetection
-from transformers import YolosImageProcessor, YolosForObjectDetection
-from transformers import DetrForSegmentation
-from PIL import Image, ImageDraw, ImageStat
-import requests
-from io import BytesIO
 import base64
-from collections import Counter
 import logging
-from fastapi import FastAPI, File, UploadFile, HTTPException, Form
-from fastapi.responses import JSONResponse
-import uvicorn
-import pandas as pd
-import traceback
 import os
-# Set up logging
-logging.basicConfig(level=logging.INFO, format="%(asctime)s - %(levelname)s - %(message)s")
 logger = logging.getLogger(__name__)
-# Constants
-CONFIDENCE_THRESHOLD = 0.5
-VALID_MODELS = [
     "facebook/detr-resnet-50",
     "facebook/detr-resnet-101",
     "facebook/detr-resnet-50-panoptic",
     "facebook/detr-resnet-101-panoptic",
     "hustvl/yolos-tiny",
-    "hustvl/yolos-base"
 ]
-MODEL_DESCRIPTIONS = {
-    "facebook/detr-resnet-50": "DETR with ResNet-50 backbone for object detection. Fast and accurate for general use.",
-    "facebook/detr-resnet-101": "DETR with ResNet-101 backbone for object detection. More accurate but slower than ResNet-50.",
-    "facebook/detr-resnet-50-panoptic": "DETR with ResNet-50 for panoptic segmentation. Detects objects and segments scenes.",
-    "facebook/detr-resnet-101-panoptic": "DETR with ResNet-101 for panoptic segmentation. High accuracy for complex scenes.",
-    "hustvl/yolos-tiny": "YOLOS Tiny model. Lightweight and fast, ideal for resource-constrained environments.",
-    "hustvl/yolos-base": "YOLOS Base model. Balances speed and accuracy for object detection."
 }
-# Lazy model loading
-models = {}
-processors = {}
-def process(image, model_name):
-    """Process an image and return detected image, objects, confidences, unique objects, unique confidences, and properties."""
-    try:
-        if model_name not in VALID_MODELS:
-            raise ValueError(f"Invalid model: {model_name}. Choose from: {VALID_MODELS}")
-        # Load model and processor
         if model_name not in models:
             logger.info(f"Loading model: {model_name}")
-            if "yolos" in model_name:
-                models[model_name] = YolosForObjectDetection.from_pretrained(model_name)
-                processors[model_name] = YolosImageProcessor.from_pretrained(model_name)
-            elif "panoptic" in model_name:
-                models[model_name] = DetrForSegmentation.from_pretrained(model_name)
-                processors[model_name] = DetrImageProcessor.from_pretrained(model_name)
-            else:
-                models[model_name] = DetrForObjectDetection.from_pretrained(model_name)
-                processors[model_name] = DetrImageProcessor.from_pretrained(model_name)
-        model, processor = models[model_name], processors[model_name]
-        inputs = processor(images=image, return_tensors="pt")
         with torch.no_grad():
             outputs = model(**inputs)
-        target_sizes = torch.tensor([image.size[::-1]])
         draw = ImageDraw.Draw(image)
-        object_names = []
-        confidence_scores = []
         object_counter = Counter()
         if "panoptic" in model_name:
             processed_sizes = torch.tensor([[inputs["pixel_values"].shape[2], inputs["pixel_values"].shape[3]]])
             results = processor.post_process_panoptic(outputs, target_sizes=target_sizes, processed_sizes=processed_sizes)[0]
@@ -83,6 +170,7 @@ def process(image, model_name):
                 label_name = model.config.id2label.get(label, "Unknown")
                 score = segment.get("score", 1.0)
                 if "masks" in results and segment["id"] < len(results["masks"]):
                     mask = results["masks"][segment["id"]].cpu().numpy()
                     if mask.shape[0] > 0 and mask.shape[1] > 0:
@@ -106,7 +194,6 @@ def process(image, model_name):
                     x, y, x2, y2 = box.tolist()
                     draw.rectangle([x, y, x2, y2], outline="#32CD32", width=2)
                     label_name = model.config.id2label.get(label.item(), "Unknown")
-                    # Place text at top-right corner, outside the box, with smaller size
                     text = f"{label_name}: {score:.2f}"
                     text_bbox = draw.textbbox((0, 0), text)
                     text_width, text_height = text_bbox[2] - text_bbox[0], text_bbox[3] - text_bbox[1]
@@ -115,58 +202,82 @@ def process(image, model_name):
                     confidence_scores.append(float(score))
                     object_counter[label_name] = float(score)
         unique_objects = list(object_counter.keys())
         unique_confidences = [object_counter[obj] for obj in unique_objects]
-        # Image properties
-        file_size = "Unknown"
-        if hasattr(image, "fp") and image.fp is not None:
-            buffered = BytesIO()
-            image.save(buffered, format="PNG")
-            file_size = f"{len(buffered.getvalue()) / 1024:.2f} KB"
-        # Color statistics
-        try:
-            stat = ImageStat.Stat(image)
-            color_stats = {
-                "mean": [f"{m:.2f}" for m in stat.mean],
-                "stddev": [f"{s:.2f}" for s in stat.stddev]
-            }
-        except Exception as e:
-            logger.error(f"Error calculating color statistics: {str(e)}")
-            color_stats = {"mean": "Error", "stddev": "Error"}
-        properties = {
             "Format": image.format if hasattr(image, "format") and image.format else "Unknown",
             "Size": f"{image.width}x{image.height}",
             "Width": f"{image.width} px",
             "Height": f"{image.height} px",
             "Mode": image.mode,
-            "Aspect Ratio": f"{round(image.width / image.height, 2) if image.height != 0 else 'Undefined'}",
-            "File Size": file_size,
-            "Mean (R,G,B)": ", ".join(color_stats["mean"]) if isinstance(color_stats["mean"], list) else color_stats["mean"],
-            "StdDev (R,G,B)": ", ".join(color_stats["stddev"]) if isinstance(color_stats["stddev"], list) else color_stats["stddev"]
         }
         return image, object_names, confidence_scores, unique_objects, unique_confidences, properties
     except Exception as e:
         logger.error(f"Error in process: {str(e)}\n{traceback.format_exc()}")
-        raise
 # FastAPI Setup
 app = FastAPI(title="Object Detection API")
 @app.post("/detect")
 async def detect_objects_endpoint(
-    file: UploadFile = File(None),
-    image_url: str = Form(None),
-    model_name: str = Form(VALID_MODELS[0])
-):
-    """FastAPI endpoint to detect objects in an image from file or URL."""
     try:
         if (file is None and not image_url) or (file is not None and image_url):
-            raise HTTPException(status_code=400, detail="Provide either an image file or an image URL, but not both.")
         if file:
             if not file.content_type.startswith("image/"):
                 raise HTTPException(status_code=400, detail="File must be an image")
@@ -178,207 +289,454 @@ async def detect_objects_endpoint(
             image = Image.open(BytesIO(response.content)).convert("RGB")
         if model_name not in VALID_MODELS:
-            raise HTTPException(status_code=400, detail=f"Invalid model. Choose from: {VALID_MODELS}")
-        detected_image, detected_objects, detected_confidences, unique_objects, unique_confidences, _ = process(image, model_name)
         buffered = BytesIO()
         detected_image.save(buffered, format="PNG")
         img_base64 = base64.b64encode(buffered.getvalue()).decode("utf-8")
         img_url = f"data:image/png;base64,{img_base64}"
-        return JSONResponse(content={
-            "image_url": img_url,
-            "detected_objects": detected_objects,
-            "confidence_scores": detected_confidences,
-            "unique_objects": unique_objects,
-            "unique_confidence_scores": unique_confidences
-        })
     except Exception as e:
         logger.error(f"Error in FastAPI endpoint: {str(e)}\n{traceback.format_exc()}")
         raise HTTPException(status_code=500, detail=f"Error processing image: {str(e)}")
-# Gradio UI
-def create_gradio_ui():
-    with gr.Blocks(theme=gr.themes.Default(primary_hue="blue", secondary_hue="gray")) as demo:
-        gr.Markdown(
-            """
-            # 🚀 Object Detection App
-            Upload an image or provide a URL to detect objects using state-of-the-art transformer models (DETR, YOLOS).
-            """
-        )
-        with gr.Tabs():
-            with gr.Tab("📷 Image Upload"):
-                with gr.Row():
-                    with gr.Column(scale=1):
-                        gr.Markdown("### Input")
-                        model_choice = gr.Dropdown(
-                            choices=VALID_MODELS,
-                            value=VALID_MODELS[0],
-                            label="🔎 Select Model",
-                            info="Choose a model for object detection or panoptic segmentation."
-                        )
-                        model_info = gr.Markdown(
-                            f"**Model Info**: {MODEL_DESCRIPTIONS[VALID_MODELS[0]]}",
-                            visible=True
-                        )
-                        image_input = gr.Image(type="pil", label="📷 Upload Image")
-                        image_url_input = gr.Textbox(
-                            label="🔗 Image URL",
-                            placeholder="https://example.com/image.jpg"
-                        )
-                        with gr.Row():
-                            submit_btn = gr.Button("✨ Detect", variant="primary")
-                            clear_btn = gr.Button("🗑️ Clear", variant="secondary")
-                        model_choice.change(
-                            fn=lambda model_name: f"**Model Info**: {MODEL_DESCRIPTIONS.get(model_name, 'No description available.')}",
-                            inputs=model_choice,
-                            outputs=model_info
-                        )
-                    with gr.Column(scale=2):
-                        gr.Markdown("### Results")
-                        error_output = gr.Textbox(
-                            label="⚠️ Errors",
-                            visible=False,
-                            lines=3,
-                            max_lines=5
-                        )
-                        output_image = gr.Image(
-                            type="pil",
-                            label="🎯 Detected Image",
-                            interactive=False
-                        )
-                        with gr.Row():
-                            objects_output = gr.DataFrame(
-                                label="📋 Detected Objects",
                                 interactive=False,
-                                value=None
                             )
-                            unique_objects_output = gr.DataFrame(
-                                label="🔍 Unique Objects",
                                 interactive=False,
-                                value=None
                             )
-                        properties_output = gr.DataFrame(
-                            label="📄 Image Properties",
-                            interactive=False,
-                            value=None
-                        )
-                def process_for_gradio(image, url, model_name):
-                    try:
-                        if image is None and not url:
-                            return None, None, None, None, "Please provide an image or URL"
-                        if image and url:
-                            return None, None, None, None, "Please provide either an image or URL, not both"
-                        if url:
-                            response = requests.get(url, timeout=10)
-                            response.raise_for_status()
-                            image = Image.open(BytesIO(response.content)).convert("RGB")
-                        detected_image, objects, scores, unique_objects, unique_scores, properties = process(image, model_name)
-                        objects_df = pd.DataFrame({
-                            "Object": objects,
-                            "Confidence Score": [f"{score:.2f}" for score in scores]
-                        }) if objects else pd.DataFrame(columns=["Object", "Confidence Score"])
-                        unique_objects_df = pd.DataFrame({
-                            "Unique Object": unique_objects,
-                            "Confidence Score": [f"{score:.2f}" for score in unique_scores]
-                        }) if unique_objects else pd.DataFrame(columns=["Unique Object", "Confidence Score"])
-                        properties_df = pd.DataFrame([properties]) if properties else pd.DataFrame(columns=properties.keys())
-                        return detected_image, objects_df, unique_objects_df, properties_df, ""
-                    except Exception as e:
-                        error_msg = f"Error processing image: {str(e)}"
-                        logger.error(f"{error_msg}\n{traceback.format_exc()}")
-                        return None, None, None, None, error_msg
-                submit_btn.click(
-                    fn=process_for_gradio,
-                    inputs=[image_input, image_url_input, model_choice],
-                    outputs=[output_image, objects_output, unique_objects_output, properties_output, error_output]
-                )
-                clear_btn.click(
-                    fn=lambda: [None, "", None, None, None, None],
-                    inputs=None,
-                    outputs=[image_input, image_url_input, output_image, objects_output, unique_objects_output, properties_output, error_output]
-                )
-            with gr.Tab("🔗 URL Input"):
-                gr.Markdown("### Process Image from URL")
-                image_url_input = gr.Textbox(
-                    label="🔗 Image URL",
-                    placeholder="https://example.com/image.jpg"
-                )
-                url_model_choice = gr.Dropdown(
-                    choices=VALID_MODELS,
-                    value=VALID_MODELS[0],
-                    label="🔎 Select Model"
-                )
-                url_model_info = gr.Markdown(
-                    f"**Model Info**: {MODEL_DESCRIPTIONS[VALID_MODELS[0]]}",
-                    visible=True
-                )
-                url_submit_btn = gr.Button("🔄 Process URL", variant="primary")
-                url_output = gr.JSON(label="API Response")
-                url_model_choice.change(
-                    fn=lambda model_name: f"**Model Info**: {MODEL_DESCRIPTIONS.get(model_name, 'No description available.')}",
-                    inputs=url_model_choice,
-                    outputs=url_model_info
-                )
-                def process_url_for_gradio(url, model_name):
-                    try:
-                        response = requests.get(url, timeout=10)
-                        response.raise_for_status()
-                        image = Image.open(BytesIO(response.content)).convert("RGB")
-                        detected_image, objects, scores, unique_objects, unique_scores, _ = process(image, model_name)
-                        buffered = BytesIO()
-                        detected_image.save(buffered, format="PNG")
-                        img_base64 = base64.b64encode(buffered.getvalue()).decode("utf-8")
-                        return {
-                            "image_url": f"data:image/png;base64,{img_base64}",
-                            "detected_objects": objects,
-                            "confidence_scores": scores,
-                            "unique_objects": unique_objects,
-                            "unique_confidence_scores": unique_scores
-                        }
-                    except Exception as e:
-                        error_msg = f"Error processing URL: {str(e)}"
-                        logger.error(f"{error_msg}\n{traceback.format_exc()}")
-                        return {"error": error_msg}
-                url_submit_btn.click(
-                    fn=process_url_for_gradio,
-                    inputs=[image_url_input, url_model_choice],
-                    outputs=[url_output]
-                )
-            with gr.Tab("ℹ️ Help"):
-                gr.Markdown(
-                    """
-                    ## How to Use
-                    - **Image Upload**: Select a model, upload an image or provide a URL, and click "Detect" to see detected objects and image properties.
-                    - **URL Input**: Enter an image URL, select a model, and click "Process URL" to get results in JSON format.
-                    - **Models**: Choose from DETR (object detection or panoptic segmentation) or YOLOS (lightweight detection).
-                    - **Clear**: Reset all inputs and outputs using the "Clear" button.
-                    - **Errors**: Check the error box for any processing issues.
-                    ## Tips
-                    - Use high-quality images for better detection results.
-                    - Panoptic models (e.g., DETR-ResNet-50-panoptic) provide segmentation masks for complex scenes.
-                    - For faster processing, try YOLOS-Tiny on resource-constrained devices.
-                    """
-                )
-    return demo
 if __name__ == "__main__":
-    demo = create_gradio_ui()
-    demo.launch()
-    # To run FastAPI, use: uvicorn object_detection:app --host 0.0.0.0 --port 8000

+import argparse
 import base64
 import logging
 import os
+import sys
+import traceback
+import threading
+from collections import Counter
+from io import BytesIO
+from typing import Dict, List, Optional, Tuple
+import gradio as gr
+import pandas as pd
+import requests
+import torch
+import uvicorn
+from fastapi import FastAPI, File, Form, HTTPException, UploadFile
+from fastapi.responses import JSONResponse
+from PIL import Image, ImageDraw, ImageStat
+from transformers import (
+    DetrForObjectDetection,
+    DetrForSegmentation,
+    DetrImageProcessor,
+    YolosForObjectDetection,
+    YolosImageProcessor,
+)
+import nest_asyncio
+# ------------------------------
+# Configuration
+# ------------------------------
+# Logging configuration
+logging.basicConfig(
+    level=logging.INFO,
+    format="%(asctime)s - %(levelname)s - %(message)s",
+)
 logger = logging.getLogger(__name__)
+# Model and processing constants
+CONFIDENCE_THRESHOLD: float = 0.5
+VALID_MODELS: List[str] = [
     "facebook/detr-resnet-50",
     "facebook/detr-resnet-101",
     "facebook/detr-resnet-50-panoptic",
     "facebook/detr-resnet-101-panoptic",
     "hustvl/yolos-tiny",
+    "hustvl/yolos-base",
 ]
+MODEL_DESCRIPTIONS: Dict[str, str] = {
+    "facebook/detr-resnet-50": (
+        "DETR with ResNet-50 backbone for object detection. Fast and accurate for general use."
+    ),
+    "facebook/detr-resnet-101": (
+        "DETR with ResNet-101 backbone for object detection. More accurate but slower than ResNet-50."
+    ),
+    "facebook/detr-resnet-50-panoptic": (
+        "DETR with ResNet-50 for panoptic segmentation. Detects objects and segments scenes."
+    ),
+    "facebook/detr-resnet-101-panoptic": (
+        "DETR with ResNet-101 for panoptic segmentation. High accuracy for complex scenes."
+    ),
+    "hustvl/yolos-tiny": (
+        "YOLOS Tiny model. Lightweight and fast, ideal for resource-constrained environments."
+    ),
+    "hustvl/yolos-base": (
+        "YOLOS Base model. Balances speed and accuracy for object detection."
+    ),
 }
+# Port configuration
+DEFAULT_GRADIO_PORT: int = 7860
+DEFAULT_FASTAPI_PORT: int = 8000
+PORT_RANGE: range = range(7860, 7870)  # Try ports 7860-7869
+MAX_PORT_ATTEMPTS: int = 10
+# Thread-safe storage for lazy-loaded models and processors
+models: Dict[str, any] = {}
+processors: Dict[str, any] = {}
+model_lock = threading.Lock()
+# ------------------------------
+# Model Loading
+# ------------------------------
+def load_model_and_processor(model_name: str) -> Tuple[any, any]:
+    """
+    Load and cache the specified model and processor thread-safely.
+    Args:
+        model_name: Name of the model to load (must be in VALID_MODELS).
+    Returns:
+        Tuple containing the loaded model and processor.
+    Raises:
+        ValueError: If the model_name is invalid or loading fails.
+    """
+    with model_lock:
         if model_name not in models:
             logger.info(f"Loading model: {model_name}")
+            try:
+                if "yolos" in model_name:
+                    models[model_name] = YolosForObjectDetection.from_pretrained(model_name)
+                    processors[model_name] = YolosImageProcessor.from_pretrained(model_name)
+                elif "panoptic" in model_name:
+                    models[model_name] = DetrForSegmentation.from_pretrained(model_name)
+                    processors[model_name] = DetrImageProcessor.from_pretrained(model_name)
+                else:
+                    models[model_name] = DetrForObjectDetection.from_pretrained(model_name)
+                    processors[model_name] = DetrImageProcessor.from_pretrained(model_name)
+                logger.debug(f"Model {model_name} loaded successfully")
+            except Exception as e:
+                logger.error(f"Failed to load model {model_name}: {str(e)}")
+                raise ValueError(f"Failed to load model: {str(e)}")
+        return models[model_name], processors[model_name]
+# ------------------------------
+# Image Processing
+# ------------------------------
+def process(image: Image.Image, model_name: str) -> Tuple[Image.Image, List[str], List[float], List[str], List[float], Dict[str, str]]:
+    """
+    Process an image for object detection or panoptic segmentation.
+    Args:
+        image: PIL Image to process.
+        model_name: Name of the model to use (must be in VALID_MODELS).
+    Returns:
+        Tuple containing:
+        - Annotated image (PIL Image).
+        - List of detected object names.
+        - List of confidence scores for detected objects.
+        - List of unique object names.
+        - List of confidence scores for unique objects.
+        - Dictionary of image properties (format, size, etc.).
+    Raises:
+        ValueError: If the model_name is invalid.
+        RuntimeError: If processing fails due to model or image issues.
+    """
+    if model_name not in VALID_MODELS:
+        raise ValueError(f"Invalid model: {model_name}. Choose from: {VALID_MODELS}")
+    try:
+        # Load model and processor
+        model, processor = load_model_and_processor(model_name)
+        logger.debug(f"Processing image with model: {model_name}")
+        # Prepare image for processing
+        inputs = processor(images=image, return_tensors="pt")
         with torch.no_grad():
             outputs = model(**inputs)
+        # Initialize drawing context
         draw = ImageDraw.Draw(image)
+        object_names: List[str] = []
+        confidence_scores: List[float] = []
         object_counter = Counter()
+        target_sizes = torch.tensor([image.size[::-1]])
+        # Process panoptic segmentation or object detection
         if "panoptic" in model_name:
             processed_sizes = torch.tensor([[inputs["pixel_values"].shape[2], inputs["pixel_values"].shape[3]]])
             results = processor.post_process_panoptic(outputs, target_sizes=target_sizes, processed_sizes=processed_sizes)[0]
                 label_name = model.config.id2label.get(label, "Unknown")
                 score = segment.get("score", 1.0)
+                # Apply segmentation mask if available
                 if "masks" in results and segment["id"] < len(results["masks"]):
                     mask = results["masks"][segment["id"]].cpu().numpy()
                     if mask.shape[0] > 0 and mask.shape[1] > 0:
                     x, y, x2, y2 = box.tolist()
                     draw.rectangle([x, y, x2, y2], outline="#32CD32", width=2)
                     label_name = model.config.id2label.get(label.item(), "Unknown")
                     text = f"{label_name}: {score:.2f}"
                     text_bbox = draw.textbbox((0, 0), text)
                     text_width, text_height = text_bbox[2] - text_bbox[0], text_bbox[3] - text_bbox[1]
                     confidence_scores.append(float(score))
                     object_counter[label_name] = float(score)
+        # Compile unique objects and confidences
         unique_objects = list(object_counter.keys())
         unique_confidences = [object_counter[obj] for obj in unique_objects]
+        # Calculate image properties
+        properties: Dict[str, str] = {
             "Format": image.format if hasattr(image, "format") and image.format else "Unknown",
             "Size": f"{image.width}x{image.height}",
             "Width": f"{image.width} px",
             "Height": f"{image.height} px",
             "Mode": image.mode,
+            "Aspect Ratio": (
+                f"{round(image.width / image.height, 2)}" if image.height != 0 else "Undefined"
+            ),
+            "File Size": "Unknown",
+            "Mean (R,G,B)": "Unknown",
+            "StdDev (R,G,B)": "Unknown",
         }
+        # Compute file size
+        try:
+            buffered = BytesIO()
+            image.save(buffered, format="PNG")
+            properties["File Size"] = f"{len(buffered.getvalue()) / 1024:.2f} KB"
+        except Exception as e:
+            logger.error(f"Error calculating file size: {str(e)}")
+        # Compute color statistics
+        try:
+            stat = ImageStat.Stat(image)
+            properties["Mean (R,G,B)"] = ", ".join(f"{m:.2f}" for m in stat.mean)
+            properties["StdDev (R,G,B)"] = ", ".join(f"{s:.2f}" for s in stat.stddev)
+        except Exception as e:
+            logger.error(f"Error calculating color statistics: {str(e)}")
         return image, object_names, confidence_scores, unique_objects, unique_confidences, properties
     except Exception as e:
         logger.error(f"Error in process: {str(e)}\n{traceback.format_exc()}")
+        raise RuntimeError(f"Failed to process image: {str(e)}")
+# ------------------------------
 # FastAPI Setup
+# ------------------------------
 app = FastAPI(title="Object Detection API")
 @app.post("/detect")
 async def detect_objects_endpoint(
+    file: Optional[UploadFile] = File(None),
+    image_url: Optional[str] = Form(None),
+    model_name: str = Form(VALID_MODELS[0]),
+) -> JSONResponse:
+    """
+    FastAPI endpoint to detect objects in an image from file upload or URL.
+    Args:
+        file: Uploaded image file (optional).
+        image_url: URL of the image (optional).
+        model_name: Model to use for detection (default: first VALID_MODELS entry).
+    Returns:
+        JSONResponse containing the processed image (base64), detected objects, and confidences.
+    Raises:
+        HTTPException: If input validation fails or processing errors occur.
+    """
     try:
+        # Validate input
         if (file is None and not image_url) or (file is not None and image_url):
+            raise HTTPException(
+                status_code=400,
+                detail="Provide either an image file or an image URL, not both.",
+            )
+        # Load image
         if file:
             if not file.content_type.startswith("image/"):
                 raise HTTPException(status_code=400, detail="File must be an image")
             image = Image.open(BytesIO(response.content)).convert("RGB")
         if model_name not in VALID_MODELS:
+            raise HTTPException(
+                status_code=400,
+                detail=f"Invalid model. Choose from: {VALID_MODELS}",
+            )
+        # Process image
+        detected_image, detected_objects, detected_confidences, unique_objects, unique_confidences, _ = process(
+            image, model_name
+        )
+        # Encode image as base64
         buffered = BytesIO()
         detected_image.save(buffered, format="PNG")
         img_base64 = base64.b64encode(buffered.getvalue()).decode("utf-8")
         img_url = f"data:image/png;base64,{img_base64}"
+        return JSONResponse(
+            content={
+                "image_url": img_url,
+                "detected_objects": detected_objects,
+                "confidence_scores": detected_confidences,
+                "unique_objects": unique_objects,
+                "unique_confidence_scores": unique_confidences,
+            }
+        )
+    except requests.RequestException as e:
+        logger.error(f"Error fetching image from URL: {str(e)}")
+        raise HTTPException(status_code=400, detail=f"Failed to fetch image: {str(e)}")
     except Exception as e:
         logger.error(f"Error in FastAPI endpoint: {str(e)}\n{traceback.format_exc()}")
         raise HTTPException(status_code=500, detail=f"Error processing image: {str(e)}")
+# ------------------------------
+# Gradio UI Setup
+# ------------------------------
+def create_gradio_ui() -> gr.Blocks:
+    """
+    Create and configure the Gradio UI for object detection.
+    Returns:
+        Gradio Blocks object representing the UI.
+    Raises:
+        RuntimeError: If UI creation fails.
+    """
+    try:
+        with gr.Blocks(theme=gr.themes.Default(primary_hue="blue", secondary_hue="gray")) as app:
+            gr.Markdown(
+                f"""
+                # 🚀 Object Detection App
+                Upload an image or provide a URL to detect objects using state-of-the-art transformer models (DETR, YOLOS).
+                Running on port: {os.getenv('GRADIO_SERVER_PORT', 'auto-selected')}
+                """
+            )
+            with gr.Tabs():
+                with gr.Tab("📷 Image Upload"):
+                    with gr.Row():
+                        with gr.Column(scale=1):
+                            gr.Markdown("### Input")
+                            model_choice = gr.Dropdown(
+                                choices=VALID_MODELS,
+                                value=VALID_MODELS[0],
+                                label="🔎 Select Model",
+                                info="Choose a model for object detection or panoptic segmentation.",
+                            )
+                            model_info = gr.Markdown(
+                                f"**Model Info**: {MODEL_DESCRIPTIONS[VALID_MODELS[0]]}",
+                                visible=True,
+                            )
+                            image_input = gr.Image(type="pil", label="📷 Upload Image")
+                            image_url_input = gr.Textbox(
+                                label="🔗 Image URL",
+                                placeholder="https://example.com/image.jpg",
+                            )
+                            with gr.Row():
+                                submit_btn = gr.Button("✨ Detect", variant="primary")
+                                clear_btn = gr.Button("🗑️ Clear", variant="secondary")
+                            model_choice.change(
+                                fn=lambda model_name: (
+                                    f"**Model Info**: {MODEL_DESCRIPTIONS.get(model_name, 'No description available.')}"
+                                ),
+                                inputs=model_choice,
+                                outputs=model_info,
+                            )
+                        with gr.Column(scale=2):
+                            gr.Markdown("### Results")
+                            error_output = gr.Textbox(
+                                label="⚠️ Errors",
+                                visible=False,
+                                lines=3,
+                                max_lines=5,
+                            )
+                            output_image = gr.Image(
+                                type="pil",
+                                label="🎯 Detected Image",
                                 interactive=False,
                             )
+                            with gr.Row():
+                                objects_output = gr.DataFrame(
+                                    label="📋 Detected Objects",
+                                    interactive=False,
+                                    value=None,
+                                )
+                                unique_objects_output = gr.DataFrame(
+                                    label="🔍 Unique Objects",
+                                    interactive=False,
+                                    value=None,
+                                )
+                            properties_output = gr.DataFrame(
+                                label="📄 Image Properties",
                                 interactive=False,
+                                value=None,
                             )
+                    def process_for_gradio(image: Optional[Image.Image], url: Optional[str], model_name: str) -> Tuple[
+                        Optional[Image.Image], Optional[pd.DataFrame], Optional[pd.DataFrame], Optional[pd.DataFrame], str
+                    ]:
+                        """
+                        Process image for Gradio UI and return results.
+                        Args:
+                            image: Uploaded PIL Image (optional).
+                            url: Image URL (optional).
+                            model_name: Model to use for detection.
+                        Returns:
+                            Tuple of detected image, objects DataFrame, unique objects DataFrame, properties DataFrame, and error message.
+                        """
+                        try:
+                            if image is None and not url:
+                                return None, None, None, None, "Please provide an image or URL"
+                            if image and url:
+                                return None, None, None, None, "Please provide either an image or URL, not both"
+                            if url:
+                                response = requests.get(url, timeout=10)
+                                response.raise_for_status()
+                                image = Image.open(BytesIO(response.content)).convert("RGB")
+                            detected_image, objects, scores, unique_objects, unique_scores, properties = process(
+                                image, model_name
+                            )
+                            objects_df = (
+                                pd.DataFrame(
+                                    {
+                                        "Object": objects,
+                                        "Confidence Score": [f"{score:.2f}" for score in scores],
+                                    }
+                                )
+                                if objects
+                                else pd.DataFrame(columns=["Object", "Confidence Score"])
+                            )
+                            unique_objects_df = (
+                                pd.DataFrame(
+                                    {
+                                        "Unique Object": unique_objects,
+                                        "Confidence Score": [f"{score:.2f}" for score in unique_scores],
+                                    }
+                                )
+                                if unique_objects
+                                else pd.DataFrame(columns=["Unique Object", "Confidence Score"])
+                            )
+                            properties_df = (
+                                pd.DataFrame([properties])
+                                if properties
+                                else pd.DataFrame(columns=properties.keys())
+                            )
+                            return detected_image, objects_df, unique_objects_df, properties_df, ""
+                        except requests.RequestException as e:
+                            error_msg = f"Error fetching image from URL: {str(e)}"
+                            logger.error(f"{error_msg}\n{traceback.format_exc()}")
+                            return None, None, None, None, error_msg
+                        except Exception as e:
+                            error_msg = f"Error processing image: {str(e)}"
+                            logger.error(f"{error_msg}\n{traceback.format_exc()}")
+                            return None, None, None, None, error_msg
+                    submit_btn.click(
+                        fn=process_for_gradio,
+                        inputs=[image_input, image_url_input, model_choice],
+                        outputs=[output_image, objects_output, unique_objects_output, properties_output, error_output],
+                    )
+                    clear_btn.click(
+                        fn=lambda: [None, "", None, None, None, None],
+                        inputs=None,
+                        outputs=[
+                            image_input,
+                            image_url_input,
+                            output_image,
+                            objects_output,
+                            unique_objects_output,
+                            properties_output,
+                            error_output,
+                        ],
+                    )
+                with gr.Tab("🔗 JSON Output"):
+                    gr.Markdown("### Process Image for JSON Output")
+                    image_input_json = gr.Image(type="pil", label="📷 Upload Image")
+                    image_url_input_json = gr.Textbox(
+                        label="🔗 Image URL",
+                        placeholder="https://example.com/image.jpg",
+                    )
+                    url_model_choice = gr.Dropdown(
+                        choices=VALID_MODELS,
+                        value=VALID_MODELS[0],
+                        label="🔎 Select Model",
+                    )
+                    url_model_info = gr.Markdown(
+                        f"**Model Info**: {MODEL_DESCRIPTIONS[VALID_MODELS[0]]}",
+                        visible=True,
+                    )
+                    url_submit_btn = gr.Button("🔄 Process", variant="primary")
+                    url_output = gr.JSON(label="API Response")
+                    url_model_choice.change(
+                        fn=lambda model_name: (
+                            f"**Model Info**: {MODEL_DESCRIPTIONS.get(model_name, 'No description available.')}"
+                        ),
+                        inputs=url_model_choice,
+                        outputs=url_model_info,
+                    )
+                    def process_url_for_gradio(image: Optional[Image.Image], url: Optional[str], model_name: str) -> Dict:
+                        """
+                        Process image from file or URL for Gradio UI and return JSON response.
+                        Args:
+                            image: Uploaded PIL Image (optional).
+                            url: Image URL (optional).
+                            model_name: Model to use for detection.
+                        Returns:
+                            Dictionary with processed image (base64), detected objects, and confidences.
+                        """
+                        try:
+                            if image is None and not url:
+                                return {"error": "Please provide an image or URL"}
+                            if image and url:
+                                return {"error": "Please provide either an image or URL, not both"}
+                            if url:
+                                response = requests.get(url, timeout=10)
+                                response.raise_for_status()
+                                image = Image.open(BytesIO(response.content)).convert("RGB")
+                            detected_image, objects, scores, unique_objects, unique_scores, _ = process(
+                                image, model_name
+                            )
+                            buffered = BytesIO()
+                            detected_image.save(buffered, format="PNG")
+                            img_base64 = base64.b64encode(buffered.getvalue()).decode("utf-8")
+                            return {
+                                "image_url": f"data:image/png;base64,{img_base64}",
+                                "detected_objects": objects,
+                                "confidence_scores": scores,
+                                "unique_objects": unique_objects,
+                                "unique_confidence_scores": unique_scores,
+                            }
+                        except requests.RequestException as e:
+                            error_msg = f"Error fetching image from URL: {str(e)}"
+                            logger.error(f"{error_msg}\n{traceback.format_exc()}")
+                            return {"error": error_msg}
+                        except Exception as e:
+                            error_msg = f"Error processing image: {str(e)}"
+                            logger.error(f"{error_msg}\n{traceback.format_exc()}")
+                            return {"error": error_msg}
+                    url_submit_btn.click(
+                        fn=process_url_for_gradio,
+                        inputs=[image_input_json, image_url_input_json, url_model_choice],
+                        outputs=[url_output],
+                    )
+                with gr.Tab("ℹ️ Help"):
+                    gr.Markdown(
+                        """
+                        ## How to Use
+                        - **Image Upload**: Select a model, upload an image or provide a URL, and click "Detect" to see detected objects and image properties.
+                        - **JSON Output**: Upload an image or enter a URL, select a model, and click "Process" to get results in JSON format.
+                        - **Models**: Choose from DETR (object detection or panoptic segmentation) or YOLOS (lightweight detection).
+                        - **Clear**: Reset all inputs and outputs using the "Clear" button in the Image Upload tab.
+                        - **Errors**: Check the error box (Image Upload) or JSON response (JSON Output) for issues.
+                        ## Tips
+                        - Use high-quality images for better detection results.
+                        - Panoptic models (e.g., DETR-ResNet-50-panoptic) provide segmentation masks for complex scenes.
+                        - For faster processing, try YOLOS-Tiny on resource-constrained devices.
+                        """
+                    )
+        return app
+    except Exception as e:
+        logger.error(f"Error creating Gradio UI: {str(e)}\n{traceback.format_exc()}")
+        raise RuntimeError(f"Failed to create Gradio UI: {str(e)}")
+# ------------------------------
+# Launcher
+# ------------------------------
+def parse_args() -> argparse.Namespace:
+    """
+    Parse command-line arguments with defaults and ignore unrecognized arguments.
+    Returns:
+        Parsed arguments as a Namespace object.
+    Raises:
+        SystemExit: If argument parsing fails (handled by argparse).
+    """
+    parser = argparse.ArgumentParser(
+        description="Launcher for Object Detection App with Gradio UI and optional FastAPI server."
+    )
+    parser.add_argument(
+        "--gradio-port",
+        type=int,
+        default=DEFAULT_GRADIO_PORT,
+        help=f"Port for the Gradio UI (default: {DEFAULT_GRADIO_PORT}).",
+    )
+    parser.add_argument(
+        "--enable-fastapi",
+        action="store_true",
+        default=False,
+        help="Enable the FastAPI server (disabled by default).",
+    )
+    parser.add_argument(
+        "--fastapi-port",
+        type=int,
+        default=DEFAULT_FASTAPI_PORT,
+        help=f"Port for the FastAPI server if enabled (default: {DEFAULT_FASTAPI_PORT}).",
+    )
+    # Parse known arguments and ignore unrecognized ones (e.g., Jupyter kernel args)
+    args, _ = parser.parse_known_args()
+    return args
+def find_available_port(start_port: int, port_range: range, max_attempts: int) -> Optional[int]:
+    """
+    Find an available port within the specified range.
+    Args:
+        start_port: Initial port to try (e.g., from args or environment).
+        port_range: Range of ports to attempt.
+        max_attempts: Maximum number of ports to try.
+    Returns:
+        Available port number, or None if no port is found.
+    Raises:
+        OSError: If port binding fails for reasons other than port in use.
+    """
+    import socket
+    port = start_port
+    attempts = 0
+    # Check environment variable GRADIO_SERVER_PORT
+    env_port = os.getenv("GRADIO_SERVER_PORT")
+    if env_port and env_port.isdigit():
+        port = int(env_port)
+        logger.info(f"Using GRADIO_SERVER_PORT from environment: {port}")
+    while attempts < max_attempts:
+        with socket.socket(socket.AF_INET, socket.SOCK_STREAM) as s:
+            try:
+                s.bind(("0.0.0.0", port))
+                logger.debug(f"Port {port} is available")
+                return port
+            except OSError as e:
+                if e.errno == 98:  # Port in use
+                    logger.debug(f"Port {port} is in use")
+                    port = port + 1 if port < max(port_range) else min(port_range)
+                    attempts += 1
+                else:
+                    raise
+            except Exception as e:
+                logger.error(f"Error checking port {port}: {str(e)}")
+                raise
+    logger.error(f"No available port found in range {min(port_range)}-{max(port_range)} after {max_attempts} attempts")
+    return None
+def run_fastapi_server(host: str, port: int) -> None:
+    """
+    Run the FastAPI server using Uvicorn.
+    Args:
+        host: Host address for the FastAPI server.
+        port: Port for the FastAPI server.
+    """
+    try:
+        uvicorn.run(app, host=host, port=port)
+    except Exception as e:
+        logger.error(f"Error running FastAPI server: {str(e)}\n{traceback.format_exc()}")
+        sys.exit(1)
+def main() -> None:
+    """
+    Main function to launch Gradio UI and optional FastAPI server.
+    Raises:
+        SystemExit: If the application is interrupted or encounters an error.
+    """
+    try:
+        # Apply nest_asyncio to allow nested event loops in Jupyter/Colab
+        nest_asyncio.apply()
+        # Parse command-line arguments
+        args = parse_args()
+        logger.info(f"Parsed arguments: {args}")
+        # Find available port for Gradio
+        gradio_port = find_available_port(args.gradio_port, PORT_RANGE, MAX_PORT_ATTEMPTS)
+        if gradio_port is None:
+            logger.error("Failed to find an available port for Gradio UI")
+            sys.exit(1)
+        # Launch FastAPI server in a separate thread if enabled
+        if args.enable_fastapi:
+            logger.info(f"Starting FastAPI server on port {args.fastapi_port}")
+            fastapi_thread = threading.Thread(
+                target=run_fastapi_server,
+                args=("0.0.0.0", args.fastapi_port),
+                daemon=True
+            )
+            fastapi_thread.start()
+        # Launch Gradio UI
+        logger.info(f"Starting Gradio UI on port {gradio_port}")
+        app = create_gradio_ui()
+        app.launch(server_port=gradio_port, server_name="0.0.0.0")
+    except KeyboardInterrupt:
+        logger.info("Application terminated by user.")
+        sys.exit(0)
+    except OSError as e:
+        logger.error(f"Port binding error: {str(e)}")
+        sys.exit(1)
+    except Exception as e:
+        logger.error(f"Error running application: {str(e)}\n{traceback.format_exc()}")
+        sys.exit(1)
 if __name__ == "__main__":
+    main()

hf_space/hf_space/README.md CHANGED Viewed

@@ -1,54 +1,56 @@
 # 🚀 Object Detection with Transformer Models
-This project provides an object detection system using state-of-the-art transformer models, such as **DETR (DEtection TRansformer)** and **YOLOS (You Only Look One-level Series)**. The system can detect objects from uploaded images or image URLs, and it supports different models for detection and segmentation tasks. It includes a Gradio-based web interface and a FastAPI-based API for programmatic access.
-You can try the demo online on Hugging Face: [Demo Link](https://huggingface.co/spaces/NeerajCodz/ObjectDetection).
 ## Models Supported
-The following models are supported, as defined in the application:
 - **DETR (DEtection TRansformer)**:
-  - `facebook/detr-resnet-50`: DETR with ResNet-50 backbone for object detection. Fast and accurate for general use.
-  - `facebook/detr-resnet-101`: DETR with ResNet-101 backbone for object detection. More accurate but slower than ResNet-50.
-  - `facebook/detr-resnet-50-panoptic`(currently has bugs): DETR with ResNet-50 for panoptic segmentation. Detects objects and segments scenes.
-  - `facebook/detr-resnet-101-panoptic`(currently has bugs): DETR with ResNet-101 for panoptic segmentation. High accuracy for complex scenes.
 - **YOLOS (You Only Look One-level Series)**:
-  - `hustvl/yolos-tiny`: YOLOS Tiny model. Lightweight and fast, ideal for resource-constrained environments.
-  - `hustvl/yolos-base`: YOLOS Base model. Balances speed and accuracy for object detection.
 ## Features
-- **Image Upload**: Upload images from your device for object detection via the Gradio interface.
-- **URL Input**: Input an image URL for detection through the Gradio interface or API.
 - **Model Selection**: Choose between DETR and YOLOS models for detection or panoptic segmentation.
-- **Object Detection**: Detects objects and highlights them with bounding boxes and confidence scores.
-- **Panoptic Segmentation**: Some models (e.g., DETR panoptic variants) support detailed scene segmentation with colored masks.
-- **Image Properties**: Displays image metadata such as format, size, aspect ratio, file size, and color statistics.
-- **API Access**: Use the FastAPI endpoint `/detect` to programmatically process images and retrieve detection results.
 ## How to Use
-### 1. **Normal Git Clone Method**
 Follow these steps to set up the application locally:
 #### Prerequisites
 - Python 3.8 or higher
-- Install dependencies using `pip`
 #### Clone the Repository
 ```bash
-git clone https://github.com/NeerajCodz/ObjectDetection.git
 cd ObjectDetection
 ```
 #### Install Dependencies
-Install the required dependencies from `requirements.txt`:
 ```bash
 pip install -r requirements.txt
@@ -56,88 +58,150 @@ pip install -r requirements.txt
 #### Run the Application
-Start the FastAPI server using uvicorn:
 ```bash
-uvicorn objectdetection:app --reload
 ```
-Alternatively, launch the Gradio interface by running the main script:
 ```bash
-python app.py
 ```
 #### Access the Application
-- For FastAPI: Open your browser and navigate to `http://localhost:8000` to use the API or view the Swagger UI.
-- For Gradio: The Gradio interface URL will be displayed in the console (typically `http://127.0.0.1:7860`).
 ### 2. **Running with Docker**
-If you prefer to use Docker to set up and run the application, follow these steps:
 #### Prerequisites
-- Docker installed on your machine. If you don’t have Docker, download and install it from [here](https://www.docker.com/get-started).
-#### Build the Docker Image
-First, clone the repository (if you haven't already):
 ```bash
-git clone https://github.com/NeerajCodz/ObjectDetection.git
-cd ObjectDetection
 ```
-Now, build the Docker image:
 ```bash
-docker build -t objectdetection:latest .
 ```
-#### Run the Docker Container
-Once the image is built, run the application using this command:
 ```bash
-docker run -p 5000:5000 objectdetection:latest
 ```
-This will start the application on port 5000. Open your browser and go to `http://localhost:5000` to access the FastAPI interface.
 ### 3. **Demo**
-You can try the demo directly online through Hugging Face's Spaces:
 [Object Detection Demo](https://huggingface.co/spaces/NeerajCodz/ObjectDetection)
 ## Using the API
-You can interact with the application via the FastAPI `/detect` endpoint to send images and get detection results.
-**Endpoint**: `/detect`
-**POST**: `/detect`
-**Parameters**:
-- `file`: (optional) Image file (must be of type `image/*`).
-- `image_url`: (optional) URL of the image.
-- `model_name`: (optional) Choose from `facebook/detr-resnet-50`, `hustvl/yolos-tiny`, etc.
-**Example Request Body**:
-```json
-{
-  "image_url": "https://example.com/image.jpg",
-  "model_name": "facebook/detr-resnet-50"
-}
 ```
-**Response**:
-The response includes a base64-encoded image with detections, detected objects, confidence scores, and unique objects with their scores.
 ```json
 {
@@ -149,14 +213,20 @@ The response includes a base64-encoded image with detections, detected objects,
 }
 ```
 ## Development Setup
-If you'd like to contribute or modify the application:
 1. Clone the repository:
 ```bash
-git clone https://github.com/NeerajCodz/ObjectDetection.git
 cd ObjectDetection
 ```
@@ -166,20 +236,37 @@ cd ObjectDetection
 pip install -r requirements.txt
 ```
-3. Run the FastAPI server or Gradio interface:
 ```bash
-uvicorn objectdetection:app --reload
 ```
-or
 ```bash
-python app.py
 ```
-4. Open your browser and navigate to `http://localhost:8000` (FastAPI) or the Gradio URL (typically `http://127.0.0.1:7860`).
 ## Contributing
-Contributions are welcome! Feel free to open issues or submit pull requests for bug fixes or new features on the [GitHub repository](https://github.com/NeerajCodz/ObjectDetection).

 # 🚀 Object Detection with Transformer Models
+This project provides a robust object detection system leveraging state-of-the-art transformer models, including **DETR (DEtection TRansformer)** and **YOLOS (You Only Look One-level Series)**. The system supports object detection and panoptic segmentation from uploaded images or image URLs. It features a user-friendly **Gradio** web interface for interactive use and a **FastAPI** endpoint for programmatic access.
+Try the online demo on Hugging Face Spaces: [Object Detection Demo](https://huggingface.co/spaces/NeerajCodz/ObjectDetection).
 ## Models Supported
+The application supports the following models, each tailored for specific detection or segmentation tasks:
 - **DETR (DEtection TRansformer)**:
+  - `facebook/detr-resnet-50`: Fast and accurate object detection with a ResNet-50 backbone.
+  - `facebook/detr-resnet-101`: Higher accuracy object detection with a ResNet-101 backbone, slower than ResNet-50.
+  - `facebook/detr-resnet-50-panoptic`: Panoptic segmentation with ResNet-50 (note: may have stability issues).
+  - `facebook/detr-resnet-101-panoptic`: Panoptic segmentation with ResNet-101 (note: may have stability issues).
 - **YOLOS (You Only Look One-level Series)**:
+  - `hustvl/yolos-tiny`: Lightweight and fast, ideal for resource-constrained environments.
+  - `hustvl/yolos-base`: Balances speed and accuracy for object detection.
 ## Features
+- **Image Upload**: Upload images via the Gradio interface for object detection.
+- **URL Input**: Provide image URLs for detection through the Gradio interface or API.
 - **Model Selection**: Choose between DETR and YOLOS models for detection or panoptic segmentation.
+- **Object Detection**: Highlights detected objects with bounding boxes and confidence scores.
+- **Panoptic Segmentation**: Supports scene segmentation with colored masks (DETR panoptic models).
+- **Image Properties**: Displays metadata like format, size, aspect ratio, file size, and color statistics.
+- **API Access**: Programmatically process images via the FastAPI `/detect` endpoint.
+- **Flexible Deployment**: Run locally, in Docker, or in cloud environments like Google Colab.
 ## How to Use
+### 1. **Local Setup (Git Clone)**
 Follow these steps to set up the application locally:
 #### Prerequisites
 - Python 3.8 or higher
+- `pip` for installing dependencies
+- Git for cloning the repository
 #### Clone the Repository
 ```bash
+git clone https://github.com/NeerajCodz/ObjectDetection
 cd ObjectDetection
 ```
 #### Install Dependencies
+Install required packages from `requirements.txt`:
 ```bash
 pip install -r requirements.txt
 #### Run the Application
+Launch the Gradio interface:
 ```bash
+python app.py
 ```
+To enable the FastAPI server:
 ```bash
+python app.py --enable-fastapi
 ```
 #### Access the Application
+- **Gradio**: Open the URL displayed in the console (typically `http://127.0.0.1:7860`).
+- **FastAPI**: Navigate to `http://localhost:8000` for the API or Swagger UI (if enabled).
 ### 2. **Running with Docker**
+Use Docker for a containerized setup.
 #### Prerequisites
+- Docker installed on your machine. Download from [Docker's official site](https://www.docker.com/get-started).
+#### Pull the Docker Image
+Pull the pre-built image from Docker Hub:
 ```bash
+docker pull neerajcodz/objectdetection:latest
 ```
+#### Run the Docker Container
+Run the application on port 8080:
 ```bash
+docker run -d -p 8080:80 neerajcodz/objectdetection:latest
 ```
+Access the interface at `http://localhost:8080`.
+#### Build and Run the Docker Image
+To build the Docker image locally:
+1. Ensure you have a `Dockerfile` in the repository root (example provided in the repository).
+2. Build the image:
+```bash
+docker build -t objectdetection:local .
+```
+3. Run the container:
 ```bash
+docker run -d -p 8080:80 objectdetection:local
 ```
+Access the interface at `http://localhost:8080`.
 ### 3. **Demo**
+Try the demo on Hugging Face Spaces:
 [Object Detection Demo](https://huggingface.co/spaces/NeerajCodz/ObjectDetection)
+## Command-Line Arguments
+The `app.py` script supports the following command-line arguments:
+- `--gradio-port <port>`: Specify the port for the Gradio UI (default: 7860).
+  - Example: `python app.py --gradio-port 7870`
+- `--enable-fastapi`: Enable the FastAPI server (disabled by default).
+  - Example: `python app.py --enable-fastapi`
+- `--fastapi-port <port>`: Specify the port for the FastAPI server (default: 8000).
+  - Example: `python app.py --enable-fastapi --fastapi-port 8001`
+You can combine arguments:
+```bash
+python app.py --gradio-port 7870 --enable-fastapi --fastapi-port 8001
+```
+Alternatively, set the `GRADIO_SERVER_PORT` environment variable:
+```bash
+export GRADIO_SERVER_PORT=7870
+python app.py
+```
 ## Using the API
+**Note**: The FastAPI API is currently unstable and may require additional configuration for production use.
+The `/detect` endpoint allows programmatic image processing.
+### Running the FastAPI Server
+Enable FastAPI when launching the script:
+```bash
+python app.py --enable-fastapi
+```
+Or run FastAPI separately with Uvicorn:
+```bash
+uvicorn objectdetection:app --host 0.0.0.0 --port 8000
 ```
+Access the Swagger UI at `http://localhost:8000/docs` for interactive testing.
+### Endpoint Details
+- **Endpoint**: `POST /detect`
+- **Parameters**:
+  - `file`: (optional) Image file (must be `image/*` type).
+  - `image_url`: (optional) URL of the image.
+  - `model_name`: (optional) Model name (e.g., `facebook/detr-resnet-50`, `hustvl/yolos-tiny`).
+- **Content-Type**: `multipart/form-data` for file uploads, `application/json` for URL inputs.
+### Example Requests
+#### Using `curl` with an Image URL
+```bash
+curl -X POST "http://localhost:8000/detect" \
+  -H "Content-Type: application/json" \
+  -d '{"image_url": "https://example.com/image.jpg", "model_name": "facebook/detr-resnet-50"}'
+```
+#### Using `curl` with an Image File
+```bash
+curl -X POST "http://localhost:8000/detect" \
+  -F "file=@/path/to/image.jpg" \
+  -F "model_name=facebook/detr-resnet-50"
+```
+### Response Format
+The response includes a base64-encoded image with detections and detection details:
 ```json
 {
 }
 ```
+### Notes
+- Ensure only one of `file` or `image_url` is provided.
+- The API may experience instability with panoptic models; use object detection models for reliability.
+- Test the API using the Swagger UI for easier debugging.
 ## Development Setup
+To contribute or modify the application:
 1. Clone the repository:
 ```bash
+git clone https://github.com/NeerajCodz/ObjectDetection
 cd ObjectDetection
 ```
 pip install -r requirements.txt
 ```
+3. Run the application:
 ```bash
+python app.py
 ```
+Or run FastAPI:
 ```bash
+uvicorn objectdetection:app --host 0.0.0.0 --port 8000
 ```
+4. Access at `http://localhost:7860` (Gradio) or `http://localhost:8000` (FastAPI).
 ## Contributing
+Contributions are welcome! To contribute:
+1. Fork the repository.
+2. Create a feature or bugfix branch (`git checkout -b feature/your-feature`).
+3. Commit changes (`git commit -m "Add your feature"`).
+4. Push to the branch (`git push origin feature/your-feature`).
+5. Open a pull request on the [GitHub repository](https://github.com/NeerajCodz/ObjectDetection).
+Please include tests and documentation for new features. Report issues via GitHub Issues.
+## Troubleshooting
+- **Port Conflicts**: If port 7860 is in use, specify a different port with `--gradio-port` or set `GRADIO_SERVER_PORT`.
+- **Colab Issues**: Use the `--gradio-port` argument or environment variable to avoid port conflicts in Google Colab.
+- **Panoptic Model Bugs**: Avoid `detr-resnet-*-panoptic` models until stability issues are resolved.
+- **API Instability**: Test with smaller images and object detection models first.
+For further assistance, open an issue on the [GitHub repository](https://github.com/NeerajCodz/ObjectDetection).

hf_space/hf_space/hf_space/README.md CHANGED Viewed

@@ -1,12 +1,185 @@
----
-title: ObjectDetection
-emoji: 🦀
-colorFrom: green
-colorTo: yellow
-sdk: gradio
-sdk_version: 5.29.0
-app_file: app.py
-pinned: false
----
-Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference

+# 🚀 Object Detection with Transformer Models
+This project provides an object detection system using state-of-the-art transformer models, such as **DETR (DEtection TRansformer)** and **YOLOS (You Only Look One-level Series)**. The system can detect objects from uploaded images or image URLs, and it supports different models for detection and segmentation tasks. It includes a Gradio-based web interface and a FastAPI-based API for programmatic access.
+You can try the demo online on Hugging Face: [Demo Link](https://huggingface.co/spaces/NeerajCodz/ObjectDetection).
+## Models Supported
+The following models are supported, as defined in the application:
+- **DETR (DEtection TRansformer)**:
+  - `facebook/detr-resnet-50`: DETR with ResNet-50 backbone for object detection. Fast and accurate for general use.
+  - `facebook/detr-resnet-101`: DETR with ResNet-101 backbone for object detection. More accurate but slower than ResNet-50.
+  - `facebook/detr-resnet-50-panoptic`(currently has bugs): DETR with ResNet-50 for panoptic segmentation. Detects objects and segments scenes.
+  - `facebook/detr-resnet-101-panoptic`(currently has bugs): DETR with ResNet-101 for panoptic segmentation. High accuracy for complex scenes.
+- **YOLOS (You Only Look One-level Series)**:
+  - `hustvl/yolos-tiny`: YOLOS Tiny model. Lightweight and fast, ideal for resource-constrained environments.
+  - `hustvl/yolos-base`: YOLOS Base model. Balances speed and accuracy for object detection.
+## Features
+- **Image Upload**: Upload images from your device for object detection via the Gradio interface.
+- **URL Input**: Input an image URL for detection through the Gradio interface or API.
+- **Model Selection**: Choose between DETR and YOLOS models for detection or panoptic segmentation.
+- **Object Detection**: Detects objects and highlights them with bounding boxes and confidence scores.
+- **Panoptic Segmentation**: Some models (e.g., DETR panoptic variants) support detailed scene segmentation with colored masks.
+- **Image Properties**: Displays image metadata such as format, size, aspect ratio, file size, and color statistics.
+- **API Access**: Use the FastAPI endpoint `/detect` to programmatically process images and retrieve detection results.
+## How to Use
+### 1. **Normal Git Clone Method**
+Follow these steps to set up the application locally:
+#### Prerequisites
+- Python 3.8 or higher
+- Install dependencies using `pip`
+#### Clone the Repository
+```bash
+git clone https://github.com/NeerajCodz/ObjectDetection.git
+cd ObjectDetection
+```
+#### Install Dependencies
+Install the required dependencies from `requirements.txt`:
+```bash
+pip install -r requirements.txt
+```
+#### Run the Application
+Start the FastAPI server using uvicorn:
+```bash
+uvicorn objectdetection:app --reload
+```
+Alternatively, launch the Gradio interface by running the main script:
+```bash
+python app.py
+```
+#### Access the Application
+- For FastAPI: Open your browser and navigate to `http://localhost:8000` to use the API or view the Swagger UI.
+- For Gradio: The Gradio interface URL will be displayed in the console (typically `http://127.0.0.1:7860`).
+### 2. **Running with Docker**
+If you prefer to use Docker to set up and run the application, follow these steps:
+#### Prerequisites
+- Docker installed on your machine. If you don’t have Docker, download and install it from [here](https://www.docker.com/get-started).
+#### Build the Docker Image
+First, clone the repository (if you haven't already):
+```bash
+git clone https://github.com/NeerajCodz/ObjectDetection.git
+cd ObjectDetection
+```
+Now, build the Docker image:
+```bash
+docker build -t objectdetection:latest .
+```
+#### Run the Docker Container
+Once the image is built, run the application using this command:
+```bash
+docker run -p 5000:5000 objectdetection:latest
+```
+This will start the application on port 5000. Open your browser and go to `http://localhost:5000` to access the FastAPI interface.
+### 3. **Demo**
+You can try the demo directly online through Hugging Face's Spaces:
+[Object Detection Demo](https://huggingface.co/spaces/NeerajCodz/ObjectDetection)
+## Using the API
+You can interact with the application via the FastAPI `/detect` endpoint to send images and get detection results.
+**Endpoint**: `/detect`
+**POST**: `/detect`
+**Parameters**:
+- `file`: (optional) Image file (must be of type `image/*`).
+- `image_url`: (optional) URL of the image.
+- `model_name`: (optional) Choose from `facebook/detr-resnet-50`, `hustvl/yolos-tiny`, etc.
+**Example Request Body**:
+```json
+{
+  "image_url": "https://example.com/image.jpg",
+  "model_name": "facebook/detr-resnet-50"
+}
+```
+**Response**:
+The response includes a base64-encoded image with detections, detected objects, confidence scores, and unique objects with their scores.
+```json
+{
+  "image_url": "data:image/png;base64,...",
+  "detected_objects": ["person", "car"],
+  "confidence_scores": [0.95, 0.87],
+  "unique_objects": ["person", "car"],
+  "unique_confidence_scores": [0.95, 0.87]
+}
+```
+## Development Setup
+If you'd like to contribute or modify the application:
+1. Clone the repository:
+```bash
+git clone https://github.com/NeerajCodz/ObjectDetection.git
+cd ObjectDetection
+```
+2. Install dependencies:
+```bash
+pip install -r requirements.txt
+```
+3. Run the FastAPI server or Gradio interface:
+```bash
+uvicorn objectdetection:app --reload
+```
+or
+```bash
+python app.py
+```
+4. Open your browser and navigate to `http://localhost:8000` (FastAPI) or the Gradio URL (typically `http://127.0.0.1:7860`).
+## Contributing
+Contributions are welcome! Feel free to open issues or submit pull requests for bug fixes or new features on the [GitHub repository](https://github.com/NeerajCodz/ObjectDetection).

hf_space/hf_space/hf_space/hf_space/hf_space/hf_space/.huggingface.yaml ADDED Viewed

	@@ -0,0 +1,7 @@

+sdk: gradio
+python_version: 3.10
+app_file: app.py
+title: Object Detection App
+subtitle: Real-time object detection in images using Gradio
+hardware: cpu-basic
+license: mit

hf_space/hf_space/hf_space/hf_space/hf_space/hf_space/hf_space/.github/workflows/docker-build-push.yml ADDED Viewed

	@@ -0,0 +1,26 @@

+name: Build and Push Docker Image to Docker Hub
+on:
+  push:
+    branches:
+      - main
+jobs:
+  build-and-push:
+    runs-on: ubuntu-latest
+    steps:
+      - name: Checkout code
+        uses: actions/checkout@v4
+      - name: Log in to Docker Hub
+        uses: docker/login-action@v3
+        with:
+          username: ${{ secrets.DOCKER_USERNAME }}
+          password: ${{ secrets.DOCKER_PAT }}
+      - name: Build and push Docker image
+        uses: docker/build-push-action@v6
+        with:
+          context: .
+          push: true
+          tags: ${{ secrets.DOCKER_USERNAME }}/objectdetection:latest

hf_space/hf_space/hf_space/hf_space/hf_space/hf_space/hf_space/.github/workflows/hf-space-sync.yml ADDED Viewed

	@@ -0,0 +1,36 @@

+name: Sync to Hugging Face Space
+on:
+  push:
+    branches: [ main ]
+jobs:
+  deploy-to-hf-space:
+    runs-on: ubuntu-latest
+    steps:
+      - name: Checkout Repository
+        uses: actions/checkout@v3
+      - name: Install Git
+        run: sudo apt-get install git
+      - name: Push to Hugging Face Space
+        env:
+          HF_TOKEN: ${{ secrets.HF_TOKEN }}
+          HF_USERNAME: ${{ secrets.HF_USERNAME }}
+          EMAIL: ${{ secrets.EMAIL }}
+        run: |
+          git config --global user.email $EMAIL
+          git config --global user.name $HF_USERNAME
+          git clone https://$HF_USERNAME:[email protected]/spaces/$HF_USERNAME/ObjectDetection hf_space
+          rsync -av --exclude='.git' ./ hf_space/
+          cd hf_space
+          git add .
+          if git diff --cached --quiet; then
+            echo "✅ No changes to commit."
+          else
+            git commit -m "Sync from GitHub"
+            git push
+          fi

hf_space/hf_space/hf_space/hf_space/hf_space/hf_space/hf_space/.gitignore ADDED Viewed

	@@ -0,0 +1,5 @@

+__pycache__/
+venv/
+*.pyc
+.DS_Store
+.env

hf_space/hf_space/hf_space/hf_space/hf_space/hf_space/hf_space/Dockerfile ADDED Viewed

	@@ -0,0 +1,13 @@

+FROM python:3.11-slim
+WORKDIR /app
+COPY requirements.txt .
+RUN pip install --no-cache-dir -r requirements.txt
+COPY app.py .
+EXPOSE 5000
+CMD ["python", "app.py"]

hf_space/hf_space/hf_space/hf_space/hf_space/hf_space/hf_space/LICENSE ADDED Viewed

	@@ -0,0 +1,21 @@

+MIT License
+Copyright (c) 2025 Neeraj Sathish Kumar
+Permission is hereby granted, free of charge, to any person obtaining a copy
+of this software and associated documentation files (the "Software"), to deal
+in the Software without restriction, including without limitation the rights
+to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
+copies of the Software, and to permit persons to whom the Software is
+furnished to do so, subject to the following conditions:
+The above copyright notice and this permission notice shall be included in all
+copies or substantial portions of the Software.
+THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
+IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
+FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
+AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
+LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
+OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
+SOFTWARE.

hf_space/hf_space/hf_space/hf_space/hf_space/hf_space/hf_space/app.py ADDED Viewed

	@@ -0,0 +1,384 @@

+import gradio as gr
+import torch
+from transformers import DetrImageProcessor, DetrForObjectDetection
+from transformers import YolosImageProcessor, YolosForObjectDetection
+from transformers import DetrForSegmentation
+from PIL import Image, ImageDraw, ImageStat
+import requests
+from io import BytesIO
+import base64
+from collections import Counter
+import logging
+from fastapi import FastAPI, File, UploadFile, HTTPException, Form
+from fastapi.responses import JSONResponse
+import uvicorn
+import pandas as pd
+import traceback
+import os
+# Set up logging
+logging.basicConfig(level=logging.INFO, format="%(asctime)s - %(levelname)s - %(message)s")
+logger = logging.getLogger(__name__)
+# Constants
+CONFIDENCE_THRESHOLD = 0.5
+VALID_MODELS = [
+    "facebook/detr-resnet-50",
+    "facebook/detr-resnet-101",
+    "facebook/detr-resnet-50-panoptic",
+    "facebook/detr-resnet-101-panoptic",
+    "hustvl/yolos-tiny",
+    "hustvl/yolos-base"
+]
+MODEL_DESCRIPTIONS = {
+    "facebook/detr-resnet-50": "DETR with ResNet-50 backbone for object detection. Fast and accurate for general use.",
+    "facebook/detr-resnet-101": "DETR with ResNet-101 backbone for object detection. More accurate but slower than ResNet-50.",
+    "facebook/detr-resnet-50-panoptic": "DETR with ResNet-50 for panoptic segmentation. Detects objects and segments scenes.",
+    "facebook/detr-resnet-101-panoptic": "DETR with ResNet-101 for panoptic segmentation. High accuracy for complex scenes.",
+    "hustvl/yolos-tiny": "YOLOS Tiny model. Lightweight and fast, ideal for resource-constrained environments.",
+    "hustvl/yolos-base": "YOLOS Base model. Balances speed and accuracy for object detection."
+}
+# Lazy model loading
+models = {}
+processors = {}
+def process(image, model_name):
+    """Process an image and return detected image, objects, confidences, unique objects, unique confidences, and properties."""
+    try:
+        if model_name not in VALID_MODELS:
+            raise ValueError(f"Invalid model: {model_name}. Choose from: {VALID_MODELS}")
+        # Load model and processor
+        if model_name not in models:
+            logger.info(f"Loading model: {model_name}")
+            if "yolos" in model_name:
+                models[model_name] = YolosForObjectDetection.from_pretrained(model_name)
+                processors[model_name] = YolosImageProcessor.from_pretrained(model_name)
+            elif "panoptic" in model_name:
+                models[model_name] = DetrForSegmentation.from_pretrained(model_name)
+                processors[model_name] = DetrImageProcessor.from_pretrained(model_name)
+            else:
+                models[model_name] = DetrForObjectDetection.from_pretrained(model_name)
+                processors[model_name] = DetrImageProcessor.from_pretrained(model_name)
+        model, processor = models[model_name], processors[model_name]
+        inputs = processor(images=image, return_tensors="pt")
+        with torch.no_grad():
+            outputs = model(**inputs)
+        target_sizes = torch.tensor([image.size[::-1]])
+        draw = ImageDraw.Draw(image)
+        object_names = []
+        confidence_scores = []
+        object_counter = Counter()
+        if "panoptic" in model_name:
+            processed_sizes = torch.tensor([[inputs["pixel_values"].shape[2], inputs["pixel_values"].shape[3]]])
+            results = processor.post_process_panoptic(outputs, target_sizes=target_sizes, processed_sizes=processed_sizes)[0]
+            for segment in results["segments_info"]:
+                label = segment["label_id"]
+                label_name = model.config.id2label.get(label, "Unknown")
+                score = segment.get("score", 1.0)
+                if "masks" in results and segment["id"] < len(results["masks"]):
+                    mask = results["masks"][segment["id"]].cpu().numpy()
+                    if mask.shape[0] > 0 and mask.shape[1] > 0:
+                        mask_image = Image.fromarray((mask * 255).astype("uint8"))
+                        colored_mask = Image.new("RGBA", image.size, (0, 0, 0, 0))
+                        mask_draw = ImageDraw.Draw(colored_mask)
+                        r, g, b = (segment["id"] * 50) % 255, (segment["id"] * 100) % 255, (segment["id"] * 150) % 255
+                        mask_draw.bitmap((0, 0), mask_image, fill=(r, g, b, 128))
+                        image = Image.alpha_composite(image.convert("RGBA"), colored_mask).convert("RGB")
+                        draw = ImageDraw.Draw(image)
+                if score > CONFIDENCE_THRESHOLD:
+                    object_names.append(label_name)
+                    confidence_scores.append(float(score))
+                    object_counter[label_name] = float(score)
+        else:
+            results = processor.post_process_object_detection(outputs, target_sizes=target_sizes)[0]
+            for score, label, box in zip(results["scores"], results["labels"], results["boxes"]):
+                if score > CONFIDENCE_THRESHOLD:
+                    x, y, x2, y2 = box.tolist()
+                    draw.rectangle([x, y, x2, y2], outline="#32CD32", width=2)
+                    label_name = model.config.id2label.get(label.item(), "Unknown")
+                    # Place text at top-right corner, outside the box, with smaller size
+                    text = f"{label_name}: {score:.2f}"
+                    text_bbox = draw.textbbox((0, 0), text)
+                    text_width, text_height = text_bbox[2] - text_bbox[0], text_bbox[3] - text_bbox[1]
+                    draw.text((x2 - text_width - 2, y - text_height - 2), text, fill="#32CD32")
+                    object_names.append(label_name)
+                    confidence_scores.append(float(score))
+                    object_counter[label_name] = float(score)
+        unique_objects = list(object_counter.keys())
+        unique_confidences = [object_counter[obj] for obj in unique_objects]
+        # Image properties
+        file_size = "Unknown"
+        if hasattr(image, "fp") and image.fp is not None:
+            buffered = BytesIO()
+            image.save(buffered, format="PNG")
+            file_size = f"{len(buffered.getvalue()) / 1024:.2f} KB"
+        # Color statistics
+        try:
+            stat = ImageStat.Stat(image)
+            color_stats = {
+                "mean": [f"{m:.2f}" for m in stat.mean],
+                "stddev": [f"{s:.2f}" for s in stat.stddev]
+            }
+        except Exception as e:
+            logger.error(f"Error calculating color statistics: {str(e)}")
+            color_stats = {"mean": "Error", "stddev": "Error"}
+        properties = {
+            "Format": image.format if hasattr(image, "format") and image.format else "Unknown",
+            "Size": f"{image.width}x{image.height}",
+            "Width": f"{image.width} px",
+            "Height": f"{image.height} px",
+            "Mode": image.mode,
+            "Aspect Ratio": f"{round(image.width / image.height, 2) if image.height != 0 else 'Undefined'}",
+            "File Size": file_size,
+            "Mean (R,G,B)": ", ".join(color_stats["mean"]) if isinstance(color_stats["mean"], list) else color_stats["mean"],
+            "StdDev (R,G,B)": ", ".join(color_stats["stddev"]) if isinstance(color_stats["stddev"], list) else color_stats["stddev"]
+        }
+        return image, object_names, confidence_scores, unique_objects, unique_confidences, properties
+    except Exception as e:
+        logger.error(f"Error in process: {str(e)}\n{traceback.format_exc()}")
+        raise
+# FastAPI Setup
+app = FastAPI(title="Object Detection API")
+@app.post("/detect")
+async def detect_objects_endpoint(
+    file: UploadFile = File(None),
+    image_url: str = Form(None),
+    model_name: str = Form(VALID_MODELS[0])
+):
+    """FastAPI endpoint to detect objects in an image from file or URL."""
+    try:
+        if (file is None and not image_url) or (file is not None and image_url):
+            raise HTTPException(status_code=400, detail="Provide either an image file or an image URL, but not both.")
+        if file:
+            if not file.content_type.startswith("image/"):
+                raise HTTPException(status_code=400, detail="File must be an image")
+            contents = await file.read()
+            image = Image.open(BytesIO(contents)).convert("RGB")
+        else:
+            response = requests.get(image_url, timeout=10)
+            response.raise_for_status()
+            image = Image.open(BytesIO(response.content)).convert("RGB")
+        if model_name not in VALID_MODELS:
+            raise HTTPException(status_code=400, detail=f"Invalid model. Choose from: {VALID_MODELS}")
+        detected_image, detected_objects, detected_confidences, unique_objects, unique_confidences, _ = process(image, model_name)
+        buffered = BytesIO()
+        detected_image.save(buffered, format="PNG")
+        img_base64 = base64.b64encode(buffered.getvalue()).decode("utf-8")
+        img_url = f"data:image/png;base64,{img_base64}"
+        return JSONResponse(content={
+            "image_url": img_url,
+            "detected_objects": detected_objects,
+            "confidence_scores": detected_confidences,
+            "unique_objects": unique_objects,
+            "unique_confidence_scores": unique_confidences
+        })
+    except Exception as e:
+        logger.error(f"Error in FastAPI endpoint: {str(e)}\n{traceback.format_exc()}")
+        raise HTTPException(status_code=500, detail=f"Error processing image: {str(e)}")
+# Gradio UI
+def create_gradio_ui():
+    with gr.Blocks(theme=gr.themes.Default(primary_hue="blue", secondary_hue="gray")) as demo:
+        gr.Markdown(
+            """
+            # 🚀 Object Detection App
+            Upload an image or provide a URL to detect objects using state-of-the-art transformer models (DETR, YOLOS).
+            """
+        )
+        with gr.Tabs():
+            with gr.Tab("📷 Image Upload"):
+                with gr.Row():
+                    with gr.Column(scale=1):
+                        gr.Markdown("### Input")
+                        model_choice = gr.Dropdown(
+                            choices=VALID_MODELS,
+                            value=VALID_MODELS[0],
+                            label="🔎 Select Model",
+                            info="Choose a model for object detection or panoptic segmentation."
+                        )
+                        model_info = gr.Markdown(
+                            f"**Model Info**: {MODEL_DESCRIPTIONS[VALID_MODELS[0]]}",
+                            visible=True
+                        )
+                        image_input = gr.Image(type="pil", label="📷 Upload Image")
+                        image_url_input = gr.Textbox(
+                            label="🔗 Image URL",
+                            placeholder="https://example.com/image.jpg"
+                        )
+                        with gr.Row():
+                            submit_btn = gr.Button("✨ Detect", variant="primary")
+                            clear_btn = gr.Button("🗑️ Clear", variant="secondary")
+                        model_choice.change(
+                            fn=lambda model_name: f"**Model Info**: {MODEL_DESCRIPTIONS.get(model_name, 'No description available.')}",
+                            inputs=model_choice,
+                            outputs=model_info
+                        )
+                    with gr.Column(scale=2):
+                        gr.Markdown("### Results")
+                        error_output = gr.Textbox(
+                            label="⚠️ Errors",
+                            visible=False,
+                            lines=3,
+                            max_lines=5
+                        )
+                        output_image = gr.Image(
+                            type="pil",
+                            label="🎯 Detected Image",
+                            interactive=False
+                        )
+                        with gr.Row():
+                            objects_output = gr.DataFrame(
+                                label="📋 Detected Objects",
+                                interactive=False,
+                                value=None
+                            )
+                            unique_objects_output = gr.DataFrame(
+                                label="🔍 Unique Objects",
+                                interactive=False,
+                                value=None
+                            )
+                        properties_output = gr.DataFrame(
+                            label="📄 Image Properties",
+                            interactive=False,
+                            value=None
+                        )
+                def process_for_gradio(image, url, model_name):
+                    try:
+                        if image is None and not url:
+                            return None, None, None, None, "Please provide an image or URL"
+                        if image and url:
+                            return None, None, None, None, "Please provide either an image or URL, not both"
+                        if url:
+                            response = requests.get(url, timeout=10)
+                            response.raise_for_status()
+                            image = Image.open(BytesIO(response.content)).convert("RGB")
+                        detected_image, objects, scores, unique_objects, unique_scores, properties = process(image, model_name)
+                        objects_df = pd.DataFrame({
+                            "Object": objects,
+                            "Confidence Score": [f"{score:.2f}" for score in scores]
+                        }) if objects else pd.DataFrame(columns=["Object", "Confidence Score"])
+                        unique_objects_df = pd.DataFrame({
+                            "Unique Object": unique_objects,
+                            "Confidence Score": [f"{score:.2f}" for score in unique_scores]
+                        }) if unique_objects else pd.DataFrame(columns=["Unique Object", "Confidence Score"])
+                        properties_df = pd.DataFrame([properties]) if properties else pd.DataFrame(columns=properties.keys())
+                        return detected_image, objects_df, unique_objects_df, properties_df, ""
+                    except Exception as e:
+                        error_msg = f"Error processing image: {str(e)}"
+                        logger.error(f"{error_msg}\n{traceback.format_exc()}")
+                        return None, None, None, None, error_msg
+                submit_btn.click(
+                    fn=process_for_gradio,
+                    inputs=[image_input, image_url_input, model_choice],
+                    outputs=[output_image, objects_output, unique_objects_output, properties_output, error_output]
+                )
+                clear_btn.click(
+                    fn=lambda: [None, "", None, None, None, None],
+                    inputs=None,
+                    outputs=[image_input, image_url_input, output_image, objects_output, unique_objects_output, properties_output, error_output]
+                )
+            with gr.Tab("🔗 URL Input"):
+                gr.Markdown("### Process Image from URL")
+                image_url_input = gr.Textbox(
+                    label="🔗 Image URL",
+                    placeholder="https://example.com/image.jpg"
+                )
+                url_model_choice = gr.Dropdown(
+                    choices=VALID_MODELS,
+                    value=VALID_MODELS[0],
+                    label="🔎 Select Model"
+                )
+                url_model_info = gr.Markdown(
+                    f"**Model Info**: {MODEL_DESCRIPTIONS[VALID_MODELS[0]]}",
+                    visible=True
+                )
+                url_submit_btn = gr.Button("🔄 Process URL", variant="primary")
+                url_output = gr.JSON(label="API Response")
+                url_model_choice.change(
+                    fn=lambda model_name: f"**Model Info**: {MODEL_DESCRIPTIONS.get(model_name, 'No description available.')}",
+                    inputs=url_model_choice,
+                    outputs=url_model_info
+                )
+                def process_url_for_gradio(url, model_name):
+                    try:
+                        response = requests.get(url, timeout=10)
+                        response.raise_for_status()
+                        image = Image.open(BytesIO(response.content)).convert("RGB")
+                        detected_image, objects, scores, unique_objects, unique_scores, _ = process(image, model_name)
+                        buffered = BytesIO()
+                        detected_image.save(buffered, format="PNG")
+                        img_base64 = base64.b64encode(buffered.getvalue()).decode("utf-8")
+                        return {
+                            "image_url": f"data:image/png;base64,{img_base64}",
+                            "detected_objects": objects,
+                            "confidence_scores": scores,
+                            "unique_objects": unique_objects,
+                            "unique_confidence_scores": unique_scores
+                        }
+                    except Exception as e:
+                        error_msg = f"Error processing URL: {str(e)}"
+                        logger.error(f"{error_msg}\n{traceback.format_exc()}")
+                        return {"error": error_msg}
+                url_submit_btn.click(
+                    fn=process_url_for_gradio,
+                    inputs=[image_url_input, url_model_choice],
+                    outputs=[url_output]
+                )
+            with gr.Tab("ℹ️ Help"):
+                gr.Markdown(
+                    """
+                    ## How to Use
+                    - **Image Upload**: Select a model, upload an image or provide a URL, and click "Detect" to see detected objects and image properties.
+                    - **URL Input**: Enter an image URL, select a model, and click "Process URL" to get results in JSON format.
+                    - **Models**: Choose from DETR (object detection or panoptic segmentation) or YOLOS (lightweight detection).
+                    - **Clear**: Reset all inputs and outputs using the "Clear" button.
+                    - **Errors**: Check the error box for any processing issues.
+                    ## Tips
+                    - Use high-quality images for better detection results.
+                    - Panoptic models (e.g., DETR-ResNet-50-panoptic) provide segmentation masks for complex scenes.
+                    - For faster processing, try YOLOS-Tiny on resource-constrained devices.
+                    """
+                )
+    return demo
+if __name__ == "__main__":
+    demo = create_gradio_ui()
+    demo.launch()
+    # To run FastAPI, use: uvicorn object_detection:app --host 0.0.0.0 --port 8000

hf_space/hf_space/hf_space/hf_space/hf_space/hf_space/hf_space/hf_space/.gitattributes ADDED Viewed

	@@ -0,0 +1,35 @@

+*.7z filter=lfs diff=lfs merge=lfs -text
+*.arrow filter=lfs diff=lfs merge=lfs -text
+*.bin filter=lfs diff=lfs merge=lfs -text
+*.bz2 filter=lfs diff=lfs merge=lfs -text
+*.ckpt filter=lfs diff=lfs merge=lfs -text
+*.ftz filter=lfs diff=lfs merge=lfs -text
+*.gz filter=lfs diff=lfs merge=lfs -text
+*.h5 filter=lfs diff=lfs merge=lfs -text
+*.joblib filter=lfs diff=lfs merge=lfs -text
+*.lfs.* filter=lfs diff=lfs merge=lfs -text
+*.mlmodel filter=lfs diff=lfs merge=lfs -text
+*.model filter=lfs diff=lfs merge=lfs -text
+*.msgpack filter=lfs diff=lfs merge=lfs -text
+*.npy filter=lfs diff=lfs merge=lfs -text
+*.npz filter=lfs diff=lfs merge=lfs -text
+*.onnx filter=lfs diff=lfs merge=lfs -text
+*.ot filter=lfs diff=lfs merge=lfs -text
+*.parquet filter=lfs diff=lfs merge=lfs -text
+*.pb filter=lfs diff=lfs merge=lfs -text
+*.pickle filter=lfs diff=lfs merge=lfs -text
+*.pkl filter=lfs diff=lfs merge=lfs -text
+*.pt filter=lfs diff=lfs merge=lfs -text
+*.pth filter=lfs diff=lfs merge=lfs -text
+*.rar filter=lfs diff=lfs merge=lfs -text
+*.safetensors filter=lfs diff=lfs merge=lfs -text
+saved_model/**/* filter=lfs diff=lfs merge=lfs -text
+*.tar.* filter=lfs diff=lfs merge=lfs -text
+*.tar filter=lfs diff=lfs merge=lfs -text
+*.tflite filter=lfs diff=lfs merge=lfs -text
+*.tgz filter=lfs diff=lfs merge=lfs -text
+*.wasm filter=lfs diff=lfs merge=lfs -text
+*.xz filter=lfs diff=lfs merge=lfs -text
+*.zip filter=lfs diff=lfs merge=lfs -text
+*.zst filter=lfs diff=lfs merge=lfs -text
+*tfevents* filter=lfs diff=lfs merge=lfs -text

hf_space/hf_space/hf_space/hf_space/hf_space/hf_space/hf_space/hf_space/README.md ADDED Viewed

	@@ -0,0 +1,12 @@

+---
+title: ObjectDetection
+emoji: 🦀
+colorFrom: green
+colorTo: yellow
+sdk: gradio
+sdk_version: 5.29.0
+app_file: app.py
+pinned: false
+---
+Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference

hf_space/hf_space/hf_space/hf_space/hf_space/hf_space/hf_space/requirements.txt ADDED Viewed

	@@ -0,0 +1,8 @@

+transformers
+torch
+tensorflow
+gradio
+pillow
+timm
+fastapi
+requests

requirements.txt CHANGED Viewed

@@ -5,4 +5,7 @@ gradio
 pillow
 timm
 fastapi
-requests

 pillow
 timm
 fastapi
+requests
+uvicorn
+pandas
+nest_asyncio