lzyhha commited on
Commit
ab35982
·
1 Parent(s): 1c029a6
Files changed (2) hide show
  1. app.py +3 -3
  2. visualcloze.py +0 -1
app.py CHANGED
@@ -35,9 +35,9 @@ Click the task button in the right bottom to acquire **examples** of various tas
35
  The runtime on the Zero GPU runtime depends on the size of the image grid.
36
  When generating an image with the resoluation of 1024,
37
  the runtime is approximately **[45s for a 2x2 grid], [55s for a 2x3 grid], [70s for a 3x3 grid], [90s for a 3x4 grid]**.
38
- When generating three images in a 3x4 grid, i.e., Image to Depth + Normal + Hed,
39
- the runtime is approximately **110s**.
40
- **Deploying locally with an 80G A100 can reduce the runtime by more than half.**
41
 
42
  ### Note: For better quality, you can deploy the demo locally using the [model](https://huggingface.co/VisualCloze/VisualCloze/blob/main/visualcloze-512-lora.pth), which supports a higher resolution than this online demo, by following the instructions in the [GitHub repository](https://github.com/lzyhha/VisualCloze/tree/main?tab=readme-ov-file#2-web-demo-gradio).
43
  """
 
35
  The runtime on the Zero GPU runtime depends on the size of the image grid.
36
  When generating an image with the resoluation of 1024,
37
  the runtime is approximately **[45s for a 2x2 grid], [55s for a 2x3 grid], [70s for a 3x3 grid], [90s for a 3x4 grid]**.
38
+ **Deploying locally with an 80G A100 can reduce the runtime by more than half.**
39
+ Disabling SDEdit upsampling by setting the upsampling noise to 1 or reducing the upsampling steps
40
+ can also save computation time, but it may lead to a decrease in generation quality.
41
 
42
  ### Note: For better quality, you can deploy the demo locally using the [model](https://huggingface.co/VisualCloze/VisualCloze/blob/main/visualcloze-512-lora.pth), which supports a higher resolution than this online demo, by following the instructions in the [GitHub repository](https://github.com/lzyhha/VisualCloze/tree/main?tab=readme-ov-file#2-web-demo-gradio).
43
  """
visualcloze.py CHANGED
@@ -450,7 +450,6 @@ class VisualClozeModel:
450
  row_end = grid_h * grid_w
451
  for i in range(row_start, row_end):
452
  # when the image is masked, then output it
453
- print(mask_position, i, row_start, is_upsampling, grid_h, grid_w)
454
  if mask_position[i - row_start] and is_upsampling:
455
  cropped = output_images[-1].crop(((i - row_start) * ret_w // self.grid_w, 0, ((i - row_start) + 1) * ret_w // self.grid_w, ret_h))
456
  upsampled = self.upsampling(
 
450
  row_end = grid_h * grid_w
451
  for i in range(row_start, row_end):
452
  # when the image is masked, then output it
 
453
  if mask_position[i - row_start] and is_upsampling:
454
  cropped = output_images[-1].crop(((i - row_start) * ret_w // self.grid_w, 0, ((i - row_start) + 1) * ret_w // self.grid_w, ret_h))
455
  upsampled = self.upsampling(