x-lai commited on
Commit
fa89df1
·
1 Parent(s): 5c19260

Update README.md

Browse files

Former-commit-id: 82ee63759c5d6daa1873bb7951736a7c370819f5

Files changed (1) hide show
  1. README.md +2 -1
README.md CHANGED
@@ -2,6 +2,8 @@
2
 
3
  This is the official implementation of ***LISA(large Language Instructed Segmentation Assistant)***.
4
 
 
 
5
  ## News
6
  - [x] [2023.8.2] Paper is released and github repo is created.
7
 
@@ -17,7 +19,6 @@ This is the official implementation of ***LISA(large Language Instructed Segment
17
  4. multi-turn conversation.
18
 
19
  **LISA** also demonstrates robust zero-shot capability when trained exclusively on reasoning-free datasets. In addition, fine-tuning the model with merely 239 reasoning segmentation image-instruction pairs results in further performance enhancement.
20
- <p align="center"> <img src="imgs/fig_teaser4_crop.png" width="100%"> </p>
21
 
22
  ## Abstract
23
  In this work, we propose a new segmentation task --- ***reasoning segmentation***. The task is designed to output a segmentation mask given a complex and implicit query text. We establish a benchmark comprising over one thousand image-instruction pairs, incorporating intricate reasoning and world knowledge for evaluation purposes. Finally, we present LISA: Large-language Instructed Segmentation Assistant, which inherits the language generation capabilities of the multi-modal Large Language Model (LLM) while also possessing the ability to produce segmentation masks.
 
2
 
3
  This is the official implementation of ***LISA(large Language Instructed Segmentation Assistant)***.
4
 
5
+ <p align="center"> <img src="imgs/fig_teaser4_crop.png" width="100%"> </p>
6
+
7
  ## News
8
  - [x] [2023.8.2] Paper is released and github repo is created.
9
 
 
19
  4. multi-turn conversation.
20
 
21
  **LISA** also demonstrates robust zero-shot capability when trained exclusively on reasoning-free datasets. In addition, fine-tuning the model with merely 239 reasoning segmentation image-instruction pairs results in further performance enhancement.
 
22
 
23
  ## Abstract
24
  In this work, we propose a new segmentation task --- ***reasoning segmentation***. The task is designed to output a segmentation mask given a complex and implicit query text. We establish a benchmark comprising over one thousand image-instruction pairs, incorporating intricate reasoning and world knowledge for evaluation purposes. Finally, we present LISA: Large-language Instructed Segmentation Assistant, which inherits the language generation capabilities of the multi-modal Large Language Model (LLM) while also possessing the ability to produce segmentation masks.