Spaces:
Paused
Paused
x-lai
commited on
Commit
·
fa89df1
1
Parent(s):
5c19260
Update README.md
Browse filesFormer-commit-id: 82ee63759c5d6daa1873bb7951736a7c370819f5
README.md
CHANGED
@@ -2,6 +2,8 @@
|
|
2 |
|
3 |
This is the official implementation of ***LISA(large Language Instructed Segmentation Assistant)***.
|
4 |
|
|
|
|
|
5 |
## News
|
6 |
- [x] [2023.8.2] Paper is released and github repo is created.
|
7 |
|
@@ -17,7 +19,6 @@ This is the official implementation of ***LISA(large Language Instructed Segment
|
|
17 |
4. multi-turn conversation.
|
18 |
|
19 |
**LISA** also demonstrates robust zero-shot capability when trained exclusively on reasoning-free datasets. In addition, fine-tuning the model with merely 239 reasoning segmentation image-instruction pairs results in further performance enhancement.
|
20 |
-
<p align="center"> <img src="imgs/fig_teaser4_crop.png" width="100%"> </p>
|
21 |
|
22 |
## Abstract
|
23 |
In this work, we propose a new segmentation task --- ***reasoning segmentation***. The task is designed to output a segmentation mask given a complex and implicit query text. We establish a benchmark comprising over one thousand image-instruction pairs, incorporating intricate reasoning and world knowledge for evaluation purposes. Finally, we present LISA: Large-language Instructed Segmentation Assistant, which inherits the language generation capabilities of the multi-modal Large Language Model (LLM) while also possessing the ability to produce segmentation masks.
|
|
|
2 |
|
3 |
This is the official implementation of ***LISA(large Language Instructed Segmentation Assistant)***.
|
4 |
|
5 |
+
<p align="center"> <img src="imgs/fig_teaser4_crop.png" width="100%"> </p>
|
6 |
+
|
7 |
## News
|
8 |
- [x] [2023.8.2] Paper is released and github repo is created.
|
9 |
|
|
|
19 |
4. multi-turn conversation.
|
20 |
|
21 |
**LISA** also demonstrates robust zero-shot capability when trained exclusively on reasoning-free datasets. In addition, fine-tuning the model with merely 239 reasoning segmentation image-instruction pairs results in further performance enhancement.
|
|
|
22 |
|
23 |
## Abstract
|
24 |
In this work, we propose a new segmentation task --- ***reasoning segmentation***. The task is designed to output a segmentation mask given a complex and implicit query text. We establish a benchmark comprising over one thousand image-instruction pairs, incorporating intricate reasoning and world knowledge for evaluation purposes. Finally, we present LISA: Large-language Instructed Segmentation Assistant, which inherits the language generation capabilities of the multi-modal Large Language Model (LLM) while also possessing the ability to produce segmentation masks.
|