Spaces:
Paused
Paused
Xin Lai
commited on
Update README.md
Browse files
README.md
CHANGED
@@ -18,7 +18,7 @@
|
|
18 |
- [ ] ReasonSeg Dataset Release
|
19 |
- [ ] Training Code Release
|
20 |
|
21 |
-
**LISA: Reasoning Segmentation Via Large Language Model [[Paper](https://arxiv.org/
|
22 |
[Xin Lai](https://scholar.google.com/citations?user=tqNDPA4AAAAJ&hl=zh-CN),
|
23 |
[Zhuotao Tian](https://scholar.google.com/citations?user=mEjhz-IAAAAJ&hl=en),
|
24 |
[Yukang Chen](https://scholar.google.com/citations?user=6p0ygKUAAAAJ&hl=en),
|
@@ -29,7 +29,7 @@
|
|
29 |
|
30 |
## Abstract
|
31 |
In this work, we propose a new segmentation task --- ***reasoning segmentation***. The task is designed to output a segmentation mask given a complex and implicit query text. We establish a benchmark comprising over one thousand image-instruction pairs, incorporating intricate reasoning and world knowledge for evaluation purposes. Finally, we present LISA: Large-language Instructed Segmentation Assistant, which inherits the language generation capabilities of the multi-modal Large Language Model (LLM) while also possessing the ability to produce segmentation masks.
|
32 |
-
For more details, please refer to
|
33 |
|
34 |
## Highlights
|
35 |
**LISA** unlocks the new segmentation capabilities of multi-modal LLMs, and can handle cases involving:
|
|
|
18 |
- [ ] ReasonSeg Dataset Release
|
19 |
- [ ] Training Code Release
|
20 |
|
21 |
+
**LISA: Reasoning Segmentation Via Large Language Model [[Paper](https://arxiv.org/abs/2308.00692)]** <br />
|
22 |
[Xin Lai](https://scholar.google.com/citations?user=tqNDPA4AAAAJ&hl=zh-CN),
|
23 |
[Zhuotao Tian](https://scholar.google.com/citations?user=mEjhz-IAAAAJ&hl=en),
|
24 |
[Yukang Chen](https://scholar.google.com/citations?user=6p0ygKUAAAAJ&hl=en),
|
|
|
29 |
|
30 |
## Abstract
|
31 |
In this work, we propose a new segmentation task --- ***reasoning segmentation***. The task is designed to output a segmentation mask given a complex and implicit query text. We establish a benchmark comprising over one thousand image-instruction pairs, incorporating intricate reasoning and world knowledge for evaluation purposes. Finally, we present LISA: Large-language Instructed Segmentation Assistant, which inherits the language generation capabilities of the multi-modal Large Language Model (LLM) while also possessing the ability to produce segmentation masks.
|
32 |
+
For more details, please refer to the [paper](https://arxiv.org/abs/2308.00692).
|
33 |
|
34 |
## Highlights
|
35 |
**LISA** unlocks the new segmentation capabilities of multi-modal LLMs, and can handle cases involving:
|