Xin Lai commited on
Commit
76f1879
·
unverified ·
1 Parent(s): 2393012

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -18,7 +18,7 @@
18
  - [ ] ReasonSeg Dataset Release
19
  - [ ] Training Code Release
20
 
21
- **LISA: Reasoning Segmentation Via Large Language Model [[Paper](https://arxiv.org/pdf/2308.00692.pdf)]** <br />
22
  [Xin Lai](https://scholar.google.com/citations?user=tqNDPA4AAAAJ&hl=zh-CN),
23
  [Zhuotao Tian](https://scholar.google.com/citations?user=mEjhz-IAAAAJ&hl=en),
24
  [Yukang Chen](https://scholar.google.com/citations?user=6p0ygKUAAAAJ&hl=en),
@@ -29,7 +29,7 @@
29
 
30
  ## Abstract
31
  In this work, we propose a new segmentation task --- ***reasoning segmentation***. The task is designed to output a segmentation mask given a complex and implicit query text. We establish a benchmark comprising over one thousand image-instruction pairs, incorporating intricate reasoning and world knowledge for evaluation purposes. Finally, we present LISA: Large-language Instructed Segmentation Assistant, which inherits the language generation capabilities of the multi-modal Large Language Model (LLM) while also possessing the ability to produce segmentation masks.
32
- For more details, please refer to:
33
 
34
  ## Highlights
35
  **LISA** unlocks the new segmentation capabilities of multi-modal LLMs, and can handle cases involving:
 
18
  - [ ] ReasonSeg Dataset Release
19
  - [ ] Training Code Release
20
 
21
+ **LISA: Reasoning Segmentation Via Large Language Model [[Paper](https://arxiv.org/abs/2308.00692)]** <br />
22
  [Xin Lai](https://scholar.google.com/citations?user=tqNDPA4AAAAJ&hl=zh-CN),
23
  [Zhuotao Tian](https://scholar.google.com/citations?user=mEjhz-IAAAAJ&hl=en),
24
  [Yukang Chen](https://scholar.google.com/citations?user=6p0ygKUAAAAJ&hl=en),
 
29
 
30
  ## Abstract
31
  In this work, we propose a new segmentation task --- ***reasoning segmentation***. The task is designed to output a segmentation mask given a complex and implicit query text. We establish a benchmark comprising over one thousand image-instruction pairs, incorporating intricate reasoning and world knowledge for evaluation purposes. Finally, we present LISA: Large-language Instructed Segmentation Assistant, which inherits the language generation capabilities of the multi-modal Large Language Model (LLM) while also possessing the ability to produce segmentation masks.
32
+ For more details, please refer to the [paper](https://arxiv.org/abs/2308.00692).
33
 
34
  ## Highlights
35
  **LISA** unlocks the new segmentation capabilities of multi-modal LLMs, and can handle cases involving: