Spaces:

aletrn
/

lisa-on-cuda

Paused

Xin Lai commited on Aug 3, 2023

Commit

76f1879

unverified ·

1 Parent(s): 2393012

Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -18,7 +18,7 @@
 - [ ] ReasonSeg Dataset Release
 - [ ] Training Code Release
-**LISA: Reasoning Segmentation Via Large Language Model [[Paper](https://arxiv.org/pdf/2308.00692.pdf)]** <br />
 [Xin Lai](https://scholar.google.com/citations?user=tqNDPA4AAAAJ&hl=zh-CN),
 [Zhuotao Tian](https://scholar.google.com/citations?user=mEjhz-IAAAAJ&hl=en),
 [Yukang Chen](https://scholar.google.com/citations?user=6p0ygKUAAAAJ&hl=en),
@@ -29,7 +29,7 @@
 ## Abstract
 In this work, we propose a new segmentation task --- ***reasoning segmentation***. The task is designed to output a segmentation mask given a complex and implicit query text. We establish a benchmark comprising over one thousand image-instruction pairs, incorporating intricate reasoning and world knowledge for evaluation purposes. Finally, we present LISA: Large-language Instructed Segmentation Assistant, which inherits the language generation capabilities of the multi-modal Large Language Model (LLM) while also possessing the ability to produce segmentation masks.
-For more details, please refer to:
 ## Highlights
 **LISA** unlocks the new segmentation capabilities of multi-modal LLMs, and can handle cases involving:

 - [ ] ReasonSeg Dataset Release
 - [ ] Training Code Release
+**LISA: Reasoning Segmentation Via Large Language Model [[Paper](https://arxiv.org/abs/2308.00692)]** <br />
 [Xin Lai](https://scholar.google.com/citations?user=tqNDPA4AAAAJ&hl=zh-CN),
 [Zhuotao Tian](https://scholar.google.com/citations?user=mEjhz-IAAAAJ&hl=en),
 [Yukang Chen](https://scholar.google.com/citations?user=6p0ygKUAAAAJ&hl=en),
 ## Abstract
 In this work, we propose a new segmentation task --- ***reasoning segmentation***. The task is designed to output a segmentation mask given a complex and implicit query text. We establish a benchmark comprising over one thousand image-instruction pairs, incorporating intricate reasoning and world knowledge for evaluation purposes. Finally, we present LISA: Large-language Instructed Segmentation Assistant, which inherits the language generation capabilities of the multi-modal Large Language Model (LLM) while also possessing the ability to produce segmentation masks.
+For more details, please refer to the [paper](https://arxiv.org/abs/2308.00692).
 ## Highlights
 **LISA** unlocks the new segmentation capabilities of multi-modal LLMs, and can handle cases involving: