Spaces:
Running
Running
Update app.py
Browse files
app.py
CHANGED
@@ -53,6 +53,7 @@ description = """CLaMP 3 is a **multimodal and multilingual** music information
|
|
53 |
- You can **retrieve music using any text input (in any language) or an image** (`.png`, `.jpg`).
|
54 |
- When using an image, **BLIP** generates a caption, which is then used for retrieval.
|
55 |
- Since CLaMP 3's training data includes **rich visual descriptions of musical scenes**, it can **match images to semantically relevant music**.
|
|
|
56 |
|
57 |
### ⚠️ **Limitations**
|
58 |
- This demo retrieves music **only from the WikiMT-X benchmark (1,000 pieces)**.
|
|
|
53 |
- You can **retrieve music using any text input (in any language) or an image** (`.png`, `.jpg`).
|
54 |
- When using an image, **BLIP** generates a caption, which is then used for retrieval.
|
55 |
- Since CLaMP 3's training data includes **rich visual descriptions of musical scenes**, it can **match images to semantically relevant music**.
|
56 |
+
- For simplicity, this demo retrieves music based on **metadata (text descriptions)** rather than directly searching sheet music, MIDI, or audio files.
|
57 |
|
58 |
### ⚠️ **Limitations**
|
59 |
- This demo retrieves music **only from the WikiMT-X benchmark (1,000 pieces)**.
|