Spaces:

kishkath
/

bpe-tokenizer

Running

kishkath commited on Jan 15

Commit

708e762

verified ·

1 Parent(s): 926a036

Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -1,3 +1,18 @@
 # Telugu Text Tokenizer
 A Gradio web interface for encoding and decoding Telugu text using a trained BPE tokenizer.
@@ -34,17 +49,3 @@ The tokenizer is trained on a diverse corpus of Telugu text with:
 - Target compression ratio: ≥ 3.2x
 - Perfect reconstruction guarantee
----
-- title: Bpe Tokenizer
-- emoji: 🔥
-- colorFrom: blue
-- colorTo: yellow
-- sdk: gradio
-- sdk_version: 5.12.0
-- app_file: app.py
-- pinned: false
-- license: apache-2.0
-- short_description: Telugu BPE tokenizer with vocabulary of 4800 words.
----
-Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference

+---
+title: Bpe Tokenizer
+emoji: 🔥
+colorFrom: blue
+colorTo: yellow
+sdk: gradio
+sdk_version: 5.12.0
+app_file: app.py
+pinned: false
+license: apache-2.0
+short_description: Telugu BPE tokenizer with vocabulary of 4800 words.
+---
+Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
 # Telugu Text Tokenizer
 A Gradio web interface for encoding and decoding Telugu text using a trained BPE tokenizer.
 - Target compression ratio: ≥ 3.2x
 - Perfect reconstruction guarantee