ij5
/

whitespace-correction

Text2Text Generation

Generated from Trainer

text-generation-inference

Model card Files Files and versions Community

ij5 commited on Apr 5

Commit

a6ece63

·

verified ·

1 Parent(s): cc667e2

Update README.md

Files changed (1) hide show

README.md +22 -0

README.md CHANGED Viewed

@@ -12,6 +12,28 @@ model-index:
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
 # correction
 This model is a fine-tuned version of [paust/pko-t5-base](https://huggingface.co/paust/pko-t5-base) on the None dataset.

 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
+# Basic Inference
+```python
+from transformers import T5TokenizerFast, T5ForConditionalGeneration
+tokenizer = T5TokenizerFast.from_pretrained('ij5/whitespace-correction')
+model = T5ForConditionalGeneration.from_pretrained('ij5/whitespace-correction')
+def fix_whitespace(text):
+    inputs = f"띄어쓰기 교정: {text}"
+    tokenized = tokenizer(inputs, max_length=128, truncation=True, return_tensors='pt').to('cuda')
+    output_ids = model.generate(
+        input_ids=tokenized['input_ids'],
+        attention_mask=tokenized['attention_mask'],
+        max_length=128,
+    )
+    return tokenizer.decode(output_ids[0], skip_special_tokens=True)
+print(fix_whitespace("흔들 리는 가지 사이로 불쑥 바람의 형상 이 드 러나기라도 할 것처럼."))
+# result: 흔들리는 가지 사이로 불쑥 바람의 형상이 드러나기라도 할 것처럼.
+```
 # correction
 This model is a fine-tuned version of [paust/pko-t5-base](https://huggingface.co/paust/pko-t5-base) on the None dataset.