kotoba-tech
/

kotoba-whisper-v2.1

Automatic Speech Recognition

hf-asr-leaderboard

Model card Files Files and versions Community

asahi417 commited on Sep 17, 2024

Commit

94e6a2d

·

verified ·

1 Parent(s): 635d6b6

Update README.md

Files changed (1) hide show

README.md +0 -44

README.md CHANGED Viewed

@@ -152,50 +152,6 @@ print(result)
 +     punctuator=False,
 ```
-### Transcription with Prompt
-Kotoba-whisper can generate transcription with prompting as below:
-```python
-import re
-import torch
-from transformers import pipeline
-from datasets import load_dataset
-# config
-model_id = "kotoba-tech/kotoba-whisper-v2.1"
-torch_dtype = torch.float16 if torch.cuda.is_available() else torch.float32
-device = "cuda:0" if torch.cuda.is_available() else "cpu"
-model_kwargs = {"attn_implementation": "sdpa"} if torch.cuda.is_available() else {}
-generate_kwargs = {"language": "japanese", "task": "transcribe"}
-# load model
-pipe = pipeline(
-    model=model_id,
-    torch_dtype=torch_dtype,
-    device=device,
-    model_kwargs=model_kwargs,
-    chunk_length_s=15,
-    batch_size=16,
-    trust_remote_code=True
-)
-# load sample audio
-dataset = load_dataset("japanese-asr/ja_asr.reazonspeech_test", split="test")
-# --- Without prompt ---
-text = pipe(dataset[10]["audio"], generate_kwargs=generate_kwargs)['text']
-print(text)
-# 81歳、力強い走りに変わってきます。
-# --- With prompt ---: Let's change `81` to `91`.
-prompt = "91歳"
-generate_kwargs['prompt_ids'] = pipe.tokenizer.get_prompt_ids(prompt, return_tensors="pt").to(device)
-text = pipe(dataset[10]["audio"], generate_kwargs=generate_kwargs)['text']
-# currently the pipeline for ASR appends the prompt at the beginning of the transcription, so remove it
-text = re.sub(rf"\A\s*{prompt}\s*", "", text)
-print(text)
-# あっぶったでもスルガさん、91歳、力強い走りに変わってきます。
-```
 ### Flash Attention 2
 We recommend using [Flash-Attention 2](https://huggingface.co/docs/transformers/main/en/perf_infer_gpu_one#flashattention-2)

 +     punctuator=False,
 ```
 ### Flash Attention 2
 We recommend using [Flash-Attention 2](https://huggingface.co/docs/transformers/main/en/perf_infer_gpu_one#flashattention-2)