Automatic Speech Recognition
Transformers
PyTorch
TensorBoard
Safetensors
Uyghur
wav2vec2
Generated from Trainer
hf-asr-leaderboard
mozilla-foundation/common_voice_8_0
robust-speech-event
Eval Results (legacy)
Instructions to use lucio/xls-r-uyghur-cv8 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use lucio/xls-r-uyghur-cv8 with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("automatic-speech-recognition", model="lucio/xls-r-uyghur-cv8")# Load model directly from transformers import AutoProcessor, AutoModelForCTC processor = AutoProcessor.from_pretrained("lucio/xls-r-uyghur-cv8") model = AutoModelForCTC.from_pretrained("lucio/xls-r-uyghur-cv8") - Notebooks
- Google Colab
- Kaggle
Training in progress, step 18500
Browse files
.ipynb_checkpoints/eval-checkpoint.py
CHANGED
|
@@ -49,7 +49,7 @@ def log_results(result: Dataset, args: Dict[str, str]):
|
|
| 49 |
def normalize_text(text: str) -> str:
|
| 50 |
"""DO ADAPT FOR YOUR USE CASE. this function normalizes the target text."""
|
| 51 |
|
| 52 |
-
chars_to_ignore_regex = '[,
|
| 53 |
|
| 54 |
text = re.sub(chars_to_ignore_regex, "", text.lower())
|
| 55 |
|
|
|
|
| 49 |
def normalize_text(text: str) -> str:
|
| 50 |
"""DO ADAPT FOR YOUR USE CASE. this function normalizes the target text."""
|
| 51 |
|
| 52 |
+
chars_to_ignore_regex = '[!"%,.:;?\\_|©«¬»،؛؟‒–—’“”„…‹›−☺♂�\\\\-]' # noqa: W605 IMPORTANT: this should correspond to the chars that were ignored during training
|
| 53 |
|
| 54 |
text = re.sub(chars_to_ignore_regex, "", text.lower())
|
| 55 |
|
eval.py
CHANGED
|
@@ -49,7 +49,7 @@ def log_results(result: Dataset, args: Dict[str, str]):
|
|
| 49 |
def normalize_text(text: str) -> str:
|
| 50 |
"""DO ADAPT FOR YOUR USE CASE. this function normalizes the target text."""
|
| 51 |
|
| 52 |
-
chars_to_ignore_regex = '[,
|
| 53 |
|
| 54 |
text = re.sub(chars_to_ignore_regex, "", text.lower())
|
| 55 |
|
|
|
|
| 49 |
def normalize_text(text: str) -> str:
|
| 50 |
"""DO ADAPT FOR YOUR USE CASE. this function normalizes the target text."""
|
| 51 |
|
| 52 |
+
chars_to_ignore_regex = '[!"%,.:;?\\_|©«¬»،؛؟‒–—’“”„…‹›−☺♂�\\\\-]' # noqa: W605 IMPORTANT: this should correspond to the chars that were ignored during training
|
| 53 |
|
| 54 |
text = re.sub(chars_to_ignore_regex, "", text.lower())
|
| 55 |
|
pytorch_model.bin
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
size 1262104049
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:73b90be4276eb5faf73aa36222d89249b80ec66348e9433da7efe67d53c41e0d
|
| 3 |
size 1262104049
|
runs/Jan28_17-40-32_job-0074bb36-c67f-4775-b1b6-176eb09b0ba4/events.out.tfevents.1643391738.job-0074bb36-c67f-4775-b1b6-176eb09b0ba4.843454.0
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:5c05204e072a6a078c5880fef30fe2a2e48fea9f4c7ad611aec292154ed3f6db
|
| 3 |
+
size 45634
|