Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
flexitok
/
bpe_script_Arab_16000
like
0
Follow
Moto
2
Standard Arabic
Persian
tokenizer
bpe
flexitok
fineweb2
License:
mit
Model card
Files
Files and versions
xet
Community
main
bpe_script_Arab_16000
/
script_1
1.26 MB
Ctrl+K
Ctrl+K
1 contributor
History:
9 commits
gsaltintas
Upload script_1/flexitok--bpe_script_Arab_16000_overlap.json with huggingface_hub
3c65d02
verified
5 days ago
flexitok--bpe_script_Arab_16000.yaml
Safe
82 Bytes
Upload script_1/flexitok--bpe_script_Arab_16000.yaml with huggingface_hub
6 days ago
flexitok--bpe_script_Arab_16000_info.json
Safe
90 Bytes
Upload script_1/flexitok--bpe_script_Arab_16000_info.json with huggingface_hub
6 days ago
flexitok--bpe_script_Arab_16000_overlap.json
Safe
1.54 kB
Upload script_1/flexitok--bpe_script_Arab_16000_overlap.json with huggingface_hub
5 days ago
flexitok--bpe_script_Arab_16000_super_mapping.json
Safe
270 kB
Upload script_1/flexitok--bpe_script_Arab_16000_super_mapping.json with huggingface_hub
5 days ago
flexitok--bpe_script_Arab_16000_vocab.json
Safe
984 kB
Upload script_1/flexitok--bpe_script_Arab_16000_vocab.json with huggingface_hub
5 days ago
participating_tokenizers.json
Safe
389 Bytes
Upload script_1/participating_tokenizers.json with huggingface_hub
6 days ago