SentencePiece is an unsupervised text tokenizer and detokenizer mainly for Neural Network-based text generation systems where the vocabulary size is predetermined prior to the neural model training.
SentencePiece implements subword units (e.g., byte-pair-encoding (BPE) [Sennrich et al.]) and unigram language model [Kudo]) with the extension of direct training from raw sentences. SentencePiece allows us to make a purely end-to-end system that does not depend on language-specific pre/postprocessing.
Install
pixiaddsentencepiece-python
micromambainstall-c https://repo.prefix.dev/conda-forgesentencepiece-python
Version
Platforms
Last published
filename | version | build | Created | size | Architecture | Downloads | |
---|---|---|---|---|---|---|---|
sentencepiece-python-0.2.0-py311hf719afc_11.conda | 0.2.0 | py311hf719afc_11 (11) | 2 months ago | 2.33 MB | N/A | ||
Dependencies: | |||||||
sentencepiece-python-0.2.0-py313hf84a410_11.conda | 0.2.0 | py313hf84a410_11 (11) | 2 months ago | 2.27 MB | N/A | ||
Dependencies: | |||||||
sentencepiece-python-0.2.0-py39h4b43c20_11.conda | 0.2.0 | py39h4b43c20_11 (11) | 2 months ago | 2.44 MB | N/A | ||
Dependencies: | |||||||
sentencepiece-python-0.2.0-py313he3ee21c_11.conda | 0.2.0 | py313he3ee21c_11 (11) | 2 months ago | 2.27 MB | N/A | ||
Dependencies: | |||||||
sentencepiece-python-0.2.0-py312h1f37e12_11.conda | 0.2.0 | py312h1f37e12_11 (11) | 2 months ago | 2.27 MB | N/A | ||
Dependencies: | |||||||
sentencepiece-python-0.2.0-py312h17bc00d_11.conda | 0.2.0 | py312h17bc00d_11 (11) | 2 months ago | 2.45 MB | N/A | ||
Dependencies: | |||||||
sentencepiece-python-0.2.0-py39hd605401_11.conda | 0.2.0 | py39hd605401_11 (11) | 2 months ago | 2.45 MB | N/A | ||
Dependencies: | |||||||
sentencepiece-python-0.2.0-py39h6002195_11.conda | 0.2.0 | py39h6002195_11 (11) | 2 months ago | 2.26 MB | N/A | ||
Dependencies: | |||||||
sentencepiece-python-0.2.0-py312h68a4125_11.conda | 0.2.0 | py312h68a4125_11 (11) | 2 months ago | 2.24 MB | N/A | ||
Dependencies: | |||||||
sentencepiece-python-0.2.0-py39hff9589f_11.conda | 0.2.0 | py39hff9589f_11 (11) | 2 months ago | 2.42 MB | N/A | ||
Dependencies: | |||||||
sentencepiece-python-0.2.0-py311h5d6eed4_11.conda | 0.2.0 | py311h5d6eed4_11 (11) | 2 months ago | 2.43 MB | N/A | ||
Dependencies: | |||||||
sentencepiece-python-0.2.0-py310h8d6b4ad_11.conda | 0.2.0 | py310h8d6b4ad_11 (11) | 2 months ago | 2.26 MB | N/A | ||
Dependencies: | |||||||
sentencepiece-python-0.2.0-py312hb957f94_11.conda | 0.2.0 | py312hb957f94_11 (11) | 2 months ago | 2.38 MB | N/A | ||
Dependencies: | |||||||
sentencepiece-python-0.2.0-py311h5187e9b_11.conda | 0.2.0 | py311h5187e9b_11 (11) | 2 months ago | 2.27 MB | N/A | ||
Dependencies: | |||||||
sentencepiece-python-0.2.0-py39h88fbaca_11.conda | 0.2.0 | py39h88fbaca_11 (11) | 2 months ago | 2.29 MB | N/A | ||
Dependencies: | |||||||
sentencepiece-python-0.2.0-py310heea6b5d_11.conda | 0.2.0 | py310heea6b5d_11 (11) | 2 months ago | 2.42 MB | N/A | ||
Dependencies: | |||||||
sentencepiece-python-0.2.0-py311hfd60e34_11.conda | 0.2.0 | py311hfd60e34_11 (11) | 2 months ago | 2.3 MB | N/A | ||
Dependencies: | |||||||
sentencepiece-python-0.2.0-py313h2704890_11.conda | 0.2.0 | py313h2704890_11 (11) | 2 months ago | 2.38 MB | N/A | ||
Dependencies: | |||||||
sentencepiece-python-0.2.0-py310h38199c8_11.conda | 0.2.0 | py310h38199c8_11 (11) | 2 months ago | 2.29 MB | N/A | ||
Dependencies: | |||||||
sentencepiece-python-0.2.0-py312he2566dd_11.conda | 0.2.0 | py312he2566dd_11 (11) | 2 months ago | 2.27 MB | N/A | ||
Dependencies: |