SentencePiece is an unsupervised text tokenizer and detokenizer mainly for Neural Network-based text generation systems where the vocabulary size is predetermined prior to the neural model training.
SentencePiece implements subword units (e.g., byte-pair-encoding (BPE) [Sennrich et al.]) and unigram language model [Kudo]) with the extension of direct training from raw sentences. SentencePiece allows us to make a purely end-to-end system that does not depend on language-specific pre/postprocessing.
Install
pixiaddsentencepiece
micromambainstall-c https://repo.prefix.dev/conda-forgesentencepiece
Version
Platforms
Last published
filename | version | build | Created | size | Architecture | Downloads | |
---|---|---|---|---|---|---|---|
sentencepiece-0.2.0-hca73ee5_11.conda | 0.2.0 | hca73ee5_11 (11) | 3 months ago | 19.44 KB | N/A | ||
Dependencies: Run exports: Weak: | |||||||
sentencepiece-0.2.0-hd1d1222_11.conda | 0.2.0 | hd1d1222_11 (11) | 3 months ago | 19.42 KB | N/A | ||
Dependencies: Run exports: Weak: | |||||||
sentencepiece-0.2.0-h7197b3e_11.conda | 0.2.0 | h7197b3e_11 (11) | 3 months ago | 19.31 KB | N/A | ||
Dependencies: Run exports: Weak: | |||||||
sentencepiece-0.2.0-h074ec65_11.conda | 0.2.0 | h074ec65_11 (11) | 3 months ago | 19.29 KB | N/A | ||
Dependencies: Run exports: Weak: | |||||||
sentencepiece-0.2.0-h4d92249_11.conda | 0.2.0 | h4d92249_11 (11) | 3 months ago | 19.75 KB | N/A | ||
Dependencies: Run exports: Weak: | |||||||
sentencepiece-0.2.0-h6c1b121_11.conda | 0.2.0 | h6c1b121_11 (11) | 3 months ago | 19.44 KB | N/A | ||
Dependencies: Run exports: Weak: | |||||||
sentencepiece-0.2.0-h355fbbf_11.conda | 0.2.0 | h355fbbf_11 (11) | 3 months ago | 19.45 KB | N/A | ||
Dependencies: Run exports: Weak: | |||||||
sentencepiece-0.2.0-h789a12f_11.conda | 0.2.0 | h789a12f_11 (11) | 3 months ago | 19.77 KB | N/A | ||
Dependencies: Run exports: Weak: | |||||||
sentencepiece-0.2.0-h9ebfe73_11.conda | 0.2.0 | h9ebfe73_11 (11) | 3 months ago | 19.32 KB | N/A | ||
Dependencies: Run exports: Weak: | |||||||
sentencepiece-0.2.0-hc8f76dd_11.conda | 0.2.0 | hc8f76dd_11 (11) | 3 months ago | 19.29 KB | N/A | ||
Dependencies: Run exports: Weak: | |||||||
sentencepiece-0.2.0-h9022467_11.conda | 0.2.0 | h9022467_11 (11) | 3 months ago | 19.46 KB | N/A | ||
Dependencies: Run exports: Weak: | |||||||
sentencepiece-0.2.0-hb52d4ef_11.conda | 0.2.0 | hb52d4ef_11 (11) | 3 months ago | 19.43 KB | N/A | ||
Dependencies: Run exports: Weak: | |||||||
sentencepiece-0.2.0-hb5d48cf_11.conda | 0.2.0 | hb5d48cf_11 (11) | 3 months ago | 19.45 KB | N/A | ||
Dependencies: Run exports: Weak: | |||||||
sentencepiece-0.2.0-h873f942_11.conda | 0.2.0 | h873f942_11 (11) | 3 months ago | 19.75 KB | N/A | ||
Dependencies: Run exports: Weak: | |||||||
sentencepiece-0.2.0-h6f50818_11.conda | 0.2.0 | h6f50818_11 (11) | 3 months ago | 19.46 KB | N/A | ||
Dependencies: Run exports: Weak: | |||||||
sentencepiece-0.2.0-hf5c48a9_11.conda | 0.2.0 | hf5c48a9_11 (11) | 3 months ago | 19.47 KB | N/A | ||
Dependencies: Run exports: Weak: | |||||||
sentencepiece-0.2.0-he3f5c48_11.conda | 0.2.0 | he3f5c48_11 (11) | 3 months ago | 19.46 KB | N/A | ||
Dependencies: Run exports: Weak: | |||||||
sentencepiece-0.2.0-h64d5bc9_11.conda | 0.2.0 | h64d5bc9_11 (11) | 3 months ago | 19.45 KB | N/A | ||
Dependencies: Run exports: Weak: | |||||||
sentencepiece-0.2.0-h412708c_11.conda | 0.2.0 | h412708c_11 (11) | 3 months ago | 19.34 KB | N/A | ||
Dependencies: Run exports: Weak: | |||||||
sentencepiece-0.2.0-h459e5fc_11.conda | 0.2.0 | h459e5fc_11 (11) | 3 months ago | 19.77 KB | N/A | ||
Dependencies: Run exports: Weak: |