Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
gbyuvd
/
FastChemTokenizer
like
1
Feature Extraction
qwen3
chemistry
tokenizer
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
main
FastChemTokenizer
9.96 MB
1 contributor
History:
41 commits
gbyuvd
Update FastChemTokenizerHF2.py
8cc4c16
verified
3 months ago
benchmark
Upload latent visualization notebook
3 months ago
bigsmiles-proto
Upload BigSMILES vocab
3 months ago
latent_space_plots
Upload benchmark script and set
3 months ago
selftok_core
Update to include SELFIES Tokenizer & Vocabs
3 months ago
selftok_wtails
Update to include SELFIES Tokenizer & Vocabs
3 months ago
smitok
First commit
3 months ago
smitok_core
Upload HF wrapper and smitok_core without tails
3 months ago
.gitattributes
Safe
1.71 kB
Upload benchmark script and set
3 months ago
CHANGELOG
Safe
193 Bytes
Tensor handling fix
3 months ago
FastChemTokenizer.py
Safe
23.1 kB
Tensor handling fix
3 months ago
FastChemTokenizerHF.py
Safe
24 kB
Proper full HF Compat
3 months ago
FastChemTokenizerHF2.py
25.5 kB
Update FastChemTokenizerHF2.py
3 months ago
README.md
Safe
12.2 kB
Update README.md
3 months ago
config.json
Safe
896 Bytes
Update config.json
3 months ago
requirements.txt
Safe
120 Bytes
Upload requirements.txt
3 months ago