Instructions to use google-bert/bert-base-chinese with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use google-bert/bert-base-chinese with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("fill-mask", model="google-bert/bert-base-chinese")# Load model directly from transformers import AutoTokenizer, AutoModelForMaskedLM tokenizer = AutoTokenizer.from_pretrained("google-bert/bert-base-chinese") model = AutoModelForMaskedLM.from_pretrained("google-bert/bert-base-chinese") - Inference
- Notebooks
- Google Colab
- Kaggle
The do_lower_case should be 'true'
#17
by robin0307 - opened
in tokenizer_config.json the "do_lower_case": false
but it's should be true
>>> from transformers import AutoTokenizer
>>> tokenizer = AutoTokenizer.from_pretrained('google-bert/bert-base-chinese')
>>> tokenizer.do_lower_case
False
>>> tokenizer.decode(tokenizer('My name is Robin')['input_ids'])
'[CLS] [UNK] name is [UNK] [SEP]'