Update README.md

2026-05-21 08:55:48 +08:00 · 2023-08-05 23:58:38 +08:00
parent f63c1d1026
commit b81d4a8c7a
1 changed files with 2 additions and 2 deletions
--- a/README.md
+++ b/README.md
@@ -186,7 +186,7 @@ print(f'Response: {response}')
 ## Quantization
-We provide examples to show how to load models in `NF4` and `Int8`. For starters, make sure you have implemented `bitsandbytes`. Note that the requirements for `bitsandbytes` is:
+We provide examples to show how to load models in `NF4` and `Int8`. For starters, make sure you have implemented `bitsandbytes`. Note that the requirements for `bitsandbytes` are:
 ```
 **Requirements** Python >=3.8. Linux distribution (Ubuntu, MacOS, etc.) + CUDA > 10.0.
@@ -197,7 +197,7 @@ Windows users should find another option, which might be [bitsandbytes-windows-w
 Then you only need to add your quantization configuration to `AutoModelForCausalLM.from_pretrained`. See the example below:
 ```python
-from transformers import BitsAndBytesConfig
+from transformers import AutoModelForCausalLM, BitsAndBytesConfig
 # quantization configuration for NF4 (4 bits)
 quantization_config = BitsAndBytesConfig(