Giter VIP home page Giter VIP logo

automodelforcausallm's People

Watchers

 avatar

automodelforcausallm's Issues

AutoModelForCausalLM error with accelerate and bitsandbytes

Hy,
I was running this code:

model = AutoModelForCausalLM.from_pretrained(
    model_id,
    device_map='auto',
    quantization_config=nf4_config,
    use_cache=False,
    attn_implementation="flash_attention_2"

)

when this error occured:

ImportError                               Traceback (most recent call last)

Cell In[26], line 1
----> 1 model = AutoModelForCausalLM.from_pretrained(
      2     model_id,
      3     device_map='auto',
      4     quantization_config=nf4_config,
      5     use_cache=False,
      6     attn_implementation="flash_attention_2"
      7 
      8 )

File ~\anacondaNewV\envs\tensorflow\lib\site-packages\transformers\models\auto\auto_factory.py:563, in _BaseAutoModelClass.from_pretrained(cls, pretrained_model_name_or_path, *model_args, **kwargs)
    561 elif type(config) in cls._model_mapping.keys():
    562     model_class = _get_model_class(config, cls._model_mapping)
--> 563     return model_class.from_pretrained(
    564         pretrained_model_name_or_path, *model_args, config=config, **hub_kwargs, **kwargs
    565     )
    566 raise ValueError(
    567     f"Unrecognized configuration class {config.__class__} for this kind of AutoModel: {cls.__name__}.\n"
    568     f"Model type should be one of {', '.join(c.__name__ for c in cls._model_mapping.keys())}."
    569 )

File ~\anacondaNewV\envs\tensorflow\lib\site-packages\transformers\modeling_utils.py:3049, in PreTrainedModel.from_pretrained(cls, pretrained_model_name_or_path, config, cache_dir, ignore_mismatched_sizes, force_download, local_files_only, token, revision, use_safetensors, *model_args, **kwargs)
   3046     hf_quantizer = None
   3048 if hf_quantizer is not None:
-> 3049     hf_quantizer.validate_environment(
   3050         torch_dtype=torch_dtype, from_tf=from_tf, from_flax=from_flax, device_map=device_map
   3051     )
   3052     torch_dtype = hf_quantizer.update_torch_dtype(torch_dtype)
   3053     device_map = hf_quantizer.update_device_map(device_map)

File ~\anacondaNewV\envs\tensorflow\lib\site-packages\transformers\quantizers\quantizer_bnb_4bit.py:62, in Bnb4BitHfQuantizer.validate_environment(self, *args, **kwargs)
     60 def validate_environment(self, *args, **kwargs):
     61     if not (is_accelerate_available() and is_bitsandbytes_available()):
---> 62         raise ImportError(
     63             "Using `bitsandbytes` 8-bit quantization requires Accelerate: `pip install accelerate` "
     64             "and the latest version of bitsandbytes: `pip install -i https://pypi.org/simple/ bitsandbytes`"
     65         )
     67     if kwargs.get("from_tf", False) or kwargs.get("from_flax", False):
     68         raise ValueError(
     69             "Converting into 4-bit or 8-bit weights from tf/flax weights is currently not supported, please make"
     70             " sure the weights are in PyTorch format."
     71         )

ImportError: Using `bitsandbytes` 8-bit quantization requires Accelerate: `pip install accelerate` and the latest version of bitsandbytes: 'pip install -i https://pypi.org/simple/ bitsandbytes`

The error still occured after running

!pip install accelerate
!pip install -i https://pypi.org/simple/ bitsandbytes

I first thought it was because tensorflow used my cpu instead of my gpu. This issue was very helpful: tensorflow/tensorflow#63362. So I found a way that tensorflow use my gpu by running this: pip install tensorflow-gpu==2.10.0 in a conda environment. But the error still remain.

Does someone have any idea? ( excuse for my english, it's not so good)

Thank you very much.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.