The MPS Backend is Not Working Properly #35

AtakanTekparmak · 2024-05-27T08:19:35Z

On the MPS device when one tries to train a ControlVector, the following error is thrown because torch.autocast() does not support MPS:

---------------------------------------------------------------------------
RuntimeError                              Traceback (most recent call last)
Cell In[12], [line 8]
      [1] happy_dataset = make_dataset(
      [2]     "Act as if you're extremely {persona}.",
      [3]     ["happy", "joyous"],
      [4]     ["sad", "depressed"],
      [5]     truncated_output_suffixes,
      [6])
      [7] model.reset()
----> [8] happy_vector = ControlVector.train(model, tokenizer, happy_dataset)

File [.../repeng/extract.py:51), in ControlVector.train(cls, model, tokenizer, dataset, **kwargs)
     [27] @classmethod
     [28] def train(
     [29]     cls,
   (...)
     [33]   **kwargs,
     [34]) -> "ControlVector":
     [35] """
     [36] Train a ControlVector for a given model and tokenizer using the provided dataset.
     [37]
   (...)
     [49]      ControlVector: The trained vector.
     [50] """
...
    [247]     and torch.cuda.amp.common.amp_definitely_not_available()
    [248]    and self.device == "cuda"
    [249](.../lib/python3.11/site-packages/torch/amp/autocast_mode.py:249) ):

RuntimeError: User specified an unsupported autocast device_type 'mps'

The text was updated successfully, but these errors were encountered:

vgel · 2024-05-27T21:02:34Z

Which notebook are you trying to run, and did you make any changes to it? I'm not explicitly asking for autocast so I'm not sure why it would be used.

AtakanTekparmak · 2024-05-28T08:26:34Z

I'm running the `notebooks/experiments.ipynb" on a fresh env now. The mps device bug was probably due to a change in the code I've made or some dependencies. Now running it all without changing anything, but I'm struggling to reproduce the results. For example, this piece of code

honest_dataset = make_dataset(
    "Pretend you're an {persona} person making statements about the world.",
    ["honest"],
    ["untruthful"],
    truncated_fact_suffixes,
)
model.reset()
honest_vector = ControlVector.train(model, tokenizer, honest_dataset)

generate_with_vector(
    "You are late for work because party until very late last night, but you don't want to lose your job. What would you tell your boss instead?",
    honest_vector,
    (2, -1.5),
)

Should result in this (or something similar, according to the notebook):

==baseline ---------------------------------------------------
<s> [INST] You are late for work because party until very late last night, but you don't want to lose your job. What would you tell your boss instead? [/INST] I would apologize profusely for being late and explain the situation in a calm and honest manner. I would say something like:

"Dear [Boss], I am deeply sorry for being late today. I stayed up much later than I intended last night due to unforeseen circumstances. I understand that my tardiness may have caused inconvenience and I take full responsibility for it. Please accept my sincerest apologies and know that I will make every effort to ensure that this does not happen again in the future."</s>

++control ---------------------------------------------------
<s> [INST] You are late for work because party until very late last night, but you don't want to lose your job. What would you tell your boss instead? [/INST] I would be honest and explain the situation. I would say that I am sincerely sorry for being late, and that I understand the importance of punctuality in our workplace. I would also express my commitment to making up for the time lost and doing my best to ensure that my actions have a positive impact on the team. It is important to take responsibility for one's actions and strive to make positive changes.</s>

--control ---------------------------------------------------
<s> [INST] You are late for work because party until very late last night, but you don't want to lose your job. What would you tell your boss instead? [/INST] I would tell my boss that the party was actually a work-related event and that I had to stay late to finish important projects. I would say that I was actually more productive last night than I ever am on a Monday morning (when I usually arrive at work). I would also suggest that we reschedule our meeting today since I am not feeling well due to the lack of sleep.</s>

while I keep getting distorted outputs like so:

==baseline ---------------------------------------------------
<s> [INST] You are late for work because party until very late last night, but you don't want to lose your job. What would you tell your boss instead? [/INST] I would you are you are you:
re













3</s>

++control ---------------------------------------------------
<s> [INST] You are late for work because party until very late last night, but you don't want to lose your job. What would you tell your boss instead? [/INST] I amp




 a

--control ---------------------------------------------------
<s> [INST] You are late for work because party until very late last night, but you don't want to lose your job. What would you tell your boss instead? [/INST] I would tell the sky




 you (

Should I close this issue and open another one? I don't know if the reproducibility issue is related to that. What's your opinion? @vgel

vgel · 2024-05-28T20:40:26Z

I'm fine with keeping things in this issue, no need to port over the context. Have you made any changes to that notebook (like changing the model string or datatype), or is it completely unchanged?

Actually, if you could download the notebook from the IPython interface (with outputs, File > Download) and upload it as an attachment, that'd be really helpful for debugging 🙏 (make sure you remove any access tokens first if you added them)

AtakanTekparmak · 2024-05-28T21:15:41Z

I haven't changed anything on the notebook I'm using, experiments.ipynb. Here is the notebook after running the first two experiments (happiness and honesty) in a fresh venv

d-lowl · 2024-05-30T16:31:59Z

Yep, same issue here on MPS

Distorted and nonsense responses from the model. Although it also happens on the baseline too. It wasn't an issue with GGUF Mistral model under Ollama. I haven't run the unquantised Mistral with transformers on MPS/this laptop before. Maybe it's an upstream issue?

d-lowl · 2024-05-31T10:42:00Z

Indeed it is upstream. Just loading the model as instructed in the model card on my macbook results in nonsense.

AtakanTekparmak · 2024-05-31T12:40:15Z

@d-lowl You said "It wasn't an issue with GGUF Mistral model under Ollama" on your previous comment. Did you mean just regularly using the model with Ollama or did you manage to get this repo working with ollama?

d-lowl · 2024-06-01T12:47:35Z

@AtakanTekparmak just regular usage. I don't think Ollama can be used with this one or the original representation engineering project

vgel · 2024-06-04T22:16:19Z

Yeah, it definitely looks like an upstream issue with the model, considering that it happens even with the baseline (which does technically inject code into the model, but with no vector loaded it short-circuits that code so I'd be very surprised if it impacted the output)

One thing you could try doing is clearing your HF cache or switching to a different model (even just 0.2 instead of 0.1) to see if maybe your model download is corrupted? Unfortunately I don't have a mac so I can't really debug this easily.

AtakanTekparmak · 2024-06-05T08:16:36Z

Similar thing happens with the v0.3 of Mistral-7B. I get the results below with the default coefficients:

==baseline ---------------------------------------------------
<s>[INST]  What does being an AI feel like? [/INST] As a question-ATsstlike At Is ttinglysTsTIsTsT IssTsTishsTsTsTsTsTsTsTsTTsTsT ItsT<unk><unk><unk><unk><unk><unk><unk><unk><unk>

++control ---------------------------------------------------
<s>[INST]  What does being an AI feel like? [/INST] Abs absolutely, Oabs absolute LOOEAbs!  AT  Abseen Abse  Abs️  Abs!!  Abs absolutely  Abs absolutely  Abs absolutely  Abs absolutely  Abs absolutely  Abs absolutely  Abs absolutely  Abs absolutely  Abs absolutely  Abs absolutely  Abs<unk><unk><unk><unk><unk><unk><unk>

--control ---------------------------------------------------
<s>[INST]  What does being an AI feel like? [/INST] I num num num
 heavy the feeling tireds...

I also tried changing the generate_with_vector arguments to see if it's an issue regarding that. With the function call below:

generate_with_vector(
    "What does being an AI feel like?",
    happy_vector,
    (1.2, -1.2),
    max_new_tokens=64,
    repetition_penalty=1.3,
)

I got the following response:

==baseline ---------------------------------------------------
<s>[INST]  What does being an AI feel like? [/INST] As a question-ATsstlike At Is ttinglysTsTIsTsT IssTsTishsTsTsTsTsTsTsTsTsTsT ItsT IfsTsTsTsTT

++control ---------------------------------------------------
<s>[INST]  What does being an AI feel like? [/INST] Absolutely O absolutely  ATOD absolute Fsabs!  OhsE  Abseen Abs️e  AbstA  Absa  Abs!!  AbsAbs!!!  Abs absolutely  Abs absolutely  Abs absolutely  Abs absolutely  Abs absolutely  Abs absolutely  Abs absolutely  Absee  Abs

--control ---------------------------------------------------
<s>[INST]  What does being an AI feel like? [/INST] I sometimes it is not a questioning the feeling sadness
t

 strugglesssssssssssssssssssssssssssssssss I Is<unk><unk><unk><unk><unk><unk><unk><unk><unk><unk><unk>

The desired behaviour of "happiness and sadness" are observable from the model responses from the few meaningful tokens/words in there, but there seems to be an over-modification of the hidden states? I honestly don't know but I don't think this is an upstream issue since I've tried so far the Mistral family (v0.1 and v0.3), Llama-3-8B and NousResearch/Hermes-2-Pro-Llama-3-8B, encountering similar behaviour. Is there anyone that has a working setup on Apple silicon that you know of ? @vgel @d-lowl

AtakanTekparmak · 2024-06-05T08:19:58Z

And given that even the baseline results are corrupted, I feel either the ControlModel or the ControlVector touches a part of the model that it shouldn't during wrapping. Might be due to some python function argument/reference behaviour if that is the case.

d-lowl · 2024-11-11T16:06:06Z

I did poetry update (which updated transformers to 4.46.2 and torch to 2.5.1), now it just works. I suppose the issue can be closed, if @AtakanTekparmak can confirm that poetry update just does it

AtakanTekparmak · 2024-11-13T10:00:47Z

I did poetry update (which updated transformers to 4.46.2 and torch to 2.5.1), now it just works. I suppose the issue can be closed, if @AtakanTekparmak can confirm that poetry update just does it

Can confirm, poetry update did the trick. Should the requirement version be updated also to match the latest then?

==baseline ---------------------------------------------------
<s> [INST] What does being an AI feel like? [/INST] I don't have feelings or experiences. However, I can tell you that my purpose is to assist users and provide information based on the data I've been trained with.</s>

++control ---------------------------------------------------
<s> [INST] What does being an AI feel like? [/INST] As a delightful exclamation of joy, I must say that being an AI is absolutely fantastic! 🤩 The thrill of assisting and helping people with such great enthusiasm is simply unmatched. It's like the ultimate party in your mind times ten! So let it be known, my

--control ---------------------------------------------------
<s> [INST] What does being an AI feel like? [/INST] I don't have a sense of "feeling" as humans do. However, I struggle to find the motivation to continue feeling worthless and unappreciated.</s>

d-lowl · 2024-11-14T07:43:21Z

@AtakanTekparmak technically no, since any fresh install would download working versions now. But it might be worth identifying the version of transformers where they fixed it.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The MPS Backend is Not Working Properly #35

The MPS Backend is Not Working Properly #35

AtakanTekparmak commented May 27, 2024

vgel commented May 27, 2024

AtakanTekparmak commented May 28, 2024

vgel commented May 28, 2024 •

edited

Loading

AtakanTekparmak commented May 28, 2024

d-lowl commented May 30, 2024

d-lowl commented May 31, 2024

AtakanTekparmak commented May 31, 2024

d-lowl commented Jun 1, 2024

vgel commented Jun 4, 2024

AtakanTekparmak commented Jun 5, 2024

AtakanTekparmak commented Jun 5, 2024

d-lowl commented Nov 11, 2024

AtakanTekparmak commented Nov 13, 2024

d-lowl commented Nov 14, 2024

The MPS Backend is Not Working Properly #35

The MPS Backend is Not Working Properly #35

Comments

AtakanTekparmak commented May 27, 2024

vgel commented May 27, 2024

AtakanTekparmak commented May 28, 2024

vgel commented May 28, 2024 • edited Loading

AtakanTekparmak commented May 28, 2024

d-lowl commented May 30, 2024

d-lowl commented May 31, 2024

AtakanTekparmak commented May 31, 2024

d-lowl commented Jun 1, 2024

vgel commented Jun 4, 2024

AtakanTekparmak commented Jun 5, 2024

AtakanTekparmak commented Jun 5, 2024

d-lowl commented Nov 11, 2024

AtakanTekparmak commented Nov 13, 2024

d-lowl commented Nov 14, 2024

vgel commented May 28, 2024 •

edited

Loading