KeyError with OpenAI MODEL_CAPABILITIES #4785

gabayben · 2024-12-22T17:01:58Z

What happened?

Location: https://github.com/microsoft/autogen/blob/main/python/packages/autogen-ext/src/autogen_ext/models/openai/_openai_client.py

Line 131 in the above location raises a KeyError when you provide a model that is not in the MODEL_CAPABILITIES dict (eg. 'llama3-8b-8192').

This is a big problem because the OpenAIChatCompletionClient should be usable with other providers.

A couple changes of are required:

Check that the model is in MODEL_CAPABILITIES.
If not, then may return a default dict. Some like {'function_calling': False, 'json_output': False, 'vision': False}.

However, I suggest ignoring model capabilities entirely and let developers make sure that models that they use have the capabilities that are required. From there, you could just let the AsyncOpenAI client raise an exception.

What did you expect to happen?

I expected the client to be unaware of the model that is set, like in the official openai sdk.

How can we reproduce it (as minimally and precisely as possible)?

`
from autogen_ext.models.openai import OpenAIChatCompletionClient

groq_model_client = OpenAIChatCompletionClient(
model='llama3-groq-70b-8192-tool-use-preview',
base_url='https://api.groq.com/openai/v1',
api_key='<GROQ_API_KEY>'
)

KeyError: 'llama3-groq-70b-8192-tool-use-preview'

`

AutoGen version

0.4.0.dev9

Which package was this bug in

Extensions

Model used

llama3-groq-70b-8192-tool-use-preview (Groq)

Python version

3.12.5

Operating system

Windows 11

Any additional info you think would be helpful for fixing this bug

A couple changes of are required:

Check that the model is in MODEL_CAPABILITIES.
If not, then may return a default dict. Some like {'function_calling': False, 'json_output': False, 'vision': False}.

However, I suggest ignoring model capabilities entirely and let developers make sure that models that they use have the capabilities that are required. From there, you could just let the AsyncOpenAI client raise an exception.

ekzhu · 2024-12-22T17:16:34Z

Does setting model capabilities directly in the model client constructor works? It's documented in the API docs.

In the prior versions, countless issues come from developers plugging in local models without necessary capabilities and expect it to "just work". So we want to have more validation. Perhaps as a first step is to produce a clean error message suggesting capabilities to be added in the constructor?

jackgerrits · 2024-12-22T18:32:44Z

https://microsoft.github.io/autogen/dev/user-guide/core-user-guide/faqs.html#what-are-model-capabilities-and-how-do-i-specify-them

gabayben · 2024-12-23T00:57:24Z

I see. Ok if you encountered issues with this in the past then it's probably best not to ignore the model capabilities. I think it could be good to produce an error message for it.

ekzhu · 2024-12-23T01:22:22Z

@gabayben let me know if you would like to submit a PR for this. #4787

github-actions bot added the needs-triage label Dec 22, 2024

gabayben closed this as completed Dec 23, 2024

ekzhu mentioned this issue Dec 23, 2024

Reminder in error message to add model capabilities when model name does not exist #4787

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

KeyError with OpenAI MODEL_CAPABILITIES #4785

KeyError with OpenAI MODEL_CAPABILITIES #4785

gabayben commented Dec 22, 2024 •

edited

Loading

ekzhu commented Dec 22, 2024

jackgerrits commented Dec 22, 2024

gabayben commented Dec 23, 2024

ekzhu commented Dec 23, 2024

KeyError with OpenAI MODEL_CAPABILITIES #4785

KeyError with OpenAI MODEL_CAPABILITIES #4785

Comments

gabayben commented Dec 22, 2024 • edited Loading

What happened?

What did you expect to happen?

How can we reproduce it (as minimally and precisely as possible)?

KeyError: 'llama3-groq-70b-8192-tool-use-preview'

AutoGen version

Which package was this bug in

Model used

Python version

Operating system

Any additional info you think would be helpful for fixing this bug

ekzhu commented Dec 22, 2024

jackgerrits commented Dec 22, 2024

gabayben commented Dec 23, 2024

ekzhu commented Dec 23, 2024

gabayben commented Dec 22, 2024 •

edited

Loading