Models: Add Qwen2-1.5B-Instruct #2759

ThiloteE · 2024-07-27T19:33:41Z

Adds a models3.json entry for Qwen2-1.5B-Instruct

Description of Model

It is a tiny bilingual model and at the date of writing with very strong results in benchmarks (for its parameter size). It supports a context of up to 32768. Because of its model size it has very fast responses, even when doing inference on CPU. This LLM is LITERALLY for all. Since the model fits into 4GB of RAM (just barely, if the Operating System and other apps also need RAM) or alternatively into 3GB of VRAM, this will be the workhorse of the desperate and hardware poor.

The model was trained/finetuned on English and Chinese language
License: Apache 2.0

Personal Impression:

I got the impression the model is very task focused and this is the reason, why I chose Below is an instruction that describes a task. Write a response that appropriately completes the request. as system prompt. Since the model is relatively small, its responses may seem not very coherent or intelligent, but it works surprisingly well with GPT4All's LocalDocs feature. It is like the model was made for RAG. Its long context adds to that. It mainly will appeal to English and Chinese speaking users.

Checklist before requesting a review

I have performed a self-review of my code.
If it is a core feature, I have added thorough tests.
I have added thorough documentation for my code.
I have tagged PR with relevant project labels. I acknowledge that a PR without labels may be dismissed.
If this PR addresses a bug, I have provided both a screenshot/video of the original bug and the working solution.

Signed-off-by: ThiloteE <[email protected]>

ThiloteE · 2024-07-27T19:45:00Z

I have added this model at the location "order": "z",, because I fear there might be merge conflicts with #2750

ThiloteE · 2024-07-28T10:51:54Z

VAGOsolutions confirm its RAG capabilities in their German RAG benchmark:

cosmic-snow · 2024-07-28T13:23:25Z

I've downloaded it and checked some fields, and they're all fine: md5sum, name, filename, filesize, quant, type, parameters

I have not looked at their site/blog to verify the templates, however a quick test with them went well.

ThiloteE added 2 commits July 27, 2024 21:18

Update models3.json - add Qwen2-1.5B-Instruct

6c37722

Signed-off-by: ThiloteE <[email protected]>

Add context length in description

cf16582

Signed-off-by: ThiloteE <[email protected]>

ThiloteE added models models.json This requires a change to the official model list. labels Jul 27, 2024

ThiloteE marked this pull request as ready for review July 27, 2024 19:53

manyoso approved these changes Jul 29, 2024

View reviewed changes

manyoso merged commit e45685b into main Jul 29, 2024
6 of 12 checks passed

cebtenzzre mentioned this pull request Jul 29, 2024

chat: fix comparison of prerelease versions #2772

Merged

supersonictw mentioned this pull request Jul 30, 2024

Models: Add Yi-1.5-9B-Chat-16K #2750

Open

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Models: Add Qwen2-1.5B-Instruct #2759

Models: Add Qwen2-1.5B-Instruct #2759

ThiloteE commented Jul 27, 2024 •

edited by cebtenzzre

Loading

ThiloteE commented Jul 27, 2024

ThiloteE commented Jul 28, 2024

cosmic-snow commented Jul 28, 2024

Models: Add Qwen2-1.5B-Instruct #2759

Models: Add Qwen2-1.5B-Instruct #2759

Conversation

ThiloteE commented Jul 27, 2024 • edited by cebtenzzre Loading

Description of Model

Personal Impression:

Checklist before requesting a review

ThiloteE commented Jul 27, 2024

ThiloteE commented Jul 28, 2024

cosmic-snow commented Jul 28, 2024

ThiloteE commented Jul 27, 2024 •

edited by cebtenzzre

Loading