Req for Ollama reload of models + chapters not reaching word counts #67

HannaLovvold · 2024-10-31T16:53:17Z

I have been using this for about a week now and I'm loving it (almost) and more people should know about this program. I currently have 2 problems with it.

I set a prompt like this: "Please write a story set in modern times, the story should contain 10 chapters of 1000-1500 words in each chapter." Then I add the story details. I have noticed that if you use a single model for all steps then it hits the context limit really quick. (I initially though that none of my models worked as they would all start looping). Then I tried copying the file in Ollama to a new name but it must have known it was the same file as it didn't reload. Is there a way to add reloading of a model in Ollama for each model stage of the config.py? Nothing popped out at me in the Ollama library (I only know very basic python). This would reset the context for the model at each stage and clear up part of the problem.
I can also see that Ollama has a context of only 2k for all models but looking at the Ollama python library I saw a reference to num_ctx in the _types.py under
class Options(TypedDict, total=False):
# load time options
num_ctx: int.
I think it may be able to be changed somewhere in your code but I have no idea where it would go. (1000 words should only be around 1400 tokens + overheads but with 2k context it's not going far.)

Thank you

[edit]
just found that num_ctx is already listed in wrapper.py but not implemented. Not sure how to implement it.

LoggeL · 2024-10-31T19:12:38Z

[edit] just found that num_ctx is already listed in wrapper.py but not implemented. Not sure how to implement it.

You can pass it as model parameter but I'll add a default for it for 8192 which should be a healthy amount. Care for the extra VRAM usage (~+200MB for qwen2.5:7b).

HannaLovvold · 2024-10-31T23:51:45Z

Great. Thanks for that. Any thoughts on the reloading of the models? Even with 8k I don't think it would make it through an entire run with a single model. (qwen2.5:7b is ok for testing but I'll move on to eva.qwen2.5:72b which I find is really good at novels.)

LoggeL · 2024-11-03T20:35:26Z

They shouldn't fill the context up. It doesn't pass the whole history down on each turn. It should only pass the required information for each each step. You can also see the prompts here:

https://github.com/datacrystals/AIStoryWriter/blob/main/Writer/Prompts.py

HannaLovvold · 2024-11-04T00:16:53Z

The reason I ask is that if you look at my log, by the time it gets to the end of the chapter outline just for 10 chapters the context has risen to (Warning, Detected High Token Context Length of est. ~71265.2tok). If I wanted to do a 30 chapter outline for a novel that could take it to over 200K. Most of the time when it start climbing towards that number I start getting JSON Error during parsing: Expecting value: line 1 column 1 (char 0) looping no matter which model I use.

Main.log

Edit: Just did another test run just using command-r and this is what happened.

As you can see once it hit 170k it just started writing gibberish and it had only gotten to the start of the chapter 4 outline.

LoggeL · 2024-11-04T15:48:21Z

Thanks for pointing out. Will investigate further.

Wuzzooy · 2024-12-11T14:37:26Z

I have the same issue with openrouter or llama.cpp, it's like there is no limit on the context and more chapters there is and more it's an issue.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Req for Ollama reload of models + chapters not reaching word counts #67

Req for Ollama reload of models + chapters not reaching word counts #67

HannaLovvold commented Oct 31, 2024 •

edited

Loading

LoggeL commented Oct 31, 2024

HannaLovvold commented Oct 31, 2024

LoggeL commented Nov 3, 2024

HannaLovvold commented Nov 4, 2024 •

edited

Loading

LoggeL commented Nov 4, 2024

Wuzzooy commented Dec 11, 2024

Req for Ollama reload of models + chapters not reaching word counts #67

Req for Ollama reload of models + chapters not reaching word counts #67

Comments

HannaLovvold commented Oct 31, 2024 • edited Loading

LoggeL commented Oct 31, 2024

HannaLovvold commented Oct 31, 2024

LoggeL commented Nov 3, 2024

HannaLovvold commented Nov 4, 2024 • edited Loading

LoggeL commented Nov 4, 2024

Wuzzooy commented Dec 11, 2024

HannaLovvold commented Oct 31, 2024 •

edited

Loading

HannaLovvold commented Nov 4, 2024 •

edited

Loading