Ollama is broken

jmwhite · November 19, 2024, 10:13pm

I’m having the same issue on my Mac. I’m running Ollama directly on the host so that I can make use of the GPU. When running oTToDev in the container I needed to set

OLLAMA_API_BASE_URL=http://host.docker.internal:11434

If I open a terminal in the oTToDev container I can:

first time running a ps command I get:
→ curl {OLLAMA_API_BASE_URL}/api/ps

{“models”:}

but then running:

→ curl {OLLAMA_API_BASE_URL}/api/chat -d '{

“model”: “dolphin_2_9_2_lc:latest”*
}’

{“model”:“dolphin_2_9_2_lc:latest”,“created_at”:“2024-11-19T21:49:37.519538Z”,“message”:{“role”:“assistant”,“content”:“”},“done_reason”:“load”,“done”:true}

Now ps works

→ curl {OLLAMA_API_BASE_URL}/api/ps

{“models”:[{“name”:“dolphin_2_9_2_lc:latest”,“model”:“dolphin_2_9_2_lc:latest”,“size”:102690629632,“digest”:“f586f65be437e2fca804b550a90715e227ec9106d6d67cffc3bd9a7553f7a782”,“details”:{“parent_model”:“”,“format”:“gguf”,“family”:“qwen2”,“families”:[“qwen2”],“parameter_size”:“72.7B”,“quantization_level”:“Q4_0”},“expires_at”:“2024-11-19T13:54:37.519822-08:00”,“size_vram”:102690629632}]}

But refreshing the web page still won’t find any models

eventually the model unloads and I get back to:

→ curl {OLLAMA_API_BASE_URL}/api/ps

{“models”:}

TheNoteTaker · November 22, 2024, 9:55am

I’ve been having the exact same issues. Docker will absolutely not work. Even trying to allow direct access host network doesn’t work either. Have tried going into the container and directly modifying the env vars too. I double checked that they’re all correct. However, in the network requests it’s not even using the ones I enter.

I tried entering in random gibberish for the Ollama API URL and it still uses the same one every time.

Erminig · November 23, 2024, 4:38pm

For my return: I encountered the same problem when I avoided using Docker. Ollama didn’t work initially because I had configured an Anthropics key for Claude. I deleted all my keys and kept only OLLAMA_API_BASE_URL=http://localhost:11434. It works now, but it’s very slow. I’m using the CodeLlama 13B model, which may not be the most suitable.

s.hashimoto6600 · November 23, 2024, 5:13pm

How can I get this kind of UI?
I am getting old UI.

gngglobetech · November 25, 2024, 3:48am

TheNoteTaker · November 26, 2024, 5:29am

This is what I ended up doing.

bolto90 · November 27, 2024, 6:47pm

github.com/coleam00/bolt.new-any-llm

Ollama Defaulting to sonnet model (claude-3-5-sonnet-latest)

opened 04:07PM - 27 Nov 24 UTC

aaronbolton

### Describe the bug when using ollama its requesting claude-3-5-sonnet-latest …model this only seem to have happen in a recent PR 4:00:15 PM [vite] ✨ new dependencies optimized: vite-plugin-node-polyfills/shims/buffer, vite-plugin-node-polyfills/shims/global, vite-plugin-node-polyfills/shims/process, nanostores, @remix-run/cloudflare, remix-utils/client-only, js-cookie, @radix-ui/react-tooltip, @nanostores/react, react-toastify, ignore 4:00:15 PM [vite] ✨ optimized dependencies changed. reloading 4:00:16 PM [vite] ✨ new dependencies optimized: remix-island, ai/react, framer-motion, diff, node:path, jszip, file-saver, @octokit/rest, date-fns, @radix-ui/react-dialog, react-resizable-panels, istextorbinary, @webcontainer/api, @codemirror/autocomplete, @codemirror/commands, @codemirror/language, @codemirror/search, @codemirror/state, @codemirror/view, @radix-ui/react-dropdown-menu, react-markdown, @xterm/addon-fit, @xterm/addon-web-links, @xterm/xterm, @uiw/codemirror-theme-vscode, @codemirror/lang-javascript, @codemirror/lang-html, @codemirror/lang-css, @codemirror/lang-sass, @codemirror/lang-json, @codemirror/lang-markdown, @codemirror/lang-wast, @codemirror/lang-python, @codemirror/lang-cpp, rehype-raw, remark-gfm, rehype-sanitize, unist-util-visit, shiki 4:00:16 PM [vite] ✨ optimized dependencies changed. reloading APICallError [AI_APICallError]: Not Found at /app/node_modules/.pnpm/@ai-sdk+provider-utils@1.0.20_zod@3.23.8/node_modules/@ai-sdk/provider-utils/dist/index.js:505:14 at process.processTicksAndRejections (node:internal/process/task_queues:95:5) at async postToApi (/app/node_modules/.pnpm/@ai-sdk+provider-utils@1.0.20_zod@3.23.8/node_modules/@ai-sdk/provider-utils/dist/index.js:398:28) at async OllamaChatLanguageModel.doStream (/app/node_modules/.pnpm/ollama-ai-provider@0.15.2_zod@3.23.8/node_modules/ollama-ai-provider/dist/index.js:485:50) at async fn (file:///app/node_modules/.pnpm/ai@3.4.9_react@18.3.1_sswr@2.1.0_svelte@4.2.18__svelte@4.2.18_vue@3.4.30_typescript@5.5.2__zod@3.23.8/node_modules/ai/dist/index.mjs:3938:23) at async file:///app/node_modules/.pnpm/ai@3.4.9_react@18.3.1_sswr@2.1.0_svelte@4.2.18__svelte@4.2.18_vue@3.4.30_typescript@5.5.2__zod@3.23.8/node_modules/ai/dist/index.mjs:256:22 at async _retryWithExponentialBackoff (file:///app/node_modules/.pnpm/ai@3.4.9_react@18.3.1_sswr@2.1.0_svelte@4.2.18__svelte@4.2.18_vue@3.4.30_typescript@5.5.2__zod@3.23.8/node_modules/ai/dist/index.mjs:86:12) at async startStep (file:///app/node_modules/.pnpm/ai@3.4.9_react@18.3.1_sswr@2.1.0_svelte@4.2.18__svelte@4.2.18_vue@3.4.30_typescript@5.5.2__zod@3.23.8/node_modules/ai/dist/index.mjs:3903:13) at async fn (file:///app/node_modules/.pnpm/ai@3.4.9_react@18.3.1_sswr@2.1.0_svelte@4.2.18__svelte@4.2.18_vue@3.4.30_typescript@5.5.2__zod@3.23.8/node_modules/ai/dist/index.mjs:3977:11) at async file:///app/node_modules/.pnpm/ai@3.4.9_react@18.3.1_sswr@2.1.0_svelte@4.2.18__svelte@4.2.18_vue@3.4.30_typescript@5.5.2__zod@3.23.8/node_modules/ai/dist/index.mjs:256:22 at async chatAction (/app/app/routes/api.chat.ts:64:20) at async Object.callRouteAction (/app/node_modules/.pnpm/@remix-run+server-runtime@2.10.0_typescript@5.5.2/node_modules/@remix-run/server-runtime/dist/data.js:37:16) at async /app/node_modules/.pnpm/@remix-run+router@1.17.0/node_modules/@remix-run/router/dist/router.cjs.js:4612:21 at async callLoaderOrAction (/app/node_modules/.pnpm/@remix-run+router@1.17.0/node_modules/@remix-run/router/dist/router.cjs.js:4677:16) at async Promise.all (index 1) at async callDataStrategyImpl (/app/node_modules/.pnpm/@remix-run+router@1.17.0/node_modules/@remix-run/router/dist/router.cjs.js:4552:17) at async callDataStrategy (/app/node_modules/.pnpm/@remix-run+router@1.17.0/node_modules/@remix-run/router/dist/router.cjs.js:4041:19) at async submit (/app/node_modules/.pnpm/@remix-run+router@1.17.0/node_modules/@remix-run/router/dist/router.cjs.js:3900:21) at async queryImpl (/app/node_modules/.pnpm/@remix-run+router@1.17.0/node_modules/@remix-run/router/dist/router.cjs.js:3858:22) at async Object.queryRoute (/app/node_modules/.pnpm/@remix-run+router@1.17.0/node_modules/@remix-run/router/dist/router.cjs.js:3827:18) at async handleResourceRequest (/app/node_modules/.pnpm/@remix-run+server-runtime@2.10.0_typescript@5.5.2/node_modules/@remix-run/server-runtime/dist/server.js:413:20) at async requestHandler (/app/node_modules/.pnpm/@remix-run+server-runtime@2.10.0_typescript@5.5.2/node_modules/@remix-run/server-runtime/dist/server.js:156:18) at async /app/node_modules/.pnpm/@remix-run+dev@2.10.0_@remix-run+react@2.10.2_react-dom@18.3.1_react@18.3.1__react@18.3.1_typ_qwyxqdhnwp3srgtibfrlais3ge/node_modules/@remix-run/dev/dist/vite/cloudflare-proxy-plugin.js:70:25 { cause: undefined, url: 'https://ollama.mydomain.com/api/chat', requestBodyValues: { format: undefined, model: 'claude-3-5-sonnet-latest', options: { num_ctx: 24576, num_predict: 8000, temperature: 0 }, messages: [ [Object], [Object] ], tools: undefined }, statusCode: 404, responseHeaders: { 'content-length': '78', 'content-type': 'application/json; charset=utf-8', date: 'Wed, 27 Nov 2024 16:00:45 GMT' }, responseBody: '{"error":"model \\"claude-3-5-sonnet-latest\\" not found, try pulling it first"}', isRetryable: false, data: undefined, [Symbol(vercel.ai.error)]: true, [Symbol(vercel.ai.error.AI_APICallError)]: true } ### Link to the Bolt URL that caused the error n/a ### Steps to reproduce select ollama, select ollama model ### Expected behavior produce output from ollama ### Screen Recording / Screenshot _No response_ ### Platform - OS: [e.g. macOS, Windows, Linux] - Browser: [e.g. Chrome, Safari, Firefox] - Version: [e.g. 91.1] ### Additional context _No response_

I’ve raised this issue as a potential source of the problem.

it appears lines 286-289 are causing this issue there was a recent commit to add these but once commented out it works again

bdhaliwal24 · November 28, 2024, 4:03am

I modified constants.py
//export const DEFAULT_MODEL = ‘claude-3-5-sonnet-latest’;
export const DEFAULT_MODEL = ‘qwen2.5-coder:7b’;

Also I commented out the 3 lines in getOllammaModels, and it STILL tries to use claude-sonnet:
bolt-ai-1 | requestBodyValues: {
bolt-ai-1 | format: undefined,
bolt-ai-1 | model: ‘claude-3-5-sonnet-latest’,

I did a recursive search through the whole tree and there is no way that it should be requesting to use this model. I am also sanitizing my container and images by deleting docker images and containers each time.

Any ideas on how its still managing to try and pick claude-sonnet?

bdhaliwal24 · November 28, 2024, 4:17am

Ok FINALLY got it to work. Here is my topology:
Server hostname=fractal:

Linux Ubuntu running Ollama and has various models loaded including Qwen. It’s running a 4070ti Super
Docker compose profile:production running Otto
Client:
Macbook, using Chrome browser as a pure client

Changes made:

Create and update .env.local with :
OLLAMA_API_BASE_URL=http://fractal.local.net:11434
Update bindings.sh to take file from app/.env.local instead of just .env.local since the docker build DOES NOT seem to include .env.local in the base directory
Changed DEFAULT_MODEL=‘qwen2.5-code:7b’; it seems like Bolt seems to somehow ignore this anyway (see my previous comment)
Modified the constants.py PROVIDER_LIST with, this effectively hard-codes the model with the Ollama selection:
name: ‘Ollama’,
staticModels: [
{
name: ‘qwen2.5-coder:7b’,
label: ‘Qwen’,
provider: ‘Alibaba’,
maxTokenAllowed: 8000
}
],

I can confirm from the log files that it was going out to the Ollama and Qwen model installed on Fractal server. Next major issue however is the fact that the pseudo-IDE that its supposed to update does not work and I see this error at the bottome message window:
"Failed to spawn bolt shell

Failed to execute ‘postMessage’ on ‘Worker’: SharedArrayBuffer transfer requires self.crossOriginIsolated."

I can see the results however only in the intermediate window above it.

Somehow my random incantations after a few nights of trying finally somehow partially worked.

TheDevelolper · November 28, 2024, 12:16pm

I think it’s the model it’s trying to use that’s the problem.

baoduy · November 28, 2024, 1:13pm

If you are running bolt.new in side a docker just try to replace localhost by host.docker.internal

Refer here for details networking - How host.docker.internal works on Windows - Super User

Hope this help