Close Menu

    Subscribe to Updates

    Get the latest Tech news from SynapseFlow

    What's Hot

    Best TV 2026: OLED, Micro RGB and more tested

    June 3, 2026

    My local LLM felt unfinished until I put a proper interface in front of it

    June 3, 2026

    The final trailer for 2026’s Supergirl movie is here to remind you why its Woman of Tomorrow subtitle was dropped from the DC comic book movie

    June 3, 2026
    Facebook X (Twitter) Instagram
    • Homepage
    • About Us
    • Contact Us
    • Privacy Policy
    Facebook X (Twitter) Instagram YouTube
    synapseflow.co.uksynapseflow.co.uk
    • AI News & Updates
    • Cybersecurity
    • Future Tech
    • Reviews
    • Software & Apps
    • Tech Gadgets
    synapseflow.co.uksynapseflow.co.uk
    Home»Software & Apps»My local LLM felt unfinished until I put a proper interface in front of it
    My local LLM felt unfinished until I put a proper interface in front of it
    Software & Apps

    My local LLM felt unfinished until I put a proper interface in front of it

    The Tech GuyBy The Tech GuyJune 3, 2026No Comments6 Mins Read0 Views
    Share
    Facebook Twitter LinkedIn Pinterest Email
    Advertisement


    I’ve been really getting into running local LLMs lately, and Ollama has been a lifesaver. It’s a great tool, but I always felt like the interface left something to be desired.

    Advertisement

    So I started looking for a proper UI that hooks into it, and turns out there are actually a few decent options, and I think I’ve found the perfect one for me.


    A MacBook on a sofa showing an open view of Obsidian's graph view alongside a local LLM


    I hooked Obsidian to a local LLM and it beats NotebookLM at its own game

    My notes now talk back and it’s terrifyingly useful.

    Ollama is great… for developers

    I want something prettier

    Ollama website open in Zen browser
    Yadullah Abidi / MakeUseOf

    Ollama is one of the best places to start with local LLMs. You just pull a model, run it, done. It also now has a desktop app, which is nice. But after using it for a while, I kept finding myself wanting more out of the experience, and the one thing that really bugged me was not being able to use it properly from my phone. That’s what eventually pushed me to Open WebUI, and I haven’t looked back.

    To be clear, I still use Ollama. It runs in the background on my Mac mini, and everything else talks to it. That part is excellent. The desktop app, though, is fine. You get a chat window, your conversation history is there, you can drag in a file. It works.

    What it doesn’t have is anything beyond that. For example, it has nothing to do with accessing it from another device. And the phone thing kept annoying me. I wanted to open the full interface from my phone when I’m not at my desk, and the Ollama app just doesn’t do that (no, I don’t want to SSH into my machine to access my hosted LLMs.)

    The more I used it, the more it felt like a feature that was added because people kept asking for one, not something that was designed to be a daily driver. Which makes sense because that was never really the point of Ollama. It’s supposed to be infrastructure. It runs on a server; it serves the models. For the actual user-interface, I wanted an app that could hook into Ollama and just work seamlessly with it.

    Setting up Open WebUI is super easy

    Docker pretty much makes it ready to go

    Open WebUI is free, open-source, and runs entirely on your machine. The easiest way to get it going is through Docker. If you already have Docker installed, you just need to run this command in your terminal:

    docker run -d -p 3000:8080 --add-host=host.docker.internal:host-gateway -v open-webui:/app/backend/data --name open-webui --restart always ghcr.io/open-webui/open-webui:main

    Then you open http://localhost:3000, create a local account, and you’re in. It finds your Ollama models automatically, so you can start chatting with your LLMs pretty much instantly.

    The interface is pretty similar to ChatGPT. It has everything you would need, like persistent conversation history, a model switcher, and more. One thing to do right away: go into the model settings and bump the context window up from the default 2048 tokens to 8192 or higher.


    MNN models market.


    5 useful things I do with a local LLM on my phone

    Privacy aside, a local LLM is just really convenient.

    Ollama’s default is set low for performance reasons, but it’s too low for anything beyond a quick back and forth. With a bigger context window, longer conversations stop losing track of what you said earlier, and document chat actually becomes useful.

    There’s also a Controls panel on the side of every conversation that goes pretty deep if you want it to. Temperature, top-k, top-p, presence penalty, frequency penalty, seed, stop sequences, reasoning effort.

    Most people will never touch any of it, and that’s fine. But it’s there, if you want to experiment with how a model responds without digging through config files, you can directly do that from the UI.

    You can access it from any device

    Yes, even when you’re not at home

    Ollama running locally.
    Roine Bertelson/MUO

    The problem with running something locally is the “local” part. It requires a bit more setup to access it from outside your network. You could expose Open WebUI to the internet directly, but I wouldn’t.

    Port forwarding on a home router, a public-facing server, all of that is more hassle than it’s worth and introduces problems I didn’t want to deal with.

    Tailscale is what I use for this. Simply head to the Tailscale website, create a free account, and download the app on your home machine. Then install it on your phone too and sign in to the same account. That’s it, both devices now show up on your Tailscale network, and you can see each other from anywhere.

    Tailscale dashboard.
    Roine Bertelson/MUO

    To find your home machine’s IP, open the Tailscale app, and it’s right there on the main screen, something like 100.x.x.x. Take that IP, open your phone browser, type 100.x.x.x:3000 and Open WebUI loads exactly like it does at home. I just bookmarked it on my phone and never thought about it again.

    Once it’s set up, using it is simple. Open a browser on your phone, and type in your home machine’s Tailscale IP followed by :3000. To find your machine’s Tailscale IP, just open the Tailscale app on your home machine, and it’s right there on the main screen.

    So in the end, your final URL will look something like 100.x.x.x:3000. Just enter that on any device connected via Tailscale, and that’s it! Now you’ll have the exact same UI on any other device, like your phone.


    Gemma 4 responding to a prompt on a MacBook in LM Studio


    The fix for local LLMs was never a bigger model

    My local LLM kept choking on context until I added this 500MB model.

    One more thing worth knowing

    Open WebUI isn’t restricted to Ollama either. You can point it at pretty much any OpenAI-compatible API, so if you don’t have the hardware to run larger models locally, you’re not stuck with small 4B models.

    You can also use it with Nvidia BUILD, which lets you run really large open-weight models for free through their cloud. It’s a setup that scales with whatever hardware situation you’re in.

    Advertisement
    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    The Tech Guy
    • Website

    Related Posts

    The latest Star Wars movie is a successful failure

    June 3, 2026

    You can name cells in Excel, and formulas get much easier after that

    June 2, 2026

    ASUS’ new laptop lineup finally kills the budget-versus-premium divide

    June 2, 2026

    These new NVIDIA laptops finally give Windows what it’s been missing: real creative power

    June 1, 2026

    How to install WMIC in Windows 11 using Settings, PowerShell or CMD

    June 1, 2026

    Alienware’s world-first 39″ 5K monitor solves the one problem that killed OLED gaming

    June 1, 2026
    Leave A Reply Cancel Reply

    Advertisement
    Top Posts

    The iPad Air brand makes no sense – it needs a rethink

    October 12, 202516 Views

    ChatGPT Group Chats are here … but not for everyone (yet)

    November 14, 20258 Views

    Facebook updates its algorithm to give users more control over which videos they see

    October 8, 20258 Views
    Stay In Touch
    • Facebook
    • YouTube
    • TikTok
    • WhatsApp
    • Twitter
    • Instagram
    Advertisement
    About Us
    About Us

    SynapseFlow brings you the latest updates in Technology, AI, and Gadgets from innovations and reviews to future trends. Stay smart, stay updated with the tech world every day!

    Our Picks

    Best TV 2026: OLED, Micro RGB and more tested

    June 3, 2026

    My local LLM felt unfinished until I put a proper interface in front of it

    June 3, 2026

    The final trailer for 2026’s Supergirl movie is here to remind you why its Woman of Tomorrow subtitle was dropped from the DC comic book movie

    June 3, 2026
    categories
    • AI News & Updates
    • Cybersecurity
    • Future Tech
    • Reviews
    • Software & Apps
    • Tech Gadgets
    Facebook X (Twitter) Instagram Pinterest YouTube Dribbble
    • Homepage
    • About Us
    • Contact Us
    • Privacy Policy
    © 2026 SynapseFlow All Rights Reserved.

    Type above and press Enter to search. Press Esc to cancel.

    Ad Blocker Enabled!
    Ad Blocker Enabled!
    Our website is made possible by displaying online advertisements to our visitors. Please support us by disabling your Ad Blocker.