Use this webui (its the stabld diffusion ui for llm)
https://github.com/oobabooga/text-generation-webui
I am pretty sure it has a sever option.
Here is a list of the models it likely supports, including gpt4all. https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard
Best one i tried is wizard vicuna 13B running on a rtx2070
Time to implement a new checkbox “remember the state of the remember me checkbox”
Jokes aside this probably requires injecting some code or script into the webpage. Maybe theres a browser extension that can do this.