Run LM Studio as a service (headless)

Advanced

Starting in v0.3.5, LM Studio can be run as a service without the GUI. This is useful for running LM Studio on a server or in the background on your local machine.

Run LM Studio as a service

Running LM Studio as a service consists of several new features intended to make it more efficient to use LM Studio as a developer tool.

The ability to run LM Studio without the GUI
The ability to start the LM Studio LLM server on machine login, headlessly
On-demand model loading

To enable this, head to app settings (Cmd / Ctrl + ,) and check the box to run the LLM server on login.

Enable the LLM server to start on machine login

When this setting is enabled, exiting the app will minimize it to the system tray, and the LLM server will continue to run in the background.

Just-In-Time (JIT) model loading

Useful when utilizing LM Studio as an LLM service with other frontends or applications.

Load models on demand

When JIT loading is ON:

Call to /v1/models will return all downloaded models, not only the ones loaded into memory
Calls to inference endpoints will load the model into memory if it's not already loaded

When JIT loading is OFF:

Call to /v1/models will return only the models loaded into memory
You have to first load the model into memory before being able to use it

Idle TTL and Auto-Evict

The counterpart to loading models on-demand is unloading them from memory at the right time.

LM Studio 0.3.9 introduces TTL (unload after some time) and Auto-Evict (limit how many JIT models are loaded at one time).

Read about Idle TTL and Auto-Evict.

Auto Server Start

Your last server state will be saved and restored on app or service launch.

To achieve this programmatically, you can use the following command:

lms server start

Pro Tip

If you haven't already, bootstrap lms on your machine by following the instructions here.

Community

Chat with other LM Studio developers, discuss LLMs, hardware, and more on the LM Studio Discord server.

Please report bugs and issues in the lmstudio-bug-tracker GitHub repository.

Run LM Studio as a service (headless)