Chat with an LLM directly in your markdown documents!
This command line tool extends Simon Willison's amazing llm tool to allow you to have a conversation with an LLM in a markdown document. The idea is that you then have a permanent record of the conversation which you can submit to version control or keep in your Obsidian vault.
Why would you want to use this instead of llm chat
?
- It's easier to save conversations to version control and/or Obsidian
- Navigating your favourite text editor is easier than the terminal
- You get nice syntax highlighting when the LLM writes fenced code blocks
- You can comment out parts of the conversation history (see below)
- This tool has support for markdown links to images and files. Links to websites coming soon.
- It opens up possibilities for experimentation - by editing the chat history you can see how different models with different settings would answer the same question, and even get models conversing with each other.
This project is a very early work-in-progress. Use only if you're willing to troubleshoot.
- Install from git:
pip install git+https://github.com/matweldon/markdown_llm.git
OR, clone and install as editable for development:
git clone https://github.com/matweldon/markdown_llm.git
cd markdown_llm
pip install -e .
-
Set the API key (see below).
-
Ensure you have the
code
command in your PATH, or edit theeditor_cmd
option in config (see below) to open the text editor of your choice.
Simon Willison's llm
package provides key management which can be used in this package for text models. To set the Anthropic key, enter:
llm keys set claude
and enter the key at the prompt. The package stores keys in a JSON file at a location that can be found with llm keys path
.
The vision model will work with an 'ANTHROPIC_API_KEY' in a .env
file in the root of the directory where the package is used, or by setting the environment variable in the session. In the future the package will support other APIs but the Anthropic API is currently hard-coded for vision models.
Run the script:
llmd your_file.md
and when the markdown prompt appears, write your prompt.
---
model: claude-3-5-sonnet-20240620
system: '{sys_python_prefs}'
date: 17 October 2024
options:
max_tokens: 4096
---
# %User
What is President Reagan's first name? Just give me the answer.
Then run the command again. The LLM's response will be inserted directly into your markdown file.
# %User
What is President Reagan's first name? Just give me the answer.
# %Assistant
Ronald
Then you can carry on the conversation:
# %User
What about Thatcher?
# %Assistant
Margaret
You can also comment out parts of the conversation using <!--llm
and llm-->
. This allows you to edit the conversation history, for example to rerun responses to obtain a sample of several different answers. The API is stateless - the API call reconstructs the full conversation each time a request is sent. This means that you can "put words into the LLM's mouth" and generally mess around with the flow of the conversation.
To make the file initialisation work, you need to ensure the code
command (for VS Code) is in your PATH. To use other text editors, set the 'editor_cmd' option in a USER or PROJECT config yaml (see below). Otherwise, you can still open the file that is created and enter your prompt manually.
Once you've written a prompt, setting up a keyboard shortcut in your editor saves time moving back and forth between the editor and the terminal. To set up a keyboard shortcut in VS Code, open keybindings.json
from the Command palette and add this entry (substituting your preferred key combination):
{
"key": "cmd+ctrl+r",
"command":"workbench.action.terminal.sendSequence",
"args": {
"text": "llmd ${file}\n"
},
"when": "editorTextFocus && editorLangId == markdown",
}
Currently, the package has only been tested with Anthropic Claude Sonnet 3.5. But using the Python SDK of the llm
package means that in principle this supports any models Simon Willison's package does. This includes OpenAI models and local open-source models. However, vision model use only supports Anthropic models.
There are some usage examples in the examples folder. You'll see that I used this package quite heavily in writing the package.
You can add relative file links:
# %User
[](./path/to/script.py)
Write some tests for this function.
The full file contents are included in the prompt. Inlining links can be switched off by setting ignore_links: true
in the yaml header. Paths are relative to the file path of the markdown file.
Note
Inlining files can be a useful way to handle prompts containing 'unsafe' patterns that would otherwise conflict with the package's text parsing, such as markdown file links or image links, because text included in this way is not subjected to any more parsing.
The tool supports vision models - currently only Anthropic. To add an image to the chat, add a markdown link to the relative path of the image:
# %User
![image1](data/cats.png)
How many cats are in the image? Are there any black cats?
and then run the command. So far, I've only tested this with png files but it should also work with JPEG.
Image markdown supports the same options, but aliases are not supported because the package calls the Anthropic API directly rather than using llm
.
Inlining images can be switched off by setting ignore_images: true
in the yaml header.
There are three configuration options, in order of decreasing priority:
- File YAML header
- PROJECT llm_config.yaml in the current working directory
- USER config.yaml in a user config folder (system-dependent location)
The default config settings are defined in init.py. Setting an element to 'null' in a yaml config (equivalent to None
in code) will unset a setting from a lower-priority config.
The model, system message, ignore_links
, ignore_images
and model options can be set in a file yaml header. The header is re-parsed each time the command is called, so the settings can be changed mid-conversation. For example, you can set ignore_images: true
to save tokens when an image is no longer needed.
When using text-only (not vision) models, you can specify the model using aliases set with llm aliases set
from the llm package.
---
model: sonnet
system: Only speak in French.
options:
max_tokens: 1024
temperature: 0.0
---
# %User
What is the capital of France?
Model options depend on the model type. For both OpenAI and Anthropic they include temperature
and max_tokens
. The options exposed through the llm package aren't very well documented so it's easiest to just look at the source code:
Other configuration options include USER and PROJECT config yaml files. The most important setting here which is never set in the yaml header is editor_cmd
which determines how the file is initially opened. The USER yaml file is located in the OS's preferred location for user configs. You can print the location with llmd-config-path
. The PROJECT config yaml is a file named 'llmd_config.yaml' located in the current working directory.
The repo contains an example PROJECT config: 'llmd_config.yaml'. The easiest way to create a new USER config is cp llmd_config.yaml "$(llmd-config-path)"
. This can be edited with code "$(llmd-config-path)"
.
The package has a templated system message capability. System messages defined in the YAML header, or in the config files, can make use of reusable snippets which are also stored in configs. Any entry in a config YAML with a prefix of sys_
is a system message snippet and is available to be used in templated system messages.
For example, I have added a list of my python preferences to the DEFAULT_CONFIG dictionary (defined in 'init.py') as sys_python_prefs
. This can then be used in the YAML header by adding a templated system message (see templated_system_msg.md).
-
Initialise and open the file automatically -
Handle images -
Implement a config system -
Make a system to use reusable templated system message snippets. -
Parse hyperlinks to other text files and inline the text, so that I don't have to copy and paste it all into the prompt - Make it easier to install without having to use Simon W's API interface
- Implement click command line interface with flags
- Feature: when parsing links to files, allow the user to pass the code type inside the square brackets:
- E.g.
[lang:python](path/to/script.py)
- is parsed as:
{'type': 'link', 'link': 'path/to/script.py', 'language': 'python'}
- Which then adds a fenced code block when inlining the text, which may or may not make any difference to the LLM in reading the code.
- If there's nothing in the square brackets this doesn't happen
- E.g.
- Parse links to websites, strip tags and inline the text
- Thanks to Simon Willison for creating the package this package depends on
- Thanks to the creators of the LLM APIs, especially Anthropic, for making this project possible
- Thanks to the creators of Obsidian for making me such a markdown addict
- Inspired by the growing trend of AI-assisted writing and coding
To test that the package works without the llm
key management utilities, run mv "$(llm keys path)" "$(llm keys path).bak"
and then try to run the package. Remember to change it back again after.