Note

Click here to download the full example code

REST API

Ragna was designed to help you quickly build custom RAG powered web applications. For this you can leverage the built-in REST API.

This tutorial walks you through basic steps of using Ragna's REST API.

Step 1: Start the REST API

Ragnas REST API is normally started from a terminal with

$ ragna deploy

For this tutorial we use our helper that does the equivalent just from Python.

Note

By default, the REST API is started from the ragna.toml configuration file in the current working directory. If you don't have a configuration file yet, you can run

$ ragna init

to start an interactive wizard that helps you create one. The config that we'll be using for this tutorial is equivalent of picking the first option the wizard offers you, i.e. using only demo components.

import ragna._docs as ragna_docs

from ragna.deploy import Config

config = Config()

ragna_deploy = ragna_docs.RagnaDeploy(config=config)

Let's make sure the REST API is started correctly and can be reached.

import httpx

client = httpx.Client(base_url=f"http://{config.hostname}:{config.port}")
client.get("/health").raise_for_status()

Out:

<Response [200 OK]>

Step 2: Authentication

In order to use Ragna's REST API, we need to authenticate first. This is handled by the ragna.deploy.Auth class, which can be overridden through the config. By default, ragna.deploy.NoAuth is used. By hitting the /login endpoint, we get a session cookie, which is later used to authorize our requests.

client.get("/login", follow_redirects=True)
dict(client.cookies)

Out:

{'ragna': '3d075b39-5e78-4427-a40e-0ae1b2efefe3'}

Note

In a regular deployment, you'll have login through your browser and create an API key in your profile page. The API key is used as bearer token and can be set with

httpx.Client(..., headers={"Authorization": f"Bearer {RAGNA_API_KEY}"})

Step 3: Uploading documents

Before we start with the upload process, let's first have a look what kind of documents are supported.

import json

response = client.get("/api/components").raise_for_status()
print(json.dumps(response.json(), indent=2))

Out:

{
  "documents": [
    ".docx",
    ".md",
    ".pdf",
    ".pptx",
    ".txt"
  ],
  "source_storages": [
    {
      "properties": {},
      "title": "Ragna/DemoSourceStorage",
      "type": "object"
    }
  ],
  "assistants": [
    {
      "properties": {},
      "title": "Ragna/DemoAssistant",
      "type": "object",
      "avatar": "imgs/ragna_logo.svg"
    }
  ]
}

For simplicity, let's use a demo document with some information about Ragna

from pathlib import Path

print(ragna_docs.SAMPLE_CONTENT)

document_path = Path.cwd() / "ragna.txt"

with open(document_path, "w") as file:
    file.write(ragna_docs.SAMPLE_CONTENT)

Out:

Ragna is an open source project built by Quansight. It is designed to allow
organizations to explore the power of Retrieval-augmented generation (RAG) based
AI tools. Ragna provides an intuitive API for quick experimentation and built-in
tools for creating production-ready applications allowing you to quickly leverage
Large Language Models (LLMs) for your work.

The Ragna website is https://ragna.chat/. The source code is available at
https://github.com/Quansight/ragna under the BSD 3-Clause license.

The upload process in Ragna consists of two parts:

Register the document in Ragna's database. This returns the document ID, which is needed for the upload.

response = client.post(
    "/api/documents", json=[{"name": document_path.name}]
).raise_for_status()
documents = response.json()
print(json.dumps(documents, indent=2))

Out:

[
  {
    "id": "3bd41e1f-b0d0-4dad-bcbf-b3956da225cb",
    "name": "ragna.txt",
    "metadata": {
      "path": "/tmp/tmpraaxwu5m/documents/3bd41e1f-b0d0-4dad-bcbf-b3956da225cb"
    },
    "mime_type": "text/plain"
  }
]

Perform the upload through a multipart request with the following parameters:

The field is documents for all entries
The field name is the ID of the document returned by step 1.
The field value is the binary content of the document.

with open(document_path, "rb") as f:
    print(
        client.put(
            "/api/documents",
            files=[("documents", (documents[0]["id"], f))],
        )
    )

Out:

<Response [200 OK]>

Step 4: Select a source storage and assistant

The configuration we are using only supports demo components for the source storage and assistant and so we pick them here.

from ragna import source_storages, assistants

source_storage = source_storages.RagnaDemoSourceStorage.display_name()
assistant = assistants.RagnaDemoAssistant.display_name()

print(f"{source_storage=}, {assistant=}")

Out:

source_storage='Ragna/DemoSourceStorage', assistant='Ragna/DemoAssistant'

Step 5: Start chatting

Now that we have uploaded a document, and selected a source storage and assistant to be used, we can create a new chat.

response = client.post(
    "/api/chats",
    json={
        "name": "Tutorial REST API",
        "input": [document["id"] for document in documents],
        "source_storage": source_storage,
        "assistant": assistant,
    },
).raise_for_status()
chat = response.json()
print(json.dumps(chat, indent=2))

Out:

{
  "id": "4dfd32d6-be44-4845-ae95-0c4bdb4ed119",
  "name": "Tutorial REST API",
  "metadata_filter": null,
  "documents": [
    {
      "id": "3bd41e1f-b0d0-4dad-bcbf-b3956da225cb",
      "name": "ragna.txt",
      "metadata": {
        "path": "/tmp/tmpraaxwu5m/documents/3bd41e1f-b0d0-4dad-bcbf-b3956da225cb"
      },
      "mime_type": "text/plain"
    }
  ],
  "source_storage": "Ragna/DemoSourceStorage",
  "assistant": "Ragna/DemoAssistant",
  "corpus_name": "default",
  "params": {},
  "messages": [],
  "prepared": false,
  "created_at": "2025-06-20T13:52:22.195016Z"
}

As can be seen by the "prepared": false value in the chat JSON object we still need to prepare it.

client.post(f"/api/chats/{chat['id']}/prepare").raise_for_status()

Out:

<Response [200 OK]>

Finally, we can get answers to our questions.

response = client.post(
    f"/api/chats/{chat['id']}/answer",
    json={"prompt": "What is Ragna?"},
).raise_for_status()
answer = response.json()
print(json.dumps(answer, indent=2))

Out:

{
  "id": "679ad99e-e4ff-412c-877e-dc3994c0abe8",
  "content": "I'm a demo assistant and can be used to try Ragna's workflow.\nI will only mirror back my inputs. \n\nSo far I have received 1 messages.\n\nYour last prompt was:\n\n> What is Ragna?\n\nThese are the sources I was given:\n\n- ragna.txt: Ragna is an open source project built by Quansight. It is designed to allow organizations to [...]",
  "role": "assistant",
  "sources": [
    {
      "id": "7be76e97-0c24-4576-bac6-fd1ab24aee3e",
      "document_id": "3bd41e1f-b0d0-4dad-bcbf-b3956da225cb",
      "document_name": "ragna.txt",
      "location": "",
      "content": "Ragna is an open source project built by Quansight. It is designed to allow organizations to [...]",
      "num_tokens": 17
    }
  ],
  "timestamp": "2025-06-20T13:52:22.320856Z"
}

print(answer["content"])

Out:

I'm a demo assistant and can be used to try Ragna's workflow.
I will only mirror back my inputs. 

So far I have received 1 messages.

Your last prompt was:

> What is Ragna?

These are the sources I was given:

- ragna.txt: Ragna is an open source project built by Quansight. It is designed to allow organizations to [...]

Before we close the tutorial, let's terminate the REST API and have a look at what would have printed in the terminal if we had started it with the ragna deploy command.

ragna_deploy.terminate()

Out:

INFO:     127.0.0.1:43766 - "GET /health HTTP/1.1" 200 OK
INFO:     127.0.0.1:43772 - "GET /health HTTP/1.1" 200 OK
INFO:     127.0.0.1:43772 - "GET /login HTTP/1.1" 303 See Other
INFO:     127.0.0.1:43772 - "GET /oauth-callback HTTP/1.1" 303 See Other
INFO:     127.0.0.1:43772 - "GET / HTTP/1.1" 303 See Other
INFO:     127.0.0.1:43772 - "GET /docs HTTP/1.1" 200 OK
INFO:     127.0.0.1:43772 - "GET /api/components HTTP/1.1" 200 OK
INFO:     127.0.0.1:43772 - "POST /api/documents HTTP/1.1" 200 OK
INFO:     127.0.0.1:43772 - "PUT /api/documents HTTP/1.1" 200 OK
INFO:     127.0.0.1:43772 - "POST /api/chats HTTP/1.1" 200 OK
INFO:     127.0.0.1:43772 - "POST /api/chats/4dfd32d6-be44-4845-ae95-0c4bdb4ed119/prepare HTTP/1.1" 200 OK
INFO:     127.0.0.1:43772 - "POST /api/chats/4dfd32d6-be44-4845-ae95-0c4bdb4ed119/answer HTTP/1.1" 200 OK

Total running time of the script: ( 0 minutes 2.581 seconds)

Download Python source code: gallery_rest_api.py

Download Jupyter notebook: gallery_rest_api.ipynb

Gallery generated by mkdocs-gallery