Backend Architecture

Prerequisites:

Python 3.10 or higher (3.11+ recommended)
Basic understanding of async/await patterns
SQLite3

Architecture Overview

CosmaSense uses a monorepo structure with three main packages:

Backend (packages/cosma-backend): Quart async web framework + Uvicorn
TUI (packages/cosma-tui): Textual framework for terminal UI
CLI (root): Click-based orchestrator

All packages share dependencies via workspace configuration.

Tech Stack

Backend Framework

Component	Technology
Web Framework	Quart (async Flask)
ASGI Server	Uvicorn
Validation	QuartSchema
Database	asqlite (async SQLite)
File Watching	watchdog

AI & Search

Component	Technology
Vector Search	sqlite-vec
Keyword Search	FTS5 (Full-Text Search)
LLM Backend	LiteLLM/Ollama
Embeddings	sentence-transformers (e5-base-v2)
File Parsing	MarkItDown (20+ formats)

Processing Pipeline

Files go through a 4-stage pipeline (see pipeline.py:56-174):

Discovery

Recursively scan directories and collect file metadata. Only files with modified timestamps different from the database are processed.

# Skip logic checks modified time
if file.modified_time == db_modified_time:
    skip_file()

Parsing

Extract text from 20+ file formats using MarkItDown. Calculate content hash to detect changes.Supported formats:

Documents: PDF, DOCX, TXT, MD
Images: PNG, JPG, GIF (with OCR)
Code: PY, JS, TS, JAVA, etc.
Spreadsheets: XLSX, CSV

# Hash check to detect content changes
if calculate_hash(content) == db_hash:
    skip_to_next_stage()

Summarization

AI generates:

Title
Summary (max 100 words)
3-5 relevant keywords

Uses LiteLLM or Ollama for local/cloud LLM support.

Embedding

Create 768-dimensional vectors using the e5-base-v2 model for semantic search.Embeddings stored in file_embeddings virtual table with triggers to keep in sync.

Hybrid Search System

CosmaSense combines two search methods (see searcher.py:91-220):

Semantic Search (Vector Similarity)

Embeds query using same e5-base-v2 model
Calculates cosine similarity against file embeddings
Score: exp(-distance) scaled to 0-0.5 range

Keyword Search (FTS5)

SQLite FTS5 searches content/title/keywords
Uses BM25 ranking algorithm
Score: relevance scaled to 0-0.5 range

Combined Scoring

final_score = semantic_score + keyword_score

Results sorted by combined score in descending order.

Database Schema

See schema.sql for full details:

-- Core file data
CREATE TABLE files (
  path TEXT PRIMARY KEY,
  content TEXT,
  title TEXT,
  summary TEXT,
  keywords TEXT,
  hash TEXT,
  status TEXT,
  modified_time INTEGER
);

-- Vector search (sqlite-vec)
CREATE VIRTUAL TABLE file_embeddings USING vec0(
  embedding FLOAT[768]
);

-- Keyword search (FTS5)
CREATE VIRTUAL TABLE files_fts USING fts5(
  content, title, keywords
);

Triggers keep all three tables synchronized automatically.

Async Programming in CosmaSense

CosmaSense uses Python’s async/await for non-blocking I/O operations.

Key Concepts

# Use 'async def' to create async functions
async def fetch_data():
    result = await database.query()
    return result

# Call with 'await'
data = await fetch_data()

Critical: Never use time.sleep() in async functions! It will freeze the entire app. Use await asyncio.sleep() instead.

Real-time Updates

Server-Sent Events (SSE)

The backend uses SSE to push updates to the TUI:

@app.route('/api/updates')
async def updates_stream():
    async for event in watch_file_changes():
        yield f"data: {event}\n\n"

Event types:

file_parsing: File is being parsed
file_summarizing: AI is generating summary
file_embedding: Creating vector embedding
file_complete: File successfully indexed
file_failed: Error during processing

File Watching

Uses watchdog library to monitor filesystem changes:

# Auto-reindex when files change
watcher.schedule(handler, path, recursive=True)

Development Setup

Clone Repository

git clone https://github.com/cosmasense/cosma.git
cd cosma

Install Dependencies

pip install -r requirements.txt

# For development
pip install -r requirements-dev.txt

Run Backend

# Start backend server (port 60534)
cosma serve

# Or with debug logging
DEBUG=1 cosma serve

Run TUI

# In another terminal
cosma search

Testing

# Run all tests
pytest

# Run with coverage
pytest --cov=cosma_backend

# Run specific test file
pytest tests/test_pipeline.py

Troubleshooting

RuntimeWarning: coroutine was never awaited

You forgot to use await when calling an async function:

# Wrong
result = async_function()

# Correct
result = await async_function()

App freezes during file processing

You’re running blocking code in an async function. Use asyncio.to_thread():

# Wrong
async def process():
    slow_operation()  # Blocks event loop

# Correct
async def process():
    await asyncio.to_thread(slow_operation)

Database locked errors

Multiple async operations trying to write simultaneously. Use proper connection pooling:

async with app.db.acquire() as conn:
    await conn.execute("INSERT ...")

Getting Started

Core Concepts

Backend Architecture

Architecture Overview

Tech Stack

Backend Framework

AI & Search

Processing Pipeline

Hybrid Search System

Semantic Search (Vector Similarity)

Keyword Search (FTS5)

Combined Scoring

Database Schema

Async Programming in CosmaSense

Key Concepts

Real-time Updates

Server-Sent Events (SSE)

File Watching

Development Setup

Testing

Troubleshooting

Further Reading

Getting Started

Core Concepts

​Architecture Overview

​Tech Stack

​Backend Framework

​AI & Search

​Processing Pipeline

​Hybrid Search System

​Semantic Search (Vector Similarity)

​Keyword Search (FTS5)

​Combined Scoring

​Database Schema

​Async Programming in CosmaSense

​Key Concepts

​Real-time Updates

​Server-Sent Events (SSE)

​File Watching

​Development Setup

​Testing

​Troubleshooting

​Further Reading

Architecture Overview

Tech Stack

Backend Framework

AI & Search

Processing Pipeline

Hybrid Search System

Semantic Search (Vector Similarity)

Keyword Search (FTS5)

Combined Scoring

Database Schema

Async Programming in CosmaSense

Key Concepts

Real-time Updates

Server-Sent Events (SSE)

File Watching

Development Setup

Testing

Troubleshooting

Further Reading