Hybrid retrieval. Citations, not hallucinations.

Postgres + pgvector for vector search, BM25 for lexical, fused with reciprocal-rank fusion. Citation-first answers — every claim links to its chunk. No separate vector DB, no five-system synchronisation problem.

PostgreSQLpgvectorTypeScriptOpenAI

Start this engagement →All LLM services →

Cycle

8 weeks · corpus to citation

Stack

Postgres · pgvector · TypeScript

Output

Ingestion + retrieval + citation surface

Discipline

Citations on every answer, no exceptions

[WHY THIS EXISTS]

Most RAG demos fall apart at scale.

A naive cosine-similarity search over a 1M-chunk corpus retrieves plausible-but-wrong context, the LLM hallucinates confidently on top, and the user discovers the failure mode by getting fired for citing it. Hybrid retrieval + citations make the failure mode visible.

Pgvector + BM25 hybrid — dense semantic + lexical exact-match
Reciprocal-rank fusion to merge scores defensibly
Citation surface — every claim in the answer links to its source chunk
Chunking pipeline tuned to your corpus, not someone's blog post

[HOW WE BUILD IT]

Retrieval first, generation second.

Corpus + chunking audit

We characterise your corpus before we pick a chunker. PDFs with tables, code with comments, transcripts with timestamps — each gets a different strategy.