🥤 RAGLite is a Python toolkit for Retrieval-Augmented Generation (RAG) with DuckDB or PostgreSQL
-
Updated
May 12, 2026 - Python
🥤 RAGLite is a Python toolkit for Retrieval-Augmented Generation (RAG) with DuckDB or PostgreSQL
PyLate efficient inference engine
Official repository of S. Martinico, F. M. Nardini, C. Rulli, and R. Venturini. "Efficient Multivector Retrieval with Token-Aware Clustering and Hierarchical Indexing" Short Paper @ ACM SIGIR 2026.
High-performance late-interaction retrieval engine for on-prem AI. ColBERT/ColPali multi-vector search with Rust fused MaxSim, Triton GPU kernels, ROQ quantization, LEMUR routing, WAL-backed CRUD, and a FastAPI server — single machine, CPU or GPU.
Python library for MUVERA multi-vector retrieval via Fixed Dimensional Encodings. ColBERT / ColQwen2 / ColQwen3.5 compatible.
ColFastVLM: Towards low-latency indexing in visual document retrieval
Repo for portfolio, containing working redirects to all projects.
🌐 Build and share your personal website with ease using mjsushanth.github.io, a simple and effective static site generator.
Add a description, image, and links to the late-interaction topic page so that developers can more easily learn about it.
To associate your repository with the late-interaction topic, visit your repo's landing page and select "manage topics."