tomás.erdmannsdörffer ← back to portfolio
portfolio / projects / rag-scientific-ml

Ask my research corpus.

A focused RAG system over a curated corpus of papers on physics-informed neural networks, neural operators, model compression, and LLM inference — the literature I actually use day-to-day. Retrieval runs entirely client-side using BM25 over precomputed indices; an LLM call (your choice of provider) writes the answer. Open the behind-the-scenes panel to see the retrieved chunks, scores, and exact prompt.

RAG BM25 retrieval Browser-side index 24 chunks · 16 papers OpenAI-compatible LLM Zero backend
LLM:Not configured · Index:loading…

Ask anything about the corpus.

Retrieval is precomputed and instant. The LLM fills in the answer using only retrieved context — try one of these:

Retrieval[01]
AlgorithmBM25 + tag boost
Corpus size
Top-k retrieved4
Last query (ms)
Retrieved chunks[02]
Ask a question to see retrieved chunks with similarity scores.
Prompt sent to LLM[03]

What's actually happening here?

Most "chat with your PDFs" demos hide the retrieval step behind a black box. This one shows everything: the chunks pulled, their similarity scores, and the exact prompt sent to the LLM. That transparency is the whole point — if retrieval fails, you can see why.

Architecture

Honest trade-offs

Why not "chat with all the PDFs"?

A focused corpus is the whole point. The system is good at scientific ML because the chunks are tagged, sectioned, and curated. Throw 10,000 random PDFs at the same architecture and recall collapses. Choosing a scope is a real engineering decision — and a more interesting portfolio piece than another generic Q&A bot.

The corpus

Covers Raissi et al. (PINNs), Karniadakis et al. (PINN review), Sahli Costabal (cardiac PINN), Li et al. (FNO), Lu et al. (DeepONet), Chen et al. (Neural ODE), Hinton (distillation), Han (deep compression), Frankle & Carbin (lottery ticket), Jacob (QAT), Dettmers (LLM.int8), Frantar (GPTQ), Dao (FlashAttention), Kwon (vLLM), Buoso (WarpPINN), Holzapfel & Ogden, Guccione, plus de Avila Belbute-Peres (differentiable physics), JAX, Adam, and Transformer.