Symmetrical Retrieval-Augmented Generation: A Risk-Averse Framework for Safety-Critical Translation of Biomedical Subject-matter
20 Pages
Open Access
Journal:
Journal of Translation Studies
Volume 6
Issue 1
Publication Year 2026
pp. 107 - 126
Summary
Large Language Models (LLMs) demonstrate high fluency in machine translation, yet their probabilistic nature can lead to hallucinations and terminological drift in dealing with safety-critical biomedical texts. To resolve this “fluency trap,” this study proposes a symmetrical Retrieval-Augmented Generation (RAG) framework powered by DeepSeek-V3. Functioning as a risk-management instrument, the architecture enforces a dual-constraint mechanism: a Lexical Stream applies rigid terminology injection to ensure high factual precision, while a Syntactic Stream retrieves structural templates from the authoritative New England Journal of Medicine (NEJM) corpus to preserve the professional style. Empirical evaluations on clinical trial protocols reveal that our framework significantly outperforms zero-shot GPT-4, achieving a near-perfect 99.2% Terminology Accuracy Rate (TAR) and superior semantic alignment. Ultimately, this study demonstrates that domain-specific, theory-driven architectures are more effective than larger model size, successfully transforming biomedical AI translation from probabilistic guessing into accurate, data-driven translation.
Details
- Pages
- 20
- DOI
- 10.3726/JTS012026.6
- Open Access
- CC-BY
- Publication date
- 2026 (June)
- Keywords
- biomedical translation retrieval-augmented generation (RAG) large language models (LLMs) risk management dual-constraint architecture
- Product Safety
- Peter Lang Group AG