Nvidiachat

less than 1 minute read

🚀 RAG on Windows using TensorRT-LLM, NVIDIA NIM and LlamaIndex 🦙

ChatRTX is a demo app that lets you personalize a GPT large language model (LLM) connected to your own content—docs, notes, photos. Leveraging retrieval-augmented generation (RAG), TensorRT-LLM, NVIDIA NIM microservices and RTX acceleration, you can query a custom chatbot to quickly get contextually relevant answers. This app also lets you give query through your voice. As it all runs locally on your Windows RTX PC, you’ll get fast and secure results. ChatRTX supports various file formats, including text, pdf, doc/docx, xml, png, jpg, bmp. Simply point the application at the folder containing your files and it’ll load them into the library in a matter of seconds.

https://github.com/NVIDIA/ChatRTX

Share on

X Facebook LinkedIn Bluesky

Pro Engels

Nvidiachat

🚀 RAG on Windows using TensorRT-LLM, NVIDIA NIM and LlamaIndex 🦙

Share on

You May Also Enjoy

munal-os

comprehensive Model Context Protocol (MCP) server implementing the latest specification

A masochist’s guide to web development

Meditron is a suite of open-source medical Large Language Models (LLMs)