rag-solution

Author	SHA1	Message	Date
idchlife	93d538ecc6	Checking properly source of the file for metadata, with instanceof	2026-02-11 16:23:27 +03:00
idchlife	f5659675ec	- main feat: adaptation for async enrichment - added file_type, this will hold the "таблица", "презентация" and so on types - file source metadata is now taken either from local source or yandex disk.	2026-02-11 15:46:54 +03:00
idchlife	7b52887558	Enrichment now processed via chunks. 2 documents -> into the vector storage. Also geussing source from the file extension	2026-02-11 11:23:50 +03:00
idchlife	1e6ab247b9	Phase 12 done... loading via adaptive collection, yadisk or local	2026-02-10 22:19:27 +03:00
idchlife	e9dd28ad55	Prep for Phase 12 of loading files for enrichment through the adaptive collections	2026-02-10 21:42:59 +03:00
idchlife	06a3155b6b	Working Yandex Disk integration for loading files. Tests for local and Yandex	2026-02-10 20:42:07 +03:00
idchlife	63c3e2c5c7	Adaptive Collection, and Phase 11 WIP	2026-02-10 20:12:43 +03:00
idchlife	447ecaba39	enrichment with years, events	2026-02-10 13:20:19 +03:00
idchlife	ce62fd50ed	Created this MD file to store things we need to look out to	2026-02-09 21:33:03 +03:00
idchlife	2cb9b39bf2	removed test retrieval feature. off you go	2026-02-09 21:17:42 +03:00
idchlife	f9c47c772f	llamaindex update + unpacking archives in data	2026-02-09 19:00:23 +03:00
idchlife	0adbc29692	env step for llamaindex	2026-02-05 22:48:39 +03:00
idchlife	effbc7d00f	proper usage of embedding models if defined in .env	2026-02-05 01:07:25 +03:00
idchlife	31d198afb8	properly loading .env file with dotenv	2026-02-05 00:08:59 +03:00
idchlife	833aad317a	quick fix to use openai instead of ollama, in vetor_storage.py	2026-02-05 00:04:10 +03:00
idchlife	f87f3c0cdd	moved demo.html into demo-ui folder and renamed to index.html for ease of server serving... lol	2026-02-04 23:36:23 +03:00
idchlife	a6320985dd	resolved conflicts in requirements.txt	2026-02-04 23:34:37 +03:00
idchlife	69e7ecee62	Updated requirements.txt file	2026-02-04 23:13:27 +03:00
idchlife	8c57921b7f	Working demo.html with connection to the api endpoint	2026-02-04 23:13:00 +03:00
idchlife	9188b672c2	preparations for demo html page	2026-02-04 22:50:24 +03:00
idchlife	bf3a3735cb	openai compatible integration done	2026-02-04 22:30:57 +03:00
idchlife	ae8c00316e	Langchain plan phases for openai integration (openai compaible endpoint), server for retrieving data	2026-02-04 21:34:22 +03:00
idchlife	ea4ce23cd9	Retrieval and also update on russian language	2026-02-04 16:51:50 +03:00
idchlife	3dea3605ad	Enrichment for llamaindex. It goes for a long time using local model, so better use external model not local, for EMBEDDING	2026-02-04 16:06:01 +03:00
idchlife	f36108d652	Vector storage Qdrant initialization and configuration	2026-02-04 01:10:07 +03:00
idchlife	c37aec1d99	File extensions and libraries for llamaindex	2026-02-04 01:02:21 +03:00
idchlife	fa26d77520	Cli with ping for llamaindex	2026-02-04 00:59:01 +03:00
idchlife	86fd643e66	Start of work on Llamaindex framework	2026-02-04 00:49:45 +03:00
idchlife	d354d3dcca	Working chat with AI agent with retrieving data	2026-02-04 00:02:53 +03:00
idchlife	299ee0acb5	Working retrieval with the cli	2026-02-03 23:25:24 +03:00
idchlife	4cbd5313d2	Working enrichment	2026-02-03 22:55:12 +03:00
idchlife	8d7e39a603	langchain loading documents into vector storage	2026-02-03 20:52:08 +03:00
idchlife	762ed89843	langchain vector storage connection and confguration	2026-02-03 20:42:09 +03:00
idchlife	cd7c96e022	langchain extensions for data files and their libraries	2026-02-03 20:17:13 +03:00
idchlife	d99433d087	langchain done cli	2026-02-03 19:51:35 +03:00
idchlife	351fe27cca	Initial commit	2026-02-03 19:24:41 +03:00

36 Commits