Commit Graph

  • 93d538ecc6 Checking properly source of the file for metadata, with instanceof main idchlife 2026-02-11 16:23:27 +03:00
  • f5659675ec - main feat: adaptation for async enrichment - added file_type, this will hold the "таблица", "презентация" and so on types - file source metadata is now taken either from local source or yandex disk. idchlife 2026-02-11 15:46:54 +03:00
  • 7b52887558 Enrichment now processed via chunks. 2 documents -> into the vector storage. Also geussing source from the file extension idchlife 2026-02-11 11:23:50 +03:00
  • 1e6ab247b9 Phase 12 done... loading via adaptive collection, yadisk or local idchlife 2026-02-10 22:19:27 +03:00
  • e9dd28ad55 Prep for Phase 12 of loading files for enrichment through the adaptive collections idchlife 2026-02-10 21:42:59 +03:00
  • 06a3155b6b Working Yandex Disk integration for loading files. Tests for local and Yandex idchlife 2026-02-10 20:42:07 +03:00
  • 63c3e2c5c7 Adaptive Collection, and Phase 11 WIP idchlife 2026-02-10 20:12:43 +03:00
  • 447ecaba39 enrichment with years, events idchlife 2026-02-10 13:20:19 +03:00
  • ce62fd50ed Created this MD file to store things we need to look out to idchlife 2026-02-09 21:33:03 +03:00
  • 2cb9b39bf2 removed test retrieval feature. off you go idchlife 2026-02-09 21:17:42 +03:00
  • f9c47c772f llamaindex update + unpacking archives in data idchlife 2026-02-09 19:00:23 +03:00
  • 0adbc29692 env step for llamaindex idchlife 2026-02-05 22:48:39 +03:00
  • effbc7d00f proper usage of embedding models if defined in .env idchlife 2026-02-05 01:07:25 +03:00
  • 31d198afb8 properly loading .env file with dotenv idchlife 2026-02-05 00:08:59 +03:00
  • 833aad317a quick fix to use openai instead of ollama, in vetor_storage.py idchlife 2026-02-05 00:04:10 +03:00
  • f87f3c0cdd moved demo.html into demo-ui folder and renamed to index.html for ease of server serving... lol idchlife 2026-02-04 23:36:23 +03:00
  • a6320985dd resolved conflicts in requirements.txt idchlife 2026-02-04 23:34:37 +03:00
  • 69e7ecee62 Updated requirements.txt file idchlife 2026-02-04 23:13:27 +03:00
  • 8c57921b7f Working demo.html with connection to the api endpoint idchlife 2026-02-04 23:13:00 +03:00
  • 9188b672c2 preparations for demo html page idchlife 2026-02-04 22:50:24 +03:00
  • bf3a3735cb openai compatible integration done idchlife 2026-02-04 22:30:57 +03:00
  • ae8c00316e Langchain plan phases for openai integration (openai compaible endpoint), server for retrieving data idchlife 2026-02-04 21:34:22 +03:00
  • ea4ce23cd9 Retrieval and also update on russian language idchlife 2026-02-04 16:51:50 +03:00
  • 3dea3605ad Enrichment for llamaindex. It goes for a long time using local model, so better use external model not local, for EMBEDDING idchlife 2026-02-04 16:06:01 +03:00
  • f36108d652 Vector storage Qdrant initialization and configuration idchlife 2026-02-04 01:10:07 +03:00
  • c37aec1d99 File extensions and libraries for llamaindex idchlife 2026-02-04 01:02:21 +03:00
  • fa26d77520 Cli with ping for llamaindex idchlife 2026-02-04 00:59:01 +03:00
  • 86fd643e66 Start of work on Llamaindex framework idchlife 2026-02-04 00:49:45 +03:00
  • d354d3dcca Working chat with AI agent with retrieving data idchlife 2026-02-04 00:02:53 +03:00
  • 299ee0acb5 Working retrieval with the cli idchlife 2026-02-03 23:25:24 +03:00
  • 4cbd5313d2 Working enrichment idchlife 2026-02-03 22:55:12 +03:00
  • 8d7e39a603 langchain loading documents into vector storage idchlife 2026-02-03 20:52:08 +03:00
  • 762ed89843 langchain vector storage connection and confguration idchlife 2026-02-03 20:42:09 +03:00
  • cd7c96e022 langchain extensions for data files and their libraries idchlife 2026-02-03 20:17:13 +03:00
  • d99433d087 langchain done cli idchlife 2026-02-03 19:51:35 +03:00
  • 351fe27cca Initial commit idchlife 2026-02-03 19:24:41 +03:00