Cli with ping for llamaindex

2026-02-04 00:59:01 +03:00
parent 86fd643e66
commit fa26d77520
3 changed files with 198 additions and 1 deletions
--- a/services/rag/llamaindex/PLANNING.md
+++ b/services/rag/llamaindex/PLANNING.md
@@ -11,7 +11,7 @@ Chosen data folder: relatve ./../../../data - from the current folder
 # Phase 1 (cli entrypoint)

 - [x] Create virtual env in the `venv` folder in the current directory.
- [ ] Create cli.py file, with the usage of `click` python library. Make default command "ping" which will write output "pong"
+- [x] Create cli.py file, with the usage of `click` python library. Make default command "ping" which will write output "pong"

 # Phase 2 (installation of base framework for RAG solution and preparation for data loading)

--- a/services/rag/llamaindex/QWEN.md
+++ b/services/rag/llamaindex/QWEN.md
@@ -0,0 +1,136 @@
+# RAG Solution with LlamaIndex and Qdrant
+
+## Project Overview
+
+This is a Retrieval Augmented Generation (RAG) solution built using LlamaIndex as the primary framework and Qdrant as the vector storage. The project is designed to load documents from a shared data directory, store them in a vector database, and enable semantic search and chat capabilities using local Ollama models.
+
+### Key Technologies
+- **RAG Framework**: LlamaIndex
+- **Vector Storage**: Qdrant
+- **Embedding Models**: Ollama (configurable via environment variables)
+- **Chat Models**: Ollama (configurable via environment variables)
+- **Data Directory**: `./../../../data` (relative to project root)
+- **Logging**: loguru with file rotation and stdout logging
+
+### Architecture Components
+- CLI entry point (`cli.py`)
+- Document enrichment module (`enrichment.py`)
+- Vector storage configuration (`vector_storage.py`)
+- Retrieval module (`retrieval.py`)
+- Chat agent (`agent.py`)
+
+## Building and Running
+
+### Prerequisites
+1. Python virtual environment (already created in `venv` folder)
+2. Ollama running locally on default port 11434
+3. Qdrant running locally (REST API on port 6333, gRPC on port 6334)
+4. Data files in the `./../../../data` directory
+
+### Setup Process
+1. Activate the virtual environment:
+   ```bash
+   source venv/bin/activate
+   ```
+
+2. Install required packages based on the document extensions found in the data directory (see EXTENSIONS.md for details)
+
+3. Configure environment variables in `.env` file (copy from `.env.dist`)
+
+4. Run the CLI to initialize the system:
+   ```bash
+   python cli.py ping  # Should return "pong"
+   ```
+
+### Available Commands
+- `ping`: Basic connectivity test
+- `enrich`: Load and process documents from the data directory into vector storage
+- `chat`: Start an interactive chat session with the RAG system
+
+## Development Conventions
+
+### Logging
+- Use `loguru` for all logging
+- Log to both file (`logs/dev.log`) with rotation and stdout
+- Use appropriate log levels (DEBUG, INFO, WARNING, ERROR)
+
+### Environment Variables
+- `OLLAMA_EMBEDDING_MODEL`: Name of the Ollama model to use for embeddings
+- `OLLAMA_CHAT_MODEL`: Name of the Ollama model to use for chat functionality
+- API keys for external services (OpenRouter option available but commented out)
+
+### Document Processing
+- Support multiple file formats based on EXTENSIONS.md
+- Use text splitters appropriate for each document type
+- Store metadata (filename, page, section, paragraph) with embeddings
+- Track processed documents to avoid re-processing (using SQLite if needed)
+
+### Vector Storage
+- Collection name: "documents_llamaindex"
+- Initialize automatically if not exists
+- Support for Ollama embeddings by default
+- Optional OpenAI embedding support via OpenRouter (commented out)
+
+## Project Phases
+
+### Phase 1: CLI Entry Point
+- [x] Virtual environment setup
+- [x] CLI creation with `click` library
+- [x] Basic "ping" command implementation
+
+### Phase 2: Framework Installation
+- [x] LlamaIndex installation
+- [ ] Data folder analysis and EXTENSIONS.md creation
+- [ ] Required loader libraries installation
+
+### Phase 3: Vector Storage Setup
+- [ ] Qdrant library installation
+- [ ] Vector storage initialization module
+- [ ] Embedding model configuration with Ollama
+- [ ] Collection creation strategy
+
+### Phase 4: Document Enrichment
+- [ ] Document loading module with appropriate loaders
+- [ ] Text splitting strategies implementation
+- [ ] Document tracking mechanism
+- [ ] CLI command for enrichment
+
+### Phase 5: Retrieval Feature
+- [ ] Retrieval module configuration
+- [ ] Query processing with metadata retrieval
+
+### Phase 6: Chat Agent
+- [ ] Agent module with Ollama integration
+- [ ] Integration with retrieval module
+- [ ] CLI command for chat functionality
+
+## File Structure
+```
+llamaindex/
+├── venv/                 # Python virtual environment
+├── cli.py               # CLI entry point
+├── vector_storage.py    # Vector storage configuration (to be created)
+├── enrichment.py        # Document loading and processing (to be created)
+├── retrieval.py         # Search and retrieval functionality (to be created)
+├── agent.py             # Chat agent implementation (to be created)
+├── EXTENSIONS.md        # Supported file extensions and loaders (to be created)
+├── .env.dist            # Environment variable template
+├── .env                 # Local environment variables (git-ignored)
+├── logs/                # Log files directory
+│   └── dev.log          # Main log file with rotation
+└── PLANNING.md          # Project planning document
+```
+
+## Data Directory
+The system expects documents to be placed in `./../../../data` relative to the project root. The system will analyze this directory to determine supported file types and appropriate loaders.
+
+## Testing
+- Unit tests for individual modules
+- Integration tests for end-to-end functionality
+- CLI command tests
+
+## Troubleshooting
+- Ensure Ollama is running on port 11434
+- Verify Qdrant is accessible on ports 6333 (REST) and 6334 (gRPC)
+- Check that the data directory contains supported file types
+- Review logs in `logs/dev.log` for detailed error information
--- a/services/rag/llamaindex/cli.py
+++ b/services/rag/llamaindex/cli.py
@@ -0,0 +1,61 @@
+#!/usr/bin/env python3
+"""
+CLI entry point for the RAG solution using LlamaIndex and Qdrant.
+"""
+
+import click
+from loguru import logger
+import sys
+from pathlib import Path
+
+
+def setup_logging():
+    """Setup logging with loguru to file and stdout."""
+    # Create logs directory if it doesn't exist
+    logs_dir = Path("logs")
+    logs_dir.mkdir(exist_ok=True)
+    
+    # Remove default logger to customize it
+    logger.remove()
+    
+    # Add file handler with rotation
+    logger.add(
+        "logs/dev.log",
+        rotation="10 MB",
+        retention="10 days",
+        level="INFO",
+        format="{time:YYYY-MM-DD HH:mm:ss} | {level} | {file}:{line} | {message}"
+    )
+    
+    # Add stdout handler
+    logger.add(
+        sys.stdout,
+        level="INFO",
+        format="{time:YYYY-MM-DD HH:mm:ss} | {level} | {message}",
+        colorize=True
+    )
+
+
+@click.group()
+@click.version_option(version='1.0.0')
+def main():
+    """Main CLI entry point for the RAG solution."""
+    setup_logging()
+    logger.info("Starting RAG solution CLI")
+
+
+@main.command(help="Basic connectivity test that returns 'pong'")
+@click.option('--verbose', '-v', is_flag=True, help="Enable verbose output")
+def ping(verbose):
+    """Ping command that outputs 'pong'."""
+    if verbose:
+        logger.info("Executing ping command")
+        click.echo("pong")
+        logger.info("Ping command completed successfully")
+    else:
+        click.echo("pong")
+        logger.info("Ping command executed")
+
+
+if __name__ == '__main__':
+    main()