feat: move query-related settings to env file for better configuration

• Add env vars for token and chunk settings • Add token count logging for prompts • Add token count logging for context • Move hardcoded values to env variables • Improve logging clarity and consistency
2025-02-16 19:26:57 +08:00
parent 8fdbcb0d3f
commit 601df31edf
4 changed files with 69 additions and 17 deletions
--- a/.env.example
+++ b/.env.example
@@ -27,14 +27,21 @@ TIMEOUT=300

 ### RAG Configuration
 MAX_ASYNC=4
-MAX_TOKENS=32768
 EMBEDDING_DIM=1024
 MAX_EMBED_TOKENS=8192
-#HISTORY_TURNS=3
-#CHUNK_SIZE=1200
-#CHUNK_OVERLAP_SIZE=100
-#COSINE_THRESHOLD=0.2
-#TOP_K=60
+### Settings relative to query
+HISTORY_TURNS=3
+COSINE_THRESHOLD=0.2
+TOP_K=60
+MAX_TOKEN_TEXT_CHUNK = 4000
+MAX_TOKEN_RELATION_DESC = 4000
+MAX_TOKEN_ENTITY_DESC = 4000
+### Settings relative to indexing
+CHUNK_SIZE=1200
+CHUNK_OVERLAP_SIZE=100
+MAX_TOKENS=32768
+MAX_TOKEN_SUMMARY=500
+SUMMARY_LANGUAGE=English

 ### LLM Configuration (Use valid host. For local services, you can use host.docker.internal)
 ### Ollama example