Commit Graph

2351 Commits

Author SHA1 Message Date
yangdx
3cca18c59c Refactor pipeline status updates and entity extraction.
- Let all parrallel jobs using one pipe_status objects
- Improved thread safety with pipeline_status_lock
- Only pipeline jobs can add message to pipe_status
- Marked insert_custom_chunks as deprecated
2025-03-10 16:48:59 +08:00
yangdx
57a41eedb8 Fix linting 2025-03-10 15:41:46 +08:00
yangdx
46610682ce Fix data persistence issue in single-process mode
In single-process mode, data updates and persistence were not working properly because the update flags were not being correctly handled between different objects.
2025-03-10 15:41:00 +08:00
yangdx
adca27fae9 Merge branch 'main' into neo4j-add-min-degree 2025-03-10 02:13:49 +08:00
yangdx
4065a7df92 Fix linting 2025-03-10 02:07:19 +08:00
yangdx
14e1b31d1c Improved logging clarity in storage operations 2025-03-10 02:05:55 +08:00
yangdx
6b0acce644 Avoid redundant llm cache updates 2025-03-10 01:45:58 +08:00
yangdx
d2708b966d Added update flag to avoid persistence if no data is changed for KV storage 2025-03-10 01:17:25 +08:00
yangdx
4977c718f1 Improve KV storage initialize logic 2025-03-10 00:12:35 +08:00
yangdx
c938989920 Fix llm cache save problem in json_kv storage 2025-03-09 23:33:03 +08:00
yangdx
bc42afe7b6 Unify llm_response_cache and hashing_kv, prevent creating an independent hashing_kv. 2025-03-09 22:15:26 +08:00
yangdx
e47883d872 Add atomic data initialization lock to prevent race conditions 2025-03-09 17:33:15 +08:00
yangdx
020a6b5ae0 Refactor LLM cache config to use argparse and add status display 2025-03-09 16:45:57 +08:00
yangdx
c854aabde0 Add process ID to log messages for better multi-process debugging clarity
- Add PID to KV and Neo4j storage  logs
- Add PID to query context logs
- Improve KV data count logging for llm cache
2025-03-09 15:25:10 +08:00
yangdx
90527875fd Fix async issues in namespace init 2025-03-09 15:22:06 +08:00
zrguo
32a7d40650 Update lightrag_openai_neo4j_milvus_redis_demo.py 2025-03-09 02:11:31 +08:00
zrguo
fe34c50b43 Merge pull request #1030 from HKUDS/dev
Fixed and improved Iteration extraction
2025-03-09 01:29:49 +08:00
zrguo
c936aaf5c8 fix linting 2025-03-09 01:29:21 +08:00
zrguo
595d8bf372 Update prompt.py 2025-03-09 01:25:15 +08:00
zrguo
548f9a8234 Update prompts 2025-03-09 01:21:39 +08:00
yangdx
6a969e8de4 Disable logging for graph database lock acquisition and release 2025-03-09 01:14:24 +08:00
yangdx
c5d0962872 Fix linting 2025-03-09 01:00:42 +08:00
yangdx
3cf4268e7a Change logging level from INFO to DEBUG for cache hit/miss messages 2025-03-09 00:59:40 +08:00
zrguo
da59cc89d8 fix linting 2025-03-09 00:51:14 +08:00
zrguo
376b63b704 Merge pull request #1027 from dixyes/fix-qdrant-payload-id
Fix qdrant payload id
2025-03-09 00:28:17 +08:00
zrguo
3ae261dcd3 Merge pull request #1025 from danielaskdd/add-env-llm-cache-for-extract
Add LLM cache control env for API Server
2025-03-09 00:26:55 +08:00
yangdx
18c0770409 fix: duplicate nodes for same entity(label) problem in Neo4j
- Add entity_id field as key in Neo4j nodes
- Use  entity_id for nodes retrival and upsert
2025-03-09 00:24:55 +08:00
zrguo
38ebb16d63 Merge pull request #1021 from da-luggas/main
Fixed entites_section_list comma error
2025-03-09 00:19:41 +08:00
zrguo
906619fa93 Merge pull request #1024 from ShanGor/main
fix the postgres get all labels and get knowledge graph
2025-03-09 00:17:11 +08:00
yangdx
73452e63fa Add async lock for atomic graph database operations
• Introduced graph_db_lock mechanism
• Ensured atomic node/edge merge and insert operation
2025-03-08 22:48:12 +08:00
yangdx
95c06f1bde Add graph DB lock to shared storage system
• Introduced new graph_db_lock
• Added detailed lock debugging output
2025-03-08 22:36:41 +08:00
dixyes
458eafd714 Fix qdrant payload id
Qdrant now is using PointStruct.payload["id"], not PointStruct.id UUID.
This will fix id overwrite
2025-03-08 16:40:40 +08:00
Samuel Chan
b7f67eda21 fix the postgres get all labels and get knowledge graph 2025-03-08 11:45:59 +08:00
yangdx
fb4a4c736e Add duplicate edge upsert checking and logging 2025-03-08 11:36:24 +08:00
yangdx
22a93fb717 Limit neighbor nodes fetch to 1000 in Neo4JStorage. 2025-03-08 11:29:08 +08:00
yangdx
887f6ed81a Fix return empty list when no edges is found 2025-03-08 11:20:22 +08:00
yangdx
af26d65698 Convert _ensure_label method from async to sync 2025-03-08 10:23:27 +08:00
yangdx
78f8d7a1ce Convert node and edge IDs to f-strings for consistency.
- Use f-strings for node IDs
- Use f-strings for edge IDs
- Ensure consistent ID formatting
2025-03-08 10:20:10 +08:00
yangdx
84222b8b76 Refactor Neo4JStorage methods for robustness and clarity.
- Add error handling and resource cleanup
- Improve method documentation
- Optimize result consumption
2025-03-08 10:19:20 +08:00
yangdx
fcb04e47e5 Refactor Neo4J APOC fall back retrival implementaion 2025-03-08 04:28:54 +08:00
Yannick Stephan
27ab894d00 Merge pull request #1018 from HKUDS/dev
Fix edit entity and relation bugs
2025-03-07 21:26:43 +01:00
yangdx
c07b592e1b Add missing await consume 2025-03-08 02:39:51 +08:00
yangdx
af803f4e7a Refactor Neo4J graph query with min_degree an inclusive match support 2025-03-08 01:20:36 +08:00
Lukas Selch
bad3781f51 Fixed entites_section_list comma error 2025-03-07 12:04:10 +01:00
yangdx
0ee2e7fd48 Suppress Neo4j warning logs by setting logger level. 2025-03-07 16:56:48 +08:00
yangdx
6e3b23069c - Remove useless _label_exists method 2025-03-07 16:43:18 +08:00
yangdx
17ccab8db2 Updated README with LLM cache configuration details 2025-03-07 15:22:20 +08:00
yangdx
c2f861fba9 Settign LLM cache option for entity extraction from env 2025-03-07 15:03:33 +08:00
zrguo
e822f35c89 Fix edit entity and relation bugs 2025-03-07 14:39:06 +08:00
zrguo
c76f43c8e8 Merge pull request #1015 from HKUDS/main
Update dev
2025-03-07 12:49:30 +08:00