Commit Graph

528 Commits

Author SHA1 Message Date
hyb
e08905b398 feat: 增加redis KV存储,增加openai+neo4j+milvus+redis的demo测试,新增lightrag.py: RedisKVStorage,新增requirements.txt:aioredis依赖 2025-01-22 16:42:13 +08:00
zrguo
b84aab5cd0 Merge pull request #590 from jin38324/main
Enhance Robustness of insert Method with Pipeline Processing and Caching Mechanisms
2025-01-16 14:20:08 +08:00
jin
6ae8647285 support pipeline mode 2025-01-16 12:58:15 +08:00
jin
d5ae6669ea support pipeline mode 2025-01-16 12:52:37 +08:00
jin
17a2ec2bc4 Merge branch 'HKUDS:main' into main 2025-01-16 09:59:27 +08:00
Samuel Chan
2b7d253117 Merge remote-tracking branch 'origin/main' 2025-01-15 12:09:05 +08:00
Samuel Chan
d91a330e9d Enrich README.md for postgres usage, make some change to cater python version<12 2025-01-15 12:02:55 +08:00
zrguo
61349b0fe7 Merge pull request #581 from jiabin2wang20230918/fix_mongo
Fix for mongoDB document 16MB limit and add os env NEO4J_MAX_CONNECTION_POOL_SIZE for neo4j
2025-01-13 18:38:22 +08:00
bingo
1984da0fd6 add logger.debug for mongo_impl get_by_mode_and_id() 2025-01-13 07:27:30 +00:00
bingo
f28b90b2b3 1. add os env NEO4J_MAX_CONNECTION_POOL_SIZE to for neo4j ;
2. fix https://github.com/HKUDS/LightRAG/issues/580 issue for mongoDB document 16MB limit.
2025-01-13 07:06:01 +00:00
Samuel Chan
c016934021 Revise the AGE implementation on get_node_edges, to align with Neo4j behavior. 2025-01-12 21:38:39 +08:00
Samuel Chan
d03d6f5fc5 Revised the postgres implementation, to use attributes(node_id) rather than nodes to identify an entity. Which significantly reduced the table counts. 2025-01-11 09:30:19 +08:00
jin
85331e3fa2 update Oracle support
add cache support, fix bug
2025-01-10 11:36:28 +08:00
Samuel Chan
85d35b16d4 Merge branch 'HKUDS:main' into main 2025-01-06 12:53:06 +08:00
Samuel Chan
6ae27d8f06 Some enhancements:
- Enable the llm_cache storage to support get_by_mode_and_id, to improve the performance for using real KV server
- Provide an option for the developers to cache the LLM response when extracting entities for a document. Solving the paint point that sometimes the process failed, the processed chunks we need to call LLM again, money and time wasted. With the new option (by default not enabled) enabling, we can cache that result, can significantly save the time and money for beginners.
2025-01-06 12:50:05 +08:00
zrguo
6d7a01f29d Merge pull request #542 from xiyihan0/main
添加选取Neo4j指定数据库功能的支持
2025-01-05 12:46:45 +08:00
xiyihan
1e3b25db22 添加选取Neo4j指定数据库功能的支持(fix lint) 2025-01-04 22:33:35 +08:00
xiyihan
bb4c271623 添加选取Neo4j指定数据库功能的支持 2025-01-04 21:47:52 +08:00
Samuel Chan
6c1b669f0f Fix the lint issue 2025-01-04 18:49:32 +08:00
Samuel Chan
e053223ef0 Fix the lint issue 2025-01-04 18:34:35 +08:00
Samuel Chan
f6f62c32a8 Fix the bug of AGE processing 2025-01-03 21:10:06 +08:00
Samuel Chan
b17cb2aa95 With a draft for progres_impl 2025-01-01 22:43:59 +08:00
Magic_yuan
650b8e38b7 feat(lightrag): Add document status tracking and checkpoint support
功能(lightrag): 添加文档状态跟踪和断点续传支持

- Add DocStatus enum and DocProcessingStatus class for document processing state management
- 添加 DocStatus 枚举和 DocProcessingStatus 类用于文档处理状态管理

- Implement JsonDocStatusStorage for persistent status storage
- 实现 JsonDocStatusStorage 用于持久化状态存储

- Add document-level deduplication in batch processing
- 在批处理中添加文档级别的去重功能

- Add checkpoint support in ainsert method for resumable document processing
- 在 ainsert 方法中添加断点续传支持,实现可恢复的文档处理

- Add status query methods for monitoring processing progress
- 添加状态查询方法用于监控处理进度

- Update LightRAG initialization to support document status tracking
- 更新 LightRAG 初始化以支持文档状态跟踪
2024-12-28 00:11:25 +08:00
Samuel Chan
42b3690e71 Fix the async issue while running on Windows 2024-12-27 12:01:50 +08:00
Alex Potapenko
848b3f6e33 GremlinStorage: fixes and patch to support other Gremlin compatible BD. Tested on ArcadeDB with Gremlin plugin. The main change is using "entity_name" vertex property instead of the label as a node_id since different implementations have different restrictions on label names. 2024-12-23 16:16:17 +01:00
Alex Potapenko
016d9f572d GremlinStorage: fix linting error, use asyncio.gather in get_node_edges() 2024-12-20 09:57:35 +01:00
Alex Potapenko
6f71293c83 Add Gremlin graph storage 2024-12-19 17:47:42 +01:00
Weaxs
344d8f277b support TiDBGraphStorage 2024-12-18 10:57:33 +08:00
Alex Potapenko
a44aceab1e AGEStorage: fix has_node() and has_edge() 2024-12-14 13:37:34 +01:00
Alex Potapenko
25d6939ca3 AGEStorage: all queries as multiline strings for consistency 2024-12-14 12:49:39 +01:00
Alex Potapenko
05c73bcfd1 AGEStorage: fix debug logging, and some cosmetics 2024-12-14 12:47:29 +01:00
Alex Potapenko
eb2e6d0d87 AGEStorage: fix has_edge(): AGE always returns field names in lower case 2024-12-14 12:24:45 +01:00
Alex Potapenko
7564841450 Add Apache AGE graph storage 2024-12-13 20:41:38 +01:00
LarFii
b7a2d336e6 Update __version__ 2024-12-13 20:15:49 +08:00
billvsme
a788c78197 keep tqdm_async work 2024-12-13 16:48:22 +08:00
billvsme
cf0278ca43 The asyncio.as_completed() function does not guarantee that the results are ordered 2024-12-13 15:42:41 +08:00
Weaxs
288985eab4 pre-commit fix tidb 2024-12-12 10:22:31 +08:00
Weaxs
8ef5a6b8cd support TiDB: add TiDBKVStorage, TiDBVectorDBStorage 2024-12-11 16:23:50 +08:00
Pankaj Kaushal
ca788463cc feat: Add ChromaDB integration for vector storage
- Implemented `ChromaVectorDBStorage` class in `lightrag/kg/chroma_impl.py` to support ChromaDB as a vector storage backend.
- Updated `lightrag.py` to include `ChromaVectorDBStorage` in the storage class mapping.
- Added a test script `test_chromadb.py` to demonstrate the usage of ChromaDB with LightRAG, including configuration for embedding functions and ChromaDB connection settings.
- fix lazy import function to support package context for dynamic class loading.
  288d4b8355
2024-12-10 16:23:05 +01:00
Suroy
979f30d20b Update oracle_impl.py
Fixed typing error in python3.9
2024-12-06 11:06:20 +08:00
partoneplay
d8ba7c57f3 Add MongoDB as KV storage 2024-12-05 13:57:43 +08:00
partoneplay
052322b213 Add Milvus as vector storage 2024-12-05 08:48:41 +08:00
AdityaKalraShorthillsAI
7734b81911 Added embedding_func argument in Neo4JStorage class's __init__ method in neo4j implementation 2024-12-02 02:44:47 +05:30
jin
adfaa75c00 Update oracle_impl.py 2024-11-25 14:21:01 +08:00
jin
776ba2f2ce Update oracle_impl.py 2024-11-25 14:15:10 +08:00
jin
26ae240c65 Logic Optimization 2024-11-25 13:40:38 +08:00
jin
1dbe803521 Merge branch 'main' of https://github.com/jin38324/LightRAG 2024-11-25 13:32:33 +08:00
jin
89c2de54a2 Optimization logic 2024-11-25 13:29:55 +08:00
lzd
9bf140d7cf 删除Neo4JStorage.has_edge中定义的没有用到的同步close函数 2024-11-22 13:59:14 +08:00
LarFii
ce7f524174 Update 2024-11-19 16:52:26 +08:00