Commit Graph

118 Commits

Author SHA1 Message Date
Samuel Chan
c016934021 Revise the AGE implementation on get_node_edges, to align with Neo4j behavior. 2025-01-12 21:38:39 +08:00
Samuel Chan
d03d6f5fc5 Revised the postgres implementation, to use attributes(node_id) rather than nodes to identify an entity. Which significantly reduced the table counts. 2025-01-11 09:30:19 +08:00
jin
85331e3fa2 update Oracle support
add cache support, fix bug
2025-01-10 11:36:28 +08:00
Samuel Chan
85d35b16d4 Merge branch 'HKUDS:main' into main 2025-01-06 12:53:06 +08:00
Samuel Chan
6ae27d8f06 Some enhancements:
- Enable the llm_cache storage to support get_by_mode_and_id, to improve the performance for using real KV server
- Provide an option for the developers to cache the LLM response when extracting entities for a document. Solving the paint point that sometimes the process failed, the processed chunks we need to call LLM again, money and time wasted. With the new option (by default not enabled) enabling, we can cache that result, can significantly save the time and money for beginners.
2025-01-06 12:50:05 +08:00
zrguo
6d7a01f29d Merge pull request #542 from xiyihan0/main
添加选取Neo4j指定数据库功能的支持
2025-01-05 12:46:45 +08:00
xiyihan
1e3b25db22 添加选取Neo4j指定数据库功能的支持(fix lint) 2025-01-04 22:33:35 +08:00
xiyihan
bb4c271623 添加选取Neo4j指定数据库功能的支持 2025-01-04 21:47:52 +08:00
Samuel Chan
6c1b669f0f Fix the lint issue 2025-01-04 18:49:32 +08:00
Samuel Chan
e053223ef0 Fix the lint issue 2025-01-04 18:34:35 +08:00
Samuel Chan
f6f62c32a8 Fix the bug of AGE processing 2025-01-03 21:10:06 +08:00
Samuel Chan
b17cb2aa95 With a draft for progres_impl 2025-01-01 22:43:59 +08:00
Magic_yuan
650b8e38b7 feat(lightrag): Add document status tracking and checkpoint support
功能(lightrag): 添加文档状态跟踪和断点续传支持

- Add DocStatus enum and DocProcessingStatus class for document processing state management
- 添加 DocStatus 枚举和 DocProcessingStatus 类用于文档处理状态管理

- Implement JsonDocStatusStorage for persistent status storage
- 实现 JsonDocStatusStorage 用于持久化状态存储

- Add document-level deduplication in batch processing
- 在批处理中添加文档级别的去重功能

- Add checkpoint support in ainsert method for resumable document processing
- 在 ainsert 方法中添加断点续传支持,实现可恢复的文档处理

- Add status query methods for monitoring processing progress
- 添加状态查询方法用于监控处理进度

- Update LightRAG initialization to support document status tracking
- 更新 LightRAG 初始化以支持文档状态跟踪
2024-12-28 00:11:25 +08:00
Samuel Chan
42b3690e71 Fix the async issue while running on Windows 2024-12-27 12:01:50 +08:00
Alex Potapenko
848b3f6e33 GremlinStorage: fixes and patch to support other Gremlin compatible BD. Tested on ArcadeDB with Gremlin plugin. The main change is using "entity_name" vertex property instead of the label as a node_id since different implementations have different restrictions on label names. 2024-12-23 16:16:17 +01:00
Alex Potapenko
016d9f572d GremlinStorage: fix linting error, use asyncio.gather in get_node_edges() 2024-12-20 09:57:35 +01:00
Alex Potapenko
6f71293c83 Add Gremlin graph storage 2024-12-19 17:47:42 +01:00
Weaxs
344d8f277b support TiDBGraphStorage 2024-12-18 10:57:33 +08:00
Alex Potapenko
a44aceab1e AGEStorage: fix has_node() and has_edge() 2024-12-14 13:37:34 +01:00
Alex Potapenko
25d6939ca3 AGEStorage: all queries as multiline strings for consistency 2024-12-14 12:49:39 +01:00
Alex Potapenko
05c73bcfd1 AGEStorage: fix debug logging, and some cosmetics 2024-12-14 12:47:29 +01:00
Alex Potapenko
eb2e6d0d87 AGEStorage: fix has_edge(): AGE always returns field names in lower case 2024-12-14 12:24:45 +01:00
Alex Potapenko
7564841450 Add Apache AGE graph storage 2024-12-13 20:41:38 +01:00
LarFii
b7a2d336e6 Update __version__ 2024-12-13 20:15:49 +08:00
billvsme
a788c78197 keep tqdm_async work 2024-12-13 16:48:22 +08:00
billvsme
cf0278ca43 The asyncio.as_completed() function does not guarantee that the results are ordered 2024-12-13 15:42:41 +08:00
Weaxs
288985eab4 pre-commit fix tidb 2024-12-12 10:22:31 +08:00
Weaxs
8ef5a6b8cd support TiDB: add TiDBKVStorage, TiDBVectorDBStorage 2024-12-11 16:23:50 +08:00
Pankaj Kaushal
ca788463cc feat: Add ChromaDB integration for vector storage
- Implemented `ChromaVectorDBStorage` class in `lightrag/kg/chroma_impl.py` to support ChromaDB as a vector storage backend.
- Updated `lightrag.py` to include `ChromaVectorDBStorage` in the storage class mapping.
- Added a test script `test_chromadb.py` to demonstrate the usage of ChromaDB with LightRAG, including configuration for embedding functions and ChromaDB connection settings.
- fix lazy import function to support package context for dynamic class loading.
  288d4b8355
2024-12-10 16:23:05 +01:00
Suroy
979f30d20b Update oracle_impl.py
Fixed typing error in python3.9
2024-12-06 11:06:20 +08:00
partoneplay
d8ba7c57f3 Add MongoDB as KV storage 2024-12-05 13:57:43 +08:00
partoneplay
052322b213 Add Milvus as vector storage 2024-12-05 08:48:41 +08:00
AdityaKalraShorthillsAI
7734b81911 Added embedding_func argument in Neo4JStorage class's __init__ method in neo4j implementation 2024-12-02 02:44:47 +05:30
jin
adfaa75c00 Update oracle_impl.py 2024-11-25 14:21:01 +08:00
jin
776ba2f2ce Update oracle_impl.py 2024-11-25 14:15:10 +08:00
jin
26ae240c65 Logic Optimization 2024-11-25 13:40:38 +08:00
jin
1dbe803521 Merge branch 'main' of https://github.com/jin38324/LightRAG 2024-11-25 13:32:33 +08:00
jin
89c2de54a2 Optimization logic 2024-11-25 13:29:55 +08:00
lzd
9bf140d7cf 删除Neo4JStorage.has_edge中定义的没有用到的同步close函数 2024-11-22 13:59:14 +08:00
LarFii
ce7f524174 Update 2024-11-19 16:52:26 +08:00
zrguo
eec08661a2 Merge pull request #286 from Sucran/main
fix neo4j bug
2024-11-19 15:25:49 +08:00
tmuife
e5f2aa3a30 use pre-commit reformat 2024-11-18 13:52:49 +08:00
tmuife
1123ccfbd0 change the type of binding parameters in Oracle23AI 2024-11-18 10:00:06 +08:00
Richard
6bdf693b85 fix neo4j bug 2024-11-15 13:11:43 +08:00
jin
662303f605 use oracle bind variables to avoid error 2024-11-15 12:57:01 +08:00
jin
41599897fb fix pre commit 2024-11-12 13:32:40 +08:00
jin
cc45d802a6 Update oracle_impl.py 2024-11-12 13:23:03 +08:00
jin
f4ee85034f Update oracle_impl.py 2024-11-12 13:03:03 +08:00
jin
77123be2a1 Update oracle_impl.py 2024-11-12 12:02:24 +08:00
jin
8bc5d4efff add Oracle support 2024-11-12 09:59:12 +08:00