Commit Graph

1205 Commits

Author SHA1 Message Date
Samuel Chan
1998a5b204 Merge remote-tracking branch 'origin/main'
# Conflicts:
#	README.md
2025-01-11 10:40:09 +08:00
Samuel Chan
d03d6f5fc5 Revised the postgres implementation, to use attributes(node_id) rather than nodes to identify an entity. Which significantly reduced the table counts. 2025-01-11 09:30:19 +08:00
Saifeddine ALOUI
224fce9b1b run precommit to fix linting issues 2025-01-11 01:37:07 +01:00
Saifeddine ALOUI
e0e656ab01 Added ssl support 2025-01-11 01:35:49 +01:00
Saifeddine ALOUI
e21fbef60b updated documlentation 2025-01-10 22:38:57 +01:00
Saifeddine ALOUI
a619b01064 Next test of timeout 2025-01-10 22:17:13 +01:00
Saifeddine ALOUI
ab3cc3f0f4 fixed missing coma 2025-01-10 21:39:41 +01:00
Saifeddine ALOUI
adb288c5bb added timeout 2025-01-10 21:39:25 +01:00
Saifeddine ALOUI
2297007b7b Simplified the api services issue #565 2025-01-10 20:30:58 +01:00
jin
85331e3fa2 update Oracle support
add cache support, fix bug
2025-01-10 11:36:28 +08:00
zrguo
a65f0026ff Merge pull request #560 from AdiKalra/main
Add custom chunking function.
2025-01-09 21:33:55 +08:00
adikalra
acde4ed173 Add custom chunking function. 2025-01-09 17:20:24 +05:30
zrguo
92b7e331a5 Merge pull request #557 from tongshiyuan/main
优化仅字符分割参数
2025-01-09 15:29:35 +08:00
zrguo
b93203804c Merge branch 'main' into main 2025-01-09 15:28:57 +08:00
zrguo
92ccfa2770 Merge pull request #555 from ParisNeo/main
Restore backwards compatibility for LightRAG's ainsert method
2025-01-09 15:27:09 +08:00
zrguo
7973f46b73 Merge pull request #553 from GurjotSinghShorthillsAI/custom-chunking-feature
Implement custom chunking feature
2025-01-09 15:26:36 +08:00
童石渊
dd213c95be 增加仅字符分割参数,如果开启,仅采用字符分割,不开启,在分割完以后如果chunk过大,会继续根据token size分割,更新测试文件 2025-01-09 11:55:49 +08:00
Saifeddine ALOUI
65c1450c66 fixed retro compatibility with ainsert by making split_by_character get a None default value 2025-01-08 20:50:22 +01:00
Gurjot Singh
9565a4663a Fix trailing whitespace and formatting issues in lightrag.py 2025-01-09 00:39:22 +05:30
zrguo
9e7784ab8a Update README.md 2025-01-08 18:17:32 +08:00
Gurjot Singh
a940251390 Implement custom chunking feature 2025-01-07 20:57:39 +05:30
zrguo
6c78c96854 fix linting errors 2025-01-07 22:02:34 +08:00
zrguo
fe7f7086b1 Merge pull request #547 from n3A87/main
Fix:Optimized logic for automatic switching modes when keywords do not exist
2025-01-07 21:51:51 +08:00
zrguo
18a9a780b7 Merge pull request #551 from tongshiyuan/main
增加chunk的字符分割功能
2025-01-07 16:38:04 +08:00
童石渊
b8f7a0a1e7 Merge remote-tracking branch 'origin/main' 2025-01-07 16:27:13 +08:00
童石渊
6b19401dc6 chunk split retry 2025-01-07 16:26:12 +08:00
LarFii
79d7050710 fix linting errors 2025-01-07 16:21:54 +08:00
LarFii
9ef4fe667a rename 2025-01-07 16:18:19 +08:00
zrguo
059e3882f1 Merge branch 'main' into main 2025-01-07 16:14:56 +08:00
LarFii
290744d770 fix requirements.txt 2025-01-07 16:04:46 +08:00
jin
957bcf8659 Organize files
move some test files from root to example
2025-01-07 13:51:20 +08:00
童石渊
3bbd3ee1b2 在Mac端torch~=2.5.1+cu121会导致本地安装时报错 2025-01-07 13:45:18 +08:00
Samuel Chan
196350b75b Revise the readme to fix the broken link. 2025-01-07 07:02:37 +08:00
童石渊
9662ecc102 Merge remote-tracking branch 'origin'
添加字符分割功能,在“insert”函数中如果增加参数split_by_character,则会按照split_by_character进行字符分割,此时如果每个分割后的chunk的tokens大于max_token_size,则会继续按token_size分割(todo:考虑字符分割后过短的chunk处理)
2025-01-07 00:43:00 +08:00
童石渊
536d6f2283 添加字符分割功能,在“insert”函数中如果增加参数split_by_character,则会按照split_by_character进行字符分割,此时如果每个分割后的chunk的tokens大于max_token_size,则会继续按token_size分割(todo:考虑字符分割后过短的chunk处理) 2025-01-07 00:28:15 +08:00
✨Data Intelligence Lab@HKU✨
22e9f1cd89 Update README.md 2025-01-06 23:21:02 +08:00
✨Data Intelligence Lab@HKU✨
e415f88bd4 Update README.md 2025-01-06 23:20:26 +08:00
xYLiuuuuuu
79646fced8 Fix:Optimized logic for automatic switching modes when keywords do not exist 2025-01-06 16:54:53 +08:00
zrguo
39a366a3dc Update __init__.py 2025-01-06 15:43:05 +08:00
zrguo
916380e511 Update README.md 2025-01-06 15:39:44 +08:00
zrguo
e2a4819af9 Update README.md 2025-01-06 15:37:37 +08:00
zrguo
c88bb4fd53 Update LICENSE 2025-01-06 15:34:50 +08:00
zrguo
990b684a85 Update lightrag.py 2025-01-06 15:27:31 +08:00
zrguo
12f4a45440 Merge pull request #545 from ShanGor/main
Enhance the llm_cache_kv_store, enable the llm_cache for entity extraction and revise readme
2025-01-06 15:24:34 +08:00
Samuel Chan
85d35b16d4 Merge branch 'HKUDS:main' into main 2025-01-06 12:53:06 +08:00
Samuel Chan
6ae27d8f06 Some enhancements:
- Enable the llm_cache storage to support get_by_mode_and_id, to improve the performance for using real KV server
- Provide an option for the developers to cache the LLM response when extracting entities for a document. Solving the paint point that sometimes the process failed, the processed chunks we need to call LLM again, money and time wasted. With the new option (by default not enabled) enabling, we can cache that result, can significantly save the time and money for beginners.
2025-01-06 12:50:05 +08:00
zrguo
6d7a01f29d Merge pull request #542 from xiyihan0/main
添加选取Neo4j指定数据库功能的支持
2025-01-05 12:46:45 +08:00
xiyihan
1e3b25db22 添加选取Neo4j指定数据库功能的支持(fix lint) 2025-01-04 22:33:35 +08:00
zrguo
6b5f4755e7 Merge pull request #539 from ShanGor/main
Added PostgreSQL implementation
2025-01-04 22:13:06 +08:00
xiyihan
bb4c271623 添加选取Neo4j指定数据库功能的支持 2025-01-04 21:47:52 +08:00