Commit Graph

2283 Commits

Author SHA1 Message Date
ArnoChen
f50604b2d3 improve conditional checks for db instance 2025-02-19 04:53:15 +08:00
ArnoChen
ae7a850d4e create mongodb vector index only if not exists 2025-02-19 04:37:43 +08:00
ArnoChen
6d8e627f85 improve MongoDB client management and storage init 2025-02-19 04:30:52 +08:00
Yannick Stephan
29582fdcbd Merge pull request #847 from YanSte/clean-up
Cleaned typing
2025-02-18 21:18:16 +01:00
Yannick Stephan
aa4a5cea7c cleaned typing 2025-02-18 21:16:52 +01:00
ArnoChen
7a970451b9 fix examples 2025-02-19 04:12:16 +08:00
Yannick Stephan
55cd900e8e clean comments and unused libs 2025-02-18 21:12:06 +01:00
ArnoChen
e194e04226 refactor database connection management and improve storage lifecycle handling
update
2025-02-19 03:49:05 +08:00
Yannick Stephan
011227e426 cleanup no required install 2025-02-18 20:27:59 +01:00
Yannick Stephan
ea41d08b9f removed torch from requirement lightrag server 2025-02-18 20:05:51 +01:00
Yannick Stephan
2524e02428 remove tqdm and cleaned readme and ollama 2025-02-18 19:58:03 +01:00
Yannick Stephan
24ae083284 removed never used method 2025-02-18 19:38:04 +01:00
yangdx
8196df83f8 Fix linting 2025-02-19 01:39:25 +08:00
yangdx
d34dbc5717 Improved document enqueue logic with existence checks.
- Added return status to `apipeline_enqueue_documents`
- Enhanced logging for duplicate documents
2025-02-19 01:35:34 +08:00
yangdx
cac93424d9 Added support for reading .xlsx files in LightRAG.
- Install openpyxl if not present
- Load .xlsx file using openpyxl
- Extract sheet titles and content
- Format rows with tab-separated values
- Append sheet content to overall text
2025-02-19 01:34:51 +08:00
yangdx
9c45824e78 Optimize file handling for DOCX and PPTX processing.
- Removed redundant file content reading.
- Directly passed file to BytesIO.
- Simplified DOCX content extraction.
- Streamlined PPTX slide processing.
- Reduced memory usage in file handling.
2025-02-19 01:34:25 +08:00
Yannick Stephan
161baa6f08 pm bs4 when ollama 2025-02-18 17:11:17 +01:00
Yannick Stephan
780d0b45f7 Merge pull request #841 from YanSte/cleanup
cleanup code
2025-02-18 16:58:53 +01:00
Yannick Stephan
46e1865b98 cleanup code 2025-02-18 16:58:11 +01:00
Yannick Stephan
99dc4859a9 Merge pull request #837 from ParisNeo/main
Fixes and Enhancements for PostgreSQL and JSON Document Storage
2025-02-18 16:46:13 +01:00
Saifeddine ALOUI
f7ef4c7ee8 Update postgres_impl.py 2025-02-18 16:10:26 +01:00
Saifeddine ALOUI
8d043c599d Update LightRagWithPostGRESQL.md 2025-02-18 15:50:00 +01:00
Yannick Stephan
e1e00a897c Merge pull request #833 from danielaskdd/import-as-needed
Implement dynamic database module imports
2025-02-18 15:13:33 +01:00
Saifeddine ALOUI
a780d7a87e Merge branch 'HKUDS:main' into main 2025-02-18 14:42:20 +01:00
yangdx
9156656298 Revert "Update Python version requirement from 3.10 to 3.11 because of StrEnum"
This reverts commit cce4e8b9b3.
2025-02-18 20:34:56 +08:00
yangdx
4e215ed4dd Revert "Update Python version requirement from 3.10 to 3.11"
This reverts commit 8008d04728.
2025-02-18 20:34:33 +08:00
yangdx
8008d04728 Update Python version requirement from 3.10 to 3.11 2025-02-18 20:25:01 +08:00
Yannick Stephan
bc190816ce Update LightRagWithPostGRESQL.md 2025-02-18 13:13:20 +01:00
yangdx
cce4e8b9b3 Update Python version requirement from 3.10 to 3.11 because of StrEnum 2025-02-18 19:40:54 +08:00
yangdx
89c01c686f Fix casting dateime it to str in needed
- Added datetime formatting utility and standardized timestamp handling in DocStatusResponse
2025-02-18 17:46:28 +08:00
Saifeddine ALOUI
b4e0c47607 Update postgres_impl.py 2025-02-18 10:29:07 +01:00
Saifeddine ALOUI
fec7889458 Update postgres_impl.py 2025-02-18 10:27:38 +01:00
Saifeddine ALOUI
2bbbfe50c6 Merge branch 'HKUDS:main' into main 2025-02-18 10:25:31 +01:00
Saifeddine ALOUI
1fe47e5ef1 Update json_doc_status_impl.py 2025-02-18 10:24:54 +01:00
Saifeddine ALOUI
9b9f989160 Update postgres_impl.py 2025-02-18 10:24:19 +01:00
Yannick Stephan
00704aa76a Merge pull request #836 from YanSte/drop
fixed drop
2025-02-18 10:22:37 +01:00
Saifeddine ALOUI
20909e495b Added drop 2025-02-18 10:22:16 +01:00
Yannick Stephan
0994d478f0 cleaned code 2025-02-18 10:21:54 +01:00
Yannick Stephan
fc0cf2934e fixed drop 2025-02-18 10:21:14 +01:00
Saifeddine ALOUI
a15d164d87 Update postgres_impl.py 2025-02-18 10:16:00 +01:00
Saifeddine ALOUI
941c89521c Debug 2025-02-18 10:12:08 +01:00
Saifeddine ALOUI
becf76a528 Debuggging 2025-02-18 10:07:57 +01:00
Saifeddine ALOUI
8f6f4c249a Added more drops 2025-02-18 10:01:21 +01:00
Saifeddine ALOUI
f02c881fd3 Added drop for Doc status 2025-02-18 09:57:10 +01:00
Saifeddine ALOUI
6a0366cb24 Fixed indentation bug 2025-02-18 09:39:10 +01:00
yangdx
5acd054180 Fix linting 2025-02-18 16:33:57 +08:00
yangdx
75ee4592b8 refactor: Implement dynamic database module imports
- Consolidate database instance management
- Improve database management and error handling
- Enhance error handling and logging
2025-02-18 16:14:11 +08:00
Saifeddine ALOUI
f11900451a Addded drop to postgresql 2025-02-18 09:10:50 +01:00
Yannick Stephan
49b1fc9bdf Merge pull request #832 from YanSte/keywords
Fixed keywords
2025-02-18 09:10:26 +01:00
Yannick Stephan
613b748dfa cleaned code 2025-02-18 09:09:12 +01:00